


Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
An introduction to linear regression, a statistical technique used to model the relationship between one or more input variables and a continuous outcome variable. Linear regression assumes a linear relationship between the input variables and the outcome variable, which can be transformed if necessary to achieve a linear relationship. The use cases of linear regression in various fields, including real estate, demand forecasting, and medical research. It also explains the foundations of linear regression modeling and the use of ordinary least squares (ols) to estimate the unknown parameters.
What you will learn
Typology: Lecture notes
1 / 4
This page cannot be seen from the preview
Don't miss anything!
Unit 3 Linear Regression
Use Cases Linear regression is often used in business, government, and other scenarios. Some common practical applications of linear regression in the real world inc lude the following:
Medical : A linear regression model can be used to analyze the effect of a proposed radiation treatment on reducing tumor sizes. Input variables might include duration of a single radiation treatment, frequency of radiation treatment, and patient attributes such as age or weight.
Model Description As the name of this technique suggests, the linear regression model assumes that there is a linear relationship between the input variables and the outcome variable. This relationship can be expressed as shown in Equation 6-1.
where: y is the outcome variable xi are the input variables, for j = 1, 2, ... , p- 1 {30 is the value of y when each xi equals zero f]i is the change in y based on a unit change in xi' for j = 1, 2, .. • , p- 1 Eisa random error term that represents the difference in the linear model and a particular observed value for y.
Suppose it is desired to build a linear regression model that estimates a person's annual income as a function of two variables-age and education-both expressed in years. In this case, income is the outcome variable, and the input variables are age and education. Although it may be an over generalization, such a model seems intuitively correct in the sense that people's income should increase as their skill set and experience expand with age. Also, the employment opportunities and starting salaries would be expected to be greater for those who have attained more education. However, it is also obvious that there is considerable variation in income levels for a group of people with identical ages and years of education. This variation is represented by E in the model. So, in this example, the model would be expressed as shown in Equation 6-2.
In the linear model, the p is represent the unknown p parameters. The estimates for these unknown parameters are chosen so that, on average, the model provides a reasonable estimate of a person's income based on age and education. In other words, the fitted model should minimize the overall error between