Describe mathematical relationships and make predictions from experimental data
Linear regression is a statistical modeling technique used to describe a continuous response variable as a function of one or more predictor variables. It can help you understand and predict the behavior of complex systems or analyze experimental, financial, and biological data.
Linear regression techniques are used to create a linear model. The model describes the relationship between a dependent variable \(y\) (also called the response) as a function of one or more independent variables \(X_i\) (called the predictors). The general equation for a linear regression model is:
\[Y = \beta_0 + \sum \ \beta_i X_i + \epsilon_i\]
where \(\beta\) represents linear parameter estimates to be computed and \(\epsilon\) represents the error terms.
Types of linear regression
Simple linear regression: models using only one predictor. The general equation is:
\[Y = \beta_0 + \beta_i X+ \epsilon_i\]
Multiple linear regression: models using multiple predictors. This regression has multiple \(X_i\) to predict the response, \(Y\). An example of this equation is:
\[Y = \beta_0 + \beta_1 X_1 + \beta_2 X_2+ \epsilon\]
Multivariate linear regression: models for multiple response variables. This regression has multiple \(Y_i\)derived from the same data \(X\). They are expressed in different formulae. An example of this system with 2 equations is:
\[Y_1 = \beta_{01} + \beta_{11} X_1 + \epsilon_1\]
\[Y_2 = \beta_{02} + \beta_{1 2}X_1 + \epsilon_2\]
Multivariate multiple linear regression: models using multiple predictors for multiple response variables. This regression has multiple \(X_i\) to predict multiple responses \(Y_i\). A generalization of the equations is:
Applications of linear regression
Linear regressions have some properties that make them very interesting for the following applications :
- Prediction or forecasting – Use a regression model to build a forecast model for a specific data set. From the model, you can use regression to predict response values where only the predictors are known.
- Strength of the regression – Use a regression model to determine if there is a relationship between a variable and a predictor, and how strong this relationship is.
Linear regression with MATLAB
Engineers commonly create simple linear regression models with MATLAB. For multiple and multivariate linear regression, you can use the Statistics and Machine Learning Toolbox™ from MATLAB. It enables stepwise, robust, and multivariate regression to:
- Generate predictions
- Compare linear model fits
- Plot residuals
- Evaluate goodness-of-fit
- Detect outliers
To create a linear model that fits curves and surfaces to your data, see Curve Fitting Toolbox™.
Examples and How To
Software Reference
See also: Time Series Regression