The statistical model for multiple linear regression with response variable and explanatory variables is
where . The deviations are independent Normal random variables with mean 0 and a common standard deviation . The parameters of the model are , and .
The ’s are estimated by the coefficients of the multiple regression equation fitted to the data by the method of least squares. The parameter is estimated by the regression standard error
where the are the residuals,
A level confidence interval for the regression coefficient is
where is the value for the density curve with area between and .
Tests of the hypothesis are based on the individual statistic:
and the distribution.
The ANOVA table for a multiple linear regression gives the degrees of freedom, sum of squares, and mean squares for the regression and residual sources of variation. The ANOVA statistic is the ratio MSR/MSE and is used to test the null hypothesis
562
If is true, this statistic has the distribution.
The squared multiple correlation is given by the expression
and is interpreted as the proportion of the variability in the response variable that is explained by the explanatory variables in the multiple linear regression.