* Standardized
– In contrast to raw parameters (which are expressed in different units of measure, depending on the described variable, and are not directly comparable) the standardized estimates of the parameters of the model allow the comparison of the contribution of particular variables to the explanation of the variance of the dependent variable
.
. The coefficient is used for the study of the corrrelation of each pair of variables, without taking into consideration the effect of the remaining variables in the model.
and is the measure of correlation between the specific independent variable
(taking into account its correlation with the remaining variables in the model) and the dependent variable
(taking into account its correlation with the remaining variables in the model).
The square of that coefficient is the partial determination coefficient – it falls within the range
and defines the relation of only the variance of the given independent variable
with that variance of the dependent variable
which was not explained by other variables in the model.
The closer the value of those coefficients to 0, the more useless the information carried by the studied variable, which means the variable is redundant.
and is the measure of correlation between the specific independent variable
(taking into account its correlation with the remaining variables in the model) and the dependent variable
(NOT taking into account its correlation with the remaining variables in the model).
The square of that coefficient is the semipartial determination coefficient – it falls within the range
and defines the relation of only the variance of the given independent variable
with the complete variance of the dependent variable
.
The closer the value of those coefficients to 0, the more useless the information carried by the studied variable, which means the variable is redundants.
) – it represents the percentage of variance of the given independent variable
, explained by the remaining independent variables. The closer to value 1 the stronger the linear relation of the studied variable with the remaining independent variables, which can mean that the variable is a redundant one.
) – determines how much the variance of the estimated regression coefficient is increased due to collinearity. The closer the value is to 1, the lower the collinearity and the smaller its effect on the coefficient variance. It is assumed that strong collinearity occurs when the coefficient VIF>5 \cite{sheather}. f the variance inflation factor is 5 (
= 2.2), this means that the standard error for the coefficient of this variable is 2.2 times larger than if this variable had zero correlation with other variables
.
– it represents the percentage of variance of the given independent variable
, NOT explained by the remaining independent variables. The closer the value of tolerance is to 0 the stronger the linear relation of the studied variable with the remaining independent variables, which can mean that the variable is a redundant one.The comparison of the two model is made with by means of:
In the case of removing only one variable the results of both tests are identical.
If the difference between the compared models is statistically significant (the value
), the full model is significantly better than the reduced model. It means that the studied variable is not redundant, it has a significant effect on the given model and should not be removed from it.
The charts allow a subjective evaluation of linearity of the relation among the variables and an identification of outliers. Additionally, scatter plots can be useful in an analysis of model residuals.