EXAMPLE 11.1
Predicting early success in college. Our case study is based on data collected on science majors at a large university.1 The purpose of the study was to attempt to predict success in the early university years. Success was measured using the cumulative grade point average (GPA) after three semesters. The explanatory variables were achievement scores available at the time of enrollment in the university. These included their average high school grades in mathematics (HSM), science (HSS), and English (HSE).
609
We will use high school grades to predict the response variable GPA. There are p = 3 explanatory variables: x1 = HSM, x2 = HSS, and x3 = HSE. The high school grades are coded on a scale from 1 to 10, with 10 corresponding to A, 9 to A−, 8 to B+, and so on. These grades define the subpopulations. For example, the straigh
One possible multiple regression model for the subpopulation mean GPAs is
μGPA = β0 + β1HSM + β2HSS + β3HSE
For the straigh
μGPA = β0 + β14 + β24 + β34