Processing math: 100%

EXAMPLE 10.11 Prediction Intervals from a Spreadsheet

In Example 10.7 (pages 511512), we used statistical software to predict the log income of Alexander, who has EDUC=16 years of education. Suppose that we have only the Excel spreadsheet. The prediction interval then requires some additional work.

Step 1. From the Excel output in Figure 10.5 (page 492), we know that s=1.1146. Excel can also find the mean and variance of the EDUC x for the 100 entrepreneurs. They are ˉx=13.28 and s2x=5.901.

Step 2. We need the value of (xiˉx)2. Recalling the definition of the variance, we see that this is just

(xiˉx)2=(n1)s2x=(99)(5.901)=584.2

Step 3. The standard error for predicting Alexander’s log income from his years of education, x*=16, is

SEˆy=s1+1n+(x*ˉx)2(xiˉx)2=1.11461+1100+(1613.28)2584.2=1.11461+1100+7.3984584.2=(1.1146)(1.01127)=1.12716

Step 4. We predict Alexander’s log income from the least-squares line (Figure 10.5 again):

ˆy=8.2546+(0.1126)(16)=10.0562

This agrees with the “Fit” from software in Example 10.8. The 95% prediction interval requires the 95% critical value for t(98). For hand calculation we use t*=1.990 from Table D with df=80. The interval is

ˆy±t*SEˆy=10.0562±(1.990)(1.12716)=10.0562±2.2430=7.8132to12.2992

Page 520

This agrees with the software result in Example 10.8, with a small difference due to roundoff and especially to not having the exact t*.

[Leave] [Close]