TOPIC 8 - LEAST-SQUARES REGRESSION

8-1: Feeding Fido

(a) FAT = 100 - .3 (200) = 40 g

(b)  FAT = 100 - 3 (120) = 64 g

(c)

(d) decrease, by 3 g.

8-2 Air Fares (cont.)

(a) $140

(b) $240

(c) slope b = (140-240)/(300-1500) = 1/12 = .083

(d) intercept a = y1 - b x1 = 140 - .083 (300) = 115

(e) FARE = .083 (DISTANCE) + 115

8-3: Air Fares ( cont. )

 
Mean Std. dev. Correlation
Airfare (y) 166.9 59.5
.795
Distance (x) 713 413

(a) b = 0.117; a = 83.3

(b) airfare = 83.3 + .117 distance

(c) 118.50

(d) 259.30

(e)

(f) (Asks for prediction)

(g) 188.90

8-4: Air Fares ( cont. )

(a) 416.80

(b)
Distance 900 901 902 903
Predicted Airfare 188.90 189.02 189.13 189.25

(c) yes; 0.117; this number is the slope coefficient of the least squares line.

(d) 11.70

8-5: Air Fares ( cont. )

(a) 150.88

(b) 27.12

(c)
Destination Distance Airfare Fitted Residual
Atlanta 576 178 150.88 27.12
Boston 370 138 126.70 11.30
Chicago 612 94 155.10 -61.10
Dallas/Fort Worth 1216 278 226.00 52.00
Detroit 409 158 131.27 26.73
Denver 1502 258 259.57 -1.56
Miami 946 198 194.30 3.70
New Orleans 998 188 200.41 -12.41
New York 189 98 105.45 -7.45
Orlando 787 179 175.64 3.36
Pittsburgh 210 138 107.92 30.08
St. Louis 737 98 169.77 -71.77

(d) St. Louis; distance: 737 airfare: 98; residual: 71.77; overestimate

(e) greater

(f) below

(g) mean: 0; standard deviation: 36.1

(h) 0.368

(i) 0.632

(j) The sum equals one.

(k) .632

8-8 Car's Fuel Efficiency (cont.)

(a) b = -.959 (6.70)/590 = -.0109, a = 27.56 - (-.0109)(3208) = 62.53

(b) The least-squares line predicts a car's fuel efficiency to drop by 100(-.0109) = 1.09 miles per gallon for each 100 additional pounds of weight.

(c) r2 = (-.959)2 = .920, so 92% of the variability in cars' miles per gallon is explained by the weight variable.

8-9 Governor Salaries (cont.)

(a) Gov salary = 28569.7+2.70875 (Avg pay)

(b) r2 =.212, so 21.2% of the variability in Gov salary values is explained by Avg pay.

(c) North Carolina has the largest positive residual.  This means that NC has a unusually large governor salary given its average pay.

(d) Connecticut has the largest (in absolute value) negative residual.  This state has an usually small governor salary given its average pay.

(e) Alaska has the largest fitted value, since it has the largest Avg pay.

8-10 Basketball Rookie Salaries

(a) salary = 2657443 - 98957 (pick number)

(b) r2 = .787, so 78.7% of the variation in salaries is explained by pick number.

(c)  player drafted number 12 has fitted salary

2657443 - 98957 (12) = 1469959

His residual would be

residual  = observed salary - fitted salary = 1370000 - 1469959 = -99956

(d) fitted yearly salary for player drafted 15 would be 2657443 - 98957 (15) = 1173088

 fitted yearly salary for player drafted 25 would be 2657443 - 98957 (25) = 183518

(e)  the regression line predicts the salary to drop by $98957 for each additional pick number.

8-14 Beatles Hit Songs

(a)  There is a general negative relationship between WEEKS on the chart and PEAK position.

(b)  If a song peaks at #20, we would predict (using the regression line) that it would stay

WEEKS = 11.54 - .124 (20) = 9.06 approx 9

(c)  "Hey Jude" stayed 19 weeks on the chart and (since it peaked at 1) we would predict to stay

WEEKS = 11.54 - .124 (1) = 11.42

So the residual is

RESIDUAL = 19 - 11.42 = 7.58

(d) "If I Feel" has a positive residual around +2, "Don't Let Me Down" has a negative residual around 2.

(e) The negative residuals are located in the middle of the plot for moderate PEAK values; the positive residuals are located for very small and very large PEAK values.  This pattern suggests a line fit is not appropriate.