TIM7100 Model Simple Linear Regression ASSIGNMENT 7

TIM7100

DATA FILE 6

ASSIGNMENT 7

1. Baseball wisdom says if you can’t hit, you can’t win. But is the number of games won by a major-league baseball team in a season related to the team’s batting average?  The table below shows the number of games won and the batting averages for 14 games in one American League season.

 Team Games Won, y Team Batting Average, x Cleveland 99 .293 New York 92 .288 Boston 85 .283 Toronto 74 .259 Texas 90 .284 Detroit 53 .256 Minnesota 78 .288 Baltimore 88 .274 California 70 .276 Milwaukee 80 .279 Seattle 85 .287 Kansas City 75 .267 Oakland 78 .265 Chicago 85 .281

1. If you were to model the relationship between the mean (or expected) number of games won by a major-league team and the team’s batting average x, using a straight line, would you expect the slope of the line to be positive or negative? Explain.
2. Construct a scattergram of the data. Does the pattern revealed by the scattergram agree with your answer to part a?Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line?
3. Graph the least squares line on your scattergram. Does your least squares line seem to fit the points on your scattergram?
4. Does the mean or expected number of games won appear to be strongly related to a team’s batting average? Explain.
5. Interpret the values of and  in the words of the problem.

1. To improve the quality of the output of any production process, it is necessary first to understand the capabilities of the process. In a particular manufacturing process, the useful life of a cutting tool is related to the speed at which the tool is operated.  It’s necessary to understand this relationship in order to predict when the tool should be replaced and how many spare tools should be available.  The data in the table below were derived from life tests for the two different brands of cutting tools currently used in the production process.

 Cutting Speed (meters per minute) Useful Life (Hours) Brand A Brand B 30 4.5 6.0 30 3.5 6.5 30 5.2 5.0 40 5.2 6.0 40 4.0 4.5 40 2.5 5.0 50 4.4 4.5 50 2.8 4.0 50 1.0 3.7 60 4.0 3.8 60 2.0 3.0 60 1.1 2.4 70 1.1 1.5 70 0.5 2.0 70 3.0 1.0

1. Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line for Brand A and for Brand B?
2. For a cutting speed of 70 meters per minute, find ± 2s for each least squares line.
3. For which brand would you feel more confident in using the least squares line to predict useful life for a given cutting speed? Explain.

1. The expenses involved in a manufacturing operation may be categorized as being for raw material, direct labor, and overhead. Overhead refers to all expenses other than those for raw materials and direct labor that are involved with running the factory.  A manufacturer of 10-speed racing bicycles is interested in estimating the relationship between its monthly factory overhead and the total number of bicycles produced per month.  The estimate will be used to help develop the manufacturing budget for next year.  The data in the table below have been collected for the previous 12 months.
 Month Production Level (1,000s of units) Overhead (\$1,000s) 1 16.9 41.4 2 15.6 35.0 3 17.4 38.3 4 11.6 29.5 5 17.7 39.6 6 17.6 37.4 7 16.3 37.5 8 15.5 37.0 9 23.4 47.9 10 28.4 55.6 11 27.1 53.1 12 19.2 40.6

1. Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line?
2. Does the straight-line model contribute information for predicting overhead costs? Test at α = 0.05.
3. Which of the four assumptions we make about the random error ϵ may be inappropriate in this problem?

1. Describe the slope of the least squares line if
1. r = 0.7
2. r = -0.7
3. r = 0
4. r2 = 0.64

1. After the Minnesota Department of Transportation installed more than 20 years ago what was then a state-of-the-art weigh-in-motion scale in the concrete surface of the eastbound lanes of Interstate 94, a study was undertaken to determine whether the scale’s readings corresponded with the static weights of the vehicles being monitored. After some preliminary comparisons using a two-axle, six-tire truck carrying different loads, calibration adjustments were made in the scale’s software and the scales were reevaluated.  The data are in the table below.  Construct a simple linear regression of the data using SPSS.
1. What is the correlation coefficient for both sets of data?
2. How can the correlation coefficients be used to evaluate the weigh-in-motion scale?
 Trial Number Static Weight of Truck x (thousand pounds) Weigh in Motion Prior to Calibration Adjustment y1 (thousand pounds) Weigh in Motion After Calibration Adjustment y2 (thousand pounds) 1 27.9 26.0 27.8 2 29.1 29.0 29.1 3 38.0 39.5 37.8 4 27.0 25.1 27.1 5 30.3 31.6 30.6 6 34.5 36.2 34.3 7 27.8 25.1 26.9 8 29.6 31.0 29.6 9 33.1 35.6 33.0 10 35.5 40.2 35.0

1. Work standards specify time, cost, and efficiency norms for the performance of work tasks. They are typically used to monitor job performance.  In one distribution center, data were collected to develop work standards for the time to assemble or fill customer orders.  The table below contains data for a random sample of 9 orders.
 Time (mins.) Order Size 27 36 15 34 71 255 35 103 8 4 60 555 3 6 10 60 10 96

1. Construct a simple linear regression of the data using SPSS. What is the equation of the least squares line?  Use time as the dependent variable.
2. In general, we would expect the mean time to fill an order to increase with the size of the order. Do the data support this theory? Test using α = .05.

1. Firms planning to build new plants or make additions to existing facilities have become very conscious of the energy efficiency of proposed new structures and are interested in the relation between annual energy consumption and the number of square feet of building shell. The table below lists the energy consumption in British thermal units (BTUs) for 22 buildings that were all subjected to the same climatic conditions.
1. Construct a simple linear regression of the data using SPSS. What is the estimate of the intercept β0 and slope β1?
2. Is annual energy consumption positively linearly related to the shell area of the building? Test using α = .10.
3. What is the observed significance level of the test of part b? Interpret its value.
4. What is the coefficient of determination? Interpret its meaning.
5. A company wishes to build a new warehouse that will contain 8,000 sq. ft. of shell area. Find the predicted value of energy consumption.  Calculate a 95% prediction interval.  Comment on the usefulness of this interval.
 BTU/Year (thousands) Shell Area (square feet) 3,870,000 30,001 1,371,000 13,530 2,422,000 26,060 672,200 6,355 233,100 4,576 218,900 24,680 354,000 2,621 3,135,000 23,350 1,470,000 18,770 1,408,000 12,220 2,201,000 25,490 2,680,000 23,680 337,500 5,650 567,500 8,001 555,300 6,147 239,400 2,660 2,629,000 19,240 1,102,000 10,700 423,500 9,125 423,500 6,510 1,691,000 13,530 1,870,000 18,860