Baseball Pitcher Performance. Major League Baseball (MLB) consists of teams that play in the American League and the National League. MLB collects a wide variety of team and player statistics. Some of the statistics often used to evaluate pitching performance are as follows:
ERA: The average number of earned runs given up by the pitcher per nine innings. An earned run is any run that the opponent scores off a particular pitcher except for runs scored as a result of errors.
SO/IP: The average number of strikeouts per inning pitched.
HR/IP: The average number of home runs per inning pitched.
R/IP: The number of runs given up per inning pitched.
The following data show values for these statistics for a random sample of 20 pitchers from the American League for a full season.
- a. Develop an estimated regression equation that can be used to predict the average number of runs given up per inning given the average number of strikeouts per inning pitched.
- b. Develop an estimated regression equation that can be used to predict the average number of runs given up per inning given the average number of home runs per inning pitched.
- c. Develop an estimated regression equation that can be used to predict the average number of runs given up per inning given the average number of strikeouts per inning pitched and the average number of home runs per inning pitched.
- d. A. J. Burnett, a pitcher for the New York Yankees, had an average number of strikeouts per inning pitched of .91 and an average number of home runs per inning of .16. Use the estimated regression equation developed in part (c) to predict the average number of runs given up per inning for A. J. Burnett. (Note: The actual value for R/IP was .6.)
- e. Suppose a suggestion was made to also use the earned run average as another independent variable in part (c). What do you think of this suggestion?
a.

Find the estimated regression equation that could be used to predict the average number of runs given up per inning, when the average number of strikeouts per innings pitched is given.
Answer to Problem 10E
The estimated regression equation that could be used to predict the average number of runs given up per inning, when the average number of strikeouts per innings pitched is given is
Explanation of Solution
Calculation:
The Major League Baseball (MLB) dataset consist of statistics regarding the average number of earned runs given up by the pitcher per nine innings (ERA), the average number of strikeouts per innings pitched (SO/IP), the average number of home runs per innings pitched (HR/IP) and the number of runs given up per innings pitched (R/IP).
Multiple linear regression model:
A multiple linear regression model is given as
Regression:
Software procedure:
Step-by-step procedure to get the regression equation using EXCEL software:
- Open an EXCEL sheet and enter the data on W, L, ERA, SO/IP, HR/IP and R/IP corresponding to columns A, B, C, D, E and F, respectively.
- Select Data > Data Analysis > Regression.
- Click OK.
- Under Input Y Range enter $F$1:$F$21.
- Under Input X Range enter $D$1:$D$21.
- Click the box of Labels.
- Under Output Range enter $I$1.
- Click OK.
The output using EXCEL software is given as follows:
Thus, the estimated regression equation that could be used to predict the average number of runs given up per inning, when the average number of strikeouts per innings pitched is given is
b.

Find the estimated regression equation that could be used to predict the average number of runs given the average number of home runs per innings pitched.
Answer to Problem 10E
The estimated regression equation that could be used to predict the average number of runs given the average number of home runs per innings pitched is
Explanation of Solution
Calculation:
The regression equation can be obtained using EXCEL software.
Software procedure:
Step-by-step procedure to get regression equation using EXCEL software:
- Open an EXCEL sheet and enter the data of W, L, ERA, SO/IP, HR/IP and R/IP corresponding to columns A, B, C, D, E and F, respectively.
- Select Data > Data Analysis > Regression.
- Click OK.
- Under Input Y Range enter $F$1:$F$21.
- Under Input X Range enter $E$1:$E$21.
- Click the box of Labels.
- Under Output Range enter $L$1.
- Click OK.
The output using EXCEL software is given as,
Thus, the estimated regression equation that could be used to predict the average number of runs given the average number of home runs per innings pitched is
c.

Find the estimated regression equation that could be used to predict the average number of runs given the average number of strikeouts per innings pitched and the average number of home runs per innings pitched.
Answer to Problem 10E
The estimated regression equation that could be used to predict the average number of runs given the average number of strikeouts per innings pitched and the average number of home runs per innings pitched is
Explanation of Solution
Calculation:
The regression equation can be obtained using EXCEL software.
Software procedure:
Step-by-step procedure to get regression equation using EXCEL software:
- Open an EXCEL sheet and enter the data of W, L, ERA, SO/IP, HR/IP and R/IP corresponding to columns A, B, C, D, E and F, respectively.
- Select Data > Data Analysis > Regression.
- Click OK.
- Under Input Y Range enter $F$1:$F$21.
- Under Input X Range enter $D$1:$E$21.
- Click the box of Labels.
- Under Output Range enter $O$1.
- Click OK.
The output using EXCEL software is given as,
Thus, the estimated regression equation that could be used to predict the average number of runs given the average number of strikeouts per innings pitched and the average number of home runs per innings pitched is
d.

Predict the average number of runs given up per inning for Pitcher A using the regression equation of Part c.
Answer to Problem 10E
The predicted average number of runs given up per inning for Pitcher A is 0.4757.
Explanation of Solution
Calculation:
The average number of strikeouts and the average number of home runs per innings pitched for Pitcher A are 0.91 and 0.16, respectively.
From Part (c), it is found that estimated regression equation that could be used to predict the average number of runs given the average number of strikeouts per innings pitched and the average number of home runs per innings pitched is
Thus, using the given information, the predicted average number of runs given up per inning for Pitcher At is calculated below.
Thus, the predicted average number of runs given up per inning for Pitcher A is 0.4757.
It is to be noted that the actual average number of runs given up per inning for Pitcher A was 0.6.
Thus, the predicted value is slightly less than the actual value. Hence, the residual is positive, which underestimates the model slightly.
e.

Explain about the suggestion to use the earned run average as another independent variable in the regression model developed in Part c.
Explanation of Solution
Calculation:
It would be useful to use the earned run average or ERA as a third independent variable, only if it has a strong linear association with the response variable, R/IP. The correlation coefficient measures the strength of the linear relation between two variables.
Software procedure:
Step-by-step procedure to get correlation coefficient using EXCEL software:
- Open an EXCEL sheet and enter the data of W, L, ERA, SO/IP, HR/IP and R/IP corresponding to columns A, B, C, D, E and F, respectively.
- In cell I1 enter the formula =CORREL(F1:F21,C1:C21).
- Press Enter.
The EXCEL output is given as follows:
Thus, the correlation coefficient between R/IO and ERA is approximately 0.964, which is very close to 1, indicating a very strong positive linear relationship between the variables.
Due to the high correlation coefficient it can be said that if a pitcher gives up more runs per inning, then the pitcher’s run average also increases. Thus, automatically there will be high value of the coefficient of determination in the regression analysis, if ERA is used as a predictor variable.
Thus, it is reasonable to use the earned run average as another independent variable in the regression model developed in Part c.
Want to see more full solutions like this?
Chapter 15 Solutions
Essentials Of Statistics For Business & Economics
- There is no difference(s) between the correlation and causation between two or more variable. If True, discuss. If False, discuss.arrow_forwardBookwork code: 6A Calculator allowed Work out the size of angle x. Write down the angle facts you 63° Search used. 134° Zoom Not drawn accurate Watch videoarrow_forwardDue: Tue 00 35 -- Resources Next Question HR MIN SEC X Question 17 of 20 > Macmillan Learning Veterinarians suspect that enteroliths, calcifications in the gut of horses, are related to diet-in particular, alfalfa. To investigate this suspicion, a group of veterinarians collected information on the diet of horses and whether the horses developed enteroliths. The table displays the findings. less than 25% alfalfa in diet 25%-50% alfalfa in diet greater than 50% alfalfa in die Total 3 Has enteroliths 0.05 Does not have enteroliths 0.40 Total 0.07 0.25 0.08 0.20 0.15 0.80 0.45 0.32 0.23 1 The probability that a randomly selected horse has enteroliths given that it is fed more than 50% alfalfa is: ○ 0.08. 0.23. ○ 0.2. ○ 0.348. 95 MacBook Air Dil 7 8 9 0 D F10 GEarrow_forward
- +900 pts /1100 Due: Tue Submit Answer Many young men in North America and Europe (but not in Asia) tend to think they need more muscle to be attractive. One study presented 200 young American men with 100 images of men with various levels of muscle. Researchers measure level of muscle in kilograms per square meter (kg/m²) of fat-free body mass. Typical young men have about 20 kg/m². Each subject chose two images, one that represented his own level of body muscle and one that he thought represented "what women prefer." The mean gap between self-image and "what women prefer" was 2.35 kg/m². Suppose that the "muscle gap" in the population of all young men has a Normal distribution with standard deviation 2.5 kg/m² Give a 90% confidence interval for the mean amount of muscle think they should add to be attractive to women (Enter your answers rounded to four decimal places.) young 00 ible lower limit = 00 upper limit = 00 00 100 100 3 # E 80 DOD F4 $ 94 55 6 MacBook Air R T Y kg/m² kg/m²…arrow_forwardIs the normal distribution important in statistics analysis? If yes, why? If no, why? Is the normal distribution always being defined by its mean and standard deviation? Discuss. Discuss with examples c) Graphs illustrating the problems and solutions are recommendedarrow_forwardFinal grades The data show the final grades for 16 randomly selected students in College Algebra and Statistics. Find the y- value for the specified x-value. Remember that no regression should be done when r is not significant. Student 2 3 4 5 6 7 8 College 89 56 99 61 74 57 98 95 HARRERS 75 55 64 62 80 70 93 91 Student 9 10 11 12 13 14 15 16 College 98 87 89 73 66 88 79 56 SABERS 92 71 89 79 76 78 68 57 Send data to Excel The correlation coefficient for the data is r=0.684 and a=0.01. Should regression analysis be done? The regression analysis should not be done. The regression analysis should be done. Find the equation of the regression line. Round the coefficients to at least three decimal places. y'=a+bx a = b= Find the predicted grade in Statistics when the College Algebra grade was 86. Round the answer to at least three decimal places. When a student gets 86 in College Algebra, we predict a, Statistics score of . Continue × G Subnet Assignment F4 F5 95 ཀུན DII D F6 F7 FB F9 F10…arrow_forward
- = 5 6 = 7 8 9 10 11 12 13 14 15 16 Emergency Escape Plan A random sample of 180 residents of the Southeastern United States found 117 had no emergency escape plan for severe weather. Find the 90% confidence interval of the true proportion of individuals living in the Southeastern United States who do not have an emergency escape plan for severe weather. Use a graphing calculator. Round your answers to three decimal places. X 1 G Continue Q * F2 A 2 N # *3 80 a F3 F4 FS $ 54 % 5 + 80 G H J 9 K B N M F11 יד F12 0 P } 1 L ... Λ . H command option.arrow_forwardCigarette Smoking A researcher found that a cigarette smoker smokes on average 31 cigarettes a day. She feels that this average is too high. She selected a random sample of 8 smokers and found that the mean number of cigarettes they smoked per day was 28. The sample standard deviation was 2.9. At a 0.01, is there enough evidence to support her claim? Assume that the population is approximately normally distributed. Use the critical value method and tables. Part: 0/5 Part 1 of 5 (a) State the hypotheses and identify the claim. μ ローロ 0arrow_forwardAccidents and Drivers' Ages For a random selection of drivers, their ages and the number of automobile accidents that they had over a 3-year period are shown. Is there a linear relationship between the variables? Use Age for the independent variable and Accidents for the dependent variable. 27 25 42 31 43 22 24 39 Age Accidents 4 3101562 Send data to Excel Part: 0/5 Part 1 of 5 (a) Draw the scatter plot for the variables. Drivers' Age and Accidents 10- Accidents 8- 7 3 2. 14 0 3 10 15 20 25 30 35 Age x Part: 1/5 Part 2 of 5 (b) Compute the value of the correlation coefficient. Round your answer to at least three decimal places. x Part: 2/5 Part 3 of 5 (c) State the hypotheses. Continue * 80 F3 a F4 # $ % G F5 < Submit Assignment 2025 McGraw Hill LLC. All Rights Reserved. Terms of Use Privacy Center 1 Accessibility F6 & ཊུ8 DII A F7 F8 F9 F10arrow_forwardFast Food A deli manager wishes to see if there is a preference for the four types of hoagies it sells. The manager selects a week at random and records the number of hoagies of each type that the store sold. At a 0.01, is there a preference in the type of hoagies sold? Use the critical value method with The Chi-Square Distribution Table. Italian 74 Steak 52 Meatball 88 Vegetable 86 Send data to Excel Part: 0/5 Part 1 of 5 (a) Identify the claim with the correct hypothesis. Ho: Customers don't show a preference in the type of hoagie ordered. (Choose one) H₁: Customers show a preference in the type of hoagie ordered. (Choose one) x G Part: 1/5 Part 2 of 5 (b) Find the critical value. Use The Chi-Square Distribution Table. Round the answer to at least three decimal places. The critical value is Part: 2/5 Part 3 of 5 (c) Compute the test statistic. Round the intermediate and final answers to at least three decimal places. 2-1 Start over Part: 3/5 Part 4 of 5 (d) Make the decision. (Choose…arrow_forward8 19 10 11 12 13 14 15 16 Physical Therapy A random sample of 10 physical therapy patients who had knee surgery shows the angle of knee bend before physical therapy and after 6 weeks of physical therapy. At a 0.10, has there been an increase in the angle range after the therapy? Assume that all variables are normally distributed; use μ, for the "before" values and "D"₁₂. Use the critical value method and tables. Patient 1 2 3 4 S 6 7 8 9 10 Degrees before 81 84 91 66 67 92 90 93 87 62 Degrees after 120 125 129 104 106 122 119 126 115 136 Send data to Excel Part: 0/5 Part 1 of 5 (a) State the hypotheses and identify the claim. Ho (Choose one) H₁: (Choose one) This hypothesis test is a (Choose one) test. 0<0 Darrow_forward8 Español 10 11 12 13 14 15 Disabling Injuries Is the type of injury related to the age of the individual at a=0.05? Type Motor vehicle Home Work Public Under 35 35 and 26 24 29 26 11 18 36 34 I over Send data to Excel Part: 0/5 Part 1 of 5 (a) State the hypotheses and identify the claim. Ho: The type of injury is (Choose one) age. (Choose one) X H₁: The type of injury is (Choose one) age. (Choose one) Part: 1/5 Part 2 of 5 (b) Find the critical value. Round the answer to at least three decimal places. Use The Chi-Square Distribution Table. Critical value- x Part: 2/5 Part 3 of 5 (c) Compute the test statistic. Round the intermediate and final answers to at least three decimal places. 7-0 x の Part: 3/5 Part 4 of 5 (d) Make the decision. (Choose one) the null hypothesis. Part: 4/5 20 F3 a F4 G Reserved. Terms of Use Submit Assignment Privacy Center | Accessibility F5 95 8 DII F6 F7 F8 F9 F10 TAarrow_forwardarrow_back_iosSEE MORE QUESTIONSarrow_forward_iosRecommended textbooks for you
- Holt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin Harcourt
- Algebra & Trigonometry with Analytic GeometryAlgebraISBN:9781133382119Author:SwokowskiPublisher:CengageAlgebra: Structure And Method, Book 1AlgebraISBN:9780395977224Author:Richard G. Brown, Mary P. Dolciani, Robert H. Sorgenfrey, William L. ColePublisher:McDougal Littell
Holt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillBig Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtAlgebra & Trigonometry with Analytic GeometryAlgebraISBN:9781133382119Author:SwokowskiPublisher:CengageAlgebra: Structure And Method, Book 1AlgebraISBN:9780395977224Author:Richard G. Brown, Mary P. Dolciani, Robert H. Sorgenfrey, William L. ColePublisher:McDougal Littell