Probability and Statistics for Engineering and the Sciences
Probability and Statistics for Engineering and the Sciences
9th Edition
ISBN: 9781305251809
Author: Jay L. Devore
Publisher: Cengage Learning
bartleby

Concept explainers

bartleby

Videos

Textbook Question
Book Icon
Chapter 12.3, Problem 35E

Refer back to the data in Exercise 4, in which y = ammonium concentration (mg/L) and x = transpiration (ml/h). Summary quantities include n = 13, Σxi = 303.7, Σyi = 52.8, Sxx = 1585.230769, Sv = −341.959231. and Syy = 77.270769.

  1. a. Obtain the equation of the estimated regression line and use it to calculate a point prediction of ammonium concentration for a future observation made when ammonium concentration is 25 ml/h.
  2. b. What happens if the estimated regression line is used to calculate a point estimate of true average concentration when transpiration is 45 ml/h? Why does it not make sense to calculate this point estimate?
  3. c. Calculate and interpret s.
  4. d. Do you think the simple linear regression model does a good job of explaining observed variation in concentration? Explain.

a.

Expert Solution
Check Mark
To determine

Find the interval estimate for the slope of the population regression.

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression is 0.632β12.440_.

Explanation of Solution

Given info:

The summary statistics of the data correspond to the variables motion sickness dose (x) and % reported nausea (y). The results of the summary statistics are n=17, i=1nxi=222.1, i=1nyi=193, i=1nyi2=2,975,i=1nxi2=3,056.69 and i=1nxiyi=2,759.6. The range of the values of the variable motion sickness dose is 6.0 to 17.6.

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯β^1x¯=iyiβ^1ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[ixiyi(ixi)(iyi)n]ixi2(ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[ixiyi(ixi)(iyi)n]ixi2(ixi)2n=2759.6222.1×193173,056.69222.1×222.117=2759.62,521.4883,056.692,901.6712=238.112155.0188

=1.536

Thus, the point estimate of the slope is β^1=1.536.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=i(yiy¯)2 where yi be the ith observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=iyi2(iyi)2n=2,975(193×193)17=783.8824

Therefore, the total sum of squares is SST=Syy=783.8824

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=i(y^iy¯)2 where y^i be the predicted value of the ith observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[ixiyi(ixi)(iyi)n]2ixi2(ixi)2n=(2759.6222.1×19317)23,056.69222.1×222.117=(2759.62,521.488)23,056.692,901.6712=238.1122155.0188.

=365.7448

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=i(yiy^)2 where yi be the predicted value of the ith observation and y¯ be the predicted value for the ith observation.

The general formula to obtain error sum of square is,

SSE=SSTSSR.

The error sum of squares is obtained as,

SSE=SSTSSR=783.8824365.7448=418.1376

Therefore, the error sum of squares is SSE=418.1376

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn2.

The estimate of error standard deviation is obtained as,

s=SSEn2=418.1376172=5.28

Thus, the estimate of error standard deviation is s=5.28_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=i(yiy^)2 where yi be the predicted value of the ith observation and y¯ be the predicted value for the ith observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=ixi2(ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=sixi2(ixi)2n=5.283,056.69222.1×222.117=0.424

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.424_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1α=10.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n2=172=15

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

The 95% confidence interval is,

C.I=β^1(ta/2×sβ^1)β1β^1+(ta/2×sβ^1)=(1.536(2.131×0.424)β11.536+(2.131×0.424))=(1.536±0.903544)(0.632,2.440)

Thus, the 95% confidence interval for the slope of the population regression is 0.632β12.440_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.632 and 2.440.

b.

Expert Solution
Check Mark
To determine

Test whether there is enough evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Answer to Problem 35E

There is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

Explanation of Solution

Calculation:

From part (a), the slope coefficient of the regression line is β^1=1.536.

The test hypotheses are given below:

Null hypothesis:

 H0:β1=0

That is, there is no useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

Alternative hypothesis:

 H1:β10

That is, there is useful relationship between the variables motion sickness dose (y) and % reported nausea (x).

T-test statistic:

The test statistic is,

t=β^1β1sβ^1t(n2)

Degrees of freedom:

The sample size is n=17

The degrees of freedom is,

d.f=n2=172=15

Thus, the degree of freedom is 15.

Level of significance:

Here, level of significance is not given.

So, the prior level of significance α=0.05 can be used.

For the level of significance α=0.05,

α2=0.052=0.025

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 15 degrees of freedom is 2.131.

Thus, the critical value is (t0.025,15)=2.131.

From part (a), the estimate of error standard deviation of slope coefficient is sβ^1=0.424.

Test statistic under null hypothesis:

Under the null hypothesis, the test statistic is obtained as follows:

t=β^1β1sβ^1=1.53600.424=3.6226

Thus, the test statistic is 3.6226.

Decision criteria for the classical approach:

If |t|>tα2(teststatistic> criticalvalue), then reject the null hypothesis (H0).

Conclusion:

Here, the test statistic is 3.6226 and critical value is 2.131.

The t statistic is greater than the critical value.

That is, 3.6226(=test statistic)>2.131(=critical value)

Based on the decision rule, the null hypothesis is rejected.

Hence, there is a linear relationship between the predictor variable % reported nausea and the response variable motion sickness dose.

Therefore, there is sufficient evidence to conclude that the predictor variable motion sickness dose is useful for predicting the value of the response variable % reported nausea.

c.

Expert Solution
Check Mark
To determine

Check whether it is plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Answer to Problem 35E

No, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

Explanation of Solution

Calculation:

Linear regression model:

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯β^1x¯=iyiβ^1ixin

The y-intercept of the regression model is obtained as follows:

β^0=iyiβ^1ixin=1931.536×222.117=8.715

Thus, the y-intercept of the regression model is β^0=8.715.

From part (a), the slope coefficient of the regression line is β^1=1.536.

Therefore, the regression equation of the variables motion sickness dose (x) and % reported nausea (y) is y=8.715+1.536x.

Predicted value of % reported nausea when the motion sickness dose is 5.0:

The predicted value of % reported nausea when the motion sickness dose is 5.0 is obtained as follows:

y=8.715+1.536x=8.715+1.536×0.5=7.947

Thus, the predicted value of % reported nausea for 5.0 motion sickness dose is –7.947.

Here, the % reported nausea is resulted as a negative value, which is not possible in reality.

Thus, the predicted value is a flaw.

Moreover, it is given that the range of the values of the variable motion sickness dose is 6.0 to 17.6.

The value 5.0 is outside the range of the variable motion sickness dose. That is, the observation 5.0 is not available.

Hence, the regression line may not give good estimate of expected % reported nausea when the motion sickness dose is 5.0.

Therefore, it is not plausible to estimate the expected % reported nausea when the motion sickness dose is 5.0 using the obtained regression line.

d.

Expert Solution
Check Mark
To determine

Find the interval estimate for the slope of the population regression after eliminating the observation (6.0,2.50).

Comment whether the observation (6.0,2.50) have a substantial impact on the regression model

Answer to Problem 35E

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719β12.7301_.

Yes, the observation (6.0,2.50) has a substantial impact on the regression model

Explanation of Solution

Calculation:

Linear regression model:

In a linear equation y=b0+b1xi the constant b1 be the slope and b0 be the y-intercept and x is the independent variable and y is the independent variable.

A linear regression model is given as y^=β^0+β^1x where y^ be the predicted values of response variable and x be the predictor variable. The β^1 be the estimate of slope and β^0 be the estimate of intercept of the line.

Here, the observation (6.0,2.50) has to be removed from the data set.

That is, the value 6.0 has to be removed from the variable motion sickness dose (x) and 2.50 has to be removed from the variable % reported nausea (y).

The results of the summary statistics after eliminating the observation (6.0,2.50) from the data set are as follows:

Sample size:

n=171=16.

Sum of the variable:

i=1nxi=222.16=216.1,i=1nyi=1932.50=191.5.

Sum of squares of the variable:

i=1nxi2=3,056.6962=3,020.69,i=1nyi2=2,9752.52=2,968.75,and i=1nxiyi=2,759.66×2.5=2,7444.6.

Y-intercept:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain y-intercept is,

β^0=y¯β^1x¯=iyiβ^1ixin

Slope:

In a linear equation y^=β^0+β^1x the constant b1 be the slope and b0 be the y-intercept form and x is the independent variable and y is the independent variable.

The general formula to obtain slope is,

β^1=SxySxx=[ixiyi(ixi)(iyi)n]ixi2(ixi)2n

The slope coefficient of the simple linear regression is,

β^1=[ixiyi(ixi)(iyi)n]ixi2(ixi)2n=2,744.6216.1×191.5163,020.69216.1×216.116=2,744.62,586.4473,020.692,918.701=1.551

Thus, the point estimate of the slope is β^1=1.551.

Total sum of square: (SST)

The total variation in the observed values of the response variable is defined as the total sum of squares. The formula for total sum of square is SST=i(yiy¯)2 where yi be the ith observation value and y¯ be the sample mean.

The total sum of square is obtained as ,

SST=Syy=iyi2(iyi)2n=2,968.75(191.5×191.5)16=676.7344

Therefore, the total sum of squares is SST=Syy=676.7344

Regression sum of square: (SSR)

The variation in the observed values of the response variable explained by the regression is defined as the regression sum of squares. The formula for regression sum of square is SSR=i(y^iy¯)2 where y^i be the predicted value of the ith observation and y¯ be the sample mean.

The regression sum of squares is obtained as is,

SSR=Sxy2Sxx=[ixiyi(ixi)(iyi)n]2ixi2(ixi)2n=(2,744.6216.1×191.516)23,020.69216.1×216.116=(2,744.62,586.447)23,020.692,918.701=2,5012.41101.9894.

=245.2453

Error sum of square: (SSE)

The variation in the observed values of the response variable which is not explained by the regression is defined as the error sum of squares. The formula for error sum of square is SSE=i(yiy^)2 where yi be the predicted value of the ith observation and y¯ be the predicted value for the ith observation.

The general formula to obtain error sum of square is,

SSE=SSTSSR.

The error sum of squares is obtained as,

SSE=SSTSSR=676.7344245.2453=431.4891

Therefore, the error sum of squares is SSE=431.4891

Estimate of error standard deviation:

The general formula for the estimate of error standard deviation is,

σ=s=SSEn2.

The estimate of error standard deviation is obtained as,

s=SSEn2=431.4891162=5.552

Thus, the estimate of error standard deviation is s=5.552_.

Error sum of square: (SSE)

The variation in the observed values of the response variable that is not explained by the regression is defined as the regression sum of squares. The formula for error sum of square is SSE=i(yiy^)2 where yi be the predicted value of the ith observation and y¯ be the predicted value for the ith observation.

Estimate of error standard deviation of slope coefficient:

The general formula for the estimate of error standard deviation of slope coefficient is,

σβ^1=σSxx,

The defining formula for Sxx is,

Sxx=ixi2(ixi)2n

The estimate of error standard deviation of slope coefficient is,

sβ^1=sixi2(ixi)2n=5.5523,020.69216.1×216.116=0.5497

Thus, the estimate of error standard deviation of slope coefficient is sβ^1=0.5497_.

Confidence interval:

The general formula for the confidence interval for the slope of the regression line is,

CI=β^1±ta/2,(n2)×sβ^1

Where, β^1 be the slope of the sample regression line, sβ^1 be the estimate of error standard deviation of slope coefficient.

Since, the level of confidence is not specified. The prior confidence level 95% can be used.

Critical value:

For 95% confidence level,

1α=10.95α=0.05α2=0.052=0.025

Degrees of freedom:

The sample size is n=16

The degrees of freedom is,

d.f=n2=162=14

From Table A.5 of the t-distribution in Appendix A, the critical value corresponding to the right tail area 0.025 and 14 degrees of freedom is 2.145.

Thus, the critical value is (t0.025,14)=2.145.

The 95% confidence interval is,

C.I=β^1(ta/2×sβ^1)β1β^1+(ta/2×sβ^1)=(1.551(2.145×0.5497)β11.551+(2.145×0.5497))=(1.551±1.1791)(0.3719,2.7301)

Thus, the 95% confidence interval for the slope of the population regression is 0.3719β12.7301_.

Interpretation:

There is 95% confident, that the expected change in % reported nausea associated with 1 unit increase in motion sickness dose lies between 0.3719 and 2..7301.

Comparison:

The 95% confidence interval for the slope of the population regression with the observation (6.0,2.50) is 0.632β12.440_.

The 95% confidence interval for the slope of the population regression after eliminating the observation (6.0,2.50) is 0.3719β12.7301_.

Here, by observing both the intervals it is clear that the (6.0,2.50) has an impact on the slope coefficient of the regression line.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!
Students have asked these similar questions
The marketing manager of a supermarket chain would like to determine the effect of shelf spaceon the sales of pet food. A random sample of 10 stores was selected, and the results are presentedbelow. Store shelf space in cm weekly sales in thousand pesos 1 45 18 2 45 21 3 75 15 4 80 18 5 95 23 6 100 26 7 135 22 8 140 27 9 185 25 10 190 28   d. Using the estimated simple linear regression equation Y=15.6414+0.0611X, estimate the weekly sales when theshelf space is 230cm? 250cm? e. Compute the coefficient of determination and interpret its value.
The following table displays the mathematics test scores for a random sample of college students, along with their final SY16C grades. a. Fit the regression line y = a+bx to the data and interpret the results. b. Use the regression equation to determine the SY16C grade for a college student who scored60 on their achievement test. What would their SY16C grade be?
A sample of X and Y scores is taken, and a regression line is used to predict Y from X. If SSY' = 300, SSE = 500, and N = 50, what is(a) SSY?(b) the standard error of the estimate?(c) R2?

Chapter 12 Solutions

Probability and Statistics for Engineering and the Sciences

Ch. 12.1 - Suppose that in a certain chemical process the...Ch. 12.2 - Refer back to the data in Exercise 4, in which y =...Ch. 12.2 - The accompanying data on y = ammonium...Ch. 12.2 - Refer to the lank temperature-efficiency ratio...Ch. 12.2 - Values of modulus of elasticity (MOE, the ratio of...Ch. 12.2 - The article Characterization of Highway Runoff in...Ch. 12.2 - For the past decade, rubber powder has been used...Ch. 12.2 - For the past decade, rubber powder has been used...Ch. 12.2 - The following data is representative of that...Ch. 12.2 - The bond behavior of reinforcing bars is an...Ch. 12.2 - Wrinkle recovery angle and tensile strength are...Ch. 12.2 - Calcium phosphate cement is gaining increasing...Ch. 12.2 - a. Obtain SSE for the data in Exercise 19 from the...Ch. 12.2 - The invasive diatom species Didymosphenia geminata...Ch. 12.2 - Prob. 25ECh. 12.2 - Show that the point of averages (x,y) lies on the...Ch. 12.2 - Prob. 27ECh. 12.2 - a. Consider the data in Exercise 20. Suppose that...Ch. 12.2 - Consider the following three data sets, in which...Ch. 12.3 - Reconsider the situation described in Exercise 7,...Ch. 12.3 - During oil drilling operations, components of the...Ch. 12.3 - Exercise 16 of Section 12.2 gave data on x =...Ch. 12.3 - During oil drilling operations, components of the...Ch. 12.3 - For the past decade, rubber powder has been used...Ch. 12.3 - Refer back to the data in Exercise 4, in which y =...Ch. 12.3 - Misi (airborne droplets or aerosols) is generated...Ch. 12.3 - Prob. 37ECh. 12.3 - Refer to the data on x = liberation rate and y =...Ch. 12.3 - Carry out the model utility test using the ANOVA...Ch. 12.3 - Prob. 40ECh. 12.3 - Prob. 41ECh. 12.3 - Verify that if each xi is multiplied by a positive...Ch. 12.3 - Prob. 43ECh. 12.4 - Fitting the simple linear regression model to the...Ch. 12.4 - Reconsider the filtration ratemoisture content...Ch. 12.4 - Astringency is the quality in a wine that makes...Ch. 12.4 - The simple linear regression model provides a very...Ch. 12.4 - Prob. 48ECh. 12.4 - You are told that a 95% CI for expected lead...Ch. 12.4 - Prob. 50ECh. 12.4 - Refer to Example 12.12 in which x = test track...Ch. 12.4 - Plasma etching is essential to the fine-line...Ch. 12.4 - Consider the following four intervals based on the...Ch. 12.4 - The height of a patient is useful for a variety of...Ch. 12.4 - Prob. 55ECh. 12.4 - The article Bone Density and Insertion Torque as...Ch. 12.5 - The article Behavioural Effects of Mobile...Ch. 12.5 - The Turbine Oil Oxidation Test (TOST) and the...Ch. 12.5 - Toughness and fibrousness of asparagus are major...Ch. 12.5 - Head movement evaluations are important because...Ch. 12.5 - Prob. 61ECh. 12.5 - Prob. 62ECh. 12.5 - Prob. 63ECh. 12.5 - The accompanying data on x = UV transparency index...Ch. 12.5 - Torsion during hip external rotation and extension...Ch. 12.5 - Prob. 66ECh. 12.5 - Prob. 67ECh. 12 - The appraisal of a warehouse can appear...Ch. 12 - Prob. 69SECh. 12 - Forensic scientists are often interested in making...Ch. 12 - Phenolic compounds are found in the effluents of...Ch. 12 - The SAS output at the bottom of this page is based...Ch. 12 - The presence of hard alloy carbides in high...Ch. 12 - The accompanying data was read from a scatterplot...Ch. 12 - An investigation was carried out to study the...Ch. 12 - Prob. 76SECh. 12 - Open water oil spills can wreak terrible...Ch. 12 - In Section 12.4, we presented a formula for...Ch. 12 - Show that SSE=Syy1Sxy, which gives an alternative...Ch. 12 - Suppose that x and y are positive variables and...Ch. 12 - Let sx and sy denote the sample standard...Ch. 12 - Verify that the t statistic for testing H0: 1 = 0...Ch. 12 - Use the formula for computing SSE to verify that...Ch. 12 - In biofiltration of wastewater, air discharged...Ch. 12 - Normal hatchery processes in aquaculture...Ch. 12 - Prob. 86SECh. 12 - Prob. 87SE
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Similar questions
SEE MORE QUESTIONS
Recommended textbooks for you
Text book image
Algebra & Trigonometry with Analytic Geometry
Algebra
ISBN:9781133382119
Author:Swokowski
Publisher:Cengage
Text book image
Functions and Change: A Modeling Approach to Coll...
Algebra
ISBN:9781337111348
Author:Bruce Crauder, Benny Evans, Alan Noell
Publisher:Cengage Learning
Correlation Vs Regression: Difference Between them with definition & Comparison Chart; Author: Key Differences;https://www.youtube.com/watch?v=Ou2QGSJVd0U;License: Standard YouTube License, CC-BY
Correlation and Regression: Concepts with Illustrative examples; Author: LEARN & APPLY : Lean and Six Sigma;https://www.youtube.com/watch?v=xTpHD5WLuoA;License: Standard YouTube License, CC-BY