EBK STATISTICAL TECHNIQUES IN BUSINESS
EBK STATISTICAL TECHNIQUES IN BUSINESS
17th Edition
ISBN: 9781259924163
Author: Lind
Publisher: MCGRAW HILL BOOK COMPANY
bartleby

Videos

Textbook Question
Book Icon
Chapter 14, Problem 23CE

Great Plains Distributors, Inc. sells roofing and siding products to home improvement retailers, such as Lowe’s and Home Depot, and commercial contractors. The owner is interested in studying the effects of several variables on the sales volume of fiber-cement siding products.

The company has 26 marketing districts across the United States. In each district, it collected information on the following variables: sales volume (in thousands of dollars), advertising dollars (in thousands), number of active accounts, number of competing brands, and a rating of market potential.

Chapter 14, Problem 23CE, Great Plains Distributors, Inc. sells roofing and siding products to home improvement retailers,

Conduct a multiple regression analysis to find the best predictors of sales.

  1. a. Draw a scatter diagram comparing sales volume with each of the independent variables. Comment on the results.
  2. b. Develop a correlation matrix. Do you see any problems? Does it appear there are any redundant independent variables?
  3. c. Develop a regression equation. Conduct the global test. Can we conclude that some of the independent variables are useful in explaining the variation in the dependent variable?
  4. d. Conduct a test of each of the independent variables. Are there any that should be dropped?
  5. e. Refine the regression equation so the remaining variables are all significant.
  6. f. Develop a histogram of the residuals and a normal probability plot. Are there any problems?
  7. g. Determine the variance inflation factor for each of the independent variables. Are there any problems?

a.

Expert Solution
Check Mark
To determine

Make a scatter diagram that compares sales with each of the independent variables.

Explain the results.

Answer to Problem 23CE

The scatter diagram is obtained below,

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  1

Explanation of Solution

Step by step procedure to obtain the correlation matrix using MINITAB software is given below:

  • Choose Graph > Matrix plot.
  • Under Each Y versus each X, select Simple.
  • Under Y variable select the column of Sales.
  • Under X variables select the columns of Ad Dollars, Number of accounts, Number of Competitors, and Potential.
  • Click OK.

The MINITAB output is obtained.

According to the output, as the number of competitors increases the sales decreases. In other hand, as the number of accounts and the rating of market potential increases, the sales also increase. However, there is no linear relationship between the Ad Dollars with the sales.

b.

Expert Solution
Check Mark
To determine

Make the correlation matrix.

Explain whether there is any problem due to redundant independent variables.

Answer to Problem 23CE

The correlation matrix is obtained as,

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  2

Explanation of Solution

Multiple linear regression model:

A multiple linear regression model is given as y^=a+b1x1+b2x2+b3x3+...+bkxk where y is the response or dependent variable, and x1,x2,...,xk are the k quantitative independent variables where k is a positive integer.

Here, a is the intercept term of the regression model, that is, the value of predicted value of y when X’s are 0 and bi’s are the slopes, that is, the amount of change of the predicted value of y for one unit increase in xi when all other independent variables are constant.

In the given problem the predicted dependent variable y is the sales. The years of advertising dollars, the number of accounts, the number of competitors and the rating of market potential, are defined as x1,x2,x3andx4, respectively.

Step by step procedure to obtain the correlation matrix using MINITAB software is given below:

  • Choose Stat > Basic Statistics > Correlation.
  • Select the columns of Sales, Ad Dollars, Number of accounts, Number of Competitors, and Potential under Variables tab.
  • Click OK.

The MINITAB output is obtained.

Multicollinearity:

In a multiple regression model, when there is high correlation between two or more independent variables, then multicollinearity occurs.

There is moderate correlation between the independent variables “Potential” and “Number of Accounts”.

Thus, there is no chance of occurrence of multicollinearity in the regression model.

c.

Expert Solution
Check Mark
To determine

Find the regression equation and perform a global test.

Answer to Problem 23CE

The regression equation is y^=178.3+1.81x1+3.318x221.185x3+0.325x4_.

Some of the independent variables are useful in explaining the variation in the dependent variable at 0.05 significance level.

Explanation of Solution

Calculation:

Step by step procedure to obtain the regression equation using MINITAB software:

  • Choose Stat > Regression > Regression > Fit Regression Model.
  • Under Responses, enter the column of Sales.
  • Under Continuous predictors, enter the columns of Ad Dollars, Number of accounts, Number of Competitors, and Potential.
  • Click OK.

Output using MINITAB software is given below:

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  3

Thus, the regression equation is y^=178.3+1.81x1+3.318x221.185x3+0.325x4_.

Consider that y is dependent variable and xi's are the independent variables where βi's are the corresponding population regression coefficient for all i=1,2,3,4.

State the hypotheses:

Null hypothesis:

H0:β1=β2=β3=β4=0.

That is, the model is not significant.

Alternative hypothesis:

H1:At least one βi is not equal to 0.

That is, the model is significant.

In case of global test the F test statistic is defined as,

F=SSRkSSEnk1, where SSR, SSE, n and k are the regression sum of square, error sum of square, sample size and the number of independent variables.

According to the output the value of F statistic is 479.1 with numerator degrees of freedom 4 and denominator degrees of freedom 21.

Consider that, the level of significance is α=0.05.

Decision rule:

  • If p-valueα, then reject the null hypothesis.
  • Otherwise failed to reject the null hypothesis.

Conclusion:

Here, p-value corresponding to the global test is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that some of the independent variables are useful in explaining the variation in the dependent variable at 0.05 significance level.

d.

Expert Solution
Check Mark
To determine

Perform individual tests of each independent variables.

Explain whether any variable should be dropped.

Answer to Problem 23CE

There is significant relationship between y and x2andx3, whereas there is no significant relationship between y and x1andx4.

It is better to drop these two variables x1andx4 and perform the regression analysis using x2andx3 as independent variables.

Explanation of Solution

Calculation:

For independent variable x1:

Consider that β1 is the population regression coefficient of independent variable x1.

State the hypotheses:

Null hypothesis:

H0:β1=0.

That is, there is no significant relationship between y and x1.

Alternative hypothesis:

H1:β10.

That is, there is significant relationship between y and x1.

In case of individual regression coefficient test the t test statistic is defined as,

t=bisbi, where bi and sbi are the ith regression coefficient and the standard deviation of the ith regression coefficient.

According to the output in Part (c) the t statistic value corresponding to x1 is 1.67 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Ad Dollars”(x1) is 0.109.

Hence, p-value(=0.109)>α(=0.05).

That is, the p-value is greater than the level of significance.

Therefore, fail to reject the null hypothesis.

Hence, it can be concluded that there is no significant relationship between y and x1.

For independent variable x2:

Consider that β2 is the population regression coefficient of independent variable x2.

State the hypotheses:

Null hypothesis:

H0:β2=0.

That is, there is no significant relationship between y and x2.

Alternative hypothesis:

H1:β20.

That is, there is significant relationship between y and x2.

According to the output in Part (c) the value of t test statistic corresponding to x2 is 20.37 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of accounts”(x2) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x2.

For independent variable x3:

Consider that β3 is the population regression coefficient of independent variable x3.

State the hypotheses:

Null hypothesis:

H0:β3=0.

That is, there is no significant relationship between y and x3.

Alternative hypothesis:

H1:β30.

That is, there is significant relationship between y and x3.

According to the output in Part (c) the value of t test statistic corresponding to x3 is –26.89 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of competitors”(x3) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x3.

For independent variable x4:

Consider that β4 is the population regression coefficient of independent variable x4.

State the hypotheses:

Null hypothesis:

H0:β4=0.

That is, there is no significant relationship between y and x4.

Alternative hypothesis:

H1:β40.

That is, there is significant relationship between y and x4.

According to the output in Part (c) the value of t test statistic corresponding to x4 is 0.69 with 21 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Potential”(x4) is 0.495.

Hence, p-value(=0.495)>α(=0.05).

That is, the p-value is greater than the level of significance.

Therefore, fail to reject the null hypothesis.

Hence, it can be concluded that there is no significant relationship between y and x4.

Hence, it can be said that as there is no significant relationship between “Sales” and “Ad Dollars” and between “Sales” and “Potential”, thus it is better to omit these two variables and perform the regression analysis using “Number of accounts” and “Number of competitors” as independent variables.

e.

Expert Solution
Check Mark
To determine

Refine the regression equation so the remaining variables are all significant.

Answer to Problem 23CE

The refined regression equation is y^=186.7+3.408x121.193x2.

Explanation of Solution

Calculation:

In this part the dependent variable is sales (y) and the independent variables are the number of accounts (x1) and the number of competitors (x2).

Step by step procedure to obtain the regression equation using MINITAB software:

  • Choose Stat > Regression > Regression > Fit Regression Model.
  • Under Responses, enter the column of Sales.
  • Under Continuous predictors, enter the columns of Number of accounts, and Number of Competitors.
  • Choose Graphs.
  • Under Residual plot select Histogram of residuals, Normal probability plot of residuals.
  • Click OK.
  • Click OK.

Output using MINITAB software is given below:

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  4

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  5

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  6

Hence, the regression equation is y^=186.7+3.408x121.193x2.

For independent variable x1:

Consider that β1 is the population regression coefficient of independent variable x1.

State the hypotheses:

Null hypothesis:

H0:β1=0.

That is, there is no significant relationship between y and x1.

Alternative hypothesis:

H1:β10.

That is, there is significant relationship between y and x1.

According to the output the t statistic value corresponding to x1 is 23.37 with 23 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of Accounts”(x1) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x1.

For independent variable x2:

Consider that β2 is the population regression coefficient of independent variable x2.

State the hypotheses:

Null hypothesis:

H0:β2=0.

That is, there is no significant relationship between y and x2.

Alternative hypothesis:

H1:β20.

That is, there is significant relationship between y and x2.

According to the output in the value of t test statistic corresponding to x2 is –26.4 with 23 degrees of freedom.

Conclusion:

Here, p-value corresponding to the “Number of competitors”(x2) is 0.

Hence, p-value(=0)<α(=0.05).

That is, the p-value is less than the level of significance.

Therefore, reject the null hypothesis.

Hence, it can be concluded that there is significant relationship between y and x2.

Hence, it can be said that as there is significant relationship between “Sales” and “Number of accounts” and “Number of competitors”.

f.

Expert Solution
Check Mark
To determine

Provide a histogram and normal probability plot for residuals.

Also explain whether there are any problems.

Explanation of Solution

Calculation:

Histogram:

From Part (e), the histogram and normal probability plot is obtained as,

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  7

EBK STATISTICAL TECHNIQUES IN BUSINESS, Chapter 14, Problem 23CE , additional homework tip  8

Assumption of normality from histogram:

  • The majority of the observation in the middle and centered on the mean of 0.
  • There are lower frequencies on the tails of the distributions.

According to the given histogram, the most of the observations are centered on the mean of 0 and there are less frequencies on the tails of the distributions. In addition, it can be considered as perfect symmetric.

According to the normal probability plot, all of the residuals are linear.

Hence, there are no problems in normality assumptions.

g.

Expert Solution
Check Mark
To determine

Find the variation inflection factor for each of the independent variables.

Explain whether there is any problem.

Explanation of Solution

Variation inflation factor(VIF):

The variation inflation factor is defined as,

VIF=11Ri2 , where Ri2 is the coefficient of determination of ith independent variable.

It is used to detect multicollinearity in the regression. If VIF<10, then three is no presence of multicollinearity.

From Part (e), the variation inflection factor corresponding to both variables are 1.12, which is less than 10.

Hence, there is no presence of multicollinearity.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!
Students have asked these similar questions
The U.S. Department of Energy’s Fuel Economy Guide provides fuel efficiency datafor cars and trucks (Fuel Economy website, September, 2012). A portion of the data for149 compact, midsize, and large cars is shown in Table 2.13. The data set contains thefollowing variables:Size: Compact, Midsize, and LargeDisplacement: Engine size in litersCylinders: Number of cylinders in the engineDrive: All wheel (A), front wheel (F), and rear wheel (R)Fuel Type: Premium (P) or regular (R) fuelCity MPG: Fuel efficiency rating for city driving in terms of miles per gallonHwy MPG: Fuel efficiency rating for highway driving in terms of miles per gallonThe complete data set is contained in the file named FuelData2012.a. Prepare a crosstabulation of the data on Size (rows) and Hwy MPG (columns). Useclasses of 15–19, 20–24, 25–29, 30–34, 35–39, and 40–44 for Hwy MPG.b. Comment on the relationship beween Size and Hwy MPG.
The U.S. Department of Energy’s Fuel Economy Guide provides fuel efficiency datafor cars and trucks (Fuel Economy website, September, 2012). A portion of the data for149 compact, midsize, and large cars is shown in Table 2.13. The data set contains thefollowing variables:Size: Compact, Midsize, and LargeDisplacement: Engine size in litersCylinders: Number of cylinders in the engineDrive: All wheel (A), front wheel (F), and rear wheel (R)Fuel Type: Premium (P) or regular (R) fuelCity MPG: Fuel efficiency rating for city driving in terms of miles per gallonHwy MPG: Fuel efficiency rating for highway driving in terms of miles per gallonThe complete data set is contained in the file named FuelData2012.a. Prepare a crosstabulation of the data on Size (rows) and Hwy MPG (columns). Useclasses of 15–19, 20–24, 25–29, 30–34, 35–39, and 40–44 for Hwy MPG.b. Comment on the relationship beween Size and Hwy MPG.c. Prepare a crosstabulation of the data on Drive (rows) and City MPG (columns).…
Is it possible to have several dependent variables in a model?   Are several variables possible in a decision-making problem?   Which model—descriptive, predictive, or prescriptive—of a particular choice issue would you be most likely to employ if it were offered to you? Why?   Give an example of a well-known corporate, political, or military leader who made a choice that either had a positive or negative impact.   What distinguishes a computer model from a spreadsheet model?

Chapter 14 Solutions

EBK STATISTICAL TECHNIQUES IN BUSINESS

Ch. 14 - The following regression output was obtained from...Ch. 14 - A study by the American Realtors Association...Ch. 14 - The manager of High Point Sofa and Chair, a large...Ch. 14 - Prob. 10ECh. 14 - Prob. 11ECh. 14 - A real estate developer wishes to study the...Ch. 14 - Prob. 13CECh. 14 - Prob. 14CECh. 14 - Prob. 15CECh. 14 - Prob. 16CECh. 14 - The district manager of Jasons, a large discount...Ch. 14 - Suppose that the sales manager of a large...Ch. 14 - The administrator of a new paralegal program at...Ch. 14 - Prob. 20CECh. 14 - Prob. 21CECh. 14 - A regional planner is studying the demographics of...Ch. 14 - Great Plains Distributors, Inc. sells roofing and...Ch. 14 - Prob. 24CECh. 14 - Prob. 25CECh. 14 - Prob. 26CECh. 14 - An investment advisor is studying the relationship...Ch. 14 - Prob. 28CECh. 14 - Prob. 29CECh. 14 - The director of special events for Sun City...Ch. 14 - Prob. 31CECh. 14 - Prob. 32CECh. 14 - Prob. 33DACh. 14 - Prob. 34DACh. 14 - Prob. 35DACh. 14 - Prob. 1PCh. 14 - Quick-print firms in a large downtown business...Ch. 14 - The following ANOVA output is given. a. Compute...Ch. 14 - Prob. 1CCh. 14 - Prob. 2CCh. 14 - Prob. 3CCh. 14 - In a scatter diagram, the dependent variable is...Ch. 14 - What level of measurement is required to compute...Ch. 14 - If there is no correlation between two variables,...Ch. 14 - Which of the following values indicates the...Ch. 14 - Under what conditions will the coefficient of...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Given the following regression equation, = 7 ...Ch. 14 - Prob. 1.9PTCh. 14 - In a multiple regression equation, what is the...Ch. 14 - Prob. 1.11PTCh. 14 - Prob. 1.12PTCh. 14 - For a dummy variable, such as gender, how many...Ch. 14 - What is the term given to a table that shows all...Ch. 14 - If there is a linear relationship between the...Ch. 14 - Given the following regression analysis output: a....Ch. 14 - Given the following regression analysis output. a....
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Similar questions
SEE MORE QUESTIONS
Recommended textbooks for you
Text book image
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Text book image
Big Ideas Math A Bridge To Success Algebra 1: Stu...
Algebra
ISBN:9781680331141
Author:HOUGHTON MIFFLIN HARCOURT
Publisher:Houghton Mifflin Harcourt
F- Test or F- statistic (F- Test of Equality of Variance); Author: Prof. Arvind Kumar Sing;https://www.youtube.com/watch?v=PdUt7InTyc8;License: Standard Youtube License
Statistics 101: F-ratio Test for Two Equal Variances; Author: Brandon Foltz;https://www.youtube.com/watch?v=UWQO4gX7-lE;License: Standard YouTube License, CC-BY
Hypothesis Testing and Confidence Intervals (FRM Part 1 – Book 2 – Chapter 5); Author: Analystprep;https://www.youtube.com/watch?v=vth3yZIUlGQ;License: Standard YouTube License, CC-BY
Understanding the Levene's Test for Equality of Variances in SPSS; Author: Dr. Todd Grande;https://www.youtube.com/watch?v=udJr8V2P8Xo;License: Standard Youtube License