Concept explainers
Household Incomes. The following data represent a sample of 14 household incomes ($1000s). Answer the following questions based on this sample.
- a. What is the
median household income for these sample data? - b. According to a previous survey, the median annual household income five years ago was $55,000. Based on the sample data above, estimate the percentage change in the median household income from five years ago to today.
- c. Compute the first and third
quartiles . - d. Provide a five-number summary.
- e. Using the z-score approach, do the data contain any outliers? Does the approach that uses the values of the first and third quartiles and the
interquartile range to detect outliers provide the same results?
a.

Find the median household income for the given sample.
Answer to Problem 68SE
The median household income for the given sample is $52,100.
Explanation of Solution
Calculation:
The data represent the incomes (in $1,000s) for a sample of 14 households. Five years ago the median annual household income was $55,500.
Software procedure:
Step by step procedure to obtain the descriptive statistics using EXCEL is as follows:
- In an EXCEL sheet enter the values of the sample and label it as Sample.
- Go to Data > Data Analysis (in case it is not default, take the Analysis ToolPak from Excel Add Ins) > Descriptive statistics.
- Enter Input Range as $A$2:$A$15, select Columns in Grouped By, tick on Summary statistics.
- Click on OK.
Output using EXCEL is given as follows:
From the EXCEL output, the median is 52.1.
Thus, the median household income for the given sample is $52,100.
b.

Find the percentage change in the median household income from five years ago to today.
Answer to Problem 68SE
The percentage change in the median household income from five years ago to today is decreased by 6.1%.
Explanation of Solution
Calculation:
Five years ago, the median annual household income was $55,500 (55.5).
The median household income for the given sample is $52,100 (52.1).
The percentage change in the median household income from five years ago to today can be obtained as given below:
Thus, the percentage change in the median household income from five years ago to today is decreased by 6.1%.
c.

Find the first and third quartiles.
Answer to Problem 68SE
The first and third quartiles are 50.75, and 52.6, respectively.
Explanation of Solution
Calculation:
First quartile:
The EXCEL function to compute first quartile is
Software Procedure:
Step by step procedure to obtain the first quartile using EXCEL is as follows:
- Open an EXCEL file.
- Enter the data in the column J in cells J1 to J14.
- In a cell A1, enter the formula QUARTILE.EXC (J1:J14,1).
- Click on OK.
Output using EXCEL is given as follows:
From the EXCEL output, the first quartile of the sample data is 50.75.
Third quartile:
The EXCEL function to compute third quartile is
Software Procedure:
Step by step procedure to obtain the third quartile using EXCEL is as follows:
- Open an EXCEL file.
- Enter the data in the column J in cells J1 to J14.
- In a cell A1, enter the formula QUARTILE.EXC (J1:J14,3).
- Click on OK.
Output using EXCEL is given as follows:
From the EXCEL output, the third quartile of the sample data is 52.6.
Thus, the first and third quartiles are 50.75, and 52.6, respectively.
d.

Find the five number summary for the data.
Answer to Problem 68SE
The five-number summary of the data is 46.5, 50.75, 52.1, 52.6 and 64.5.
Explanation of Solution
Calculation:
The five number summary consists the values of minimum, first quartile, second quartile, third quartile, maximum.
From the EXCEL output obtained in Part (a), the maximum, median, and minimum values are 64.5, and 46.5, respectively.
From Part (c), the first and third quartiles of the dataset are 50.75 and 52.6, respectively.
The quartiles of the data set are
Thus, the five-number summary of the dataset is given below:
- Minimum: 46.5,
- First quartile: 50.75,
- Median: 52.1,
- Third quartile: 52.6,
- Maximum: 64.5.
e.

Check for the outliers in the dataset by using the z-score approach.
Check for the outliers in the dataset by using quartiles and interquartile range.
Check whether or not result obtained using z-score approach matches with the result obtained using quartiles and interquartile range.
Answer to Problem 68SE
The outlier using z-score approach is 64.5.
The outliers using quartiles and interquartile range are 49.4, and 64.5.
The result obtained using z-score approach does not matches with the result obtained using quartiles and interquartile range.
Explanation of Solution
Calculation:
From the EXCEL output obtained in Part (a), the mean and standard deviation of the dataset are 52.2 and 4, respectively.
The formula for z-score is given below:
Where,
In a z-score approach, the data points with the z-score above +3 and the data points with the z-score below –3 are considered as outliers.
The z-score corresponding to the data point 49.4 can be obtained as follows:
Substitute
Thus, the z-score corresponding to 49.4 is –0.70.
Similarly, the z-score corresponding to other data points can be obtained as follows:
Data points | z-score |
46.5 | –1.42 |
48.7 | –0.87 |
49.4 | –0.70 |
51.2 | –0.25 |
51.3 | –0.22 |
51.6 | –0.15 |
52.1 | –0.02 |
52.1 | –0.02 |
52.2 | 0.00 |
52.4 | 0.05 |
52.5 | 0.07 |
52.9 | 0.17 |
53.4 | 0.30 |
64.5 | 3.07 |
From the table, it can be seen that the z-score corresponding 64.5 is greater than 3 standard deviations. Thus, it can be considered as outliers an outlier.
The IQR can be obtained as follows:
Substitute
Thus, the interquartile range is 1.85.
The formula for lower limit is obtained as follows:
Here,
Substitute
Thus, the lower limit is 49.825.
The formula for upper limit is obtained as follows:
Substitute
Thus, the upper limit is 55.375.
Outliers:
The outlier is the observational point that is distant from the remaining observational points. In other words outlier is an observation that lies in an abnormal distance from the remaining values.
In the present scenario, the data points that are less than lower limit (49.825) and the data points that are greater than upper limit (55.375) are considered as outliers.
The data point (49.4) is less than 49.825. Thus, it can be considered as an outlier.
The last observation (64.5) is greater than 55.375. Thus, it can also be considered as an outlier.
Hence, the dataset consists of two outliers, 49.4 and 64.5.
Using the z-score approach, there exists only one outlier (64.5). In the second approach there exist two outliers, 49.4 and 64.5.
Thus, the result obtained using z-score approach does not match with the result obtained using quartiles and interquartile range.
Want to see more full solutions like this?
Chapter 3 Solutions
Essentials Of Statistics For Business & Economics
- 4. Dynamic regression (adapted from Q10.4 in Hyndman & Athanasopoulos) This exercise concerns aus_accommodation: the total quarterly takings from accommodation and the room occupancy level for hotels, motels, and guest houses in Australia, between January 1998 and June 2016. Total quarterly takings are in millions of Australian dollars. a. Perform inflation adjustment for Takings (using the CPI column), creating a new column in the tsibble called Adj Takings. b. For each state, fit a dynamic regression model of Adj Takings with seasonal dummy variables, a piecewise linear time trend with one knot at 2008 Q1, and ARIMA errors. c. What model was fitted for the state of Victoria? Does the time series exhibit constant seasonality? d. Check that the residuals of the model in c) look like white noise.arrow_forwardce- 216 Answer the following, using the figures and tables from the age versus bone loss data in 2010 Questions 2 and 12: a. For what ages is it reasonable to use the regression line to predict bone loss? b. Interpret the slope in the context of this wolf X problem. y min ball bas oft c. Using the data from the study, can you say that age causes bone loss? srls to sqota bri vo X 1931s aqsini-Y ST.0 0 Isups Iq nsalst ever tom vam noboslios tsb a ti segood insvla villemari aixs-Yediarrow_forward120 110 110 100 90 80 Total Score Scatterplot of Total Score vs. Putts grit bas 70- 20 25 30 35 40 45 50 Puttsarrow_forward
- 10 15 Answer the following, using the figures and tables from the temperature versus coffee sales data from Questions 1 and 11: a. How many coffees should the manager prepare to make if the temperature is 32°F? b. As the temperature drops, how much more coffee will consumers purchase?ov (Hint: Use the slope.) 21 bru sug c. For what temperature values does the voy marw regression line make the best predictions? al X al 1090391-Yrit,vewolf 30-X Inlog arts bauoxs 268 PART 4 Statistical Studies and the Hunt forarrow_forward18 Using the results from the rainfall versus corn production data in Question 14, answer DOV 15 the following: a. Find and interpret the slope in the con- text of this problem. 79 b. Find the Y-intercept in the context of this problem. alb to sig c. Can the Y-intercept be interpreted here? (.ob or grinisiques xs as 101 gniwollol edt 958 orb sz) asiques sich ed: flow wo PEMAIarrow_forwardVariable Total score (Y) Putts hit (X) Mean. 93.900 35.780 Standard Deviation 7.717 4.554 Correlation 0.896arrow_forward
- 17 Referring to the figures and tables from the golf data in Questions 3 and 13, what hap- pens as you keep increasing X? Does Y increase forever? Explain. comis word ே om zol 6 svari woy wol visy alto su and vibed si s'ablow it bas akiog vino b tad) beil Bopara Aon csu How wod griz -do 30 義arrow_forwardVariable Temperature (X) Coffees sold (Y) Mean 35.08 29,913 Standard Deviation 16.29 12,174 Correlation -0.741arrow_forward13 A golf analyst measures the total score and number of putts hit for 100 rounds of golf an amateur plays; you can see the summary of statistics in the following table. (See the figure in Question 3 for a scatterplot of this data.)noitoloqpics bella a. Is it reasonable to use a line to fit this data? Explain. 101 250 b. Find the equation of the best fitting 15er regression line. ad aufstuess som 'moob Y lo esulav in X ni ognado a tad Variable on Mean Standard Correlation 92 Deviation Total score (Y) 93.900 7.717 0.896 Putts hit (X) 35.780 4.554 totenololbenq axlam riso voy X to asulisy datdw gribol anil er 08,080.0 zl noitsism.A How atharrow_forward
- Variable Bone loss (Y) Age (X) Mean 35.008. 67.992 Standard Deviation 7.684 10.673 Correlation 0.574arrow_forward50 Bone Loss 30 40 20 Scatterplot of Bone Loss vs. Age . [902) 10 50 60 70 80 90 Age a sub adi u xinq (20) E 4 adw I- nyd med ivia .0 What does a scatterplot that shows no linear relationship between X and Y look like?arrow_forwardVariable Temperature (X) Coffees sold (Y) Mean 35.08 29,913 Standard Deviation 16.29 12,174 Correlation -0.741arrow_forward
- Big Ideas Math A Bridge To Success Algebra 1: Stu...AlgebraISBN:9781680331141Author:HOUGHTON MIFFLIN HARCOURTPublisher:Houghton Mifflin HarcourtGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw HillHolt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGAL


