32206ED9-5A09-4F73-B833-2148DE43CB8C

.jpeg

School

Grant MacEwan University *

*We aren’t endorsed by this school

Course

151

Subject

Industrial Engineering

Date

Jan 9, 2024

Type

jpeg

Pages

1

Uploaded by BaronValor12859

Report
0 500000 1500000 2500000 3500000 VALUE IN THOUSANDS b) Create and paste a boxplot that summarizes the assessed residential dwelling values for the HIGHLANDS dwellings. What can you tell about the distribution of the data? (4 marks) The bulk of the data, between Q1 and Q3, falls between about 350K to 500K (very roughly, from the boxplot). There is a short tail to the left that shows some less expensive properties were assessed and a much larger tail to the right that shows several expensive properties were assessed, including one extreme outlier at 3.5 million. In this case, the mean will be pulled above the median as there are many more outliers to the right than the lefi. HIGHLANDS ASSESSED RESIDENTIAL DWELLING VALUES 1116 o 3500000 1 2500000 | THOUSANDS 1500000 1 = o6 =~ om 500000 1988 1 0 1 ¢)Find and paste a full set of descriptive measures for the entire population of assessed HIGHLANDS residential dwelling values. Choose the most appropriate descriptive measures and explain your choice (with reference to the shape of the data distribution). (4 marks) mean sd IQR 0% 25% 50% 75% 100% n 440340.1 207868 163250 500 339625 401000 502875 3422000 1l1le6 As the data is right skewed, the median of 401K is the most appropriate measure of centrality, and the IQR of 520K-340K = 162K is the most appropriate measure of spread. 50% of the assessed home values will fall between 340K and 502K. The distance from the minimum to Q1 is quite notable (about 338K), as is the very much larger distance of about 3500K from Q3 to the maximum. The distance from Q1 to the median is about 60K, while the distance from the median to Q3 is about 101K. The mean of 440K is pulled above the median of 401K by the values of the outlying highly assessed homes. Overall, if one ignores the outlying values, one can likely get a fairly nice home for around 400K in this neighbourhood. d) You will notice that the histogram distribution of assessed residential dwelling values that you found for the Highlands in the sample of size 15 taken in Part A does not match the shape of the distribution you found when you used the HIGHLANDS datafile with the assessed dwelling values for all residential dwellings in the Highlands in Part B. Furthermore, the boxplot of assessed residential dwelling values that you found for the Highlands in the sample of size 15 taken in Part A does not match the shape of the boxplot you found when you used the HIGHLANDS datafile with the assessed dwelling values for all residential dwellings in the Highlands. State how the shapes differ and explain why this may have happened. (4 marks) The sample did not obtain any of the very high values in the dataset, but did obtain a lower value from the dataset. Thus, the small sample (size 15) ended up with a left skewed distribution in spite of the actual population of all assessed values being right skewed!! Even though it was indicated that the sample was taken randomly (as appropriate), the random
Discover more documents: Sign up today!
Unlock a world of knowledge! Explore tailored content for a richer learning experience. Here's what you'll get:
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help