Question 3 and 4

.docx

School

Western Sydney University *

*We aren’t endorsed by this school

Course

200360

Subject

Statistics

Date

Apr 3, 2024

Type

docx

Pages

11

Uploaded by 19702001CD

Report
APPLIED PROJECT PART B – TERM 3 2021 STUDENT NAME STUDENT ID SIGNATURE Kristina Ubiparipovic 19828972 Chris Doughaim 20132864 Djurdja Saric 20219534 Description of the Data files
With your group, pick one dataset from Project Part A that you will use to answer Part B. If you are not sure of this talk to your teacher. : Provide a reason for choosing this dataset over the other datasets from Project Part A in your group The reason why we have chosen this dataset, is because we decided as a group which project would suit these questions and give us accurate results. As this dataset shows the right amount of information that gives us an understanding on how to answer each question correctly. The other datasets we could have chosen didn't have the right amount of information and we believed as a group it wouldn't have given us enough information to answer the following questions. Question 1 (7 marks) Marks All the working in EXCEL for this question must be submitted with the corresponding datafile. a. Using Excel, obtain a Descriptive Statistics output for the numerical variable chosen in the dataset. (Mean, Mode, Range and Standard Deviation)*. Write one paragraph describing the data set using the information about the mean, mode, range, standard deviation. Reading score
The standard deviation is the measure of how the data is related to the mean. As this standard deviation means high indicates data is more spread out. The range is the spread of the data from the highest to lowest of the distribution; our range on 66 is above average as it means it could have high variability or a low distribution. The mode score is 65 which means it's an avenge score meaning that the higher the mean score the higher the expectation. b. For the variable selected in part a) (using Excel to construct a histogram 8 classes)*. Write one paragraph describing the data set using the histogram. Comment on the shape of the distribution of the data. [3] The shape of the histogram is a bell curve, which is depicting the normal distribution also has a shape of a bell. The top of the curve shows the mean, mode, and median of the data collected. Its standard deviation depicts the bell curve's relative width around the mean.
c. Compare the median and mean in part a). Is there a link between your finding with your comments in Part b)? [1] Mean 66.109, Median 66 The distribution of the data is symmetric as the Mean, Median score is approximately equal to each other. d. Using an appropriate Excel output, construct a 90% confidence interval for the numerical variable chosen in part a). writing score reading score math score Mean 68.054 Mean 69.169 Mean 66.089 Standard Error 0.480529 Standard Error 0.461699 Standard Error 0.479499 Median 69 Median 70 Median 66 Mode 74 Mode 72 Mode 65 Standard Deviation 15.19566 Standard Deviation 14.60019 Standard Deviation 15.16308 Sample Variance 230.908 Sample Variance 213.1656 Sample Variance 229.919 Kurtosis -0.03336 Kurtosis -0.06827 Kurtosis 0.274964 Skewness -0.28944 Skewness -0.2591 Skewness -0.27894 Range 90 Range 83 Range 100 Minimum 10 Minimum 17 Minimum 0 Maximum 100 Maximum 100 Maximum 100
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help