For our Honors Math fall final this year we were assigned a task to collect 100 pieces of data from a survey question of our choice. With that question we surveyed 100 students from the same population. After collecting the responses we input the data into a spreadsheet to find the frequencies for each answer given. With those frequencies we then found the mean, median, mode, range, quartile 1, quartile 2, quartile 3, the interquartile range, maximum, minimum, outliers, standard deviation, and a z-score from the data. Further we were to create a hand drawn histogram and a box and whisker plot. Finding all of these is helping us come to a conclusion as to whether the responses to the survey question is normally distributed or not and if …show more content…
The next thing I found was the range. To find the range you take the maximum number in the set minus the minimum number in the set. In my data the range was 60. Next, I found the quartiles that would be used for my box and whisker plot. To find Quartile 1 you have to find the median of the data that falls below the median of the entire data. In my data Quartile 1 was 0. You do almost the same to find Quartile 3. You find the median of the upper half of the data. You can not use the median to find Quartile 3 so you have to use everything above it. Quartile 3 in my set of data was 14. Finding Quartile 2 is easy because it is the same as your median. To find any outliers you have to find the upper limits and the lower limits. To do this you must find the interquartile range first. To do that, you subtract Quartile 1 from Quartile 3. With the interquartile range, you then can add it to Quartile 3 and subtract it from Quartile 1. By adding the interquartile range to Quartile 3 you are creating the upper limit and by subtracting the interquartile range from Quartile 3 you are finding the lower limits. In this case all four of my outliers fell above the maximum limit of 36 because you can not fly negative times and 0 falls within the limits. Lastly, I found the standard deviation of my entire 100 pieces of data. To do this I wrote all the numbers that would be found of the x-axis ( 0 - 60 ). Then I found the average of all of
2. Based on the scale of measurement for each variable listed below, which measure of central tendency is most appropriate for describing the data?
5. In HANESS, the men age 18 and over had an average height of 69 inches and an SD of 3 inches. The histograms is show below, with a normal curve. The percentage of men with heights between 66 inches and 72 inches is exactly equal to the area between (a) and (b) under (c). This percentage is approximately equal to the area between (d) and (e) under the (f). Fill in the blanks.
Theoretically from the recorded data the calculated mean, median, and mode will be the most accurate representation of the real world value. The difference between the highest recorded value and lowest recorded value is the range in the set of data. Standard deviation (s) is a quantity calculated to indicate an extend of deviation for a group of data as a whole (Marshall). This is calculated using:
4. Calculate the following measures of central tendency for the set of cube measurement data. Show your work or explain your procedure for each.
5. Give the standard deviation for the mean and median column. Compare these and be sure to identify which has the least variability?
5. Give the standard deviation for the mean and median column. Compare these and be sure to identify which has the least variability?
2. Calculate descriptive statistics for the variable where students flipped a coin 10 times. Pull up Stat > Basic Statistics > Display Descriptive Statistics and set Variables: to the coin. The output will show up in your Session Window. Type the mean and the standard deviation here.
The data that I have is accurate because by looking at the histogram the graphs are perfectly skewed and, the data make sense, if I compare the data to the median and the IQR. And to find IQR subtract Quartile 3 from Quartile 1. The purpose of using mean value to see if the graph is symmetric or not but, in my case I do not have symmetrical graph or Boxplot, because it is sensitive to outliers. I choose to use the IQR instead of the standard deviation because it shows what is below the 1st quartile and above the 3rd quartile. To summarize, median and IQR are the best option because they resistant to outliers.
12. For the following scores, find the mean, median, sum of squared deviations, variance, and standard deviation:
standard deviation standardized value rescaling z-score normal model parameter statistic standard Normal model 68-95-99.7 Rule normal probability plot
a. What is a Z score for a car with a price of $ 33,000?
Mean would be the most appropriate measure of central tendency to describe this data. This is because the mean is the average of all scores in the data set. If Dr. Williams were to graph the data into a bell shaped distribution, then the mean would be in the center where most of the scores are located. The mean is calculated using all information of the data set, and is the best score to use if you want to predict an individual score.
2. Based on the scale of measurement for each variable listed below, which measure of central tendency is most appropriate for describing the data?
Q7: Create a bar graph or pie chart of the data you have collected. For example, you could create a bar graph that shows the employee turnover rate in organizations that do offer flexible work arrangement compared to organizations that do not. Alternately, you could create a pie chart that shows which types of flexible work arrangement workers want the most.
The data set seems to have many outliers and what is more the Q1 and Q3 – the quartiles are of a different distance from the median, which makes the whole distribution unsymmetrical. What is more, the whiskers are of a different length except for plasticizer manufacturing, which is generally the most symmetrical one, and is the closest to the