practice questions solution

.pdf

School

University of Washington *

*We aren’t endorsed by this school

Course

111

Subject

Statistics

Date

Feb 20, 2024

Type

pdf

Pages

5

Uploaded by MagistrateGoatMaster488

Report
1. Bags of M&Ms have six different colors: red, orange, yellow, green, blue, and brown. We are interested to see if all six colors occur in equal proporAon. We take a simple random sample of 1000 candies: 111 are blue, 225 are orange, 300 are green, 189 are red, 80 are yellow, and 95 are brown. To test the hypotheses, we have: Ho: All of the colors occur in equal proporAon (1/6) Ha: At least one of the colors occurs with proporAon not equal to 1/6 What are the expected counts for each color of M&M? _________ 2. In order to correctly perform a Chi-squared test for Goodness of Fit, the categories must be: a) ordered b) mutually exclusive c) small d) conAnuous 3. The regression equaAon for predicAng cost of a flight was fiZed using the following data The regression line is determined to be: predicted cost = 0.21(distance) + 57.18 What is true about the predicted value of the cost of a flight of 50 miles? a) It is expressed as: 0.21(50) + 57.18 = $67 b) The predicted price would be very realisAc because some of the airfares were less than $90 c) It is not appropriate to predict the flight's cost because the distance is out of the range of the observed distances. d) It is less than $106 4. Give a range of probabiliAes for the event that a t-distributed variable with ten degrees of freedom has an absolute value greater than 2.5. Put each number in decimal format with three decimal places, for example "0.125". (And be sure to put them in the correct order - the lower bound should be first!) _____< p < _____ 166.67 1000 166.67 chi Squared GOF is not appropriate for continuous data 0.01 0 o u d f 10 check the time 2228 t L 2.764
5. Julia is studying the cocoon length of a certain type of buZerfly, she manages to find a simple random sample of 84 cocoons and creates a 95% confidence interval of their lengths. When interpreAng the interval, she writes, "There is a 95% chance that the populaAon mean length is between 3.3 and 4.2 cenAmeters." Which of the following statements is correct? a) This interpretaAon is incorrect because once a sample is taken, the confidence interval will either contain the parameter or it will not, without reference to chance. b) This interpretaAon is incorrect because a confidence interval should not be constructed when we have a simple random sample. c) This interpretaAon is incorrect because a confidence interval predicts the sample mean length, not the populaAon mean length. d) This interpretaAon is correct. 6. Ager conducAng an internal review, a company finds that the event of having an above- average score on the interview is independent of having an above-average performance on the job. Which of the following is true? a) Having an above-average performance on the job and an above-average interview are mutually exclusive. b) The probability of having both an above-average interview and an above-average performance on the job is equal to the product of each individual probability. c) The probability of having an above-average performance on the job changes when adding the condiAon of having an above-average interview. d) The probability of having an above-average interview or above-average performance on the job is equal to the sum of each individual probability. 7. What is the probability of a standard normal variable having a value between -0.5 and 2.5? _______. 8. Which of the following must always be true regarding the data set used to create the box plot below? a) the data is normally distributed b) the median is less than the mean c) the median is about 70 d) none of the above is true We use confidence intervalto predict Population meaninstead of samplemean A j B We denote two eventsto be A B mutually exclusive givesus PCA n BE o ou Pl At B PIA t Pl B P An BkPIAPl B defof independence Mutually exclusive two events cannotoccur at the same time Independent turn they canoccur at the same time Checking zone 0.6835 o 9938 o 3085 0.6835 Z 2.5 P o9938 z 0.5 p o 3085 The box plot will show if a statistical data set is symmetric or skewed. When the median is in the middle of the box, and the whiskers are about the same on both sides of the box, then the distribution is symmetric. Additionally, the normal is not the only symmetric distribution, others distributions could present this box plot.
9. A dairy is trying to measure the weights of their cows to test whether their mean weight has dropped below a healthy value on a new feed. They took a sample size of 16 cows; they knew that the populaAon distribuAon was normal and decided to use the known populaAon standard deviaAon in their test. Which of the following describes which type of test they should use? a) Chi-Squared GOF b) Z Test c) T test d) Paired Test 10. A recent study focused on high-speed internet costs (in dollars) for households in a certain populaAon. Suppose it is known that for this populaAon the mean monthly high-speed internet cost is $50 and the populaAon standard deviaAon is $6. Suppose we are planning to take a random sample of 36 households from this populaAon and will be compuAng the sample mean monthly high-speed internet cost for these 36 households. Which of the following graphs provides the approximated distribuAon for the possible values of the sample mean monthly cost? a) Graph A b) Graph B c) Graph C 11. An athleAc trainer knows that the relaAonship between an athlete's resAng heart rate and heart rate ager exercise is posiAvely correlated. From a sample of 12 athletes, the athleAc trainer recorded each athlete's resAng heart rate, and heart rate, in beats per minute (bpm), ager 5 minutes of moderate exercise. On average, the resAng heart rate is 50, with an SD of 6 bpm. The heart rate ager exercise is on average 68 with an SD of 12 bpm. Complete the linear regression equaAon with the esAmated intercept. predicted heart rate ager exercise = 1.3 * resAng heart rate + ______ 12. A staAsAcian calculates the correlaAon between two variables, X and Y, and finds that it is very close to 1. Can we conclude that having a higher value of X will cause the value of Y to be larger? Explain. SE 1 so eachinterval Should be I 3 slope 1.3 intercept 68 1.3x50 3 No, because we have established that there is a high correlation doesn’t mean that one variable causes the value of the other to be larger
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help