11

.pdf

School

University of Guelph *

*We aren’t endorsed by this school

Course

2040

Subject

Statistics

Date

Apr 3, 2024

Type

pdf

Pages

9

Uploaded by CoachTeamWasp34

Report
Introductory Statistics Explained (1.11) Exercises Inference for Proportions © 2022, 2023, 2024 Jeremy Balka Chapter 11: Inference for one or two proportions. v1.11 W24 Draft J.B.’s strongly suggested exercises: 2 , 5 , 6 , 8 , 10 , 12 , 16 , 17 , 19 , 20 , 24 , 25 , 26 , 27 NB The section titles and numbers are not yet synced up with the text. 1 Introduction 2 The Sampling Distribution of ˆ p 1. What are the mean and variance of the sampling distribution of ˆ p ? 2. For what values of n and p does the normal approximation to the distribution of ˆ p work best? For what values of n and p is the normal approximation very poor? 3 Confidence Intervals and Hypothesis Tests for the Popula- tion Proportion p 3. When do we create a confidence interval for ˆ p ? 4. A random sample of 200 observations from a population revealed that 62 individuals had a certain characteristic. (a) What is ˆ p ? (b) What is SE p ) ? (c) What is a 95% confidence interval for p ? (d) If we wish to test the null hypothesis that p = 0 . 5 , what is SE 0 p ) ? (e) What is the value of the appropriate test statistic? 5. In each of the following scenarios, state if it is reasonable to use the normal approximation to calculate a confidence interval for p . (Use the n ˆ p 15 , n (1 - ˆ p ) 15 guideline.) (a) ˆ p = 0 . 70 , n = 10 . (b) ˆ p = 0 . 70 , n = 200 . (c) ˆ p = 0 . 99 , n = 200 . 1 Best n is large Das worst nis small p close to 0 or 1 Nevercan only do CI for parameters up to 0.7 7 nil p roll on 3 not both is so donot use normal approx
(d) ˆ p = 0 . 01 , n = 10,000. (e) ˆ p = 0 , n = 200 . 6. A study 1 of births in Liverpool, UK, investigated a possible relationship between parental smoking status during pregnancy and the likelihood of a male birth. In one part of the study, researchers drew a sample of 363 births in which both parents were heavy smokers during the pregnancy. Of the 363 babies born to these couples, 158 were male. Suppose we wish to construct a 95% confidence interval for the population proportion of male births to heavy-smoking parents in Liverpool. (a) What does p represent in this case? (b) What is the point estimate of p ? (c) Is it reasonable to use large sample methods to calculate a confidence interval for p here? (d) What is the standard error of the sample proportion? (e) What is a 95% confidence interval for p ? (f) Give an interpretation of the 95% confidence interval for p . (g) Test the null hypothesis that the population proportion of male births to heavy-smoking parents is 0.50, against a two sided alternative. Give appropriate hypotheses, standard error, value of the test statistic, p -value, and conclusion at = 0 . 05 . (h) To what population do your conclusions apply? 4 Determining the Minimum Sample Size n 7. Find the minimum sample size required in each of the following situations. (a) We wish to estimate p within 0.03 with 95% confidence, and we have no reasonable estimate of p beforehand. (b) We wish to estimate p within 0.03 with 95% confidence, and from prior information we feel strongly that p is approximately 0.20. (c) We wish to estimate p within 0.01 with 90% confidence, and we have no reasonable estimate of p beforehand. (d) We wish to estimate p within 0.01 with 99% confidence, and we have no reasonable estimate of p beforehand. (e) We wish to estimate p within 0.01 with 99% confidence, and we know for certain that p lies between 0.1 and 0.2. 5 Inference Procedures for the Di ff erence Between Two Pop- ulation Proportions 5.1 The Sampling Distribution of ˆ p 1 - ˆ p 2 5.2 Confidence Intervals and Hypothesis Tests for p 1 - p 2 8. In words, what is the meaning of SE p 1 - ˆ p 2 ) ? 9. A random sample of 400 observations from population 1 revealed that 82 individuals had a certain characteristic. A random sample of 400 observations from population 2 revealed that 104 individuals had that characteristic. 1 Koshy et al. (2010). Parental smoking and increased likelihood of female births. Annals of Human Biology , 37(6):789–800. np o so 100000000 g 178 Effort gg male births to smokers in Liverpool in time frame or study tassen s on t's Ho p as Ha p prove normal 2.48111 2 I I I
(a) What are the values of ˆ p 1 and ˆ p 2 ? (b) What is SE p 1 - ˆ p 2 ) ? (c) Calculate a 95% confidence interval for p 1 - p 2 . (d) Test the null hypothesis that the population proportions are equal, against a two-sided alter- native hypothesis. Give appropriate hypotheses, value of the pooled proportion ˆ p , value of the standard error, test statistic and p -value. Is there significant evidence against the null hypothesis at = 0 . 05 ? 10. Much research has gone into studying how homing pigeons are able to navigate to their home loft from an unfamiliar release point, but the precise mechanisms are still unknown. It is known that homing pigeons can detect the earth’s magnetic field, and that is likely a contributing factor in their ability to navigate. A study 2 investigated this in an experiment involving 77 homing pigeons. The pigeons were randomly divided into a magnetic pulse group (group M) and a control group (group C). The 38 control group pigeons were released at a location 106 km from the home loft, and 22 found their way home. The 39 members of the magnetic pulse group received a strong magnetic pulse (perpendicular to the earth’s magnetic field) before being released from the same location. Twenty-one of the 39 magnetic pulse group pigeons made it back to the home loft. (a) Calculate a 95% confidence interval for the di ff erence between the population proportion of pigeons that navigate home after being subjected to the magnetic pulse, and the population proportion for the control group. (Hint to ease the calculation burden: SE p M - ˆ p C ) = 0 . 1131 .) (b) Give an appropriate interpretation of the interval found in 10a . (c) Perform a hypothesis test of the null hypothesis that the two groups have the same likeli- hood of making it back to their home loft. Give appropriate hypotheses, value of the test statistic, p -value, and conclusion. (One could make an argument for the one-sided alterna- tive hypothesis that the magnetic pulse reduces the probability of a pigeon arriving home, but play it safe and use a two-sided alternative hypothesis.) (Hint to ease the calculation burden: SE 0 p M - ˆ p C ) = 0 . 1132 .) 6 Chapter Exercises 6.1 Basic Calculations 11. Calculate the p -value in the following situations. (a) H 0 : p = 0 . 3 , H a : p > 0 . 3 , Z = - 1 . 40 . (b) H 0 : p = 0 . 3 , H a : p < 0 . 3 , Z = - 1 . 40 . (c) H 0 : p = 0 . 3 , H a : p 6 = 0 . 3 , Z = - 1 . 40 . (d) H 0 : p = 0 . 6 , H a : p < 0 . 6 , Z = - 1 . 88 . (e) H 0 : p = 0 . 6 , H a : p 6 = 0 . 6 , Z = - 1 . 88 . 6.2 Concepts 12. What is the di ff erence in meaning of the symbols ˆ p and p ? 13. Would it ever make sense to test H 0 : ˆ p = 0 . 25 ? 14. What is the meaning of the term SE p ) ? Does the term SE ( p ) have a similar meaning? 2 Holland et al. (2013). A magnetic pulse does not a ff ect homing pigeon navigation: a gps tracking experiment. The Journal of Experimental Biology , 216:2192–2200. Pc 22138 0 s.IE.IE iEiiE i ii ii iii iii iii iii iiiiiii.fi no evidence be small prime iii iii iii p.si proportion p the portion for entire population ly Nowe test hypothesis about parameters not statistics estima
15. In each of the following scenarios, state whether using the normal approximation to carry out a hypothesis test is reasonable. (Use the np 0 15 , n (1 - p 0 ) 15 guideline.) (a) H 0 : p = 0 . 20 , n = 10 . (b) H 0 : p = 0 . 20 , n = 200 . (c) H 0 : p = 0 . 9980 , n = 10 . (d) H 0 : p = 0 . 9980 , n = 1,000. (e) H 0 : p = 0 . 9980 , n = 100,000. 16. An Ipsos-Reid poll asked Canadians whether schools in their community have become “less safe” than they were five years ago. Fifty percent of the respondents said schools were less safe than 5 years ago. The corresponding confidence interval for p was found to be (0 . 46 , 0 . 54) . There are some potential sources of bias in surveys like this one. For example, the question could be biased (e.g. “With the increase in violent behaviour, schools have become less safe, wouldn’t you agree?”) The folks at Ipsos are professionals, and wouldn’t give questions this blatantly biased. But if our hope is to estimate the proportion of all Canadians who feel that schools have becomes less safe, what are some other potential sources of bias? 17. Test your conceptual understanding: Which of the following statements are true? You should be able to explain why a statement is true or why a statement is false. (a) ˆ p is an unbiased estimator of p . (b) When n < 30 we should use the t distribution when calculating confidence intervals for p . (c) The true distribution of ˆ p is based on the binomial distribution. (d) The true standard deviation of the sampling distribution of ˆ p depends on the value of p . (e) The sampling distribution of ˆ p is perfectly normal for large sample sizes. 18. Test your conceptual understanding: Which of the following statements are true? You should be able to explain why a statement is true or why a statement is false. (a) The normal approximation to the sampling distribution of ˆ p works best when we have a large sample size and p = 0 . 5 . (b) The sampling distribution of ˆ p becomes more normal as p tends to 1. (c) The sampling distribution of p is approximately normal for large sample sizes. (d) All else being equal, the value of SE p ) decreases as the sample size increases. (e) In repeated sampling, exactly 95% of 95% confidence intervals for p will capture p . 6.3 Applications 19. Consider again the information in Question 10 . In part of the experiment, 38 control group pigeons were released from an unfamiliar site 106 km from their home loft, and 22 of these pigeons were able to successfully navigate home. Suppose we wish to construct a confidence interval for the true proportion of control group pigeons that will find their way to their home loft (under the conditions of the study). (a) What does p represent in this case? (b) What is the point estimate of p ? (c) Is it reasonable to use large sample methods to calculate a confidence interval for p here? (d) What is the standard error of the sample proportion? (e) What is a 95% confidence interval for p ? phone accessibility doesnt incural to that mug up a III onus T sine Eli P p m binomial diff f ftp.npfd P F itsan approximation me proportion ofthis type of pigeon under these conditions F 2238 0.5789 22 is so yes SE E as.IE I i 0.42 o.nu
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help