2
.pdf
keyboard_arrow_up
School
University of Guelph *
*We aren’t endorsed by this school
Course
2040
Subject
Statistics
Date
Apr 3, 2024
Type
Pages
9
Uploaded by CoachTeamWasp34
Introductory Statistics Explained:
Some Fundamental Concepts in Sampling
Exercises
©
2022, 2023, 2024 Jeremy Balka
Chapter 2: Some Fundamental Concepts in Sampling v1.11 W23 Draft
J.B.’s strongly suggested exercises:
1
,
2
,
3
,
4
,
5
,
6
,
8
,
9
,
12
,
15
,
17
,
18
,
21
,
24
,
29
,
31
,
32
NB The section titles and numbers are not yet synced up with the text.
1
Populations and Samples, Parameters and Statistics
1. Succinctly state the di
ff
erence between a parameter and a statistic.
2. Researchers are interested in estimating the average weight of three-year-old walleye in a lake.
A
netting program yielded 52 three-year-old walleye, with an average weight of 273 grams.
(a) What is the population of interest?
(b) What is the sample?
(c) Is the value 273 a parameter or a statistic?
(d) What is the parameter of interest in this scenario?
(e) Is it possible to know the value of the parameter of interest in this scenario?
2
Types of Sampling
2.1
Simple Random Sampling
3. An importer of used medical equipment wants to estimate the percentage of defective pacemakers
in a shipment of 200 pacemakers. They open the shipping container, remove the top 8 pacemakers,
and find that 0% of of these 8 pacemakers are defective.
Unknown to the importer, 10% of the
pacemakers in the entire shipment are defective.
(a) What is the population of interest?
(b) What is the sample?
(c) What value given above is a
statistic
?
(d) What value given above is a
parameter
?
(e) Would this sample above constitute a
simple random sample
?
1
3
yr
old
walleye
52
3ry
old
walleye
statistic
mean
or
yr
old
walleye
no
we
need
to
catch
them
all
200
pacemakers
8
No
only
top
were
sampled
4. Suppose a professor wishes to estimate the proportion of female students in a class of 50 students.
They could find out the true proportion without too much di
ffi
culty, but they decide instead to draw
a sample of 10 students. Each student’s ID number is put on a marble. The 50 marbles are put into
a box, and the box is shaken vigorously. Suppose after the vigorous shaking that the marbles can
be considered to be randomly distributed in the box. Without looking at the marbles, the professor
reaches into this box and pulls out 10 marbles without replacement. The 10 students with these ID
numbers are a sample from the population of 50 students. Which, if any, of the following statements
are true?
(a) The 50 students represent the population.
(b) The sample can be considered to be a simple random sample from the population.
(c) Each possible sample of size 10 has the same chance of being selected.
(d) The probability any individual in the population will be in the sample is 0.20.
2.2
Other Types of Random Sampling
5. A professor is interested in learning more about her students, and she decides that she is going to
draw a sample of 10 students from her population of 50.
She knows she should be drawing some
sort of random sample, but doesn’t quite know how to go about it. She considers the following two
sampling designs.
I. Put all the names of students on small pieces of paper, put them into a hat, and mix them up
such that the names are randomly distributed. Then she will pick 10 names without replace-
ment.
II. To ensure equal representation of males and females, she will have separate hats for males and
females. In the first hat she will mix up the names of the male students, then draw 5 names
without replacement. She will carry out the same procedure with the females, drawing 5 female
names. She will then pool these names together into a group of 10.
Which of the following statements are true?
(a) Sampling method I results a simple random sample.
(b) Sampling method II results in a stratified random sample.
(c) Both sampling designs are poorly constructed and should never be used.
3
Experiments and Observational Studies
6. Researchers are investigating the relative e
ff
ectiveness of two procedures that may prolong life in pa-
tients with advanced heart disease. Twenty volunteers are randomly assigned to the two procedures,
10 to procedure
A
and 10 to procedure
B
. The time until death is recorded.
(a) Is this an experiment or an observational study?
(b) What are the experimental units?
(c) What are the treatments?
(d) What is the response variable?
(e) If it is found that those patients from procedure
B
tended to live much longer on average, is it
reasonable to conclude that procedure
B
caused this increase?
T
T
T
T
T
T
stratum
may
remace
where
pop
students
observation
researcher
unit
control
other
variables
imagine
É
p
need
to
compare
to
other
studies
no
guaranteed
causal
relationship
b
c
all
other
variables
arent
controlled
is
there
a
plausible
explanation
7. Researchers are interested in the yield of four types of corn. There are 20 plots of land available,
and the researchers randomly assign the types of corn to the 20 plots of land (5 plots for each type
of corn). After the growing season, the yield (kg) of corn is measured on each of the 20 plots of land.
(a) Is this an observational study or an experiment?
(b) What are the experimental units?
(c) What are the treatments?
(d) What is the response variable?
8. Suppose a long-term observational study finds that children who watch an average of more than
50 hours of television a week are much more likely to be arrested by age 18 than those children
who watch less television. Based on this study, is it reasonable to conclude that increased television
viewing causes an increase in the likelihood of arrest?
4
Chapter Exercises
9. A study
1
investigated fast food consumption among students on a college campus in the southern
United States.
The researchers recruited students by posting flyers on campus, o
ff
ering to give
participants small giveaways and a chance to win a $100 gift card from the campus bookstore.
The 152 student participants completed several surveys, and one of the surveys involved fast food
consumption. In the analysis of the results of one of the surveys, the researchers found that male
students tended to spend more per month on fast food than female students did.
(a) What is the population of interest?
(b) Is the sample of students a simple random sample?
(c) What types of bias might be present in a study like this?
(d) Would it be possible to conduct a simple random sample of students at this university to
investigate fast food consumption?
10. A study
2
investigated physical characteristics of two species of lizard (
Phrynocephalus frontalis
and
P. versicolor
) found in a region of Inner Mongolia. The researchers were mainly interested in the
di
ff
erences between males and females within each species.
Researchers captured these lizards by
hand or by noose, and measured various physical characteristics.
(a) What are the populations of interest?
(b) Did the researchers use a simple random sampling design?
(c) What types of bias might be present in this study?
11. A study
3
investigated a possible e
ff
ect of a vitamin C supplement on the frequency and duration
of colds.
One thousand volunteers were randomly assigned to two groups.
One group received a
vitamin C supplement of 1g per day, the other group received a placebo. The participants then kept
track of variables related to cold frequency and severity for the duration of the study (approximately
2 months).
1
Heidal et al. (2012). Cost and calorie analysis of fast food consumption in college students.
Food and Nutrition Sciences
,
3:942–946
2
Qu et al. (2011).
Sexual dimorphism and female reproduction in two sympatric toad-headed lizards,
Phrynocephalus
frontalis
and
P. versicolor
(Agamidae).
Animal Biology
, 61:139–151
3
Anderson, T., Reid, D., and Beaton, G. (1972). Vitamin C and the common cold: a double-blind trial.
Canadian Medical
Association Journal
, 107:503–508
observational
b
c
diff
growing
environments
butif
random
diff
type
Espouse
to
breed
ofcom
other
is
Ip
msn.mg
fmobs
study
students
on
caring
a
iii
T.to
ti
Ima.rst
y
Yes
in
they
were
randomly
secreted
from
the
whole
population
2
species
of
lizard
no
b
c
now
due
been
selection
bias
when
catching
selection
bias
(a) This experiment was conducted as a
double-blind
experiment (neither the study participant
nor the researchers involved knew whether the participant was receiving the vitamin C or the
placebo, until after the study was completed and the data was summarized). Why would the
researchers set up the experiment in this way?
(b) The study began with 1000 participants, but 182 people dropped out before the end of the
study.
The data for the individuals that dropped out was thrown out and the analysis was
based on the remaining 818 participants. How might this bias the results?
12. Researchers investigated a possible e
ff
ect of exercise on tumour growth in rats.
Thirty rats were
injected with cancerous cells that would develop into a tumour.
These rats were then randomly
assigned to one of two groups (15 to each group). Rats in one of the groups were forced to run on a
treadmill for 30 minutes per day. Rats in the other group had no forced exercise. After one month,
the rats were sacrificed and the weight of the tumour was measured.
(a) What is the response variable?
(b) What is the explanatory variable?
(c) On one of the days, a rat caught his foot on the back of the treadmill and was badly injured.
The rat had to be sacrificed and omitted from the experiment. How might this bias the results?
Does this render the results of the study invalid?
13. Consider the following list of Central American countries:
•
Belize
•
Costa Rica
•
El Salvador
•
Guatemala
•
Honduras
•
Nicaragua
•
Panama
(a) Number the countries from 1–7, then pick a simple random sample of size 3 from this group
using the following random numbers:
98797 18334 57628 33215 32849.
What countries are selected? (Hint: We would ignore the first 2 numbers in the list (9,8), since
no country is labelled with a 9 or an 8, and the first country selected would be Panama (7).)
(b) Suppose instead of the above, we randomly pick a letter from A-Z using a random number
generator, then include the country in our sample if the country’s name begins with that letter.
If we continued in this fashion until 3 countries were selected, would that sample represent a
simple random sample
from the population?
14. None of the sampling methods in this question are perfect, but which one would yield the most
representative sample from its population? (This question requires a little thought, and the correct
answer is subject to debate.)
(a) A student wants to know the class average on a midterm exam that she recently wrote. At the
lecture following the midterm, on a Friday at 8:30 a.m., she arrives to class early and asks ten
students in the lecture room what their mark was on the test. She calculates her estimate of
the class average based on what these 10 students say they scored on the test.
(b) A student wants to investigate the average yearly income of the parents of university students.
While in line to obtain his student loan, he asks 10 people in line with him what their parents’
yearly income is, and takes an average of the responses.
so
no
biaswhen
reporting
data
uneth
ftp.tn.dk
dmhh
Y
au
w.is
er
EE
i
itiet
Ini
uneven
group
MAYBE
idk
3
Gutomats
Nicaragua
Belize
people
in
line
likely
r
poor
so
not
appropriate
representation
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
- Access to all documents
- Unlimited textbook solutions
- 24/7 expert homework help
Recommended textbooks for you
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill
Recommended textbooks for you
- Holt Mcdougal Larson Pre-algebra: Student Edition...AlgebraISBN:9780547587776Author:HOLT MCDOUGALPublisher:HOLT MCDOUGALGlencoe Algebra 1, Student Edition, 9780079039897...AlgebraISBN:9780079039897Author:CarterPublisher:McGraw Hill
Holt Mcdougal Larson Pre-algebra: Student Edition...
Algebra
ISBN:9780547587776
Author:HOLT MCDOUGAL
Publisher:HOLT MCDOUGAL
Glencoe Algebra 1, Student Edition, 9780079039897...
Algebra
ISBN:9780079039897
Author:Carter
Publisher:McGraw Hill