Introduction To Statistics And Data Analysis
Introduction To Statistics And Data Analysis
6th Edition
ISBN: 9781337793612
Author: PECK, Roxy.
Publisher: Cengage Learning,
bartleby

Videos

Textbook Question
Book Icon
Chapter 6.4, Problem 52E

The paper “Good for Women, Good for Men, Bad for People: Simpson’s Paradox and the Importance of Sex-Specific Analysis in Observational Studies” (Journal of Women’s Health and Gender-Based Medicine [2001]: 867-872) described the results of a medical study in which one treatment was shown to be better for men and better for women than a competing treatment. However, if the data for men and women are combined, it appears as though the competing treatment is better.

To see how this can happen, consider the accompanying data tables constructed from information in the paper. Subjects in the study were given either Treatment A or Treatment B, and survival was noted. Let S be the event that a patient selected at random survives, A be the event that a patient selected at random received Treatment A, and B be the event that a patient selected at random received Treatment B.

  1. a. The following table summarizes data for men and women combined:

Chapter 6.4, Problem 52E, The paper Good for Women, Good for Men, Bad for People: Simpsons Paradox and the Importance of , example  1

  1. i. Find P(S).
  2. ii. Find P(S|A).
  3. iii. Find P(S|B).
  4. iv. Which treatment appears to be better?
  5. b. Now consider the summary data for the men who participated in the study:

Chapter 6.4, Problem 52E, The paper Good for Women, Good for Men, Bad for People: Simpsons Paradox and the Importance of , example  2

  1. v. Find P(S).
  2. vi. Find P(S|A).
  3. vii. Find P(S|B).
  4. viii. Which treatment appears to be better?
  5. c. Now consider the summary data for the women who participated in the study:

Chapter 6.4, Problem 52E, The paper Good for Women, Good for Men, Bad for People: Simpsons Paradox and the Importance of , example  3

  1. ix. Find P(S). looks like Treatment B is better. This is an
  2. x. Find P(S|A).
  3. xi. Find P(S|B).
  4. xii. Which treatment appears to be better?
  5. d. You should have noticed from Parts (b) and (c) that for both men and women, Treatment A appears to be better. But in Part (a), when the data for men and women are combined, it looks like Treatment B is better. This is an example of what is called Simpson’s paradox. Write a brief explanation of why this apparent inconsistency occurs for this data set. (Hint: Do men and women respond similarly to the two treatments?)

a.

Expert Solution
Check Mark
To determine

i. Compute P(S).

ii. Obtain P(S|A).

iii. Calculate P(S|B).

iv. Find the better treatment.

Answer to Problem 52E

i. The value of P(S) is 0.76.

ii. The value of P(S|A) is 0.717.

iii. The value of P(S|B) is 0.803.

iv. Treatment B is better than Treatment A.

Explanation of Solution

Calculation:

The given information is the summary table of the survey. Event S denotes the event that a patient selected at random and survives, event A denotes that a patient selected at random received Treatment A, and B denotes the event that a patient selected at random and received Treatment B.

i.

The probability of any event A is given below:

P(A)=Number of outcomes in ATotal number of outcomes in the samplespace

The total number of randomly selected patient is 600.

The total number of patient selected at random survives is 456.

The probability of a randomly selected patients and who survive is calculated as follows:

P(S)=456600=0.76

Thus, the probability of a randomly selected patients  who survive is 0.76.

ii.

Conditional rule:

The formula for probability of E given F is, P(E|F)=n(EF)n(F).

The total number of patient selected at random and received Treatment A is 300.

The number of patient selected at random and received Treatment A and survive is 215.

The probability that the selected patients at random received Treatment A, given that the patient selected at random survives. It is calculated as follows:

P(S|A)=215300=0.717

Thus, the value of P(S|A) is equal to 0.717.

iii.

The total number of patient selected at random and received Treatment B is 300.

The number of patient selected at random that received Treatment B and survive is 241.

The probability that the selected patient at random received Treatment B, given that the patient selected at random survives. It is calculated as follows:

P(S|B)=241300=0.803

Thus, the value of P(S|B) is equal to 0.803.

iv.

The probability of patient who received Treatment B survived more than that of Treatment A.

Thus, Treatment B is better than Treatment A.

b.

Expert Solution
Check Mark
To determine

i. Compute P(S).

ii. Obtain P(S|A).

iii. Calculate P(S|B).

iv. Find the better treatment.

Answer to Problem 52E

i. The value of P(S) is 0.583.

ii. The value of P(S|A) is 0.6.

iii. The value of P(S|B) is 0.5.

iv. Treatment A is better than Treatment B.

Explanation of Solution

Calculation:

The given information is the summary table of the survey.

i.

The total number of randomly selected patient is 240.

The total number of patient selected at random and survives is 140.

The probability of a randomly selected patients who survive is calculated as follows:

P(S)=140240=0.583

Thus, the probability of a randomly selected patients who survive is 0.583.

ii.

Conditional rule:

The formula for probability of E given F is, P(E|F)=n(EF)n(F).

The total number of patients selected at random that received Treatment A is 200.

The number of patient selected at random that received Treatment A and survives is 120.

The probability that the selected patient at random received Treatment A, given that the patient selected at random survives. It is calculated as follows:

P(S|A)=120200=0.6

Thus, the value of P(S|A) is equal to 0.6.

iii.

The total number of patients selected at random that received Treatment B is 40.

The number of patient selected at random that received Treatment B and survive is 20.

The probability that the selected patient at random received Treatment B, given that the patient selected at random survives. It is calculated as follows:

P(S|B)=2040=0.5

Thus, the value of P(S|B) is equal to 0.5.

iv.

The probability of patient who received Treatment A survived more than that of Treatment B.

Thus, Treatment A is better than Treatment B.

c.

Expert Solution
Check Mark
To determine

i. Compute P(S).

ii. Obtain P(S|A).

iii. Calculate P(S|B).

iv. Find the better treatment.

Answer to Problem 52E

i. The value of P(S) is 0.878.

ii. The value of P(S|A) is 0.95.

iii. The value of P(S|B) is 0.85.

iv. Treatment A is better than Treatment B.

Explanation of Solution

Calculation:

The given information is the summary table of the survey.

i.

The total number of randomly selected patient is 360.

The total number of patient selected at random that survive is 316.

The probability of a randomly selected patients who survive is calculated as follows:

P(S)=316360=0.878

Thus, the probability of a randomly selected patients who survive is 0.878.

ii.

Conditional rule:

The formula for probability of E given F is, P(E|F)=n(EF)n(F).

The total number of patient selected at random that received Treatment A is 100.

The number of patient selected at random that received Treatment A and survive is 95.

The probability that the selected patients at random received Treatment A, given that the patient selected at random survives. It is calculated as follows:

P(S|A)=95100=0.95

Thus, the value of P(S|A) is equal to 0.95.

iii.

The total number of patient selected at random that received Treatment B is 260.

The number of patient selected at random that received Treatment B and survive is 221.

The probability that the selected patients at random received Treatment B, given that the patient selected at random survives. It is calculated as follows:

P(S|B)=221260=0.85

Thus, the value of P(S|B) is equal to 0.85.

iv.

The probability of patients who received Treatment A survived more than that of Treatment B.

Thus, Treatment A is better than Treatment B.

d.

Expert Solution
Check Mark
To determine

Explain the reason for the existence of apparent inconsistency in the data.

Explanation of Solution

From part (a), (b) and (c), it can be observed that Treatment A performs better than that of Treatment B, except part (a). In part (a), the data for men and women are combined. Thus, Treatment B performs better than that of Treatment A.

Want to see more full solutions like this?

Subscribe now to access step-by-step solutions to millions of textbook problems written by subject matter experts!
Students have asked these similar questions
A study in Sweden looked at former elite soccer players, people who had played soccer but not at the elite level, and people of the same age who did not play soccer. Here is a two-way table that classifies these subjects by whether or not they had arthritis of the hip or knee by their mid-fifties:   Elite Non-elite Did not play Arthritis 10 9 24 No arthritis 61 206 548   Based on this study, you can conclude that
In the book Business Research Methods (5th ed.), Donald R. Cooper and C. William Emory discuss studying the relationship between on-the-job accidents and smoking. Cooper and Emory describe the study as follows:   Suppose a manager implementing a smoke-free workplace policy is interested in whether smoking affects worker accidents. Since the company has complete reports of on-the-job accidents, she draws a sample of names of workers who were involved in accidents during the last year. A similar sample from among workers who had no reported accidents in the last year is drawn. She interviews members of both groups to determine if they are smokers or not.   The sample results are given in the following table.     On-the-Job Accident Smoker Yes No Row Total Heavy 12   5   17   Moderate 9   10   19   Nonsmoker 13   17   30   Column total 34   32   66       Expected counts are below observed counts   Accident No Accident Total Heavy 12   5   17     8.76   8.24…
A company institutes an exercise break for its workers to see if it will improve job​ satisfaction, as measured by a questionnaire that assesses​ workers' satisfaction before and after the implementation of the program. Using an appropriate nonparametric procedure and α=​0.05, does the data indicate that that an exercise break for the workers improved job​ satisfaction? Worker Number    1    2    3    4    5    6    7    8    9    10Before    36    28    30    46    26    26    24    16    15    28After    33    36    49    41    36    40    39    21    20    37 1. Using the Normal​ approximation, find the value of the test statistic. 2 .Find the​ P-value for the test statistic. 3. Choose the correct conclusion below.   A. The after exercise program job satisfaction scores are systematically higher when compared to the before exercise program job satisfaction scores.   B. The after exercise program job satisfaction scores seem to be systematically equal when compared to the…

Chapter 6 Solutions

Introduction To Statistics And Data Analysis

Ch. 6.1 - Refer to the previous exercise and now suppose...Ch. 6.1 - A family consisting of three peopleP1, P2, and...Ch. 6.1 - Prob. 13ECh. 6.1 - An engineering construction firm is currently...Ch. 6.1 - For the events described in the previous exercise,...Ch. 6.1 - Consider a Venn diagram picturing two events A and...Ch. 6.3 - A large department store offers online ordering....Ch. 6.3 - Consider the chance experiment described in the...Ch. 6.3 - The manager of an online music store has kept...Ch. 6.3 - Consider the chance experiment described in the...Ch. 6.3 - A bookstore sells two types of books (fiction and...Ch. 6.3 - Consider the chance experiment described in the...Ch. 6.3 - Medical insurance statuscovered (C) or not covered...Ch. 6.3 - Roulette is a game of chance that involves...Ch. 6.3 - Phoenix is a hub for a large airline. Suppose that...Ch. 6.3 - A customer satisfaction survey is planned. The...Ch. 6.3 - A professor assigns five problems to be completed...Ch. 6.3 - Refer to the following information on full-term...Ch. 6.3 - The report Teens, Social Media Technology...Ch. 6.3 - According to The Chronicle for Higher Education...Ch. 6.3 - The same issue of The Chronicle for Higher...Ch. 6.3 - A deck of 52 playing cards is mixed well, and 5...Ch. 6.3 - After all students have left the classroom, a...Ch. 6.3 - Use the information given in the previous exercise...Ch. 6.3 - The student council for a school of science and...Ch. 6.3 - A student placement center has requests from five...Ch. 6.3 - Suppose that a six-sided die is weighted so that...Ch. 6.4 - Two different airlines have a flight from Los...Ch. 6.4 - The article Chances Are You Know Someone with a...Ch. 6.4 - The accompanying data are from the article...Ch. 6.4 - Using the probabilities calculated in the previous...Ch. 6.4 - The following graphical display is similar to one...Ch. 6.4 - The article Americans Growing More Concerned About...Ch. 6.4 - The events E and T are defined as E = the event...Ch. 6.4 - The newspaper article Folic Acid Might Reduce Risk...Ch. 6.4 - Suppose that an individual is randomly selected...Ch. 6.4 - Is ultrasound a reliable method for determining...Ch. 6.4 - The paper Accuracy and Reliability of...Ch. 6.4 - The report 2015 Utah Seat Belt Use Survey (Utah...Ch. 6.4 - The National Highway Traffic Safety Administration...Ch. 6.4 - Use the information given in the previous exercise...Ch. 6.4 - The paper Good for Women, Good for Men, Bad for...Ch. 6.5 - Many fire stations handle emergency calls for...Ch. 6.5 - Refer to the information given in the previous...Ch. 6.5 - The paper Predictors of Complementary Therapy Use...Ch. 6.5 - The report TV Drama/Comedy Viewers and Health...Ch. 6.5 - The report Great Jobs, Great Lives. The...Ch. 6.5 - In a small city, approximately 15% of those...Ch. 6.5 - Jeanie is a bit forgetful, and if she doesnt make...Ch. 6.5 - Consider a system consisting of four components,...Ch. 6.5 - Consider the system described in the previous...Ch. 6.5 - In a January 2016 Harris Poll, each of 2252...Ch. 6.5 - Consider the following events: T = event that a...Ch. 6.5 - The following case study was reported in the...Ch. 6.5 - Three friends (A, B, and C) will participate in a...Ch. 6.5 - A store sells two different brands of dishwasher...Ch. 6.5 - The National Public Radio show Car Talk used to...Ch. 6.5 - Refer to the previous exercise. Suppose now that...Ch. 6.6 - A university has 10 vehicles available for use by...Ch. 6.6 - Prob. 70ECh. 6.6 - There are two traffic lights on Darlenes route...Ch. 6.6 - Let F denote the event that a randomly selected...Ch. 6.6 - According to a July 31, 2013 posting on cnn.com, a...Ch. 6.6 - Suppose that Blue Cab operates 15% of the taxis in...Ch. 6.6 - A large cable company reports the following: 80%...Ch. 6.6 - Refer to the information given in the previous...Ch. 6.6 - The authors of the paper Do Physicians Know When...Ch. 6.6 - A study of how people are using online services...Ch. 6.6 - The report Twitter in Higher Education: Usage...Ch. 6.6 - Use the information given in the previous exercise...Ch. 6.6 - Prob. 81ECh. 6.6 - Use the table of estimated probabilities from the...Ch. 6.6 - Suppose that we define the following events: C =...Ch. 6.6 - The article U.S. Investors Split Between Digital...Ch. 6.6 - Prob. 85ECh. 6.6 - The paper referenced in the previous exercise also...Ch. 6.6 - In an article that appears on the web site of the...Ch. 6.7 - The report Airline Quality Rating 2016...Ch. 6.7 - Five hundred first-year students at a state...Ch. 6.7 - Use the information given in the previous exercise...Ch. 6.7 - The table given below describes (approximately)...Ch. 6.7 - On April 1, 2010, the Bureau of the Census in the...Ch. 6.7 - Refer to the information given in the previous...Ch. 6.7 - Refer to the information given in Exercises 6.92...Ch. 6 - False positive results are not uncommon with...Ch. 6 - A company uses three different assembly linesA1,...Ch. 6 - Consider the following information about...Ch. 6 - Use the information given in the previous exercise...Ch. 6 - Use the information given in exercise 6.102 to...Ch. 6 - Prob. 105CRCh. 6 - The following table summarizing data on smoking...Ch. 6 - A study of the impact of seeking a second opinion...Ch. 6 - A company sends 40% of its overnight mail parcels...Ch. 6 - Prob. 109CRCh. 6 - Prob. 110CRCh. 6 - In a school machine shop, 60% of all machine...Ch. 6 - There are five faculty members in a certain...Ch. 6 - The general addition rule for three events states...Ch. 6 - A theater complex is currently showing four...Ch. 6 - Prob. 117CRCh. 6 - Suppose that a box contains 25 light bulbs, of...Ch. 6 - Return to Exercise 6.118, and suppose that 4 bulbs...Ch. 6 - A transmitter is sending a message using a binary...
Knowledge Booster
Background pattern image
Statistics
Learn more about
Need a deep-dive on the concept behind this application? Look no further. Learn more about this topic, statistics and related others by exploring similar questions and additional content below.
Similar questions
SEE MORE QUESTIONS
Recommended textbooks for you
Text book image
MATLAB: An Introduction with Applications
Statistics
ISBN:9781119256830
Author:Amos Gilat
Publisher:John Wiley & Sons Inc
Text book image
Probability and Statistics for Engineering and th...
Statistics
ISBN:9781305251809
Author:Jay L. Devore
Publisher:Cengage Learning
Text book image
Statistics for The Behavioral Sciences (MindTap C...
Statistics
ISBN:9781305504912
Author:Frederick J Gravetter, Larry B. Wallnau
Publisher:Cengage Learning
Text book image
Elementary Statistics: Picturing the World (7th E...
Statistics
ISBN:9780134683416
Author:Ron Larson, Betsy Farber
Publisher:PEARSON
Text book image
The Basic Practice of Statistics
Statistics
ISBN:9781319042578
Author:David S. Moore, William I. Notz, Michael A. Fligner
Publisher:W. H. Freeman
Text book image
Introduction to the Practice of Statistics
Statistics
ISBN:9781319013387
Author:David S. Moore, George P. McCabe, Bruce A. Craig
Publisher:W. H. Freeman
Hypothesis Testing using Confidence Interval Approach; Author: BUM2413 Applied Statistics UMP;https://www.youtube.com/watch?v=Hq1l3e9pLyY;License: Standard YouTube License, CC-BY
Hypothesis Testing - Difference of Two Means - Student's -Distribution & Normal Distribution; Author: The Organic Chemistry Tutor;https://www.youtube.com/watch?v=UcZwyzwWU7o;License: Standard Youtube License