Homework1PBHL-B302

.docx

School

Indiana University, Purdue University, Indianapolis *

*We aren’t endorsed by this school

Course

B302

Subject

Statistics

Date

Apr 3, 2024

Type

docx

Pages

5

Uploaded by JudgeTeam10483

Report
PBHL-B302 Homework #1 Name: Jacob Letterman Please note that many of these problems ask you to do two things - if you do only one, you can only be awarded half credit. 1. (12 pts) Explain (1) what is wrong with each of the following randomization procedures and (2) describe how you would do the randomization correctly. You should use your notes and think about things like sampling bias, and proper randomization schemes we discussed in the notes and how to employ them. (1) Ten subjects are to be assigned to two treatments, 5 to each. For each subject, a coin is tossed. If the coin comes up heads, the subject is assigned to the first treatment; if the coin comes up tails, the subject is assigned to the second treatment. Though the coin toss is somewhat random, this isn’t taking into account how after 10 flips, you may not get 5 heads and 5 tails, you may get 7 heads and 3 tails or other situations where neither are equal to each other. To fix this, just use a completely random design as it would give you a truly random group of five for each treatment. (2) Twenty students are to be used to evaluate a new treatment. Ten men are assigned to receive the treatment and 10 women are assigned to be the controls. The main thing wrong here is that the random blocked design isn’t right. What I mean is that with the design, if you have more than one treatment, then you have to have more groups within each block. An easy way to fix this would be by using a random block design, making four groups, 2 for control and 2 for treatment, with the 10 men and 10 women being divided evenly into each for accurate data. (3) An experiment will assign 40 rats to 4 different treatment conditions. The rats arrive from the supplier in batches of 10 rats and the treatment lasts two weeks. The first batch of 10 rats is randomly assigned to one of the four treatments, and data for these rats are collected. After a one-week break, another batch of 10 rats arrives and is assigned to one of the three remaining treatments. The process continues until the last batch of rats is given the treatment that has not been assigned to the three previous batches. The main problem here is that there is a chance the groups will have an uneven amount of people in them. Group 1 will, at most if you’re trying to split each batch evenly, have 3 rats, while group 4 will have at least 8, as the groups are closed off each new batch. The way that you fix is by having the groups be of equal size. You can keep the random assignments, but with having the groups it allows for not only have the same amount of data, but it keeps the 2. (12 pts) Explain (1) what is wrong with each of the following random selection procedures and (2) explain how you would do the random selection correctly. You need to use concepts from the notes, such as simple random sampling, stratified random sampling, to answer these questions. (1) A population of subjects is put in alphabetical order and a simple random sample of size 10 is taken by selecting the first 10 subjects in the list. The problem with this is that it isn’t really random, you are choosing your subjects based solely on the first ten people on a list. An easy fix for this is to just do a simple random sampling design. You can keep the alphabetical list, but by using actual randomization, it
reduces both the chance of people who have the same last name and people who are related/married. (2) To determine the reading level of an introductory statistics text, you evaluate all the written material in the third chapter. Though a random chapter was chosen, you don’t have enough data in order to justify any claim. A better way to do this was by doing simple random sampling on the entire textbook, and looking at several chapters, instead of just one. (3) You want to sample student opinions about a proposed change in procedures for changing majors. You hand out questionnaires to 100 students as they arrive for class at 7:30am. The problem with this is that you are only looking at one subset of students at a specific time. A way to fix this is by going at several different times and handing out a few surveys at a time so that you can get a wide variety of people who answered the survey, rather than just a small subset of them. 3. (6 pts) Two drugs were tested to see whether they helped women who had breast cancer without lymph node involvement. The drugs are called TAC (docetaxel, doxorubicin, and cyclophosphamide) and FAC (fluorouracil, doxorubicin, and cyclophosphamide). About half of the 1060 women with breast cancer without lymph node involvement were randomly assigned to TAC and the other half were assigned to FAC. After 77 months, 472 out of 541 of the women assigned to TAC were alive and 425 out of 524 women assigned to FAC were alive (Source: Martin et al., Adjuvant docetaxel for high- risk, non-negative breast cancer, New England Journal of Medicine 363(23): 2200-2210) (1) Find the sample percentages of survival and compare them descriptively. TAC Survival Rate: 87.2% FAC Survival Rate: 81.1% Though there is a difference in terms of total people using each drug, the difference doesn’t really do anything, it’s only a difference of 17 people, meaning the percentages would still be around the same. (2) Was this an observational study or an experiment? Explain why. From studies like these, can we conclude a cause-and-effect relationship between the drug type and the survival percentage? Why or why not? This was an experiment as the researchers were directly involved with treatment, they gave each person a different drug that helped with their breast cancer. For studies like these, we can’t say that it was a cause-and-effect as there are too many factors involved. What I mean is that there could have external reasons, like old age or car accident, that caused the patient to die that is not due to the drugs administered.
Download the data file Lab1Data.csv from Canvas. This file contains the first name and School of major for students in a certain class. 4. (2 point) Fill in the following table with the names and School of major (e.g. Medicine, Public Health, Nursing, Dentistry, Science, etc.) of five of your college friends or acquaintances. These friends or acquaintances you fill in the table below can be students at IUPUI or any other university. 1 2 3 4 5 Name Caleb Seth Mike Abby Harry School Engineerin g Biology Education Nursing Public Health 5. (2 points) How many were Public Health students? Find the proportion of Public Health students in your sample by taking = 1/5 = 20% 6. (3 points) Is the collection of names in #4 a simple random sample (SRS) of college students in the US? Why or why not? Do you think there is any bias involved? If so, how? Do you think your own major has anything to do with the names that you collected? This isn’t a simple random sample of students in the US as this was a sample of 5 people, meaning that there isn’t enough data to make a case for a sample. There might be some bias as though the question asked for it, these are five people that I know personally, meaning they were chosen to be included rather than anyone else. Only one name has had to do with my major, I was originally in engineering before data science, and that’s Caleb, as I probably wouldn’t have met him if I wasn’t in engineering, but the rest of the names are people that I have known for quite a while. For the remaining problems, we will assume that the 24 individuals in Lab1Data.csv form a “population”. In truth, this is not the case, but we are doing this exercise to illustrate the concepts of sampling and sampling variability.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help