Homework_Assignment_1 - 5

.docx

School

Pennsylvania State University *

*We aren’t endorsed by this school

Course

200

Subject

Computer Science

Date

Dec 6, 2023

Type

docx

Pages

9

Uploaded by GeneralSummer13484

Report
Homework Assignment 1 DS200: Introduction to Data Sciences 2022 fall Please complete this assignment by entering your answers in this document. You can submit this Word document or a PDF export on Canvas. Problem 1: Sampling [0.5 points] Suppose that you work at a hospital, and you have to recruit participants for a medical study to test a new heart disease medication. Match the three examples below to the three sampling approaches. Examples : 1. You try to recruit the 100 patients with the highest blood pressure. 2. You order all patients by age and try to recruit every 500 th patient, starting with a randomly chosen patient from the first 500. 3. You store patient identifiers in an array called patients , apply numpy.random. choice (patients, 100) , and try to recruit the patients returned by this function. Sampling approaches : deterministic systematic random simple random Answer: 1: deterministic sampling 2: systematic random sampling 3: simple random sampling
Problem 2: Distribution [1 point] Consider the following distribution of values: Identify the following: median outlier 1 st quartile 95% percentile Answer: a: 1 st quartile b: median c: 95% percentile d: outlier
Problem 3: Association [1 point] Consider the following three scatter plots: What kind of association can you observe between X and Y in each figure? Explain your answer. Hint: Possible kinds of association are: positive association negative association no association Answer: First figure: negative association Second figure: positive association Third figure: no association
Problem 4: Causality [1 point] Suppose that a positive association was observed between the following three variables: number of tooth cavities, ounces of sugary drinks consumed, weight in pounds. Which of these three variables might be a confounding variable? What spurious conclusion may it cause? Motivate your answer. Answer: Though the correlation between the number of tooth cavities and the weight in pounds may be positive, that does not mean that higher weight makes a greater number of cavities or that a greater number of tooth cavities makes higher. Ounces of sugary drinks consumed is a confound ing variable: drinking more ounces of sugary drinks makes both heavier and a greater number of teeth, on average. Even if there is a correlation for the two groups (number of cavities and weight) that do not meet the conditions of the confounding variable, it is considered highly likely to be a coincidence.
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help