F23_hw2

.docx

School

Rutgers University *

*We aren’t endorsed by this school

Course

225

Subject

Statistics

Date

Feb 20, 2024

Type

docx

Pages

7

Uploaded by LieutenantWildcat1908

Report
Homework 2 (82 pts) Data is accessible from the course website: Data and Resource > Data used in class> BikeProject.csv Note: 1. In the confidence interval problems, nota that components in a confidence interval include the point estimate, the critical value, the standard error, and the margin error. The result should be computed toward the end. For example, compute a CI as 5 ± 2 4 = 5 ± 8 =(− 3 , 13 ) 2. In computation problems, a basic rule is that you keep 3 or more significant decimal places for numbers during the working period and keep 2 or more significant decimal places at the number reported at the end. 3. In the fill-in-the-blank question, when denote or write the formula for a term, show both the general and the specific form based on the question . For example, the critical value for a one-sided t-test, H 0 μ = 0 , H a : μ > 0 is denoted by t ( 1 α ,n 1 ) = t ( 0.95 , 30 ) . The test statistic, t s , can be computed with a formula t s = Y s / n = 10 20 / 25 , and a value of 2.5, where the general form is t s = Y s / n , and the specific form is 10 20 / 25 , The p value can be computed with a formula Pr ( t > t s | μ 0 istrue ¿ = Pr ( t > 2.5 ,given μ 0 = 0 ) , where the general form is Pr ( t > t s | μ 0 istrue ¿ and the specific form is Pr ( t > 2.5 ,given μ 0 = 0 ) , Consider a simple linear regression Y ~ X, where X is the humidity and Y is the rental counts. The goal is to study the impact of X on Y. 1. (28) Complete the confidence interval questions. (1 pt each blank, no partial credit) a). (8) To estimate the mean response value of Y when X=0.5, the point estimate can be estimated as ^ Y h = β 0 + β 1 x h : ^ Y i = 378.88 303.59 ( 0.5 ) : (both the general formula and specific formula in this question)=_227.085 __(computed as this value). The standard error of this estimation is denoted _ S { ^ Y h } = s [ 1 n + ( x h X ) 2 Σ ( X i X ) 2 ] :171.66 [ 1 17379 + 0.016187 646.84 ] (both the general formula and specific formula in this question)=_ 1.559 _(computed as the value). At the significant level of 95%, the t-value is 1
denoted by t ( 1 α 2 ;n 2 ) : t ( 1 0.025 ; 17379 2 ) both the general formula and specific formula in this question)=___1.9601__(computed as this value). b). (8) To predict the single response (the next observation value), the point estimate can be estimated as __ ^ Y h = β 0 + β 1 x h : ^ Y i = 378.88 303.59 ( 0.5 ) : ___(both the general formula and specific formula in this question)=_ 227.085_(computed as this value). The standard error of this estimation is denoted S { Pred } = s [ 1 n + ( x h X ) 2 Σ ( X i X ) 2 + 1 ] : 171.66 [ 1 17379 + 0.016187 646.84 + 1 ] (both the general formula and specific formula in this question)=_171.66 _(computed as the value). At the significant level of 95%, the t-value is denoted by __ t ( 1 α 2 ;n 2 ) : t ( 1 0.025 ; 17379 2 ) _(both the general formula and specific formula in this question)=__1.9601___(computed as this value) c). (8) To predict the mean of m responses (the average of the next m observation values), the point estimate can be estimated as _ ^ Y h = β 0 + β 1 x h : ^ Y i = 378.88 303.59 ( 0.5 ) : (both the general formula and specific formula in this question)=_ 227.085 _(computed as this value, as m=3). The standard error of this estimation is denoted _ S Predmean 2 = s [ 1 n + ( x h X ) 2 Σ ( X i X ) 2 + 1 m ] :171.66 [ 1 17379 + 0.016187 646.84 + 1 3 ] __(both the general formula and specific formula in this question)=_ 99.12022_(computed as the value). At the significant level of 95%, the t-value is denoted by _ t ( 1 α 2 ;n 2 ) : t ( 1 0.025 ; 17379 2 ) _(both the general formula and specific formula in this question)=_1.644 _(computed as this value). d). (2) Answer this question without computation, when estimate the mean response value X=0.6, the corresponding standard error at is _Bigger _(bigger than/smaller than/the same as) at X=0.5 , because___you are moving farther from the center mean of the data which is farther than the expected mean allowing for more data in between to fluxuate the estimate ____. e). (2) Answer this question without computation, when estimate the mean of 10 responses, the corresponding standard error at is _Smaller __(bigger than/smaller than/the same as) estimate the mean of 3 responses at the same X level, because __As you increase sample size SE decreases ___. 2
2. (10 pts, no partial credit) (Compare the hypothesis test between the linear impact and linear correlation) Using the R-generated summary and ANOVA table for the model Y~X, answer the following questions. a) (6) For a two-sided hypothesis test on the linear impact, Ho : β 1 = 0 ,H a : β 1 0 if a T-test is used, the test statistic is computed with the formula: _ b 1 S ¿ b 1 }∨ r n 2 1 r 2 = 303.95 6.75 = ¿¿ , which is computed as _ ± 44.982_(value); The critical value has the notation of _ t ( 1 α 2 ;n 2 ) : t ( 1 0.025 ; 17379 2 ) ___, and a value of. 1.9601_. The p-value of the test can be computed with the formula _ Pr ( t > t s | β 1 is true ¿ = Pr ¿ ___, and the value is _p = 2*10^-16 = ~0 _. b) (2pts) Verify your answer by highlighting the corresponding p-values in the R output for the T-test . c) (2) Adjust the HT components from a two-sided test to a one-sided test. Consider the one-sided HT Ho : β 1 = 0 ,H a : β 1 > 0 , the test statistic is the same as the two-sided test, but the p-value needs to be adjusted with the formula_ Pr ( t > t s | β 1 istrue ¿ = Pr ¿ _ and computed as ; p = 2*10^-16 = ~0_in this question. Changing Critical value t ( 1 α 2 ;n 2 ) : t ( 1 0.05 ; 17379 2 ) = 1.6449 3
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help