AEMA 310_Final Fall 2022_FIN_2

pdf

School

McGill University *

*We aren’t endorsed by this school

Course

310

Subject

Statistics

Date

Jan 9, 2024

Type

pdf

Pages

12

Uploaded by ChancellorTitanium10733

Report
Final Examination Page <1> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part I: Multiple-choice and true/false questions Multiple-choice questions ( 2 pts. each): Indicate clearly the most appropriate answer by filling in the box to its left. True/false questions ( 1 pt. each): Fill in one of the boxes to the left of each statement. If you believe the statement is T rue, then fill in the box below the T ; if you believe the statement is F alse, then fill in the box below the F . If you want to change a choice, erase your former choice and indicate the final one clearly. No mark will be given if two boxes or more are checked. 1) In the case of a simple linear regression, which of the following statements is correct ? The expected value of Y for a given x can be calculated from the linear regression equation, as E(Y) = a + bx + . The ratio statistic ˆ b ˆ b/S has a “Student” t distribution with n 2 degrees of freedom. The estimate of the intercept must be equal to zero for the fitted linear regression model to be significant. The value of the estimated slope is comprised between 1 and 1 inclusively. 2) When testing H 0 : 1 2 = 2 2 against H 1 : 1 2 > 2 2 with n 1 = 6 and n 2 = 6, which of the following statements is correct ? H 0 is rejected at level when the observed test-statistic value is greater than χ 2 1- (10). H 0 is rejected at level when the observed test-statistic value is greater than t 1- (10). H 0 is rejected at level when the observed test-statistic value is greater than F 1- (5, 5). H 0 is rejected at level when the observed test-statistic value is greater than z 1- . 3) A researcher is aware that a concentration of 500 mg of heavy metals per kg of soil is the intervention threshold for decontamination. He wants to know how many samples would be required to detect with a 0.90 probability that the contamination level is above the threshold when it is in fact 50 above the threshold ( = 0.05 and 2 = 25). Which of the equations below can be correctly used for this purpose? 0.90 = (500 − 550)/√( 25 n ) + 1.282 1.282 = (500 − 550)/√( 25 n ) + 1.645 . 0.90 = (500 − 550)/ √( 25 n ) + 1.645 −1.282 = (500 − 550)/√( 25 n ) + 1.645 4) As a follow-up to Question 3), which of the following statements is not correct ? Increasing will increase the probability to detect a real difference between the acceptable concentration and the actual concentration of the heavy metal (Reject H 0 when it is false). Lowering the power of the test will increase the possibility that decontamination will not be done while it should be. Making a Type I error will cause unnecessary costs of decontamination. Increasing will lower the possibility that decontamination will not be done when necessary.
Final Examination Page <2> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part I (continued) 5) An experiment is conducted following an RCBD with one replicate per treatment per block, to test the effects of five treatments (for some disease) on five elderly men and five elderly women. In such an experimental situation, the Error df in the ANOVA decomposition is: 1 2 3 4 6) Consider an RCBD with p = 4, n = 4 and 1 replicate per treatment per block. Assume the conditions of application of a procedure of multiple pairwise comparisons of treatment means are satisfied. In this case, the LSD statistic ( = 0.05) is: 𝑡 0.95 (3)√𝐸𝑟𝑟𝑜𝑟 𝑀𝑆/2 𝑡 0.95 (3)√𝐸𝑟𝑟𝑜𝑟 𝑀𝑆/4 𝑡 0.975 (9)√𝐸𝑟𝑟𝑜𝑟 𝑀𝑆/4 𝑡 0.975 (9)√𝐸𝑟𝑟𝑜𝑟 𝑀𝑆/2
Final Examination Page <3> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part I (third and last page) 7) T F The ANOVA model for an RCBD with more than 1 replicate per treatment per block is written as X ijk = + a i + B j + (a i B j ) + ε ijk , where B j is the main effect of block j, a i B j is the effect of the treatment-by-block interaction, and both effects are fixed. In the ANOVA model for an RCBD with 1 replicate per treatment per block, the main effect of treatment i (a i ) is fixed. For a given confidence level, the length of the confidence interval for one population mean will decrease if the sample variance is increased while the sample size remains the same. A Type II error is made when H 0 is rejected while, in fact, it is true . When testing H 0 against H 1 in the probability approach, H 0 will be rejected if the probability of significance is smaller than . Increasing the sample size and the significance level lower the power of a statistical test. When the population variance is known, the length of the confidence interval for the population mean is equal to 𝑧 1− 𝛼 2 𝜎 2 𝑛 . When testing H 0 : a = b against H 1 : a > b with known a 2 and b 2 , an effective number of degrees of freedom must always be calculated. In a left-hand one-tailed test, the critical value of the test statistic decreases when decreases. If 1 bus accident happens every 3 weeks on average in a given part of the world, then the number of bus accidents in a 9-week period there follows a Poisson distribution X, with E(X) = Var(X) = 9. An estimator of population parameter θ is said to be biased and imprecise if E( θ) 0 and Var( ) is large. If a 95% confidence interval for is calculated to be [5, 6.5], with X N( , 2 ), then the probability that is less than 5 is 0.05. If some H 0 is rejected against some H 1 at = 0.01, then the same H 0 is automatically rejected against the same H 1 at = 0.001. When testing H 0 : = 0 against H 1 : 0, H 0 is rejected at the significance level when the observed value of the test statistic, z obs , is smaller than z 1- /2 or greater than z 1- /2 . ˆ ˆ ˆ
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Final Examination Page <4> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (in three episodes ”) F IRST E PISODE (P ROBLEM 1): Community-supported agriculture (CSA) farms are often synonymous with higher biodiversity because of the large number of crops grown in such systems. Dr. Small wants to investigate if this higher biodiversity helps with the amount of pollinator insects present in a production system. Note: A “healthy” production site (where pollination is adequate) is considered to have 6 pollinators per 9 m 2 on average during the day under some weather conditions (met here). 8-a) For a given production system, the producer tells Dr. Small that 80% of the flower clusters are pollinated adequately. In such a system, what is the probability that at least 2 flower clusters are pollinated adequately among 5 flower clusters randomly sampled. /5 8-b) At a “healthy” production site, Dr. Small evaluates an area of 6 m 2 , and count the number of pollinators. What is the probability that she counts at most 3 pollinators? /5
Final Examination Page <5> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (continued) F IRST E PISODE (P ROBLEM 2): Viruses causing problems in strawberry plants are mainly spread by one insect, the strawberry aphid. Dr. Small believes that as aphids feed on leaves, they transmit viruses that can cause a rapid decrease in plant growth. She decides to verify whether or not the number of strawberry aphids found on a plant could be linked to its growth. She therefore randomly chooses 5 strawberry plants at the same stage of development in a virus-infested garden. For each individual plant, she first collects all the aphids present on the plant and weighs them, and then weighs the entire plant to have a measure of plant growth. Dr. Small then runs some SAS code to apply some appropriate statistical method, and obtains the following results... The REG Procedure Model: MODEL1 Dependent Variable: Y Number of Observations Read 5 Number of Observations Used 5 Analysis of Variance Sum of Mean Source DF Squares Square F Value Pr > F Model 1 1140.08929 1140.08929 14.88 0.0308 Error 3 229.91071 76.63690 Corrected Total 4 1370.00000 Root MSE 8.75425 R-Square 0.8322 Dependent Mean 124.00000 Adj R-Sq 0.7762 Coeff Var 7.05988 Parameter Estimates Parameter Standard Variable DF Estimate Error t Value Pr > |t| Intercept 1 178.48214 14.65800 12.18 0.0012 x 1 -10.08929 2.61583 -3.86 0.0308 9) Identify the statistical method that has been applied. Is the postulated relationship statistically significant at = 0.01? Indicate the numerical information in the SAS output that supports the acceptance or the rejection of the null hypothesis. Briefly describe the relationship and what Dr. Small should conclude. /5
Final Examination Page <6> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (continued) S ECOND E PISODE (P ROBLEM 1): As a follow-up, Dr. Long wants to investigate whether the profitability of CSA farms (in $) is related to the amount of time (in hours) that participants spend working on the farm. He believes that when participants are more involved (i.e., spend more time working on the farm), it increases the profit. To verify this, he randomly samples 5 farms of similar size (i.e., land area, similar crops, number of participants). For each farm, he calculates the total profit for one year and the number of hours worked by participants that year. He obtains the following data: Total profit (,000 $) x i Amount of time worked (hours) y i (x i − x ̅) 2 (y i − y ̅) 2 (x i − x ̅)(y i − y ̅) 20 100 108.16 529 239.2 25 110 29.16 169 70.2 32 120 2.56 9 - 4.8 35 135 21.16 144 55.2 40 150 92.16 729 259.2 Ʃ = 152 Ʃ = 615 Ʃ = 253.2 Ʃ = 1580 Ʃ = 619 Ʃ/5 = 30.4 Ʃ/5 = 123 Ʃ/4 = 63.3 Ʃ/4 = 395 Ʃ/4 = 154.75 10) Is Dr. Long correct in his belief? Perform the appropriate test with = 0.05. 9
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Final Examination Page <7> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (continued) S ECOND E PISODE (P ROBLEM 2): Because CSA farms usually involve the production of valuable crops, Dr. Long continues to be interested in evaluating the economic aspects. In this problem, he wants to evaluate the profitability (in Canadian dollars, $) of 3 vegetable crops, in order to tell farmers which one(s) should be included in a CSA production system. To do this assessment, he enrolled 3 CSA farms (coded A, B, C), each with 6 similar plots allocated to vegetable production. Dr. Long randomly assigned 1 vegetable crop (leafy greens, tomato, carrots) to 2 plots at each location. At the end of the season, he calculates the profit (in $) for each plot, and obtains the data below: Profit ( ,000 $) Leafy greens Tomato Carrots 𝑋 ̅ .?. CSA farms A 6, 7 ( 𝑋 ̅ ??. = 6.5) 10, 13 ( 𝑋 ̅ ??. = 11.5) 4, 5 ( 𝑋 ̅ ??. = 4.5) 7.5 B 8, 10 ( 𝑋 ̅ ??. = 9) 9, 10 ( 𝑋 ̅ ??. = 9.5) 5, 6 ( 𝑋 ̅ ??. = 5.5) 8.0 C 7, 7 ( 𝑋 ̅ ??. = 7) 9, 9 ( 𝑋 ̅ ??. = 9) 4, 6 ( 𝑋 ̅ ??. = 5) 7.0 𝑋 ̅ ?.. 7.5 10.0 5 𝑿 ̅ ... = 7.5 11-a) Construct the ANOVA table from the data above and test whether there are main effects of the treatments ( = 0.10). Note: Block SS = 3.0 and Total SS = 100.5 ( There is room on the next page to continue your work. ) /10
Final Examination Page <8> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (continued) S ECOND E PISODE (P ROBLEM 2, C ONTINUED ): 11-b) Perform a complementary statistical analysis to answer the question: Which crop(s) do you recommend being profitable for a CSA farm? Briefly explain your recommendation in words. /4
Final Examination Page <9> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (continued) T HIRD E PISODE (P ROBLEM 1): Apple ice cider is a fermented beverage made from the frozen juice of apples. Cryo- concentration (i.e., the operation of freezing the juice of harvested apples, followed by a cold fermentation) and cryo-extraction (i.e., the extraction of juice from frozen apples left on the tree) are two approaches to producing ice cider. As the owner of an apple orchard, you want to produce ice cider with high sugar concentration at the time of fermentation, and you think that cryo-extraction would give better results. You therefore identify 5 trees of the same cultivar in your orchard. For each individual tree, you harvest half of the apples, extract the juice and freeze it. You later harvest the remaining apples once frozen on the trees and extract their juice. Before fermentation, you measure sugar content (in °Brix) of each juice. You obtain the following data and sample means: 1 2 3 4 5 Sample mean Initially harvested apples are used (cryo- concentration) 30 32 29 31 32 30.8 Apples left on the tree are used (cryo-extraction) 32 35 31 33 32 32.6 12) Based on the data above and the application of the appropriate test of significance, are you correct in thinking that cryo-extraction is a better way to produce ice cider ( = 0.05)? /9
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help
Final Examination Page <10> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (continued) T HIRD E PISODE (P ROBLEM 2): You have been told that in the case of cryo-extraction, the temperature at which apples are harvested affects the quality of ice cider due to a linear effect on the sugar content. You want to know if this is the case, and you harvest 15 apples from the same tree on 5 different days specifically chosen for the air temperature at harvest (in °F). You then extract the juice for each harvest individually and measure the sugar content (in °Brix). You obtain the following results: Air temperature x i (°F) Sugar content Yi (°Brix) 0 36 8 34 16 31 24 28 32 29 Mean = 16.0 Mean = 31.6 13) Apply the appropriate statistical method to answer your research question ( = 0.05). /11
Final Examination Page <11> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (continued) T HIRD E PISODE (P ROBLEM 3): In your orchard, you mainly grow two apple tree cultivars: McIntosh and Gala. You want to know whether or not they provide the same quality of ice cider by cryo-extraction, so you randomly harvest a certain number of frozen apples from 5 individual trees of each cultivar, extract the juice from each set of apples, and measure the sugar content (in °Brix) of each juice. You obtain the following data and calculate preliminary descriptive statistics: 1 2 3 4 5 Sample mean Sample variance McIntosh 33 34 32 34 33 33.2 0.7 Gala 35 37 34 35 35 35.2 1.2 14) Using these data, apply the appropriate testing procedure to compare the suitability of the two apple tree cultivars to produce ice cider by cryo-extraction ( = 0.10). /11
Final Examination Page <12> of 12 Statistical Methods 1 (AEMA 310) December 14, 2022 Part II (last page) T HIRD E PISODE (P ROBLEM 4): To produce quality ice cider, it is recommended that the fermentation process be stopped when the concentration of residual sugar reaches 150 g/kg. If the concentration gets higher, the batch of fermented juice is thrown away because the taste of the ice cider would be affected otherwise You want to know how many portions of fermented juice need to be analyzed in order to determine, with a probability of 0.95, that the mean concentration of residual sugar is above the recommended level while it is, in fact, 5 g/kg above it. Use = 0.01, and you may assume that you somehow know the population variance is equal to 25 (g/kg) 2 . /5
Your preview ends here
Eager to read complete document? Join bartleby learn and gain access to the full version
  • Access to all documents
  • Unlimited textbook solutions
  • 24/7 expert homework help