Case Study III: Inferential Statistics
This part of the case study will explore the application of inferential statistics to the Zagat Survey sample data. In Part II, claims were made that attempted to extrapolate the sample data to the population, but they were statistically invalid. So here in Part III, I will properly project the sample data into the population and compare it to the previous methods. Earlier in the study, the univariate estimate used was simply the mean value of the sample taken for the variable Cost. But this does not account for the presence of variation in a sample that could affect the mean. We can use this value within a standard normal distribution of sample means to try and narrow down the true
…show more content…
Both of these values, along with the test statistic, are provided in the Excel regression output. An online F-value calculator1 produced a Critical F-Value of 2.704. As outlined in Appendix C, we reject the null hypothesis and conclude that the regression is in fact useful. Using the same null and alternative hypotheses, the same test can be performed for the regression between Décor and Cost. In this case, we reject the null hypothesis and confirm the alternative, which states that Décor (being the only independent variable) does in fact affect the Cost. So there is a direct relationship between the Décor and Cost. The multiple regression model does a good bob of predicting the Cost. As seen in Part II of the case study, the regression produced a relatively high Adjusted R Square value of .76406, which means that about 76% of the variation in Cost can be explained by the other variables (Appendix D). We can also see from this output that each variable is given a t-Stat value. All of these values are higher than the Critical T-Value (from T-distribution table) for 99 degrees of freedom all the way up to the significance level of 1% (t = 2.364). And, as shown before, the F-Value hypothesis testing for the regression as a whole proved that the regression was in fact useful. So all of these claims when used in concert provide strong evidence that this model has a high predictive value. The best estimation for Cost is going to be the
Inferential statistics helps us to analyze predictions, inferences, or samples about a specific population from the observations that they make. “With inferential statistics, you are trying to reach conclusions that extend beyond the immediate data alone” (Trochim, 2006). The goal for this type of data is to review the sample data to be able to infer what the test group may think. It does this by making judgment of the chance that a difference that is observed between the groups is indeed one that can be counted on that could have otherwise happened by coincidence. In order to help solve the issue of generalization, tests of significance are used. For example, a chi-square test or T-test provides a person with the probability that the analysis’ sample results may or may not represent the respective population. In other words, the tests of significance provides us the likelihood of how the analysis results might have happened by chance in a scenario that a relationship may not exist between the variables in regards to the population that is being studied.
Week Seven Homework Exercise Answer the following questions, covering material from Ch. 13 of Methods in Behavioral Research Define inferential statistics and how researchers use inferential statistics to draw conclusions from sample data. According to Cozby (2009) inferential statistics are used to determine whether we can in fact make statements that the results reflect what would happen if we were to conduct the experiment again and again with multiple samples Define probability and discuss how it relates to the concept of statistical significance. Probability is the possible that an outcome of an experience or an event will occur (Cozby 2009) Statistical significant and probability are one in the same. A researcher is studying the
Therefore, they are considering the public’s opinion, and gathering information based on these studies which is unbiased. The quality of the article is very organized. It has subtitles for each section, charts and tables to show statistics, and a conclusion of the studies.
Table 6.1.1 displays the matlab output of beta, standard error, t-statistic and p-value for the two independent variables during 10-year period. It is found that beta of X1 is 0.2750 which indicates there is a positive relationship between the utilities excess return and the healthcare excess return. This positive relationship is statistically significant as the p-value is close to 0 which is much less than the significance level of 5%. In addition, the standard error of X1 is 0.0300 which represents the average distance that the observed values fall from the regression line. This indicates that the model fits the data. In contrast, it is found that the material excess return is negatively
Chapter 4 explained the results obtained from analysis of the surveys. The results were analyzed and significant findings were discussed further.
When we look around us, we may not recognize that statistics is all around. Before I began to take this course “Statistics for Managers” I was not aware of how statistics actually worked. The first idea that came to my mind about statistics was probability. Not knowing statistics and probability are related because they both determine a possible outcome. Throughout this course I have learned what statistics is and how it works. In this paper, I will describe descriptive and inferential statistics, hypothesis developing and testing, the selection of statistical tests, and how to evaluate statistical results in analyzing data.
Some questions in Part A require that you access data from Statistics for People Who (Think They) Hate Statistics. This data is available through the Student Textbook Resources link.
Coverage error, the failure to give some persons in a target population a chance to be selected into the sample, was present in the survey because insecure areas were not represented. The survey attempted to mitigate coverage errors by conducting face to face interviews to eliminate the coverage bias introduced by phone interviews. The response rate was not reported in the methodology page. However, if the response rate was 25%, that would indicate that the 1,000-sample size was taken from 4,000 individuals. Whether response rate affects data quality is a matter of debate, yet, according to Hillygus, “multiple studies have found that lower response rates do not indicate the results are inaccurate” (Hillygus 42). The role response rates have on the accuracy of results is uncertain, but inclusion of it provides insight into the quality of the methods.
I believe that in the verge of realization, in everyday life’s undertaking we are facing and practicing statistics. I was being into a serious dedication of understanding the underlined concepts of statistics and the necessary usefulness in conducting a survey. In fact, throughout the course, I did learn many factors that became useful in contacting a survey research. according to Walker et al. (2011). However, my success could not be possible without the appreciation and support of my teachers, discussion groups, and class presentations.in addition, the misguided notion of statistics being a tough course was proven otherwise since I realized that statistics make a lot of sense not only in survey research in personal life, especially in budgeting and planning for individual routines. Furthermore, the following paper provides a vivid explanation of my reflection of the survey class, with more attention paid to the concepts of statistics.
The two statistics basically say the same thing and doesn’t add much to the paper. Keeping one of the facts and adding a college student survey would make the point clearer. Depending on Zielke’s credentials he could do a survey himself. If he’s a college student he could go around campus and do a survey of his peers. If he’s a professor he could survey his students or his fellow professors to add another perspective.
Sampling is a method by which one can gain an estimation of accuracy for population (Jaggia & Kelly, 2014). Surveying is a form of sampling which is often used to provide an estimate. A survey was conducted on a group of people living in New York City which consisted of 445 randomly selected people from multiple boroughs on a monthly basis for six months. The survey focused on the boroughs of Manhattan, Brooklyn, Queens, The Bronx, and Staten Island. Additionally, demographic information was collected, including level of education and marital status. This paper will provide the results of the survey as well as their accuracy of the estimates in order for the New York City governmental agency personnel.
On the first day of school, Mr. Serwach’s AP Statistics class participated in an anonymous survey. Nine students answered questions ranging from their age to the number of AP classes they have taken to the number of states they have visited. The purpose of the survey was to gather some data about the class so the students could demonstrate their writing and analysis skills. Based on the results, several conclusions can be drawn to describe the population (which is just the 3rd-period statistics class because the results do not apply to anyone other than the students in the class). For example, the class is likely composed of 11th and 12th graders since the ages of the class range from 16 to 17 years old. Another speculation is that the
What makes the middle aged mother to buy cloths in Zara while the daughter aged in mid 20s buys Zara clothing? Because it is fashion able and up to trend. By collecting data and focusing on shorter response times, the company ensures that its stores are able to carry clothes that the consumers want at that time. Zara can move from identifying a trend to having clothes in its stores within 30 days. That means Zara can quickly and catch a winning fashion trend, while its competitors are struggling to catch up. Catching fashion while its hot is a clear recipe for better margins with more sales happening at full prices and fewer discounts. In comparison, most retailers of comparable size
In order to determine whether two independent variables better explain cost than a single regression (See #1), the group ran a two separate multiple regression. The first multiple regression was run with the number of passenger cruise days
Here Zagat publishes restaurant guide with help of customer’s reviews. They initially published books and became the best seller but when they emerged on web; interested customers got easier access to it but Zagat urged visitor spend few bucks for it. When they were publishing books they had more competitors. Yelp emerged on the market online. Yelp the free service provider is substituted for Zagat in online world. In case of collecting visitors and review making customers, Zagat lagged to some extend on optimizing the online portal. Regarding value chain analysis, the review for Zagat is some hundred peoples but Yelp has been able to collect data from thousands of customers.