Ryan DeLuna
Independent Project
1. Frequency distribution of a variable and bar graph of the same variable
A frequency distribution table is used for arranging data values and counting the number of time each value appears in a dataset. They can be used for both qualitative and quantitative variables. For this data pool I decided to use the subjects’ marital status because it is a qualitative, nominal level variable. (Polit, 23)
Frequency table results for racethn:
Count = 972 racethn Frequency Relative Frequency Percent of Total
Black, not Hispanic 795 0.81790123 81.790123
Hispanic 123 0.12654321 12.654321
White, not Hispanic 54 0.055555556 5.5555556
The table above is a frequency table that shows the relative frequency and the percent
…show more content…
The summary includes variance, mean, median, mode and standard deviation. As shown in the histogram majority of people in the data pool have a height of 62-68 inches. This is a symmetrical distribution seeing how close the mean and median are to each other.
3. Cross tabulation of two variables
A cross tabulation is a two-dimensional frequency distribution of two nominal or ordinal variables that records the frequency of respondents that have the specific characteristics. These tables provide a wealth of information about the relationship between the variables. For an example I chose to use poverty levels and smoking to show how a contingency table can illustrate a frequency distribution. Poverty level is a nominal variable that will be the independent variable and the dependent variable will be smoking. The end table is a chi-square test and it is used to determine if the variables are unrelated.
Contingency table results:
Rows: smoker
Columns: poverty
Cell format
Count
(Row percent)
(Column percent)
(Percent of total)
(Expected count)
(Contributions to Chi-Square)
Above poverty Below poverty Total
No
Count
(Row percent)
(Column percent)
(Percent of total)
(Expected count)
(Contributions to Chi-Square) 127
(25.87%)
(58.26%)
(13.13%)
(110.69)
(2.4) 364
(74.13%)
(48.6%)
(37.64%)
(380.31)
(0.7) 491
(100%)
(50.78%)
(50.78%)
Yes
Count
(Row percent)
(Column percent)
(Percent of
5. In HANESS, the men age 18 and over had an average height of 69 inches and an SD of 3 inches. The histograms is show below, with a normal curve. The percentage of men with heights between 66 inches and 72 inches is exactly equal to the area between (a) and (b) under (c). This percentage is approximately equal to the area between (d) and (e) under the (f). Fill in the blanks.
The quantitative subjective data is collated and organised into forms of a line graph, table values, and the calculation of mean and median data to determine if there’s any form of deviation of data. This will prove if there’s a linear or non-linear relationship between the data and test if there’s any similarities between the data values and/or the overall skew of the graphs. The outliers will be also added onto these graphs and will be compared towards each other recognising if there is a similarity or contrast in data and in relationships between both of the data. This will determine if the hypothesis of “family size has an
According to the website http://womenissues.about.com/cs/abortionstats/a/aaabortionstats.htm there are approximately 126,000 abortions conducted each day throughout the world. This website includes the abortion statistics of the world and breaks the data down to the demographics of the United States. It also discusses the decisions to have an abortion and the use of contraceptives in the United States. This was an informative website and included detailed statistics conducted by the Alan Guttmacher Institute. According to the website http://www.bls.gov/cps in 2000, gon average there were roughly 135 million employed and 6 million unemployed people in the labor force in the United States.h (p. 3) The websites definition of
A distribution table can keep all of this information (numbers, row data) handy. A person can look, and say, oh, the participants in this survey were 75% male, or 25 % female. The managers can “see” the information and not just the raw data.
For the student height of girls, the data has a normal distribution because the data has a symmetrical spread around the mean. It has a bell shape.
Chapter25. When establishing the classes for a frequency table it is generally agreed that the more classes you use the better your
The variable age is the independent variable and is a ratio level of measurement (Loiselle et al., 2011). The measure of central tendency to describe age are in table 1.2 are the mean of 57.62 which is the average age, the median which is the middle score within the distribution when all scores are organized of 58.5 and the mode of 58 which is the most frequently occurring age (Loiselle et al., 2011). The measures of variability are the range of 69 with a minimum age of 22 and a maximum age of 91, standard deviation which is the average deviation from the sample mean which is a value of 16.26, and the sample variance which is the standard deviation square and the value is 263.46 (Salkind, 2013). The distribution for this sample is described as a negative skew and the value obtained from table 1.2 is -0.22511(Salkind, 2013). A negative skew occurs when the median and the mode value are larger than the mean, within this sample the median is 58.5 the mode is 58 which is greater than the mean of 57.62, the tail would be pointed toward the left (Salkind, 2013). The kurtosis value is -0.65102 and this describes how peak or flat the curve is from the normal distribution curve which is described as mesokurtic (Salkind, 2013). The kurtosis has a large negative value which is representative of a flatter curve also know as playkurtic (Salkind, 2013).
1The purpose of this paper is to perform a chi-square analysis after watching the assigned video. Data taken from a given table will be entered on the SPSS data set with intend to obtain an output. The cross-tabulation table will be highlighted in the output document. Then the highlighted output will be submitted to the instructor.
2. Which of the following values from Table 1 tells us about variability of the scores in a distribution?
The statistical descriptive strategy will be correlation, which is defined as “two different variables are observed to determine whether there is a relationship between them” (Gravetter & Wallnau, p. 11, 2009). The descriptive statistic will be presented as percentages, mode and mean. The inferential data will be conducted using a chi-square test for independence. According to Gravetter and Wallnau’s definition of the Chi-square test for independence states that it “uses the frequency data from a sample to evaluate the relationship between variables in the population” (2009, p.
TABLE 4 Source Frequency Percentage Internet 103 48.6% Family 57 26.9% Academic study 107 50.5% Newspaper 40 18.9% Others 113
This video is about frequency, the number of times something occurs in a category. Also this video shows how to make a frequency distribution chart using test scores. First, the scores are tallied into groups. The number of tallies in each group represent the frequency of each group. The relative frequency is the percentage of frequency in each group. Cumulative frequency is the adding the total of one category and previous categories. Relative cumulative frequency is turning cumulative frequency into percent. Also, a dot plot and histogram can be used to determine frequency.
In terms of party identification, 1500 total people were surveyed and all 1500 people responded with an answer to the survey. Before moving forward, it is vital to note that there were 191 more Democrats than Republicans in the survey results but since the study is focused on averages, it will not majorly effect the conclusions but it is still important to note. In addition, interestingly, 490 respondents identified as Independents which means they are not linked to a political party and for future studies, it would be an interesting group to look at in terms of their average importance of religion.
In terms of independent variables, by referring to the Appendix, figure 1 reveals the summary of the sample; it also
The instrument used is questionnaire and chi-square is used to test the relationship between the variables, which has proven that there is a