Homework: Exploratory Data Analysis
In Excel Exercises
In picture A, you can see proportion survived in female (72%) higher than male (19%). In column “นับจำนวน ของ age” value less than column “นับจำนวน ของ survived2” both female and male, because some case doesn’t have data about age that call “missing data” (female = 466-388 = 78, male = 843-658 = 185).
In R Exercises
Central tendency and Spread
In picture B show the result central tendency of age. Min=32.00, Q1=59.75, Median=71.00, Mean=68.25, Q3=80.25 and Max=92.00. When I interpret this result. I compare two values (median-Q1 and Q3-median). The value from Q3-median is more than value from median-Q1 that mean this graph is negative skew, correlate with picture C.
Any outliers or abnormal
To find the coefficient of skewness (a measure of the degree of skewness), the mean, mode and standard deviation was needed. Due to the large data size, a computer program was used to obtain the necessary information. The data set was inserted into the program (One Variable Analysis by Haese and Harris Publications) which then analysed it and produce the required result. The information collected is displayed below with the result for the mean rounded to 87 from 86.964 and the standard deviation to 8.2 from 8.2375. This was done for convenience however it did reduce the precision of the
Forecasting the Future Female Veteran Population and Their Increased Use of the VA Medical System - VPT2
The first variable was the number of total prior arrests. The mean was 10.54. The median was 5 and the mode was 0. The most appropriate measure of central tendency for this set of data is the mode. The mode is most appropriate because out of 962 people 290 people had 0 prior arrests. The other numbers of arrests were not even close with the
Thank you for the opportunity to assess your sales data in order to provide recommendations for increasing your sales. The analysis and recommendations below are based on the data you provided, which covers a period from May 2004 through June 2006. The analysis below is based on this data alone. Therefore, our recommendations should be tempered by your knowledge of business realities and your market. Please let us know if we can answer any questions concerning the analysis or the recommendations provided.
At a glance scatter plots show whether a relationship exists between two sets of data. This data will determine correlations between students taking the SAT and ACT. Because this scatter plot is falling from left to right it has a negative slope, so therefore there is a negative correlation between these two sets of data. Although these points are falling, it is not a clear negative relationship since the clustered points are not in a straight line. Therefore, this relationship is a weak, negative relationship.
2. The focus of the plot is Eliza, she is the subject of Higgins’ and Pickering’s
Evaluates a condition and returns one value if the condition is true and a different value if the condition is false
The variable age is the independent variable and is a ratio level of measurement (Loiselle et al., 2011). The measure of central tendency to describe age are in table 1.2 are the mean of 57.62 which is the average age, the median which is the middle score within the distribution when all scores are organized of 58.5 and the mode of 58 which is the most frequently occurring age (Loiselle et al., 2011). The measures of variability are the range of 69 with a minimum age of 22 and a maximum age of 91, standard deviation which is the average deviation from the sample mean which is a value of 16.26, and the sample variance which is the standard deviation square and the value is 263.46 (Salkind, 2013). The distribution for this sample is described as a negative skew and the value obtained from table 1.2 is -0.22511(Salkind, 2013). A negative skew occurs when the median and the mode value are larger than the mean, within this sample the median is 58.5 the mode is 58 which is greater than the mean of 57.62, the tail would be pointed toward the left (Salkind, 2013). The kurtosis value is -0.65102 and this describes how peak or flat the curve is from the normal distribution curve which is described as mesokurtic (Salkind, 2013). The kurtosis has a large negative value which is representative of a flatter curve also know as playkurtic (Salkind, 2013).
b. Freeze allows the user to stop on a portion of a workbook and scroll up and down so that the user will not lose its place or viewing content.
The Excel document contains the rubrics I used for preproduction, production and post-production of the final group project. The rubrics are based on the types of employee evaluations that students could see from an employer. They hit on many of the benchmarks used by a supervisor in the industry to evaluate the effectiveness of an employees work. Each phase of the production process had an artifact that students had to collectively or individually produce. Preproduction had the paperwork (including storyboards, script breakdowns, strip boards, schedules, budgets, etc.) and the production meeting to present the plan for the production. The production stage had the raw footage that was recorded, including both audio and visual recordings. Postproduction had the finished edit of the piece, or the edit that was turned in by the students for evaluation.
When looking at figure 1, which is the stem and leaf plot displayed above, we can determine whether or not these data sets are positively skewed or negatively skewed. Both of which have been classified as positively skewed. This is known as on both; the mean is higher than the median, which is important as it tells us that the middle most number is below the average set of records. In this data however, having positively skewed data represents the higher time taken to complete the concentration section of the Census. This can show us that there are areas of the students data students
The purpose of my excel project is to calculate my expenses and savings within my family budget. All of my data is very accurate as I gather them from my invoices and receipt. After reviewing my results, I found out that there are many expense that either I need to cut down or shop around for a better rate. I also need to focus more on the saving account because I would like to save a little bit more money for the rainy day. I always wonder where most of my money goes but thanks to this project, I can see in detail about my financial
Dropping out of high school is associated with multiple factors that gradually build onto an individual. In the “Income Inequality, Social Mobility and the Decision to Drop Out of High School” study, Kearney and Levine discussed that the socioeconomic perspective of a person is one that plays a critical role in his/her perspective in continuing an education. Regions with a greater difference in income inequality often come with less social mobility. High school students’ choice to drop out is often linked with a long-term exposure to low socioeconomic circumstance that demotivated their prospect toward advancement and failure to recognize the benefits
Through the use of the excel sheet we were able to see the relationship that temperature has on pressure. As a result, we were able to come to the conclusion that the New England Patriots in fact cheated during their 2015 AFC Championship game. We were able to determine that when the temperature decreases the pressure of the ball naturally decreases. According to physics the ideal gas law states that as temperature is reduced, the pressure will also be reduced.
For more than two digits figure leaving left digit ,what ever in right is concider as stem.