What is Central Tendency in Statistics?Mean (Arithmetic)Median Mode Why is Mode Rarely used with Continuous data?Summary of When to Use Mean, Median and Mode Formula Context and Applications

What is Central Tendency in Statistics?

It is a descriptive summary of a data set. It can be defined by using some of the measures. The central tendencies do not provide information regarding individual data from the dataset. However, they give a summary of the data set. The central tendency or measure of central tendency is a central or typical value for a probability distribution.

The central tendency is known as the statistical measure. This statistical measure represents the single value of the data set or entire distribution. The objective of evaluating a central tendency is to provide an accurate description of the entire data in the distribution.

The measure or outcome of the central tendency is a single value. It attempts to explain a set of data by identifying the central position within that set of data. The numerical expressions which represent the characteristics of a group (a large collection of numerical data) are called measures of central tendency. They are also described as measures of central location.

The measures of central tendency are mean, median, and mode. However, in different conditions, some measures of central tendency become more appropriate to use than others.

Mean (Arithmetic)

The most widely known and well-accepted measure of tendency is the mean or average. It is mostly used with continuous data. The mean represents the average value of a dataset. It can be calculated as the quotient of the sum of all the values in the data set by the number of values in the data set. The mean is usually denoted as $\bar{x}$ (pronounced “x-bar”).

Example:

If there are n observations in a data set and they have values $x_{1}, x_{2}, ..., x_{n}$ , then the mean is equal to:

\bar{x} = \frac{x_{1} + x_{2} + ... + x_{n}}{n}

The formula is also written as:

\bar{x} = \frac{\sum_{i = 1}^{n} x_{i}}{n}

Where $\sum$ is Greek capital letter, which means “sum of…” and is pronounced as “sigma”.

A very significant characteristic of the mean is that it involves every value in the set of data as part of the calculation. Additionally, the mean is the lone measure of central tendency where the sum of the deviations of each value calculated from the mean is always zero.

When Not to Use the Mean?

The mean is principally susceptible to the influence of outliers, which could be considered as its one main disadvantage. There are observations that are unusual when compared to the rest of the set of data by being particularly small or large in numerical value. For example, consider the salary of staff at an organization below:

Employee	Salary ($)
1	13000
2	17000
3	15000
4	17500
5	15000
6	12000
7	18500
8	15500
9	86000
10	93000

The mean salary for ten employees is $30,250. However, the data set suggests that this mean value might not be the best way to accurately reflect the typical salary of an employee, as most employees have salaries in the $12000 to $18500 range. The mean is being altered by the two hefty salaries. Therefore, in this situation, there is a need to use other better measures of central tendency instead of mean.

Median

The middle value of a data set is called the median of the data. The median divides the data set into two halves and is called the 50^th percentile. The median is much less affected by outliers and skewed data than the mean. If the number of elements in a dataset is odd, then the middlemost element of the data arranged in ascending or descending order is the median. If the number of elements in a data set is even, the average of the two central elements of the arranged data is the median of the set.

Median with Even Data Set

When the dataset contains an even number of values, then the median value of the dataset can be found by taking the mean of the middle two values. Let’s use the same example of salary of 10 employees and after arranging data in ascending order –

Salary ($)
12000
13000
15000
15000
15500
17000
17500
18500
86000
93000

Two middle values (5^th and 6^th) are 15500 and 17000 and average of it will give the median value i.e. 16250.

Median with Odd Data Set

When the dataset contains an odd number of values, then the middle value of the data set will be the median value. As per the below table, after arranging data in ascending order –

Salary ($)
12000
13000
15000
15000
15500
17500
18500
86000
93000

The middle value (5^thvalue) is 15500 is the median value of the data set.

Mode

The value that occurs most frequently in a data set is called the mode of the data. If no two categories in the given data are the same, then the dataset has no mode. A dataset may have more than one mode if multiple categories repeat an equal number of times. The mode is the only measure of central tendency that is used for categorical variables.

Consider the given dataset 5, 4, 2, 3, 2, 1, 5, 4, 5

Mode
5
5
5
4
4
3
2
2
1

Since the mode represents the most common value. Therefore, the most recurrently occurring value in the given data set is 5.

On a histogram or bar chart, the element with the highest bar represents the mode. Therefore, the mode is sometimes considered the most popular option.

“The histogram representing the mode of a data”

Consider the example given below:

“The bar graph representing the preferred modes of transport”

In this particular data set, the preferred mode of transport is the bus.

Why is Mode Rarely used with Continuous data?

The mode is particularly problematic with continuous data because it is more likely not to have any value that is more frequent than the other.

For example, consider the data set consisting of the weights of 30 people. How likely is it that that two or more people with exactly the same weight (e.g., 55.4 kg) are present in the same sample? The answer would be that it is perhaps highly unlikely. Though many people might be close, it is impossible to find two people with exactly the same weight (to the nearest 0.1 kg), with such a small sample (30 people) and a large range of possible weights. This is why the mode is very rarely used with continuous data.

Other Limitations of Using Mode

One of the major limitations with the mode is that it is not unique. So it leaves with problems when having two or more values that share the highest frequency, such as below:

Summary of When to Use Mean, Median and Mode

The below table will help to choose the best measures of central tendency with respect to different types of variables.

Type of Variable	The Best Measure of Central Tendency
Nominal	Mode
Ordinal	Median
Interval/Ratio (not skewed)	Mean
Interval/Ratio (skewed)	Median

Formula

Arithmetic mean: $\bar{x} = \frac{x_{1} + x_{2} + ... + x_{n}}{n}$

Context and Applications

Measures of central tendency are useful for:
School and college-level education
Post-graduation course in mathematics
Data analysis courses
Engineering courses

Want more help with your statistics homework?

We've got you covered with step-by-step solutions to millions of textbook problems, subject matter experts on standby 24/7 when you're stumped, and more.

Check out a sample statistics Q&A solution here!

*Response times may vary by subject and question complexity. Median response time is 34 minutes for paid subscribers and may be longer for promotional offers.

Search. Solve. Succeed!

Study smarter access to millions of step-by step textbook solutions, our Q&A library, and AI powered Math Solver. Plus, you get 30 questions to ask an expert each month.

Tagged in

Math Statistics

Descriptive Statistics

Centre, Spread, and Shape of a Distribution

Mean, Median, Mode Homework Questions from Fellow Students

Browse our recently answered Mean, Median, Mode homework questions.

Q: I need help with this problem and an explanation of the solution for the image described below.…

Q: 7. In a 2011 article, M. Radelet and G. Pierce reported a logistic prediction equation for the death…

Q: Theorem 2.6 (The Minkowski inequality) Let p≥1. Suppose that X and Y are random variables, such that…

Q: 21. Prove that: {(a, b), - sa≤b<x} =σ{[a, b], - <a≤ b < ∞}.

Q: The number of initial public offerings of stock issued in a 10-year period and the total proceeds of…

Q: Calculate the correlation coefficient r, letting Row 1 represent the x-values and Row 2 the…

Q: Question 1:We want to evaluate the impact on the monetary economy for a company of two types of…

Q: 2PM Tue Mar 4 7 Dashboard Calendar To Do Notifications Inbox File Details a 25/SP-CIT-105-02 Statics…

Q: Bob’s commuting times to work are varied. He makes it to work on time 80 percent of the time. On 12…

Q: 13 A golf analyst measures the total score and number of putts hit for 100 rounds of golf an amateur…

Q: See data attached. SoftBus Company sells PC equipment and customized software to small companies to…

Q: QUAI6221wA1.docx X +…

Q: Proof of this Theorem Theorem 1.2 (i) Suppose that P(|X| ≤ b) = 1 for some b > 0, that E X = 0, and…

Q: 6 dong mu 2) A Using the toddler data table in Question 1 and appropriate probability notation,…

Q: Question 21 of 28 (1 point) | Question Attempt: 5 of Unlimited Dorothy ✔ ✓ 12 ✓ 13 ✓ 14 ✓ 15 ✓ 16 ✓…

Q: Why the correct answer is letter A? Students in an online course are each randomly assigned to…

Q: Let p be the population proportion for the following condition. Find the point estimates for p and…

Q: 7. Cantelli's inequality. Let X be a random variable with finite variance, o². (a) Prove that, for x…

Q: Reconsider the patient satisfaction data in Table 1. Fit a multiple regression model using both…

Q: Theorem 7.2 Suppose that E X = 0 for all k, that Var X = 0} x) ≤ 2P(S>x 1≤k≤n S√2), -S√2). P(max…

Q: 2. [20] Let {X1,..., Xn} be a random sample from Ber(p), where p = (0, 1). Consider two estimators…

Q: Find the range for the following sample data. x 23 17 11 30 27

Q: Calculate the 95% confidence intervals for the proportion of children surviving, and the…

Q: appropriate probabilities. 19 Using the data from Table 17-1, are gender and political party…

Q: Let A, A1, A2,... be measurable sets. Then P(A)=1- P(A); • P(Ø) = 0; P(A1 UA2) ≤ P(A1) + P(A2); A1 C…

Q: help me with abc please. please handwrite if possible. please don't use AI tools to answer

Q: Consider the following hypothesis test. The following results are for two independent samples taken…

Q: (a+b) R2L 2+2*0=? Ma state without proof the uniqueness theorm of probability function suppose…

Q: Examine the Variables: Carefully review and note the names of all variables in the dataset. Examples…

Q: Analyze the residuals of a linear regression model and select the best response. yes, the residual…

Q: ian income of $50,000. erty rate of 13. Using data from 50 workers, a researcher estimates Wage =…

Q: 21 Using the data from the table in Question 1, does the dominant hand differ for male toddlers…

Q: According to flightstats.com, American Airlines flights from Dallas to Chicago are on time 80% of…

Q: 3. [20] Let {X1,..., Xn} be a random sample from a binomial distribution Bin(30, p), where p (0, 1)…

Q: Business discuss

Q: 14 A survey is conducted to determine whether would prefer to work at home, if given the 20 office…

Q: 26. (a) Provide an example where X, X but E(X,) does not converge to E(X).

Q: Business Discuss

Q: TIP the aren't, the data are not sym 11 Suppose that the average salary at a certain company is…

Q: (a) Test the hypothesis. Consider the hypothesis test Ho = : against H₁o < 02. Suppose that the…

Q: 11. Suppose that the events (An, n ≥ 1) are independent. Show that the inclusion- exclusion formula…

Q: Solve please and thank you!

Q: Suppose that you have a normal population of quiz scores with mean 40 and standard deviation 10.…

Q: What was the age distribution of nurses in Great Britain at the time of Florence Nightingale? Thanks…

Q: Negate the following compound statement using De Morgans's laws.

Q: 10 15 Answer the following, using the figures and tables from the temperature versus coffee sales…

Q: 1990) 02-02 50% mesob berceus +7 What's the probability of getting more than 1 head on 10 flips of a…

Q: Pls help asap

Q: 7% of all Americans live in poverty. If 40 Americans are randomly selected, find the probability…

Q: 25 ptical rule applies because t Does the empirical rule apply to the data set shown in the…

Search. Solve. Succeed!

Study smarter access to millions of step-by step textbook solutions, our Q&A library, and AI powered Math Solver. Plus, you get 30 questions to ask an expert each month.

Tagged in

Math Statistics