Comparison between Measures of Central Tendency

By Pimath, 11 February 2025

Measures of central tendency are statistical tools that describe the central behavior of a dataset. The three most common measures are the mean, the median, and the mode. Each of these measures offers a different perspective on the data, and the choice between them depends on the nature of the data itself. In this comparison, we will explore each measure in detail, highlighting the differences and situations where each is most useful.

The Mean
The Median
The Mode
Visual Comparison

The Mean

The arithmetic mean is one of the most commonly used measures and provides a representation of the central value of a dataset. It is calculated by summing all values and dividing the result by the total number of values. The mean is sensitive to extreme values (outliers), which can significantly influence the result.

Mean Formula:

$$ \bar{x} = \frac{1}{n} \sum_{i=1}^{n} x_i $$

Practical Example:

Consider the following dataset: $ \{2, 4, 6, 8, 10\} $. The mean is calculated as:

$$ \bar{x} = \frac{2 + 4 + 6 + 8 + 10}{5} = \frac{30}{5} = 6 $$

The Median

The median is the value that separates the data into two equal halves. If the data is arranged in ascending or descending order, the median is the number in the middle. When the number of data points is even, the median is the average of the two middle numbers. Unlike the mean, the median is not influenced by extreme values.

Calculating the Median:

If the number of values is odd, the median is the central value.
If the number of values is even, the median is the average of the two central values.

Practical Example:

Consider the following dataset: $ \{1, 3, 3, 6, 7, 8, 9\} $. Since the number of values is odd, the median is the central value:

$$ \text{Median} = 6 $$

Now consider the following dataset: $ \{1, 2, 3, 4, 5, 6, 8, 9\} $. Since the number of values is even, the median is the average of the two central values:

$$ \text{Median} = \frac{4 + 5}{2} = 4.5 $$

The Mode

The mode is the value that appears most frequently in a dataset. Unlike the mean and median, which are quantitative measures, the mode can also be applied to qualitative data (categories). In some cases, a dataset may have more than one mode (if there are multiple values with the same maximum frequency), or no mode at all (if all values are unique).

Calculating the Mode:

The mode is the value that appears most frequently.
If two or more values have the same frequency, the set is multimodal.

Practical Example:

Consider the following dataset: $ \{2, 3, 4, 4, 5, 5, 6, 6, 6\} $. The mode is the value that appears most frequently:

$$ \text{Mode} = 6 $$

Now consider a multimodal dataset: $ \{2, 2, 3, 3, 4, 5, 6\} $. Since both 2 and 3 appear with the same maximum frequency, the dataset is multimodal:

$$ \text{Modes} = 2 \text{ and } 3 $$

**Comparison between Mean, Median, and Mode**
Measure	Formula	Definition	Advantages	Disadvantages
Mean	$$ \bar{x} = \frac{1}{n} \sum_{i=1}^{n} x_i $$	Sum of all values divided by the total number of values.	Easy to calculate, represents the center of the data in a balanced way.	Sensitive to outliers, can be distorted if there are extreme values.
Median	N/A	The central value that separates the data into two equal halves.	Not influenced by outliers, useful for asymmetric data.	Can be difficult to calculate for large datasets.
Mode	N/A	The value that appears most frequently in a dataset.	Used for qualitative data, useful when looking for the most common value.	Not always exists or unique. Doesn't always represent the data well.

Visual Comparison

The three measures of central tendency can be visualized usefully through the following scenarios:

Symmetric Data

In a symmetric dataset, such as a normal distribution, the mean, median, and mode all coincide at the same value, indicating that the center of the distribution is well-defined.

Asymmetric Data

In an asymmetric dataset, the mean might be influenced by extreme values and shift toward the long part of the distribution, while the median will remain stable, better representing the center of the data. The mode, meanwhile, will highlight the most common value, which might not correspond.

Feedback

Table of Contents

The Mean

The Median

The Mode

Visual Comparison

Symmetric Data

Asymmetric Data

Tags