Median
Median is a fundamental concept in statistics, offering a robust measure of central tendency that helps understand the typical value within a dataset. Understanding its definition and calculation is crucial for analyzing data effectively across various fields.

Key Takeaways
- The median represents the middle value in a dataset when all values are arranged in ascending or descending order.
- It is a measure of central tendency that is less affected by extreme outliers compared to the mean.
- Calculating the median involves ordering the data and identifying the central point, or the average of the two central points for an even number of observations.
- The median provides a different perspective on data distribution than the mean (average) or mode (most frequent value).
What is the Median? Definition & Examples
The Median is the middle number in a sorted, ascending or descending, list of numbers. It serves as a key indicator of central tendency, providing insight into the typical value of a dataset without being skewed by unusually high or low values.
More specifically, what is the median in statistics refers to the value separating the higher half from the lower half of a data sample, a population, or a probability distribution. For instance, if you have the numbers 1, 7, 3, 9, 5, you would first sort them: 1, 3, 5, 7, 9. The middle number, 5, is the median. This measure is particularly useful in skewed distributions, such as income data, where a few extremely high incomes might inflate the mean, but the median would still accurately reflect the income of the typical person.
How to Calculate the Median
To understand how to calculate the median, a systematic approach is required, which depends on whether the dataset contains an odd or even number of observations. The primary step always involves arranging the data points in sequential order.
Here are the steps to calculate the median:
- Step 1: Order the Data. Arrange all the numbers in your dataset from the smallest to the largest (or largest to smallest).
- Step 2: Count Data Points (n). Determine the total number of values in your dataset.
- Step 3: Find the Middle Value.
- If ‘n’ is an odd number, the median is the value exactly in the middle. You can find its position using the formula (n + 1) / 2.
- If ‘n’ is an even number, there are two middle values. The median is the average of these two middle numbers. Their positions are n / 2 and (n / 2) + 1.
For example, given the dataset {12, 15, 10, 18, 13}:
- Order: {10, 12, 13, 15, 18}
- n = 5 (odd)
- Median position: (5 + 1) / 2 = 3rd position. The median is 13.
For an even dataset, {20, 25, 15, 30}:
- Order: {15, 20, 25, 30}
- n = 4 (even)
- Middle positions: 4 / 2 = 2nd and (4 / 2) + 1 = 3rd. The values are 20 and 25.
- Median: (20 + 25) / 2 = 22.5.
Median vs. Mean and Mode
When analyzing data, the difference between mean median mode becomes apparent as each provides a unique perspective on the central tendency of a dataset. While all three aim to identify a “typical” value, they do so through different calculations and are affected differently by the distribution of data, especially by outliers.
The mean, often called the average, is calculated by summing all values and dividing by the count of values. It is sensitive to extreme values. The mode is the value that appears most frequently in a dataset and is useful for categorical data. The median, as discussed, is the middle value of an ordered dataset, making it robust against outliers.
The choice of which measure to use often depends on the nature of the data and the goal of the analysis. For instance, in financial data like house prices, the median is often preferred over the mean because a few very expensive homes can significantly inflate the average, making the mean less representative of the typical house price.
| Measure | Definition | Calculation | Sensitivity to Outliers |
|---|---|---|---|
| Mean | The arithmetic average of all values. | Sum of all values / Number of values | Highly sensitive |
| Median | The middle value in an ordered dataset. | Middle value (odd n) or average of two middle values (even n) after sorting. | Low sensitivity |
| Mode | The most frequently occurring value. | Value that appears most often. | Not applicable (primarily for frequency) |























