Statistics - Quartiles and Percentiles
Quartiles and percentiles are a measures of variation, which describes how spread out the data is.
Quartiles and percentiles are both types of quantiles.
Quartiles
Quartiles are values that separate the data into four equal parts.
Here is a histogram of the age of all 928 Nobel Prize winners up to the year 2020, showing the quartiles:
The quartiles (Q0,Q1,Q2,Q3,Q4) are the values that separate each quarter.
Between Q0 and Q1 are the 25% lowest values in the data. Between Q1 and Q2 are the next 25%. And so on.
- Q0 is the smallest value in the data.
- Q1 is the value separating the first quarter from the second quarter of the data.
- Q2 is the middle value (median), separating the bottom from the top half.
- Q3 is the value separating the third quarter from the fourth quarter
- Q4 is the largest value in the data.
Calculating Quartiles with Programming
Quartiles can easily be found with many programming languages.
Using software and programming to calculate statistics is more common for bigger sets of data, as finding it manually becomes difficult.
Example
With Python use the NumPy library quantile()
method to find the first, second and third quartiles of the values 13, 21, 21, 40, 42, 48, 55, 72:
import numpy
values = [13,21,21,40,42,48,55,72]
x = numpy.quantile(values, [0,0.25,0.5,0.75,1])
print(x)
Try it Yourself »
Example
Use the R quantile()
function to find the median of the values 13, 21, 21, 40, 42, 48, 55, 72:
values <- c(13,21,21,40,42,48,55,72)
quantile(values)
Try it Yourself »
Percentiles
Percentiles are values that separate the data into 100 equal parts.
For example, The 95th percentile separates the lowest 95% of the values from the top 5%
The 25th percentile (P25%) is the same as the first quartile (Q1).
The 50th percentile (P50%) is the same as the second quartile (Q2) and the median.
THe 75th percentile (P75%) is the same as the third quartile (Q3)
Calculating Percentiles with Programming
Percentiles can easily be found with many programming languages.
Using software and programming to calculate statistics is more common for bigger sets of data, as finding it manually becomes difficult.
Example
With Python use the NumPy library percentile()
method to find the 65
th percentile of the values 13, 21, 21, 40, 42, 48, 55, 72:
import numpy
values = [13,21,21,40,42,48,55,72]
x = numpy.percentile(values, 65)
print(x)
Try it Yourself »
Example
Use the R quantile()
function to find the 65th percentile (0.65
) of the values 13, 21, 21, 40, 42, 48, 55, 72:
values <- c(13,21,21,40,42,48,55,72)
quantile(values, 0.65)
Try it Yourself »