Unit 3: Measures of Dispersion, Skewness & Kurtosis

Table of Contents

1. Concept of Dispersion & Ideal Measure

Concept of Dispersion

Measures of central tendency (like the mean) tell us the "center" of the data, but they don't tell us about its spread or variability.

Example:
Set A: {49, 50, 51} => Mean = 50
Set B: {1, 50, 99} => Mean = 50
Both sets have the same mean, but Set B is much more "dispersed" (spread out) than Set A.

Dispersion: The measure of the variation of the items, or the degree to which data points are scattered around the central value.

Criteria for an Ideal Measure of Dispersion

Similar to an ideal average:


2. Absolute Measures of Dispersion

These measures are expressed in the same units as the original data (e.g., cm, kg, $).

1. Range

The simplest measure. The difference between the largest and smallest observation.

Range = Largest Value (L) - Smallest Value (S)

2. Quartile Deviation (QD)

Also known as the Semi-Interquartile Range. It measures the spread of the middle 50% of the data.

QD = (Q3 - Q1) / 2
(where Q1 = 1st Quartile, Q3 = 3rd Quartile)

3. Mean Deviation (MD)

The arithmetic mean of the absolute deviations of the observations from a measure of central tendency (usually the median or mean).

MD (from mean) = (Σ |x - x̄|) / n
MD (from median) = (Σ |x - Median|) / n

4. Variance and Standard Deviation (SD)

The most important and widely used measures of dispersion. The Standard Deviation is the "gold standard" of dispersion measures.

Variance (s²): The average of the squared deviations from the mean.
Sample Variance: s² = (Σ (x - x̄)²) / (n - 1)
Population Variance: σ² = (Σ (x - μ)²) / N
Standard Deviation (s or σ): The positive square root of the variance.
SD (s) = sqrt(Variance)

3. Relative Measures of Dispersion

Absolute measures (like SD) cannot be used to compare the variability of two different datasets if they have different units (e.g., heights in cm vs. weights in kg) or very different means.

We use relative measures, which are unit-free ratios or percentages.

1. Coefficient of Dispersion (General)

This is a general term for any relative measure of dispersion. Each absolute measure has a corresponding relative measure.

2. Coefficient of Variation (CV)

This is the most important and widely used relative measure. It corresponds to the Standard Deviation.

Coefficient of Variation (CV): The ratio of the standard deviation to the mean, expressed as a percentage.
CV = (Standard Deviation / Mean) * 100
CV = (s / x̄) * 100
Exam Tip: A common question is: "Team A has a mean score of 80 with SD=5. Team B has a mean score of 50 with SD=4. Which team is more consistent?"
- CV(A) = (5 / 80) * 100 = 6.25%
- CV(B) = (4 / 50) * 100 = 8%
- Answer: Team A is more consistent because its CV is lower.

4. Moments (Raw and Central)

Moments are a set of statistical parameters used to describe the characteristics (shape, center, spread) of a distribution.

1. Raw Moments (μ'r) - Moments about Origin (Zero)

The r-th raw moment is the arithmetic mean of the r-th power of the observations.

μ'r = (Σ xr) / n (Ungrouped)
μ'r = (Σ f * xr) / N (Grouped)

2. Central Moments (μr) - Moments about the Mean

The r-th central moment is the arithmetic mean of the r-th power of the deviations from the mean.

μr = (Σ (x - x̄)r) / n (Ungrouped)
μr = (Σ f * (x - x̄)r) / N (Grouped)

5. Measures of Skewness and Kurtosis

These measures describe the shape of the distribution.

Skewness (Asymmetry)

Measures the asymmetry or lack of symmetry of a distribution.

Moment-based Coefficient (β1):

β1 = (μ3)² / (μ2

(If β1 = 0, symmetrical. If β1 > 0, skewed.)

Kurtosis (Peakedness)

Measures the peakedness or flatness of a distribution compared to the standard Normal distribution.

Moment-based Coefficient (β2):

β2 = μ4 / (μ2