Unit 2: Measures of Central Tendency

Table of Contents

1. Criteria for an Ideal Measure of Central Tendency

An "ideal" average should have the following properties:


2. Arithmetic Mean (AM)

The "average" as most people know it. It is the sum of all observations divided by the number of observations.

Ungrouped Data: x̄ = (x1 + x2 + ... + xn) / n = (Σx) / n

Grouped Data: x̄ = (Σf * x) / (Σf) = (Σf * x) / N
(where x = midpoint of class, f = frequency, N = total frequency)

3. Median

The middle value of a dataset that has been arranged in order (ascending or descending). It divides the data into two equal halves.

Ungrouped Data:
1. Order the data.
2. Find the position: (n+1)/2.
- If n=7, position is (7+1)/2 = 4th item.
- If n=8, position is (8+1)/2 = 4.5th item (average of 4th and 5th items).

Grouped Data:
1. Find the Median Class (the class where the (N/2)-th item falls).
2. Apply formula: Median = L + [ ( (N/2) - cf ) / f ] * h
(L = lower boundary of median class, N = total frequency, cf = cumulative frequency *before* median class, f = frequency of median class, h = class width)

4. Mode (Ungrouped, Grouped, and Graphical)

The value that appears most frequently in a dataset.

1. Mode for Ungrouped Data

Simply find the value that repeats the most.

2. Mode for Grouped Data (Formula)

  1. Find the Modal Class (the class with the highest frequency).
  2. Apply the formula:
    Mode = L + [ (f1 - f0) / (2*f1 - f0 - f2) ] * h
    Where:
    • L = lower boundary of modal class
    • f1 = frequency of the modal class
    • f0 = frequency of the class *before* the modal class
    • f2 = frequency of the class *after* the modal class
    • h = class width

3. Mode by Graphical Method

The mode can be estimated graphically from a Histogram.

  1. Draw a histogram of the data.
  2. Identify the tallest bar (the modal class).
  3. Draw two lines:
    • From the top-right corner of the modal bar to the top-right corner of the *previous* bar.
    • From the top-left corner of the modal bar to the top-left corner of the *next* bar.
  4. Draw a vertical line from the intersection of these two lines down to the X-axis.
  5. The value on the X-axis is the estimated Mode.

5. Geometric Mean (GM)

The n-th root of the product of n (non-negative) observations. It is best used for averaging ratios, percentages, or growth rates.

Ungrouped Data: GM = (x1 * x2 * ... * xn)^(1/n)
Calculation Tip: The GM is difficult to calculate directly. Use logarithms:
log(GM) = (1/n) * [log(x1) + log(x2) + ... + log(xn)] = (Σ log(x)) / n
GM = Antilog [ (Σ log(x)) / n ]

6. Harmonic Mean (HM)

The reciprocal of the arithmetic mean of the reciprocals of the observations. It is best used for averaging rates and speeds.

Ungrouped Data: HM = n / ( (1/x1) + (1/x2) + ... + (1/xn) ) = n / (Σ (1/x))

7. Merits and Demerits (Summary)

Exam Tip: "Which is the 'best' average?" The Arithmetic Mean is generally the best as it satisfies most properties of an ideal average. However, if there are outliers, the Median is better. If dealing with rates/speeds, the Harmonic Mean is better. If dealing with growth rates, the Geometric Mean is better.
Measure Based on all data? Affected by Outliers? Best for...
Mean (AM) Yes Yes, very much General purpose, algebra
Median No (only position) No Data with outliers, open-ended classes
Mode No No Categorical (Nominal) data, finding most popular item
GM Yes Yes, but less than AM Growth rates, ratios
HM Yes Yes, but less than AM Speeds, rates