📊 Chapter 10: Statistics (Extended Content)
Cumulative Frequency Diagrams

Hello mathematicians! This chapter takes your understanding of data visualization to the next level. We've looked at frequency tables and bar charts, but sometimes we need to know how the data builds up over time or value. That’s where Cumulative Frequency Diagrams come in!

Don't worry if this sounds complicated – it's just a special type of running total plotted on a graph. Mastering this topic is essential for calculating important measures of spread and average (like the median) quickly and accurately for large sets of grouped data.


1. Understanding Cumulative Frequency (CF)

1.1 What is Cumulative Frequency?

Frequency tells you how many times a value or group of values occurs.
Cumulative Frequency (CF) is the running total of these frequencies. It tells you the total number of data points up to a specific value.

Analogy: Imagine you are tracking how many kilometers you cycle each day. The frequency is the distance for that day. The cumulative frequency is the total distance cycled since you started.

1.2 Creating the Cumulative Frequency Table

Cumulative frequency is almost always used with Grouped Continuous Data.

To build the table, you need two columns: the class intervals (the data groups) and the frequencies. Then, you calculate the CF.

Step-by-Step: Calculating CF
  1. Start with the frequency of the first class interval. This is your first cumulative frequency.
  2. For the second class interval, add its frequency to the previous cumulative frequency.
  3. Repeat this process until you reach the final class interval.

Quick Check: The last value in your Cumulative Frequency column must equal the Total Frequency (\(N\)) of the entire data set.


🔑 Key Concept: Plotting Points

When plotting a cumulative frequency diagram, you plot the CF against the Upper Class Boundary of the interval.

Why? The cumulative frequency value tells you how many pieces of data are LESS THAN OR EQUAL TO the upper limit of that group.

Example: If the class is \(10 < t \leq 20\) and the CF is 15, it means 15 data points are less than 20. So, we plot \((20, 15)\).


2. Drawing the Cumulative Frequency Diagram (Ogive)

The graph itself is often called an Ogive. Drawing it correctly is a crucial exam skill.

2.1 Setup and Axes

  • X-axis (Horizontal): This represents the data itself (e.g., height, time, score). Always label this axis clearly.
  • Y-axis (Vertical): This represents the Cumulative Frequency (CF). It must range from 0 up to your Total Frequency (\(N\)).

2.2 The Plotting Process

  1. The Starting Point: The curve must start at the lowest possible value, where the CF is 0. If your first class interval is \(10 < x \leq 20\), you must plot \((10, 0)\). This ensures the graph represents all data from the beginning.
  2. Plotting the Points: Plot the calculated cumulative frequency values against the Upper Class Boundaries (using small, neat crosses, \(x\)).
  3. The Curve: Join the plotted points with a smooth curve (a fluid line, not straight segments). This is essential because we are dealing with continuous data, implying values change smoothly.
⚠️ Common Mistake Alert!

Do NOT join the points with a ruler (a series of straight lines). Cumulative frequency represents a distribution that is smoothly building up. Using a ruler loses the marks for drawing a smooth curve!

Also, remember to plot against the Upper Boundary, not the midpoint or the lower boundary (except for the starting point \((0, 0)\) or the first lower boundary).


3. Interpreting the Diagram: Finding Averages and Spread

Once the smooth cumulative frequency curve is drawn, we use it to estimate key measures of location and spread. These estimates are always read off the diagram.

3.1 The Median (\(Q_2\))

The median is the middle value when all data is arranged in order.

To find the median:

  1. Calculate the position: Median position = \(\frac{N}{2}\), where \(N\) is the total frequency.
  2. Locate this position on the CF (vertical) axis.
  3. Draw a horizontal line across to the curve.
  4. Draw a vertical line down to the data (horizontal) axis.
  5. Read the value on the data axis. This is the estimated Median.

3.2 Quartiles (\(Q_1\) and \(Q_3\))

Quartiles divide the data into four equal parts (quarters).

  • Lower Quartile (\(Q_1\)): The value at the 25% mark.
    Position: \(\frac{1}{4} \times N\) or \(0.25 \times N\).
  • Upper Quartile (\(Q_3\)): The value at the 75% mark.
    Position: \(\frac{3}{4} \times N\) or \(0.75 \times N\).

Mnemonic: Q1, Q2 (Median), Q3 relate to quarters: 25%, 50%, 75% of the data.

3.3 Interquartile Range (IQR)

The IQR is a measure of spread. It measures the range of the middle 50% of the data.

Formula:
$$IQR = Q_3 - Q_1$$

A small IQR means the middle 50% of the data is tightly grouped; a large IQR means the middle 50% is very spread out.

3.4 Percentiles

Percentiles divide the data into 100 equal parts. These are useful if you want to find a specific benchmark, like the top 10% of scores.

To find the \(P^{\text{th}}\) percentile:

  1. Calculate the position: \(\frac{P}{100} \times N\).
  2. Read the corresponding value from the data axis.

Example: To find the 80th percentile, find the position at \(0.80 \times N\) on the CF axis.


✅ Quick Review: Interpreting the Diagram

  • What is the CF diagram used for? To estimate the median, quartiles, and percentiles for grouped continuous data.
  • How do you find the data value? Go from the CF axis, across to the curve, and then down to the data axis.
  • What if the question asks: "How many students scored less than 70 marks?" Find 70 on the data (x) axis, go up to the curve, and read across to the CF (y) axis.
  • Accuracy: Always read your final answer from the scale provided on the x-axis, typically to an appropriate degree of accuracy (usually 3 significant figures unless specified otherwise).

Did You Know?

While the CF diagram is great for finding the median and quartiles, it cannot be used to find the mode. For that, you would typically need a frequency polygon or a histogram.

Keep practising your plotting and reading skills—you've got this!