Comparing Data Sets
Year 8 students often struggle when asked to compare two football teams' scoring records or test results from different classes. Comparing data sets requires more than just looking at averages—students must examine both central tendency and spread to draw meaningful conclusions.
Try it right now
Click “Generate a problem” to see a fresh example of this technique.
Why it matters
Comparing data sets appears throughout GCSE Foundation and Higher papers, particularly in scatter graph questions where students analyse mathematical relationships between variables. In real life, this skill helps students evaluate which mobile phone tariff offers better value (comparing monthly costs and data allowances), or determine which revision method is more effective by comparing test scores before and after different study approaches. Sports coaches use these techniques to compare player performance across seasons—for example, comparing a striker's goals per match (mean 1.2 vs 0.8) alongside consistency (range 0-3 vs 0-5 goals). Medical researchers compare treatment effectiveness by examining both average recovery times and variation between patients. Understanding both measures of central tendency and spread ensures students can make informed decisions based on complete data analysis rather than incomplete comparisons.
How to solve comparing data sets
Comparing Data Sets
- Compare averages (mean, median) to see which set is 'higher'.
- Compare spread (range, IQR) to see which set is more consistent.
- Use the same type of average for a fair comparison.
- Back up comparisons with specific values.
Example: Set A: median 12, range 8. Set B: median 15, range 3 → B is higher and more consistent.
Worked examples
Set A has mean 8, Set B has mean 11. Which has a higher average?
Answer: Set B
- Compare the means → 11 > 8 — Set B's mean (11) is greater than Set A's mean (8).
Team A scores: 1, 1, 10, 8, 0 (mean=4). Team B scores: 4, 4, 4, 4, 4 (mean=4). Which is more consistent?
Answer: Team B
- Compare the spread → The second set has no variation (all values equal) — All values in the second set are the same, meaning zero spread.
- Conclusion → Team B is more consistent — Less spread means more consistency.
Compare ranges: Set A = {3, 6, 16, 17} range=14, Set B = {4, 5, 8, 10} range=6. Which is more spread out?
Answer: Set A
- Compare the ranges → Range A = 14, Range B = 6 — Range A (14) > Range B (6).
- Conclusion → Set A is more spread out — A larger range means more spread.
Common mistakes
- Students compare only the means without considering spread, concluding Set A (mean 12, range 15) is better than Set B (mean 10, range 2) when Set B is actually more consistent.
- Students incorrectly calculate range by subtracting smallest from largest backwards, writing range = 3 - 15 = -12 instead of 15 - 3 = 12.
- Students mix different measures when comparing, using mean for one set (mean = 8) and median for another (median = 9), making invalid comparisons.
- Students assume higher mean always indicates better performance, ignoring context where lower values might be preferred, such as comparing journey times where 25 minutes is better than 35 minutes.