Skip to content
MathAnvil
§ Statistics

Comparing Data Sets

CCSS.6.SP3 min read

Year 8 students often struggle when asked to compare two football teams' scoring records or test results from different classes. Comparing data sets requires more than just looking at averages—students must examine both central tendency and spread to draw meaningful conclusions.

Try it right now

Click “Generate a problem” to see a fresh example of this technique.

§ 01

Why it matters

Comparing data sets appears throughout GCSE Foundation and Higher papers, particularly in scatter graph questions where students analyse mathematical relationships between variables. In real life, this skill helps students evaluate which mobile phone tariff offers better value (comparing monthly costs and data allowances), or determine which revision method is more effective by comparing test scores before and after different study approaches. Sports coaches use these techniques to compare player performance across seasons—for example, comparing a striker's goals per match (mean 1.2 vs 0.8) alongside consistency (range 0-3 vs 0-5 goals). Medical researchers compare treatment effectiveness by examining both average recovery times and variation between patients. Understanding both measures of central tendency and spread ensures students can make informed decisions based on complete data analysis rather than incomplete comparisons.

§ 02

How to solve comparing data sets

Comparing Data Sets

  • Compare averages (mean, median) to see which set is 'higher'.
  • Compare spread (range, IQR) to see which set is more consistent.
  • Use the same type of average for a fair comparison.
  • Back up comparisons with specific values.

Example: Set A: median 12, range 8. Set B: median 15, range 3 → B is higher and more consistent.

§ 03

Worked examples

Beginner§ 01

Set A has mean 8, Set B has mean 11. Which has a higher average?

Answer: Set B

  1. Compare the means 11 > 8 Set B's mean (11) is greater than Set A's mean (8).
Easy§ 02

Team A scores: 1, 1, 10, 8, 0 (mean=4). Team B scores: 4, 4, 4, 4, 4 (mean=4). Which is more consistent?

Answer: Team B

  1. Compare the spread The second set has no variation (all values equal) All values in the second set are the same, meaning zero spread.
  2. Conclusion Team B is more consistent Less spread means more consistency.
Medium§ 03

Compare ranges: Set A = {3, 6, 16, 17} range=14, Set B = {4, 5, 8, 10} range=6. Which is more spread out?

Answer: Set A

  1. Compare the ranges Range A = 14, Range B = 6 Range A (14) > Range B (6).
  2. Conclusion Set A is more spread out A larger range means more spread.
§ 04

Common mistakes

  • Students compare only the means without considering spread, concluding Set A (mean 12, range 15) is better than Set B (mean 10, range 2) when Set B is actually more consistent.
  • Students incorrectly calculate range by subtracting smallest from largest backwards, writing range = 3 - 15 = -12 instead of 15 - 3 = 12.
  • Students mix different measures when comparing, using mean for one set (mean = 8) and median for another (median = 9), making invalid comparisons.
  • Students assume higher mean always indicates better performance, ignoring context where lower values might be preferred, such as comparing journey times where 25 minutes is better than 35 minutes.
Practice on your own
Generate unlimited comparing data sets worksheets with varying difficulty levels using MathAnvil's free worksheet generator.
Generate free worksheets
§ 05

Frequently asked questions

Should I always use mean when comparing data sets?
Not necessarily. Use the same measure for both sets for fair comparison. If data contains outliers (like one student scoring 95% when others score 60-70%), median often provides a better comparison than mean, as it's less affected by extreme values.
What if two sets have the same mean but different ranges?
The set with the smaller range is more consistent and often more reliable. For example, if both classes average 65% but Class A has range 20 and Class B has range 45, Class A shows more consistent performance across students.
How do I explain which data set is 'better'?
Always consider context. For test scores, higher mean with lower range usually indicates better performance. For manufacturing defects or journey times, lower means are preferable. Always justify your answer using specific numerical values from both measures.
Can I compare data sets with different numbers of values?
Yes, you can compare means and ranges regardless of sample size. However, be cautious as larger samples typically provide more reliable averages. A mean from 20 students is generally more trustworthy than one from 4 students.
What's the difference between range and interquartile range for comparison?
Range uses all data but can be affected by outliers. Interquartile range focuses on the middle 50% of data, providing better comparison when extreme values exist. For GCSE, range is more commonly required unless specifically asked for IQR.
§ 06

Related topics

Share this article