Skip to content
MathAnvil

Comparing Data Sets

CCSS.6.SPLK20.103 min read

When students compare test scores between 2 classes or analyze which basketball player scores more consistently, they're mastering data comparison skills. This fundamental concept in CCSS.6.SP and LK20.10 teaches students to look beyond single numbers and examine both central tendency and spread.

Try it right now

Why it matters

Data comparison skills appear everywhere in students' lives and future careers. When comparing smartphone battery life ratings of 8 hours versus 12 hours, students use mean comparison. Analyzing which weather station shows more consistent daily temperatures over a month requires understanding spread and variation. In sports analytics, comparing two players who both average 15 points per game but have different scoring consistency teaches reliability concepts. Students encounter these decisions when choosing between schools with average SAT scores of 1280 versus 1320, or determining which route to school has more predictable travel times. Financial literacy builds on these skills when comparing investment options with similar average returns but different risk levels. Research projects require students to evaluate which data source provides more reliable information based on sample sizes and variation patterns.

How to solve comparing data sets

Comparing Data Sets

  • Compare averages (mean, median) to see which set is 'higher'.
  • Compare spread (range, IQR) to see which set is more consistent.
  • Use the same type of average for a fair comparison.
  • Back up comparisons with specific values.

Example: Set A: median 12, range 8. Set B: median 15, range 3 β†’ B is higher and more consistent.

Worked examples

Beginner

Class A average score: 12. Class B average score: 16. Which class performed better?

Answer: Class B

  1. Compare the means β†’ 16 > 12 β€” Class B's average (16) is higher than Class A's average (12).
Easy

Two basketball players scored 10, 6, 10, 3, 1 and 6, 6, 6, 6, 6 points over 5 games. Both averaged 6. Who is more reliable?

Answer: Player 2

  1. Compare the spread β†’ The second set has no variation (all values equal) β€” All values in the second set are the same, meaning zero spread.
  2. Conclusion β†’ Player 2 is more reliable β€” Player 2 scores the same every game, meaning zero variation.
Medium

Compare ranges: Set A = {1, 2, 10, 14} range=13, Set B = {3, 6, 7, 11} range=8. Which is more spread out?

Answer: Set A

  1. Compare the ranges β†’ Range A = 13, Range B = 8 β€” Range A (13) > Range B (8).
  2. Conclusion β†’ Set A is more spread out β€” A larger range means more spread.

Common mistakes

  • βœ—Students often compare only means and ignore spread, concluding that two players averaging 12 points are equally reliable when one has scores of 10,11,13,14 (range 4) and another has 2,8,16,22 (range 20).
  • βœ—Students incorrectly calculate range as highest value only instead of highest minus lowest, writing range = 15 for data set {3,7,11,15} instead of range = 12.
  • βœ—Students assume larger numbers always mean better performance, choosing a data set with mean 25 over mean 18 without considering that lower scores might actually be better (like golf or time).
  • βœ—Students mix different measures inappropriately, comparing the mean of one set (14) to the median of another (16) and declaring the second set superior.

Practice on your own

Generate unlimited comparing data sets worksheets with customizable difficulty levels and answer keys at MathAnvil's free worksheet generator.

Generate free worksheets β†’

Frequently asked questions

How do I teach students when to use mean versus median for comparison?β–Ύ
Use mean when data has no extreme outliers and is roughly symmetric. For skewed data or when outliers exist (like household incomes ranging from $30,000 to $500,000), median provides more accurate comparison. Practice with real examples helps students recognize these patterns.
What's the best way to explain why spread matters in data comparison?β–Ύ
Use concrete examples like two pizza delivery services both averaging 30 minutes, but one delivers between 28-32 minutes (range 4) while another varies 15-45 minutes (range 30). Students quickly grasp why consistency matters for reliability.
How can students remember to compare the same measures across data sets?β–Ύ
Create a comparison checklist: same sample sizes, same units, same measures (mean with mean, not mean with median). Practice identifying unfair comparisons like comparing 50 students' math scores with 15 students' reading scores.
When should students use range versus other spread measures?β–Ύ
Range works well for smaller data sets (under 10 values) and quick comparisons. For larger sets or when outliers exist, introduce interquartile range. Start with range since it's most intuitive for middle school students.
How do I help students write proper comparison conclusions?β–Ύ
Teach the format: 'Set A has [specific measure] while Set B has [same measure], therefore [conclusion with reasoning].' For example: 'Team A averaged 45 points with range 12, while Team B averaged 45 points with range 28, therefore Team A is more consistent.'

Related topics

Share this article