Statistics - Sixth Grade
Complete Notes & Formulas
1. Statistical Questions
Definition
A statistical question is one that
expects VARIABILITY in the answers
• Can be answered by collecting DATA
• Answers will VARY from person to person
• More than ONE possible answer
Statistical vs Non-Statistical
Statistical ✓ | Non-Statistical ✗ |
---|---|
How tall are students in 6th grade? | How tall am I? |
What are favorite colors of students? | What is my favorite color? |
How many hours do people sleep? | How many hours did I sleep last night? |
2. Measures of Center and Range
Mean (Average)
Mean = Sum of all values ÷ Number of values
or
x̄ = (∑x) ÷ n
Median (Middle Value)
Step 1: Put data in ORDER (smallest to largest)
Step 2: Find the MIDDLE value
• If ODD number of values: middle one is median
• If EVEN number: average the two middle values
Mode (Most Frequent)
Mode = Value that appears MOST OFTEN
• Can have NO mode (all different)
• Can have MULTIPLE modes (tie)
Range (Spread)
Range = Maximum − Minimum
Largest value − Smallest value
Example
Data: 5, 8, 3, 8, 10, 6
Mean: (5+8+3+8+10+6) ÷ 6 = 40 ÷ 6 = 6.67
Median: Order: 3, 5, 6, 8, 8, 10 → (6+8) ÷ 2 = 7
Mode: 8 (appears twice)
Range: 10 − 3 = 7
3. Mean Absolute Deviation (MAD)
Definition
MAD measures how SPREAD OUT the data is
from the mean
Average distance of each value from the mean
Formula
MAD = (Sum of |each value − mean|) ÷ n
Steps to Calculate MAD
Step 1: Calculate the MEAN
Step 2: Find the ABSOLUTE DIFFERENCE between each value and mean
Step 3: ADD all the absolute differences
Step 4: DIVIDE by the number of values
Example
Data: 2, 4, 6, 8, 10
Step 1: Mean = (2+4+6+8+10) ÷ 5 = 6
Step 2: |2−6|=4, |4−6|=2, |6−6|=0, |8−6|=2, |10−6|=4
Step 3: 4+2+0+2+4 = 12
Step 4: MAD = 12 ÷ 5 = 2.4
Answer: MAD = 2.4
4. Quartiles and Interquartile Range (IQR)
What are Quartiles?
Quartiles divide data into FOUR equal parts
Q1 (First Quartile): 25th percentile - median of lower half
Q2 (Second Quartile): 50th percentile - the median
Q3 (Third Quartile): 75th percentile - median of upper half
IQR Formula
IQR = Q3 − Q1
IQR measures the middle 50% of data
Steps to Find IQR
Step 1: Put data in ORDER
Step 2: Find the MEDIAN (Q2)
Step 3: Find Q1 (median of lower half)
Step 4: Find Q3 (median of upper half)
Step 5: Calculate IQR = Q3 − Q1
Example
Data: 2, 3, 5, 7, 11, 13, 17, 19, 23, 29
Median (Q2): (11+13) ÷ 2 = 12
Lower half: 2, 3, 5, 7, 11 → Q1 = 5
Upper half: 13, 17, 19, 23, 29 → Q3 = 19
IQR: 19 − 5 = 14
Answer: IQR = 14
5. Outliers
Definition
An outlier is a value that is MUCH HIGHER
or MUCH LOWER than most other values
• Stands out from the rest of the data
• Significantly different from other observations
How to Identify Outliers (1.5 × IQR Rule)
Lower Boundary: Q1 − 1.5 × IQR
Upper Boundary: Q3 + 1.5 × IQR
Any value BELOW lower boundary or ABOVE upper boundary is an outlier
Effect of Removing Outliers
• Mean: Usually changes SIGNIFICANTLY (very sensitive to outliers)
• Median: Usually changes LITTLE (resistant to outliers)
• Mode: May not change at all
• Range: Often decreases significantly
Example
Data: 5, 6, 7, 8, 8, 9, 50
50 is an outlier (much larger than others)
With outlier: Mean = 13.29
Without outlier: Mean = 7.17
Removing outlier greatly changes the mean!
6. Measures of Center and Variability
Measures of Center
Mean: Average - use when data is symmetric, no outliers
Median: Middle value - use when data has outliers or is skewed
Mode: Most frequent - use for categorical data
Measures of Variability (Spread)
Range: Max − Min (affected by outliers)
IQR: Q3 − Q1 (resistant to outliers)
MAD: Average distance from mean
When to Use Each Measure
Situation | Best Center | Best Variability |
---|---|---|
No outliers, symmetric | Mean | MAD or Range |
Outliers present | Median | IQR |
Skewed data | Median | IQR |
7. Describing Distributions
Shape of Distribution
Symmetric: Both sides of center are mirror images
Skewed Right: Tail extends to the right (higher values)
Skewed Left: Tail extends to the left (lower values)
Center, Spread, and Shape
When describing a distribution, mention:
• Center: What is the typical value? (mean or median)
• Spread: How spread out is the data? (range, IQR, MAD)
• Shape: Is it symmetric or skewed?
• Outliers: Are there any unusual values?
8. Types of Samples
Random Sample
Every member of the population has an
EQUAL CHANCE of being selected
Example: Drawing names from a hat
Representative Sample
Sample that ACCURATELY REFLECTS
the characteristics of the whole population
Example: Same proportion of males/females as population
Biased Sample
Sample that does NOT represent the population
Results are UNFAIR or INACCURATE
Example: Only surveying people at a gym about exercise
Examples
Random ✓: Every 10th student entering school
Biased ✗: Only asking students in math club about favorite subject
Representative ✓: Sample with same grade proportions as whole school
Quick Reference: All Formulas
Measure | Formula |
---|---|
Mean | Sum ÷ n |
Median | Middle value (in order) |
Mode | Most frequent value |
Range | Max − Min |
MAD | Σ|x − mean| ÷ n |
IQR | Q3 − Q1 |
Outlier Boundaries | Q1 − 1.5×IQR and Q3 + 1.5×IQR |
💡 Important Tips to Remember
✓ Statistical questions expect VARIABILITY in answers
✓ Mean is sensitive to outliers, median is resistant
✓ Always put data in ORDER to find median and quartiles
✓ MAD uses ABSOLUTE VALUES (no negatives)
✓ IQR = Q3 − Q1 (middle 50% of data)
✓ Outliers: Use 1.5 × IQR rule to identify
✓ Use median and IQR when outliers are present
✓ Describe distributions with center, spread, and shape
✓ Random samples give every member equal chance
✓ Biased samples lead to inaccurate conclusions
🧠 Memory Tricks & Strategies
Mean, Median, Mode:
"Mean is average, Median is middle, Mode is most!"
Range:
"Biggest MINUS smallest - that's the range, oh so nicest!"
MAD:
"Mean Absolute Deviation - find each distance, no hesitation!"
IQR:
"Q3 minus Q1 - middle 50% when you're done!"
Quartiles:
"Quartiles divide in FOUR parts - Q1, Q2, Q3 from the start!"
Statistical Questions:
"If answers VARY, it's statistical - if just ONE answer, that's categorical!"
Master Statistics! 📊 📈 🎯
Remember: Choose appropriate measures based on your data!