Formula Sheets

Variance Formula Booklet

Variance Formula Booklet

Complete Reference Guide for All Mathematics Students

📐 What is Variance?

Definition:
Variance is a measure of how spread out a set of data is from its mean (average). It measures the average of the squared differences from the mean.

Symbols:
• Population variance: \(\sigma^2\) (sigma squared)
• Sample variance: \(s^2\)
• Also written as: \(\text{Var}(X)\) or \(V(X)\)

High variance: Data points are spread far from the mean
Low variance: Data points are clustered close to the mean
• Variance is always non-negative (\(\sigma^2 \geq 0\))

📊 Population Variance Formula

\[ \sigma^2 = \frac{\sum_{i=1}^{N}(x_i - \mu)^2}{N} \]

Where:
• \(\sigma^2\) = population variance
• \(x_i\) = each individual value in the population
• \(\mu\) = population mean (average)
• \(N\) = total number of values in the population
• \(\sum\) = sum of all values

Use when: You have data for the entire population (all possible values)

🎯 Sample Variance Formula

\[ s^2 = \frac{\sum_{i=1}^{n}(x_i - \bar{x})^2}{n-1} \]

Where:
• \(s^2\) = sample variance
• \(x_i\) = each individual value in the sample
• \(\bar{x}\) = sample mean (average)
• \(n\) = number of values in the sample
• \(n-1\) = degrees of freedom (Bessel's correction)

Use when: You have data from a sample (subset of the population)
Why n-1? Using \(n-1\) provides an unbiased estimate of the population variance

🔬 Alternative Variance Formulas

Computational Formula (easier for calculations):

\[ \text{Var}(X) = E[X^2] - (E[X])^2 \]

This means: "Expected value of squares minus square of expected value"

For Population (expanded form):

\[ \sigma^2 = \frac{\sum_{i=1}^{N}x_i^2}{N} - \mu^2 \]

For Sample (expanded form):

\[ s^2 = \frac{\sum_{i=1}^{n}x_i^2 - n\bar{x}^2}{n-1} \]

📝 Step-by-Step Calculation

Step 1: Calculate the Mean

Add all values and divide by the number of values:
\[ \bar{x} = \frac{\sum x_i}{n} \]

Step 2: Find Deviations from Mean

Subtract the mean from each value:
\[ (x_1 - \bar{x}), (x_2 - \bar{x}), ..., (x_n - \bar{x}) \]

Step 3: Square Each Deviation

Square each difference to make them positive:
\[ (x_1 - \bar{x})^2, (x_2 - \bar{x})^2, ..., (x_n - \bar{x})^2 \]

Step 4: Calculate Average of Squared Deviations

• For population: Divide by \(N\)
• For sample: Divide by \(n-1\)

💡 Worked Example

Problem: Find the variance of the sample: 4, 8, 6, 5, 3, 2

Step 1: Calculate Mean
\[ \bar{x} = \frac{4+8+6+5+3+2}{6} = \frac{28}{6} = 4.67 \]

Step 2 & 3: Find and Square Deviations
(4 - 4.67)² = 0.4489
(8 - 4.67)² = 11.0889
(6 - 4.67)² = 1.7689
(5 - 4.67)² = 0.1089
(3 - 4.67)² = 2.7889
(2 - 4.67)² = 7.1289

Step 4: Calculate Variance
Sum of squared deviations = 23.3334
\[ s^2 = \frac{23.3334}{6-1} = \frac{23.3334}{5} = 4.67 \]

Answer: Sample variance = 4.67

📈 Variance for Grouped Data

Population Variance (Grouped Data):

\[ \sigma^2 = \frac{\sum f_i(M_i - \mu)^2}{N} \]

Sample Variance (Grouped Data):

\[ s^2 = \frac{\sum f_i(M_i - \bar{x})^2}{n-1} \]

Where:
• \(f_i\) = frequency of each class
• \(M_i\) = midpoint of each class interval
• \(\bar{x} = \frac{\sum f_i M_i}{\sum f_i}\) (mean for grouped data)

⚡ Properties of Variance

Property 1: Adding a constant doesn't change variance

\[ \text{Var}(X + c) = \text{Var}(X) \]

Property 2: Multiplying by a constant

\[ \text{Var}(cX) = c^2 \cdot \text{Var}(X) \]

Property 3: Linear transformation

\[ \text{Var}(aX + b) = a^2 \cdot \text{Var}(X) \]

Property 4: Variance of independent variables

\[ \text{Var}(X_1 + X_2 + ... + X_n) = \text{Var}(X_1) + \text{Var}(X_2) + ... + \text{Var}(X_n) \]

Property 5: Variance is always non-negative

\[ \text{Var}(X) \geq 0 \]

🔄 Variance vs Standard Deviation

Relationship:
Standard deviation is the square root of variance

\[ \sigma = \sqrt{\sigma^2} \quad \text{or} \quad s = \sqrt{s^2} \]

Key Differences:
Variance: Measured in squared units
Standard Deviation: Measured in same units as original data
• Standard deviation is easier to interpret

Example: If data is in meters and variance = 16 m²
Then standard deviation = √16 = 4 meters

🎯 Applications of Variance

✓ Finance: Measuring investment risk and volatility

✓ Quality Control: Measuring consistency in manufacturing

✓ Research: Analyzing data spread in scientific studies

✓ Weather: Understanding temperature variability

✓ Education: Measuring consistency in test scores

🚫 Common Mistakes to Avoid

❌ Using \(n\) instead of \(n-1\) for sample variance
✅ Always use \(n-1\) for samples to get unbiased estimate

❌ Forgetting to square the deviations
✅ Must square each \((x_i - \bar{x})\) before summing

❌ Confusing variance with standard deviation
✅ Variance = squared units, Standard deviation = original units

❌ Calculating mean incorrectly
✅ Double-check mean calculation before finding variance

📋 Quick Reference Summary

Population Variance:

\[ \sigma^2 = \frac{\sum(x_i - \mu)^2}{N} \]

Sample Variance:

\[ s^2 = \frac{\sum(x_i - \bar{x})^2}{n-1} \]

Standard Deviation:

\[ \sigma = \sqrt{\sigma^2} \quad \text{or} \quad s = \sqrt{s^2} \]

Remember:
• Variance measures spread of data
• Always non-negative
• Use \(n-1\) for sample variance
• Square root gives standard deviation

📊 Master Variance for Statistics Success!

Variance is a fundamental concept in statistics and data analysis

💡 Pro Tips:
• Always calculate mean first, then proceed with variance
• Keep track of whether you're working with population or sample
• Use the computational formula for easier hand calculations
• Variance in squared units, standard deviation in original units
• Check your answer: variance should always be positive
• For large datasets, consider using statistical software or calculators

Shares: