IB

Statistics and Probability Formulae AA SL & AA HL

Master statistics and probability for IB Math AA SL & HL with our comprehensive foundation guide. Descriptive statistics, probability rules, binomial and normal distributions with examples and interactive calculator.
Statistics and Probability Formulae reference guide for IB Mathematics Analysis and Approaches SL and HL levels featuring mathematical notation and graphs

Statistics and Probability Formulae AA SL & AA HL: Complete Foundation Guide for IB Math

Welcome to the comprehensive foundation guide for Statistics and Probability Formulae in IB Mathematics Analysis and Approaches for both Standard Level and Higher Level students. This essential resource covers all core statistical and probability concepts that form the common foundation between AA SL and AA HL, including descriptive statistics measures, probability rules and laws, conditional probability and independence, discrete random variables and expected value, binomial distribution, normal distribution and standardization. Whether you're taking AA SL or preparing for the advanced topics in AA HL, mastering these foundational statistics and probability concepts is absolutely critical for exam success and provides essential quantitative reasoning skills for university studies in mathematics, sciences, economics, social sciences, and data analysis.

Understanding AA SL & AA HL Common Statistics Content

Both IB Math AA SL and AA HL students must master the same foundational statistics and probability content covered in this guide. While AA HL students will encounter additional advanced topics like Bayes' theorem, Poisson distribution, continuous probability density functions, hypothesis testing, and confidence intervals, the core concepts presented here are essential for all AA students regardless of level. These fundamental statistical techniques—descriptive statistics, basic probability rules, discrete random variables, binomial distribution, and normal distribution—form the foundation upon which all advanced statistical analysis is built and are tested extensively on both SL and HL examinations.

Descriptive Statistics

Measures of Central Tendency

Mean (Arithmetic Average)
\[ \bar{x} = \frac{\sum_{i=1}^{k} f_i x_i}{n} \]

where \( n = \sum_{i=1}^{k} f_i \) (total frequency)

For ungrouped data: \( \bar{x} = \frac{\sum x_i}{n} \)

Choosing the Right Measure of Central Tendency
  • Mean: Best for symmetric distributions without outliers—uses all data values
  • Median: Best for skewed distributions or data with outliers—resistant to extreme values
  • Mode: Best for categorical data or finding most common value—can be used with any data type

Measures of Dispersion

Range
\[ \text{Range} = \text{Maximum value} - \text{Minimum value} \]

Simple but affected by outliers

Interquartile Range (IQR)
\[ \text{IQR} = Q_3 - Q_1 \]

where \( Q_1 \) is first quartile (25th percentile)

and \( Q_3 \) is third quartile (75th percentile)

IQR measures spread of middle 50% of data—resistant to outliers

Variance
\[ \sigma^2 = \frac{\sum_{i=1}^{k} f_i (x_i - \mu)^2}{n} \]

Alternative formula (often easier to calculate):

\[ \sigma^2 = \frac{\sum_{i=1}^{k} f_i x_i^2}{n} - \mu^2 \]

Variance measures average squared deviation from mean

Standard Deviation
\[ \sigma = \sqrt{\sigma^2} = \sqrt{\frac{\sum_{i=1}^{k} f_i (x_i - \mu)^2}{n}} \]

Standard deviation is in same units as original data

Measures typical distance of data values from mean

Example: Calculating Mean and Standard Deviation

Test scores: 65, 70, 75, 80, 85. Find mean and standard deviation.

Solution:

Mean: \( \bar{x} = \frac{65 + 70 + 75 + 80 + 85}{5} = \frac{375}{5} = 75 \)

Variance: First calculate deviations squared:

\( (65-75)^2 = 100 \), \( (70-75)^2 = 25 \), \( (75-75)^2 = 0 \)

\( (80-75)^2 = 25 \), \( (85-75)^2 = 100 \)

\( \sigma^2 = \frac{100 + 25 + 0 + 25 + 100}{5} = \frac{250}{5} = 50 \)

Standard Deviation: \( \sigma = \sqrt{50} \approx 7.07 \)

Probability Fundamentals

Basic Probability

Probability of an Event
\[ P(A) = \frac{n(A)}{n(U)} \]

where \( n(A) \) = number of favorable outcomes

\( n(U) \) = total number of possible outcomes

Valid when all outcomes are equally likely

Complementary Events
\[ P(A) + P(A') = 1 \]

Or equivalently: \( P(A') = 1 - P(A) \)

\( A' \) is "not A" or complement of A

Very useful when P(A') is easier to calculate than P(A)

Combined Events

Addition Rule (General)
\[ P(A \cup B) = P(A) + P(B) - P(A \cap B) \]

Read as: "Probability of A or B"

Must subtract intersection to avoid double-counting

Addition Rule (Mutually Exclusive Events)
\[ P(A \cup B) = P(A) + P(B) \]

Only valid when \( P(A \cap B) = 0 \)

Events cannot happen simultaneously

Understanding Mutually Exclusive vs Independent

Mutually Exclusive: Events cannot both occur (if one happens, the other cannot)

Example: Rolling a 2 or a 5 on a single die roll

Independent: Occurrence of one event doesn't affect probability of the other

Example: Flipping a coin and rolling a die

Important: Events cannot be both mutually exclusive AND independent (except when one has probability 0)

Conditional Probability

Conditional Probability Formula
\[ P(A|B) = \frac{P(A \cap B)}{P(B)} \]

Read as: "Probability of A given B"

Valid when \( P(B) > 0 \)

Rearranged: \( P(A \cap B) = P(B) \times P(A|B) \)

Example: Conditional Probability

A class has 30 students: 18 girls and 12 boys. 20 students passed an exam: 14 girls and 6 boys. What's the probability a student is a girl given they passed?

Solution:

Let G = girl, P = passed

\( P(G|P) = \frac{P(G \cap P)}{P(P)} = \frac{14/30}{20/30} = \frac{14}{20} = 0.7 \)

Alternatively: Of the 20 who passed, 14 are girls, so \( P(G|P) = \frac{14}{20} = 0.7 \)

Independent Events

Multiplication Rule (Independent Events)
\[ P(A \cap B) = P(A) \times P(B) \]

Read as: "Probability of A and B"

Only valid for independent events

Test independence: Check if \( P(A|B) = P(A) \)

Multiplication Rule (General)
\[ P(A \cap B) = P(B) \times P(A|B) = P(A) \times P(B|A) \]

Works for any events, not just independent

Discrete Random Variables

Expected Value (Mean)

Expected Value of Discrete Random Variable
\[ E(X) = \mu = \sum x \cdot P(X = x) \]

Sum over all possible values x

Weighted average of outcomes by their probabilities

Also called mean or expectation

Variance of Discrete Random Variable
\[ \text{Var}(X) = \sum (x - \mu)^2 \cdot P(X = x) \]

Alternative formula:

\[ \text{Var}(X) = \sum x^2 \cdot P(X = x) - \mu^2 = E(X^2) - [E(X)]^2 \]
Example: Expected Value and Variance

A game costs $5 to play. You win $20 with probability 0.2, $10 with probability 0.3, and $0 with probability 0.5. Find expected net gain and variance.

Solution:

Net gains: $15, $5, -$5

E(X): \( 15(0.2) + 5(0.3) + (-5)(0.5) = 3 + 1.5 - 2.5 = 2 \)

Expected net gain: $2 per game

E(X²): \( 225(0.2) + 25(0.3) + 25(0.5) = 45 + 7.5 + 12.5 = 65 \)

Var(X): \( 65 - 2^2 = 65 - 4 = 61 \)

Binomial Distribution

Conditions for Binomial Distribution

Use binomial distribution when ALL four conditions are met:

  1. Fixed number of trials: n is predetermined
  2. Two outcomes per trial: Success or failure
  3. Constant probability: p stays same for all trials
  4. Independent trials: Outcome of one doesn't affect others
Binomial Distribution Notation and Parameters
\[ X \sim B(n, p) \]

where:

• n = number of trials

• p = probability of success on each trial

• X = number of successes in n trials

Binomial Probability Formula
\[ P(X = r) = \binom{n}{r} p^r (1-p)^{n-r} \]

where \( \binom{n}{r} = \frac{n!}{r!(n-r)!} \)

Use GDC: binompdf(n, p, r) for P(X = r)

Use GDC: binomcdf(n, p, r) for P(X ≤ r)

Binomial Mean and Variance
\[ E(X) = np \] \[ \text{Var}(X) = np(1-p) = npq \]

where \( q = 1 - p \)

Standard deviation: \( \sigma = \sqrt{np(1-p)} \)

Example: Binomial Distribution

A basketball player makes 70% of free throws. She takes 10 shots. Find:

(a) Probability of exactly 8 makes

(b) Expected number of makes

Solution:

\( X \sim B(10, 0.7) \)

(a) \( P(X = 8) = \binom{10}{8}(0.7)^8(0.3)^2 \approx 0.233 \)

Or use GDC: binompdf(10, 0.7, 8) ≈ 0.233

(b) \( E(X) = np = 10 \times 0.7 = 7 \) makes

Normal Distribution

Normal Distribution Notation
\[ X \sim N(\mu, \sigma^2) \]

where:

• μ = mean (center of distribution)

• σ² = variance

• σ = standard deviation (spread of distribution)

Standardization (Z-Score)
\[ z = \frac{x - \mu}{\sigma} \]

If \( X \sim N(\mu, \sigma^2) \), then \( Z \sim N(0, 1) \)

Z is the standard normal variable

z-score tells how many standard deviations x is from mean

Properties of Normal Distribution
  • Bell-shaped and symmetric about mean μ
  • Mean = Median = Mode (all at center)
  • Total area under curve = 1
  • Approximately 68% of data within μ ± σ
  • Approximately 95% of data within μ ± 2σ
  • Approximately 99.7% of data within μ ± 3σ
Example: Normal Distribution Problem

Heights of men are normally distributed with mean 175 cm and standard deviation 8 cm. Find:

(a) Probability a man is taller than 183 cm

(b) Height that 90% of men are shorter than

Solution:

\( X \sim N(175, 8^2) \)

(a) \( P(X > 183) = 1 - P(X \leq 183) \)

Use GDC: normalcdf(183, 1E99, 175, 8) ≈ 0.159

(b) Find x where P(X < x) = 0.90

Use GDC: invNorm(0.90, 175, 8) ≈ 185.3 cm

Interactive Statistics Calculator

Mean and Standard Deviation Calculator

Enter data values separated by commas

Study Strategies for Statistics and Probability Success

Mastering Descriptive Statistics

  1. Understand Context: Know when to use mean vs median, range vs IQR based on data characteristics
  2. Master GDC Functions: Know 1-Var Stats function thoroughly for quick calculations
  3. Interpret Results: Don't just calculate—explain what statistics tell you about the data
  4. Practice with Real Data: Use authentic datasets to build intuition

Mastering Probability

  1. Draw Diagrams: Use Venn diagrams for combined events, tree diagrams for sequences
  2. Identify Event Types: Determine if events are mutually exclusive, independent, or neither
  3. Use Complements: Calculate P(A') when it's easier than P(A) directly
  4. Check Your Work: Probabilities must be between 0 and 1; sum of all outcomes = 1

Mastering Distributions

  1. Identify Distribution Type: Check conditions for binomial; assume normal when stated or population is large
  2. Know GDC Commands: Practice binompdf, binomcdf, normalcdf, invNorm until automatic
  3. Understand Parameters: Know what n, p, μ, σ represent and how they affect distribution
  4. Sketch Distributions: Quick sketch helps visualize what you're calculating

Common Mistakes to Avoid

Common ErrorCorrect ApproachExample
Using P(A ∪ B) = P(A) + P(B) for non-mutually exclusive eventsSubtract intersection: P(A ∪ B) = P(A) + P(B) - P(A ∩ B)P(ace or heart) = P(ace) + P(heart) - P(ace of hearts)
Confusing P(A|B) with P(B|A)Use correct formula with correct conditional eventP(disease|positive test) ≠ P(positive test|disease)
Using binomial when p changes between trialsCheck all four binomial conditions before usingDrawing without replacement violates independence
Forgetting to standardize before using standard normalCalculate z = (x - μ)/σ firstCan't use z-table directly with non-standard normal
Confusing variance with standard deviationRemember σ = √(σ²)Var(X) = 25 means σ = 5, not 25

Applications in Real-World Contexts

Statistics in Science and Research

  • Experimental Design: Descriptive statistics summarize results, normal distribution models measurements
  • Quality Control: Use mean and standard deviation to set acceptable ranges
  • Clinical Trials: Binomial distribution for success/failure outcomes
  • Psychology: Normal distribution for IQ scores, reaction times

Probability in Decision Making

  • Risk Assessment: Probability quantifies likelihood of events
  • Insurance: Expected value for setting premiums
  • Game Theory: Probability informs optimal strategies
  • Weather Forecasting: Probability of precipitation

Business and Economics

  • Market Research: Use statistics to analyze consumer data
  • Financial Analysis: Normal distribution for stock returns
  • Operations Research: Binomial for defect rates
  • Forecasting: Use historical data and probability

Exam Preparation and Strategy

AA SL & HL Statistics Exam Checklist
  • ✓ Calculate mean, median, mode, quartiles, IQR quickly
  • ✓ Calculate variance and standard deviation using both formulas
  • ✓ Apply probability rules correctly (addition, multiplication)
  • ✓ Distinguish mutually exclusive from independent events
  • ✓ Calculate conditional probability P(A|B) accurately
  • ✓ Find expected value and variance of discrete random variables
  • ✓ Identify when to use binomial distribution
  • ✓ Calculate binomial probabilities using GDC
  • ✓ Standardize normal variables with z-scores
  • ✓ Use normalcdf and invNorm functions correctly
  • ✓ Draw and interpret Venn diagrams and tree diagrams
  • ✓ Work complete past papers under timed conditions

Additional RevisionTown Resources

Enhance your statistics and probability mastery with these comprehensive RevisionTown resources:

Technology and GDC Skills

Essential GDC Functions for Statistics & Probability
  • 1-Var Stats: Instant calculation of mean, median, quartiles, standard deviation
  • binompdf(n, p, r): P(X = r) for binomial distribution
  • binomcdf(n, p, r): P(X ≤ r) for binomial cumulative
  • normalcdf(lower, upper, μ, σ): Area under normal curve
  • invNorm(area, μ, σ): Find x-value for given probability
  • List Operations: Store data in lists for efficient calculation

Connecting to Other AA Topics

Statistics and probability connect with other AA curriculum areas:

  • Functions: Probability density functions, cumulative distribution functions
  • Algebra: Manipulating probability formulas, solving for unknown parameters
  • Calculus (HL): Continuous probability distributions involve integration
  • Sequences (HL): Binomial expansion relates to binomial distribution

Conclusion

Mastering statistics and probability is essential for success in IB Mathematics AA (both SL and HL) and provides powerful tools for understanding data, quantifying uncertainty, and making informed decisions in an increasingly data-driven world. The foundational statistical techniques covered in this guide—descriptive statistics, basic probability rules, discrete random variables, binomial distribution, and normal distribution—form the essential core that all AA students must master regardless of level.

Success in statistics and probability requires more than memorizing formulas—it demands conceptual understanding of when to apply each technique, ability to interpret results in context, skill in using technology efficiently, and recognition of how probability models real-world phenomena. Whether you're taking AA SL or continuing to advanced HL topics, these fundamental concepts provide the foundation for all further statistical learning.

Regular practice with past papers, systematic review of probability rules and distribution properties, consistent application of GDC functions, and development of problem-solving strategies will build the statistical competence necessary for exam success. Master both calculation procedures and conceptual interpretation to achieve complete understanding.

Continue building your AA mathematics expertise through RevisionTown's comprehensive collection of IB Mathematics resources, practice with interactive calculators, and connect statistical concepts to applications in science, business, medicine, and social research. Master these statistics and probability formulas and techniques, and you'll be well-prepared for IB examinations and the quantitative challenges that await in university studies and professional life.

Shares: