Unit 5: Theory of Probability

Table of Contents

1. Basic Concepts

Random Experiment

An experiment or process for which the outcome cannot be predicted with certainty, but all possible outcomes are known.

Sample Point

A single possible outcome of a random experiment.

Sample Space (S)

The set of all possible outcomes (sample points) of a random experiment.

Event (E)

A subset of the sample space. It is a collection of one or more sample points.


2. Algebra of Events

Events can be combined using set operations.


3. Approaches to Probability

How we define and calculate probability.

1. Classical (or 'a priori') Definition

This definition assumes all outcomes are equally likely.

If a random experiment has 'n' mutually exclusive, exhaustive, and equally likely outcomes, and 'm' of these outcomes are favorable to an event A:
P(A) = m / n = (Number of favorable outcomes) / (Total number of possible outcomes)

2. Statistical (or Empirical / 'a posteriori') Definition

This definition is based on relative frequency from actual experiments.

If an experiment is repeated 'n' times and event A occurs 'f' times, the probability of A is the limit of this relative frequency as 'n' becomes infinitely large.
P(A) = lim (n→∞) [ f / n ]

3. Axiomatic Definition

This is the modern, mathematical definition. It states the *rules* (axioms) that probability must follow, without saying how to calculate it.

Axiom 1 (Non-negativity): For any event A, P(A) ≥ 0.

Axiom 2 (Certainty): The probability of the entire sample space is 1. P(S) = 1.

Axiom 3 (Additivity): If A and B are mutually exclusive events, then P(A ∪ B) = P(A) + P(B).

4. Theorems of Probability (Addition & Multiplication)

1. Addition Law of Probability

Used to find the probability of (A or B).

General Rule (for *any* two events):

P(A ∪ B) = P(A) + P(B) - P(A ∩ B)

(We subtract P(A ∩ B) because it was counted twice).

Special Rule (for *Mutually Exclusive* events):

Since P(A ∩ B) = 0, the formula simplifies to:

P(A ∪ B) = P(A) + P(B)

2. Multiplication Law of Probability

Used to find the probability of (A and B). It is derived from the definition of conditional probability.

P(A ∩ B) = P(A) * P(B | A)
(Probability of A, times the probability of B *given that* A has already happened)

5. Conditional Probability

Conditional Probability, P(A | B): The probability of event A occurring, given that event B has already occurred.

It "updates" the probability of A based on new information. We are restricting our sample space from S down to just B.

Formula:

P(A | B) = P(A ∩ B) / P(B) (provided P(B) > 0)

Example:

Roll a die. S = {1, 2, 3, 4, 5, 6}.
Event A = "Get a 4". P(A) = 1/6.
Event B = "Get an even number" = {2, 4, 6}. P(B) = 3/6.
Event (A ∩ B) = "Get a 4 AND an even number" = {4}. P(A ∩ B) = 1/6.

P(A | B) = P(A ∩ B) / P(B) = (1/6) / (3/6) = 1/3.


6. Independent Events

Independent Events: Two events A and B are independent if the occurrence of one event does not affect the probability of the other event occurring.

Formal Definition:

A and B are independent if and only if:

P(A ∩ B) = P(A) * P(B)

This also means:

Common Mistake: Do NOT confuse "Mutually Exclusive" with "Independent".
- Mutually Exclusive: If A happens, B *cannot* happen. P(A ∩ B) = 0. They are strongly *dependent*.
- Independent: If A happens, it tells you *nothing* about B. P(A ∩ B) = P(A)P(B).

7. Bayes' Theorem and Its Applications

Bayes' Theorem allows us to "reverse" the conditional probability. If we know P(B | A), Bayes' Theorem helps us find P(A | B).

It is used to update our belief about a "cause" (A) given a new "effect" (B).

The Theorem

For two events A and B:

P(A | B) = [ P(B | A) * P(A) ] / P(B)

More generally, if the sample space is partitioned by events A1, A2, ..., Ak:

Bayes' Theorem (Full Form):
P(Ai | B) = [ P(B | Ai)P(Ai) ] / [ Σ P(B | Aj)P(Aj) ]

The denominator is just P(B), calculated using the Law of Total Probability.

Application (Example):

A factory has two machines, A1 and A2.
- A1 produces 60% of items (P(A1) = 0.6).
- A2 produces 40% of items (P(A2) = 0.4).
- 2% of items from A1 are defective (P(D | A1) = 0.02).
- 1% of items from A2 are defective (P(D | A2) = 0.01).

Question: An item is selected at random and found to be defective (D). What is the probability it came from machine A1?

We want to find P(A1 | D).

Step 1: Numerator: P(D | A1) * P(A1)
(0.02) * (0.6) = 0.012

Step 2: Denominator (Total Probability of D):
P(D) = P(D | A1)P(A1) + P(D | A2)P(A2)
P(D) = (0.02)(0.6) + (0.01)(0.4)
P(D) = 0.012 + 0.004 = 0.016

Step 3: Divide.
P(A1 | D) = 0.012 / 0.016 = 12 / 16 = 0.75

Result: There is a 75% chance the defective item came from machine A1.