Probability Cheatsheet v1.1.1

Counting

Multiplication Rule - Let’s say we have a compound experiment

(an experiment with multiple components). If the 1st component has

n1possible outcomes, the 2nd component has n2possible outcomes,

and the rth component has nrpossible outcomes, then overall there

are n1n2...nrpossibilities for the whole experiment.

Sampling Table - The sampling tables describes the different ways

to take a sample of size kout of a population of size n. The column

names denote whether order matters or not.

Matters Not Matter

With Replacement nkn+k−1

k

Without Replacement n!

(n−k)! n

k

Na¨ıve Definition of Probability -If the likelihood of each

outcome is equal, the probability of any event happening is:

P(Event) = number of favorable outcomes

number of outcomes

Probability and Thinking Conditionally

Independence

Independent Events -Aand Bare independent if knowing one

gives you no information about the other. Aand Bare independent if

and only if one of the following equivalent statements hold:

P(A∩B) = P(A)P(B)

P(A|B) = P(A)

Conditional Independence -Aand Bare conditionally

independent given Cif: P(A∩B|C) = P(A|C)P(B|C). Conditional

independence does not imply independence, and independence do es

not imply conditional independence.

Unions, Intersections, and Complements

De Morgan’s Laws - Gives a useful relation that can make

calculating probabilities of unions easier by relating them to

intersections, and vice versa. De Morgan’s Law says that the

complement is distributive as long as you flip the sign in the middle.

(A∪B)c≡Ac∩Bc

(A∩B)c≡Ac∪Bc

Joint, Marginal, and Conditional Probabilities

Joint Probability -P(A∩B) or P(A,B) - Probability of Aand B.

Marginal (Unconditional) Probability -P(A) - Probability of A

Conditional Probability -P(A|B) - Probability of Agiven B

occurred.

Conditional Probability is Probability -P(A|B) is a probability

as well, restricting the sample space to Binstead of Ω. Any theorem

that holds for probability also holds for conditional probability.

Simpson’s Paradox

P(A|B, C)< P (A|Bc, C) and P(A|B, C c)< P(A|Bc, Cc)

yet still, P(A|B)> P (A|Bc)

Bayes’ Rule and Law of Total Probability

Law of Total Probability with partitioning set B1,B2,B3, ...Bnand

with extra conditioning (just add C!)

P(A) = P(A|B1)P(B1) + P(A|B2)P(B2) + ...P (A|Bn)P(Bn)

P(A) = P(A∩B1) + P(A∩B2) + ...P (A∩Bn)

P(A|C) = P(A|B1,C)P(B1|C) + ...P (A|Bn,C)P(Bn|C)

P(A|C) = P(A∩B1|C) + P(A∩B2|C) + ...P (A∩Bn|C)

Law of Total Probability with Band Bc(special case of a partitioning

set), and with extra conditioning (just add C!)

P(A) = P(A|B)P(B) + P(A|Bc)P(Bc)

P(A) = P(A∩B) + P(A∩Bc)

P(A|C) = P(A|B,C)P(B|C) + P(A|Bc,C)P(Bc|C)

P(A|C) = P(A∩B|C) + P(A∩Bc|C)

Bayes’ Rule, and with extra conditioning (just add C!)

P(A|B) = P(A∩B)

P(B)=P(B|A)P(A)

P(B)

P(A|B,C) = P(A∩B|C)

P(B|C)=P(B|A,C)P(A|C)

P(B|C)

Odds Form of Bayes’ Rule, and with extra conditioning (just add C!)

P(A|B)

P(Ac|B)=P(B|A)

P(B|Ac)

P(A)

P(Ac)

P(A|B,C)

P(Ac|B,C)=P(B|A,C)

P(B|Ac,C)

P(A|C)

P(Ac|C)

Random Variables and their Distributions

PMF, CDF, and Independence

Probability Mass Function (PMF) (Discrete Only) gives the

probability that a random variable takes on the value X.

PX(x) = P(X=x)

Cumulative Distribution Function (CDF) gives the probability

that a random variable takes on the value x or less

FX(x0) = P(X≤x0)

Independence - Intuitively, two random variables are independent if

knowing one gives you no information about the other. X and Y are

independent if for ALL values of x and y:

P(X=x, Y =y) = P(X=x)P(Y=y)

Expected Value and Indicators

Distributions

Probability Mass Function (PMF) (Discrete Only) is a function

that takes in the value x, and gives the probability that a random

variable takes on the value x. The PMF is a positive-valued function,

and PxP(X=x)=1

PX(x) = P(X=x)

Cumulative Distribution Function (CDF) is a function that

takes in the value x, and gives the probability that a random variable

takes on the value at most x.

F(x) = P(X≤x)

Expected Value, Linearity, and Symmetry

Expected Value (aka mean,expectation, or average) can be thought

of as the “weighted average” of the possible outcomes of our random

variable. Mathematically, if x1, x2, x3,... are all of the possible values

that Xcan take, the expected value of Xcan be calculated as follows:

E(X) = P

xiP(X=xi)

Note that for any Xand Y,aand bscaling coefficients and cis our

constant, the following property of Linearity of Expectation holds:

E(aX +bY +c) = aE(X) + bE(Y) + c

If two Random Variables have the same distribution, even when they

are dependent by the property of Symmetry their expected values

are equal.

Conditional Expected Value is calculated like expectation, only

conditioned on any event A.

E(X|A) = P

xP (X=x|A)

Indicator Random Variables

Indicator Random Variables is random variable that takes on

either 1 or 0. The indicator is always an indicator of some event. If the

event occurs, the indicator is 1, otherwise it is 0. They are useful for

many problems that involve counting and expected value.

Distribution IA∼Bern(p) where p=P(A)

Fundamental Bridge The exp ectation of an indicator for Ais the

probability of the event. E(IA) = P(A). Notation:

IA=(1 A occurs

0 A does not occur

Variance

Var(X) = E(X2)−[E(X)]2

Expectation and Independence

If Xand Yare independent, then

E(XY ) = E(X)E(Y)

Continuous RVs, LotUS, and UoU

Continuous Random Variables

What’s the prob that a CRV is in an interval? Use the CDF (or

the PDF, see below). To find the probability that a CRV takes on a

value in the interval [a, b], subtract the respective CDFs.

P(a≤X≤b) = P(X≤b)−P(X≤a) = F(b)−F(a)

Note that for an r.v. with a normal distribution,

P(a≤X≤b) = P(X≤b)−P(X≤a)

= Φ b−µ

σ2−Φa−µ

σ2

What is the Cumulative Density Function (CDF)? It is the

following function of x.

F(x) = P(X≤x)

What is the Probability Density Function (PDF)? The PDF,

f(x), is the derivative of the CDF.

F0(x) = f(x)

Or alternatively,

F(x) = Zx

−∞

f(t)dt

Note that by the fundamental theorem of calculus,

F(b)−F(a) = Zb

f(x)dx

Thus to find the probability that a CRV takes on a value in an

interval, you can integrate the PDF, thus finding the area under the

density curve.

Complete Probability Cheatsheet, Cheat Sheet of Probability and Statistics

Related documents

Partial preview of the text

Download Complete Probability Cheatsheet and more Cheat Sheet Probability and Statistics in PDF only on Docsity!

Probability Cheatsheet v1.1.

Counting

Probability and Thinking Conditionally

Independence

Unions, Intersections, and Complements

Joint, Marginal, and Conditional Probabilities

Simpson’s Paradox

P (A | B, C) < P (A | B

Bayes’ Rule and Law of Total Probability

P (A|B) =

P (A ∩ B)

P (B)

P (B|A)P (A)

P (B)

P (A|B, C) =

P (A ∩ B|C)

P (B|C)

P (B|A, C)P (A|C)

P (B|C)

P (B|A)

P (A)

P (B|A, C)

P (A|C)

Random Variables and their Distributions

PMF, CDF, and Independence

Expected Value and Indicators

Distributions

Expected Value, Linearity, and Symmetry

Indicator Random Variables

IA =

Variance

Expectation and Independence

Continuous RVs, LotUS, and UoU

Continuous Random Variables

Law of the Unconscious Statistician (LotUS)

E(X) =

Universality of Uniform

Moments

Moment Generating Functions

∑^ ∞

∑^ ∞

M

M

Joint Distributions

Conditional Distributions

Marginal Distributions

Independence of Random Variables

Multivariate LotUS

Covariance and Correlation

Transition Matrix

Chain Properties

Stationary Distribution

Random Walk on Undirected Network

Uniform

Normal

∼ N (0, 1)

Exponential Distribution

Gamma Distribution

Beta Distribution

Distribution

Bernoulli

Binomial

Geometric

First Success

Negative Binomial

Hypergeometric

Poisson

Multinomial

Multivariate Uniform

Multivariate Normal (MVN)

Important CDFs

Poisson Properties (Chicken and Egg Results)

Convolutions of Random Variables

Special Cases of Random Variables

Reasoning by Representation

Geometric Series