Prepare for your exams
Get points
Guidelines and tips

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search Store documents

The best documents sold by students who completed their studies

Search through all study resources

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

University Rankings

Discover the best universities in your country according to Docsity users

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

From our blog

Exams and Study

Go to the blog

Probability Distributions: Discrete Random Variables and Binomial Distribution, Lecture notes of Statistics

John F. Kennedy University (JFKU)Statistics

An introduction to probability distributions, focusing on discrete random variables. It explains the concept of a random variable, the difference between discrete and continuous random variables, and the concept of a probability distribution. The document also covers the binomial distribution, including how to calculate the mean and standard deviation using a TI-83/84 calculator. Examples are given throughout the document, including the probability distribution of the number of prior DWI sentences for jail inmates and the probability that Air America books too many passengers.

What you will learn

What is a random variable?
How do you calculate the mean and standard deviation of a binomial distribution using a TI-83/84 calculator?
What is the difference between discrete and continuous random variables?

Typology: Lecture notes

2021/2022

Uploaded on 09/12/2022

shekhar_hin 🇺🇸

4.9

(9)

226 documents

1 / 9

This page cannot be seen from the preview

Don't miss anything!

1

Section 5.1 – Probability Distributions

A random variable is a variable (typically represented by x) that has a numeric value, determined by chance,

for each possible outcome of an experiment

Examples:

The number of students passing a certain class

The average height of the students in a class

The number of girls in a family of 5 children

The sum on the faces of two rolled dice

The number of defective parts in a sample of 20

The average daily temperature

A word about randomness

The word randomness suggests unpredictability.

Randomness and uncertainty are vague concepts that deal with variation.

A simple example of randomness involves a coin toss. The outcome of the toss is uncertain. Since the coin

tossing experiment is unpredictable, the outcome is said to exhibit randomness.

Even though individual flips of a coin are unpredictable, if we flip the coin a large number of times, a pattern

will emerge. Roughly half of the flips will be heads and half will be tails.

This long-run regularity of a random event is described with probability. Our discussions of randomness will be

limited to phenomenon that in the short run are not exactly predictable but do exhibit long run regularity.

A discrete random variable has either a finite or a countable number of values. This chapter deals with

discrete random variables.

A continuous random variable has infinitely many values, and those values can be associated with

measurements on a continuous scale in such a way that there are no gaps or interruptions.

A probability distribution is a graph, table, or formula that gives the probability for each possible value of the

random variable.

(Notice: similar to relative frequency tables, histograms)

A probability histogram is a way to graph a probability distribution.

The vertical scale shows probabilities instead of relative frequencies.

Note that the area of these rectangles is the same as the probabilities.

Partial preview of the text

Download Probability Distributions: Discrete Random Variables and Binomial Distribution and more Lecture notes Statistics in PDF only on Docsity!

Section 5.1 – Probability Distributions A random variable is a variable (typically represented by x) that has a numeric value, determined by chance, for each possible outcome of an experiment Examples: The number of students passing a certain class The average height of the students in a class The number of girls in a family of 5 children The sum on the faces of two rolled dice The number of defective parts in a sample of 20 The average daily temperature A word about randomness The word randomness suggests unpredictability. Randomness and uncertainty are vague concepts that deal with variation. A simple example of randomness involves a coin toss. The outcome of the toss is uncertain. Since the coin tossing experiment is unpredictable, the outcome is said to exhibit randomness. Even though individual flips of a coin are unpredictable, if we flip the coin a large number of times, a pattern will emerge. Roughly half of the flips will be heads and half will be tails. This long-run regularity of a random event is described with probability. Our discussions of randomness will be limited to phenomenon that in the short run are not exactly predictable but do exhibit long run regularity. A discrete random variable has either a finite or a countable number of values. This chapter deals with discrete random variables. A continuous random variable has infinitely many values, and those values can be associated with measurements on a continuous scale in such a way that there are no gaps or interruptions. A probability distribution is a graph, table, or formula that gives the probability for each possible value of the random variable. (Notice: similar to relative frequency tables, histograms) A probability histogram is a way to graph a probability distribution. The vertical scale shows probabilities instead of relative frequencies. Note that the area of these rectangles is the same as the probabilities.

M116 – NOTES – CH 5

Section 5.1 – Probability Distributions

Requirements for a Probability Distribution o 0  P(X = x)  1 o The sum of the probabilities of a discrete random variable is 1.

 P X (^^ =^ x )^ =^1

To evaluate the mean and standard deviation of a probability distribution using the calculator Enter x into L Enter the probabilities into L Press STAT Arrow right to CALC Select 1: 1-Var Stats L1,L Press ENTER
Identifying Unusual Results with the Range Rule of Thumb (section 6.2) The range rule of thumb is based on the principle that for many data sets (symmetrical, bell shaped), the vast majority (such as 95%) of sample values lie within two standard deviations of the mean. Less common values are more than two standard deviations from the mean.

Minimum “usual” value ~ mean – 2 * standard deviation =  − 2 

Maximum “usual” value ~ mean + 2 * standard deviation =  + 2 

Identifying Unusual Results with Probabilities Unusually high: x successes among n trials is unusually high if P(x or more) is very small (such as less than 0.05) Unusually low: x successes among n trials is unusually low if P( or fewer) is very small (such as less than 0.05)

M116 – NOTES – CH 5

Section 5.2 & 5.3 – Binomial Experiments

Features of a binomial experiment (5.2) 1) The experiment has a fixed number of trials (n) 2) The trials must be independent 3) Each trial has 2 possible outcomes: success (S) and failure (F) 4) Probabilities remain constant for each trial. p is the probability of success, and q is the probability of failure When sampling without replacement, the events can be treated as if they were independent

if the sample size is no more than 5% of the population size. (That is, n 0.05 N )

Find binomial probabilities with a shortcut feature of the calculator To find individual probabilities: Use binompdf(n,p,x) Press 2nd VARS Select 0:binompdf( Type n,p,x) Press ENTER To calculate cumulative probabilities from 0 to x, use binomcdf(n,p,x)
Mean, Variance, and Standard Deviation for the Binomial Distribution (5.3) If we have the probability distribution in the editor of the calculator we can use the calculator by doing STAT – CALC, 1-VarStat L1, L Otherwise we can use these formulas for binomial distributions.

= np = npq

Remember that the variance is the square of the standard deviation: Variance = 2 2

 = ( npq ) = npq

Unusual values (5.3) For a binomial distribution, it is unusual for the number of successes to be more than 2.5 σ from μ.

Minimum “usual” value ~  −2.5

Maximum “usual” value ~  +2.5

M116 – TI 83/84 CALCULATOR – CH 5

Binomial Distributions and Simulations (Chapter 5) Example 2) – Booking tickets: Air America has a policy of booking as many as 15 persons on an airplane that can seat only 14. Past studies have revealed that only 85% of the booked passengers actually arrive for the flight. Find the probability that if Air America books 15 persons, not enough seats will be available. a) Describe the random variable and success attribute. Give the possible values of the random variable. Give the number of trials and the probability of success. b) Use the calculator to find the probability that if Air America books 15 persons, not enough seats will be available. c) Is it unusual to find that there are not enough sits available? Should overbooking be a concern for passengers? d) SIMULATION Now we are going to simulate this situation by repeating the experiment 20 times. Use MATH PRB 7:randBin(n,p) and press ENTER 20 times. Record results in a table, and then use your table to answer the question to the problem. e) Use class results and answer the question again. f) OPTIONAL (OYO) Here we have another simulation technique. Use the calculator to generate 50 numbers that come from a binomial distribution with n = 15 and p = 0. (We’ll clear List 1, generate the numbers and store them into List 1, we’ll sort the list and then explore the editor) STAT 4:ClrList L1 : MATH PRB 7:randBin(n,p,50) STO L1 : STAT 3:SortA(L1) Go to the editor, explore the list and count how many times we had 15 passengers showing up. Then determine the probability, and compare with the theoretical results from part (a). Comment on the law of large numbers.

The rate of Lyme disease cases in Clinton County is 2%. In groups of 1000 what is the usual range of the distribution of x: the number of people of the county who has Lyme disease out of 1000. Here is the rest of the story: A new vaccine has been developed to avoid getting Lyme disease. We would like to know whether the vaccine is effective. There are two conflicting hypotheses: The vaccine is not effective Claim The vaccine is effective Case 1: When 1000 people from that county are given the new vaccine, it is found that 19 of them contract Lyme disease We support the claim that the vaccine is effective We don’t have enough evidence to support the claim that the vaccine is effective Case 2: When 1000 people from that county are given the new vaccine, it is found that 7 of them contract Lyme disease We support the claim that the vaccine is effective We don’t have enough evidence to support the claim that the vaccine is effective Rare Event Rule for Inferential Statistics If, under a given assumption, the probability of a particular observed event is exceptionally small, (less than 0.05) we conclude that the assumption is probably not correct.

Section 5.2 and 5.3 – how this material helps us in inferential statistics? 3) There are two conflicting hypotheses: The coin is fair Claim The coin is not fair Case 1: Heads turns up 17 times in 30 tosses We support the claim that the coin is NOT fair We don’t have enough evidence to support the claim that the coin is NOT fair Case 2: Heads turns up 27 times in 30 tosses We support the claim that the coin is NOT fair We don’t have enough evidence to support the claim that the coin is NOT fair

Probability Distributions: Discrete Random Variables and Binomial Distribution, Lecture notes of Statistics

Related documents

Partial preview of the text

Download Probability Distributions: Discrete Random Variables and Binomial Distribution and more Lecture notes Statistics in PDF only on Docsity!

M116 – NOTES – CH 5

 P X (^^ =^ x )^ =^1

Minimum “usual” value ~ mean – 2 * standard deviation =  − 2 

Maximum “usual” value ~ mean + 2 * standard deviation =  + 2 

M116 – NOTES – CH 5

if the sample size is no more than 5% of the population size. (That is, n 0.05 N )

= np = npq

 = ( npq ) = npq

Minimum “usual” value ~  −2.5

Maximum “usual” value ~  +2.5

M116 – TI 83/84 CALCULATOR – CH 5