1. Chapter 5
SPECIAL DISTRIBUTIONS
• Discrete distribution; Binomial, Poisson; point
probability and interval; Binomial and Poisson table;
applications.
• Continuous distribution; Normal distribution; interval
probability; table and application, emphasize on
P(z1<Z<z2), P(Z<z) and P(Z>z).
• Introduction to t-distribution.
1
2. 2
Learning Outcomes
• describe characteristics of Binomial distribution
• solve problems related to Binomial distribution
3. 3
Binomial Distribution
• The binomial distribution is the discrete probability distribution of the
number of successes in a sequence of n independent yes/no experiments,
each of which yields success with probability p.
• Characteristics of the Binomial Distribution:
1. The experiment consists of n identical trials.
2. Each trial has only one of the two possible mutually exclusive
outcomes, success or a failure.
3. The trials are independent, thus we must sample with replacement.
4. The probability of success is denoted by p and that of failure by q, and
p + q=1. The probabilities p and q remain constant for each trial.
• Examples:
The probability of a defective laptop manufactured at a firm is 0.05
in a random sample of ten.
p = 0.05, n = 10
The outcome to which the question refers
is called a success
4. 4
Binomial Distribution
• For a binomial experiment, the probability of exactly x
successes in n trials is given by the binomial formula:
=
• where
• n = the total number of trials
• p = probability of success
• q = 1-p = probability of failure
• x = number of successes in n trials
• n-x = number of failures in n trials
x
x x
5. 5
Example 1
• Compute the probabilities of X successes, using the
binomial formula.
• a. n= 6, x= 3, p=0.03
• b. n= 4, x= 2, p=0.18
• Solution:
= 6.5.4.3.2.1 (0.03)3(0.97)3
3.2.1 (3.2.1)
6. 6
Example 2
• A survey found that one out of five Malaysian says he or
she has visited a doctor in any given month. If 10 people
are selected at random, find the probability that exactly 3
will have visited a doctor last month.
• Solution:
• In this case, n = 10, x = 3, p = 1/5 and q = 4/5.
7. 7
Ex.1 Q.1
• 1. A burglar alarm system has 6 fail-safe components.
The probability of each failing is 0.05. Find these
probabilities: (p = 0.05, q= 1-0.05 =0.95, n = 6)
a) exactly 3 will fail
b) less than 2 will fail
c) none will fail
8. 8
Ex.1 Q.2
2. A survey from Teenage Research Unlimited found that
30% of teenage consumers receive their spending
money from part-time jobs. If 5 teenagers are selected at
random, find the probability that at least 3 of them will
have part-time jobs.
• Solution:
• p = 0.30, q = 1 – 0.30 = 0.70, n = 5
9. 9
Ex.1 Q.3
3. R. H Bruskin Associates Market Research found that
40% of Americans do not think that having a college
education is important to succeed in the business world.
If a random sample of five American is selected, find
these probabilities.
a) Exactly two people will agree with that statement.
p = 0.40 (disagree), q = 1 – 0.40 = 0.60 (agree), n = 5.
Parts a) to d) ask about those who will agree to the statement,
therefore p = 0.60 now.
P(X=2) = 5C2(0.6)2(0.4)5-2
= 0.2304
10. 10
Ex.1 Q.3
b) At most three people will agree with that statement
c) At least two people will agree with that statement
P(X ≤ 3) = P(X=0) + P(X=1) + P(X=2) + P(X=3)
= 1- {P(X = 4) + P(X = 5)}
= 1 – {5C4 (0.6)4(0.4)1 + 5C5 (0.6)5(0.4)0}
= 1 – {0.2592 +0.07776}
= 0.6630
P( X ≥ 2 ) = P(X=2) + P(X=3) + P(X=4) + P(X=5)
= 1 –{P(X=0) + P(X=1)}
= 1- { 5C0 (0.6)0(0.4)5 + 5C1 (0.6)1(0.4)4 }
= 1- {0.01024 + 0.0768 }
= 0.9130
11. 11
Ex.1 Q.3
d) Fewer than three people will agree with that statement.
P(X < 3) = P(X=0) + P(X=1) + P(X=2)
= 5C0 (0.6)0(0.4)5 + 5C1 (0.6)1(0.4)4 + 5C2 (0.6)2(0.4)3
= 0.01024 + 0.0768 + 0.2304
= 0.3174
12. 12
Ex.1 Q.4
4. It was found that 60% of American victims of health care
fraud are senior citizens. If 10 victims are randomly
selected, find the probability that exactly 3 are senior
citizens. p = 0.6, q = 0.4, n = 10
P(X=3) = 10C3.(0.6)3.(0.4)7
= 0.0425
13. 13
Binomial Table
• Refer to binomial table in your notes.
• For any number of trials n:
• The binomial probability distribution is symmetric if p =
0.5
• The binomial probability distribution is skewed to the
right if p is less than 0.5
• The binomial probability distribution is skewed to the left
if p is greater than 0.5
14. 14
Binomial Table Tips
Equally P(X = x), exactly that value, easy to use Poisson formula
At most
All lower values and up to that x value, i.e. P(X ≤ x) , so take
directly from the table
Less than P(X < x) , that value x (and larger) is NOT included
At least P(X ≥ x), that value of x onwards
Greater than P(X > x), that value x (and smaller) is NOT included
From x1 to x2 P(x1 ≤ X ≤ x2), include both x1 and x2
Between x1 and x2 P(x1<X<x2), do not include both x1 and x2
Between x1 to x2 P(x1< X ≤ x2), exclude x1 but include x2
15. 15
Example 3
• Compute the probability of X successes using the
Binomial Table.
• n=2, p=0.30, x=1
• n=4, p=0.45, x=3
• Solution:
• a. P(x=1) = P(x ≤ 1) - P(x ≤ 0)
• = 0.9100 – 0.4900
• = 0.4200
b. P(x =3) = P(x ≤ 3) - P(x ≤ 2)
= 0.9590 – 0.7585
= 0.2005
16. 16
Example 4
1. 25% of all VCR manufactured by a large electronics company are
defective. A quality control inspector randomly selects three VCRs
from the production line. What is the probability that,
• a) exactly one of these three VCRs is defective.
• b) at least two of these VCRs is defective.
• c) three of these VCRs is defective.
• Solution:
• p = 0.25, q = 1 – 0.25 = 0.75, n = 3
a) P(X=1) = P(X ≤ 1) - P(X ≤ 0)
• = 0.8438 – 0.4219
• = 0.4219
b) P(X ≥ 2) = 1 – P(X ≤ 1)
• = 1 – 0.8438
• = 0.1562
• c) P(X = 3) = P(X 3) – P(X 2)
• = 1.000 – 0.9844
• = 0.0156
17. 17
Ex. 2 Q.1
1. If 65% of the people in a community use the gym
facilities in one year, find these probabilities for a
sample of 10 people.
a) Exactly four people use the gym facilities.
b) At least six people do not use the gym facilities.
18. 18
Ex. 2 Q.2
2. In a poll of 12 to 18 year old females conducted by
Harris Interactive for the Gillete Company, 40% of the
young female said that they expected the US to have a
female president within 10 years. Suppose that a
random sample of 15 females from this age group was
selected. Find the probabilities that of young female in
this sample who expect a female president within 10
years is
a) at least 9
b) at most 5
c) 6 to 9 (include 6, 9)
d) in between 4 and 8 (exclude 4, 8)
e) less than 4
20. 20
Ex. 2 Q.3
3. In a Gallup Survey, 40% of the people interviewed were
unaware that maintaining a healthy weight could reduce
the risk of stroke. If 15 people are selected at random,
find the probability that at least 9 are unaware that
maintaining a proper weight could reduce the risk of
stroke.
•
p = 0.40, q = 1 – 0.40 = 0.60, n = 15
P(X 9) = 1 – P(X 8)
= 1 – 0.9050
= 0.095
21. 21
Mean & Variance of
the Binomial Distribution
• Find the mean, variance and standard deviation for
exercise 1 earlier.
22. 22
Example 5
• Find the mean, variance and standard deviation for each
of the values of n and p when the conditions for the
binomial distribution are met.
a) n=100, p=0.75
b) n=300, p=0.3
• Solution:
• a) = np = 100(0.75) = 75
• 2 = npq = 100(0.75)(0.25) = 18.75
• = npq = 18.75 = 4.33
• b) = np = 300(0.3) = 90
2 = npq = 100(0.3)(0.7) = 63
= npq = 63 = 7.94
23. 23
Extra Q.1
1. It has been reported that 83% of federal government
employees use e-mail. If a sample of 200 federal
government employees is selected, find the mean,
variance and standard deviation of the number of
employees who use e-mail.
= np = 200(0.83) = 166
2 = npq = 200(0.83)(0.17) = 28.22
= npq = 28.22 = 5.3122
24. 24
Extra Q.2
2. A survey found that 25% of Malaysian watch movies at
the cinema. Find the mean, variance and standard
deviation of the number of individuals who watch movie
at the cinema if a random sample of 1000 Malaysian is
selected at the Bintang Walk.
= np = 1000(0.25) = 250
2 = npq = 1000(0.25)(0.75) = 187.5
= npq = 187.5 = 13.6931
25. 25
Poisson Distribution
• The Poisson distribution depends only on the average
number of occurrences per unit time or space, (lambda)
• The Poisson probability distribution is useful when n is
large and p is small.
Conditions for Poisson:
•x is a discrete random variable
•The occurrences are random
•The occurrences are independent
26. 26
Poisson Distribution
Examples of application:
• The number of accidents that occur on a highway given
during a one-week period. (time)
• The number of customers entering a grocery store
during a one-hour interval.
• The number of television sets sold at a department store
during a given week.
• The number of typing errors per page. (space)
• A certain type of fabric made contains an average 0.5
defects per 500 yards.
27. 27
Poisson Distribution
• The probability of x occurrences in an interval is:
• Where;
• : mean number of occurrences in that interval (time or ?)
• e : the constant, approximately 2.7183
28. 28
Example 7
• Find each probability P(X; ), using the Poisson formula
– P(5;4)
– P(2;4)
• Solution:
• a. P(X=5) = e-4(45)/5!
• = 0.1563
• b. P(X=2) = e-4(42)/2!
= 0.1465
29. 29
Example 8
• If there are 200 typographical errors randomly distributed
in a 500-page manuscript, find the probability that a
given page contains exactly three errors.
• Solution:
• Find the number of errors:
• = 200/500 = 0.4 (error per page)
•
• P(X=3) = e-0.4(0.4)3
3!
= 0.0072
30. 30
Ex.3 Q.1
1. On average a household receives 2 telemarketing
phone calls per week. Using the Poisson distribution
formula, find the probability that a randomly selected
household receives:
a) exactly six telemarketing phone calls during a given week.
b) less than three telemarketing phone calls in one month.
= 2 telemarketing phone calls per week
a) P(X = 6) = = 0.0120
= 8 telemarketing phone calls in one month
P(X < 3) = P(X 2) = P(X = 0) + P(X = 1) + P(X = 2)
=
= 0.0003 + 0.0027 + 0.0107
= 0.0137
8 0 8 1 8 2
8 8 8
0! 1! 2!
e e e
2 6
2
6!
e
31. 31
Using Poisson Table
• Refer to your Poisson less than table.
Equally P(X = x), exactly that value, easy to use Poisson formula
At most
All lower values and up to that x value, i.e. P(X ≤ x) , so take
directly from the table
Less than P(X < x) , that value x (and larger) is NOT included
At least P(X ≥ x), that value of x onwards
Greater than P(X > x), that value x (and smaller) is NOT included
From x1 to x2 P(x1 ≤ X ≤ x2), include both x1 and x2
Between x1 and x2 P(x1<X<x2), do not include both x1 and x2
Between x1 to x2 P(x1< X ≤ x2), exclude x1 but include x2
33. 33
Example 10
• A sales firm receives on average three calls per hour on
its toll-free number. For any given hour, find the
probability that it will receive the following:
a. at most 3 calls
b. at least 3 calls
c. five or more calls
d. between 1 to 4 calls in 2 hours
= 3 calls per hour
P(X 3) = 0.6472
P(X 3) = 1 – P(X 2)
= 1 – 0.4232
= 0.5768
P(X 5) = 1 – P(X 4)
= 1 – 0.8153
= 0.1847
= 6 calls in 2 hours
P(1< X 4) = P(X 4) – P(X 1)
= 0.2851 – 0.0174
= 0.2677
35. 35
Example 11
• An auto sales person sells an average of 0.9 cars per
day. Find the mean, variance and standard deviation of
cars sold per day by this sales person.
• Solution:
• = = 0.9 2 = = 0.9
• = = 0.9 = 0.9487
36. 36
Example 12
• An insurance salesperson sells an average of 1.4 policies per day.
• Find the probability that this salesperson will sell no insurance policy
on a certain day.
• Find the mean, variance and standard deviation of the probability
this salesperson will sell the policies per day.
• Solution:
•
= 1.4 policies per day
• a) P(X = 0) = P(X 0)
• = 0.2466
•
•
• b) = = 1.4
• 2 = = 1.4
• = = 1.4 = 1.1832
37. 37
Example 13
• Assuming that the number of accidents that occur in a
certain company in a week has a Poisson distribution
with a standard deviation of 1.4. Determine:
• a) the mean number of accidents that occur in a week
• b) the probability that in a week, there are no accidents.
• Solution:
• = 1.4 accidents
• a) = = 2 = (1.4)2
• = 1.96
• b) P ( X = 0) = e-1.96 (1.96)0/0!
• = 0.1408
39. 39
Example 11
• An auto sales person sells an average of 0.9 cars per
day. Find the mean, variance and standard deviation of
cars sold per day by this sales person.
• Solution:
• = = 0.9 2 = = 0.9
• = = 0.9 = 0.9487
40. 40
Example 12
• An insurance salesperson sells an average of 1.4 policies per day.
• Find the probability that this salesperson will sell no insurance policy
on a certain day.
• Find the mean, variance and standard deviation of the probability
this salesperson will sell the policies per day.
• Solution:
•
= 1.4 policies per day
• a) P(X = 0) = P(X 0)
• = 0.2466
•
•
• b) = = 1.4
• 2 = = 1.4
• = = 1.4 = 1.1832
41. 41
Example 13
• Assuming that the number of accidents that occur in a
certain company in a week has a Poisson distribution
with a standard deviation of 1.4. Determine:
• a) the mean number of accidents that occur in a week
• b) the probability that in a week, there are no accidents.
• Solution:
• = 1.4 accidents
• a) = = 2 = (1.4)2
• = 1.96
• b) P ( X = 0) = e-1.96 (1.96)0/0!
• = 0.1408
42. 42
Recap Previous Lessons
Distribution of
random
variables
Discrete
distribution
Continuous
distribution
Binomial Poisson Normal
fixed number of n trials,
each trial has 2 outcomes
(success or failure),
outcomes are independent,
same probability of success
for each trial
average number of occurrences
per unit time or space, ,
the occurrences are random
& independent.
43. 43
Continuous Probability
Distribution
• A continuous random variable can assume any value
over an interval or intervals.
• The number of values contained in any interval is infinite,
hence the possible number of values that a continuous
random variable can assume is also infinite.
• Examples of continuous random variables:
– life of battery
– height of people
– time taken to complete an examination
– amount of milk in a bottle
– weight of babies
– price of houses
44. 44
Continuous Probability
Distribution
The total probability of all the (mutually exclusive) intervals within which
X can assume a value is 1.0.
The probability that X assumes a value in any interval lies in the range 0 to 1.
45. 45
Normal Distribution
• The normal distribution is the most important and most
widely used of all probability distributions.
• A large number of phenomena in the real world are
normally distributed either exactly or approximately.
• The normal probability distribution or the normal curve
is a bell-shaped (symmetric) curve.
– Its mean is denoted by and its standard deviation by .
– Also known as bell curve or Gaussian distribution.
47. 47
The Standard
Normal Distribution
• Is a normal distribution with = 0 and = 1.
• The units for the standard normal distribution curve are
denoted by z and are called the z values or z scores.
• The z value for a point on the horizontal axis gives the
distance between the mean and that point in terms of the
standard deviation.
• The value under the curve indicates the proportion of
area in each section.
• The area under a standard normal distribution curve is
used to solve practical application problems such as:
– finding the % of adult woman whose height is between 5 feet 4
inches and 5 feet 7 inches.
48. 48
The Standard
Normal Distribution
• The standard normal distribution table lists the areas under the
standard normal curve to the left of z-values from –3.9 to 3.9.
• Although the values of z on the left side of the mean are negative,
the area under the curve is always positive.
-3.9 3.9
49. 49
Using the Table (p16)
Find the area under the standard normal curve to the left of z = 1.95
P(z < 1.95) = 0.9744 or
P(z < 1.95) = P(z 1.95) = 0.9744
The probability that a continuous random
variable assumes a single value is zero.
Therefore, P(z = 1.95) = 0.
50. 50
Example 13 (p17)
• Find the area under the standard normal curve:
a) to the left of z = 1.56
b) to the left of z = -2.87
c) to the right of z = 2.45
d) to the right of z = -1.32
e) from z = 0.85 to z = 1.95
f) between z = -2.15 and z=1.67
Note:
b, c and f are in Exercise 4
53. 53
Converting x to z value
• For a normal variable X, a particular value x can be
converted to its corresponding z value by using the
formula:
z = x -
• where is the mean and is the standard deviation of
the normal distribution of x.
Remember!
• The z value for the mean of a normal distribution is
always zero.
54. 54
Example 14 (p19)
• Let x be a continuous random variable that has a
normal distribution with a mean of 50 and a standard
deviation of 10. Convert the following x values to z
values.
a) 55
= 50 and =10, x = 55
z = x - = 55 – 50 = 0.50
10
b) 35
z = x - = 35 – 50 = -1.5
10
55. 55
Example 15 (p19)
Let x be a continuous random variable that is normally distributed with a mean of
65 and a standard deviation of 15.
Find the probability that x can assume a value:
a) less than 43 b) greater than 74
c) between 56 and 71
P(56 < X < 71) = P(56– 65 < z < 71 – 65)
15 15
= P(-0.6 < z < 0.4)
= P(z < 0.4) – P(z < -0.6)
= 0.6554 – 0.2743
= 0.3811
P(X < 43) = P(z < X - )
= P(z < 43 – 65)
15
= P(z < -1.47)
= 0.0708
P(X > 74) = P(z > 74 – 65)
15
= P(z > 0.6)
= 1 – P(z < 0.6)
= 1 – 0.7257
= 0.2743
56. 56
Exercise 5 (p20)
1. Let X denote the time takes to run a road race. Suppose X is
approximately normally distributed with mean of 190 minutes and
standard deviation of 21 minutes. If one runner is selected at
random, what is the probability that this runner will complete this
road race:
a) in less than 150 minutes?
b) in 205 to 245 minutes?
P(X < 150) = P(z < 150 - 190)
21
= P(z < -1.9)
= 0.0287
P(205 < X < 245) = P[(205-190 )<z < 245 - 190)]
21 21
= P(0.71 <z < 2.62)
= P(z < 2.62) - P(z < 0.71)
= 0.9956 – 0.7611
= 0.2345
57. 57
Exercise 5
2. The mean number of hours a student spends on the computer is 3.1
hours per day. Assume the standard deviation is 0.5 hour. Find the
percentage of students who spend less than 3.5 hours on the
computer. Assume the variable is normally distributed.
P(X < 3.5) = P(z < 3.5 – 3.1)
0.5
= P(z < 0.8)
= 0.7881
Hence, 78.81% spend less than 3.5 hours.
58. 58
Exercise 5
3. The score of 6000 candidates in a certain examination are found to
be approximately normal distributed with a mean of 55 and a
standard deviation of 10:
a) If a score of 75 or more is required for passing the distinction,
estimate the number of graduates with distinction.
n = 6000, μ = 55, σ = 10
a) P(X > 75) = P(z > 75 – 55)
10
= P(z > 2)
= 1 - P(z < 2)
= 1 - 0.9772
= 0.0228
Thus, the number of graduates with distinction is
0.0228( 6000) = 136.8
≈ 137 graduates
59. 59
Exercise 5
b) Calculate the probability that a candidate selected at random has a
score between 45 and 65
P(45 < X < 65) = P[(45 – 55) <z < 65 – 55)]
10 10
= P( -1 < z < 1)
= P (z<1) – P(z<-1)
= 0.8413 – 0.1587
= 0.6826
= 10
µ = 55 65
45
z
µ = 0
- values + values
1
-1
60. 60
Introduction to t-distribution
• The t distribution is very similar to the standardized
normal distribution.
– Both distributions are bell-shaped and symmetrical.
• However, the t distribution has more area in the tails and
less in the centre than does the standard normal
distribution.
– This is because is unknown and s is used to estimate it.
• Because the value of is uncertain, the values of t that
are observed will be more variable than for z.
61. 61
t-distribution
• As the number of degrees of freedom increases, the t distribution gradually
approaches the standard normal distribution until the two are virtually
identical.
• This happens because s becomes a better estimate of as the sample size
gets larger.
• With a sample size of about 120 or more, s estimates precisely enough
that there is little difference between the t and z distributions.
• For this reason, most statisticians use z instead of t when the
sample size is greater than 120.
62. 62
Summary
You should now be able to:
• describe characteristics and parameters of normal distribution
• determine z value and probability using the standard normal
distribution table.
• solve problems related to normal distribution (complete the
exercises!)
Next Lesson W12 L23: Read notes!
• 6.0 SAMPLING DISTRIBUTION
• 6.1 Population data, sample data, population size, sample size,
probability sample.
• 6.2 Histogram of all the probability mean, polygon and frequency
curve.