Escolar Documentos
Profissional Documentos
Cultura Documentos
= =
(E.1)
Thus the cumulative probability under the curve between and a point u is equal to ) (u F .
Properties of a Density Function
1. ( ) 0
X
f x .
2. ( ) 1.
X
f x dx
3
Chapter E: Continuous Random Variables
The probability of an event { : } X a X b is given by
( ) ( ) .
b
a
P a X b f x dx =
(E.2)
where a and b are constants. Remember that ( ) P X a = is always zero in the continuous case.
The integral in property (2) above can be represented by ( ). F The probability in (E.2) above
can be represented by ( ) ( ) F b F a which is also ( ) P a X b < < in most cases. Be warned that
{ : }, X X a = or, { , , , } X a b c = are not probable events for a continuous variable, rather
typical events are { : }, X X a < { : }, X X a > { : }, X a X b < etc. whose probabilities are
always defined by (D.1).
Exampl e E.2.1 Is the following function a probability density function?
2
( ) 3(1 ) ,
X
f x x = 0 1; x < < ( ) 0,
X
f x = elsewhere.
Solution: The first condition ( ) 0
X
f x is obviously true. The second condition is 1 I =
where ( ) .
X
I f x dx
=
By decomposing the domain, we have
0 1
0 1
( ) ( ) ( ) ,
X X X
I f x dx f x dx f x dx
= +
or,
0 1
2
0 1
0 3(1 ) 0 , I dx x dx dx
= + +
or, 1. I =
Exampl e E.2.2 The content of magnesium in an alloy is a random variable, given by the
following pdf (probability density function)
( ) , 0 6,
18
x
f x x = and ( ) 0 f x = elsewhere.
a. Find the probability that the content of magnesium in alloy is at most 3.
b. Find the probability that the content of magnesium in alloy is at least 3.
c. Find the probability that the content of magnesium in alloy at most k where k is a fixed
value of , X and 0 6 k < < .
d. What content is exceeded 75% of the time?
e. What content exceeds 25% of the time?
f. Find
0.95 0.05
x .
Solution:
a.
3
3 2
0 0
1 1 1
( 3) (9 0) 0.25.
18 18 2 36 4
x x
P X dx
(
= = = = =
(
4
Chapter E: Continuous Random Variables
b.
6
6 2
3 3
1 1 3
( 3) (36 9) 0.75.
18 18 2 36 4
x x
P X dx
(
= = = = =
(
c.
2 2
2
0 0
1 1
( ) ( 0) , 0 6
18 18 2 36 36
k
k
x x k
P X k dx k k
(
= = = = < <
(
Note that this function called CDF (Cumulative Distribution Function), usually denoted by ( ) F x
where x is a fixed value of the variable X such that ( ). P X x The CDF ( ) F x can be used to
calculate both (a) and (b).
d. Let k be a fixed value of X such that ( ) 0.75. P X k > = Then we have
6
6 2
2 2
1 1
( ) (6 ) 0.75
18 18 2 36
k k
y y
P Y k dy k
(
> = = = =
(
and ( ) 0 f x = elsewhere.
The density function will be denoted by ~ ( , ). X U Let the subinterval be
) , ( d c c + contained entirely within ( , ) , then
1 ( )
( ) ( )
c d c d
c c
c d c d
P c X c d f x dy dx
+ +
+
+ = = = =
Note that this probability does not depend on the location c but only on the length of the
subinterval.
5
Chapter E: Continuous Random Variables
The cdf of this distribution is given by
( ) ( ) , .
k
F k P X k k
= =
The 100(1 )-th percentile of X at x is given by 1 ,
x
i.e., ( ). x =
It is usually denoted by
1
x
or simply . x
Exampl e E.3.1 Suppose that X is uniformly distributed between 0 and 1. Determine 95
th
percentile of . X
Solution: Hence the 95
th
percentile X at x is given by ( ) 0.95.
X
F x = The cumulative
distribution function of X at x is given by ( ) .
X
F x x = Hence 0.95.
Exampl e E.3.2 Suppose that X is uniformly distributed between 40 and 60. Determine 95
th
percentile of . X
Solution: Hence the 95
th
percentile X at x is given by ( ) 0.95.
X
F x = That is
100 0.05(100 40), x = or, 97. x =
Exampl e E.3.3 Subway trains on a certain line run every half hour between mid-night and 6 in the
morning. What is the probability that a man entering the station at random during this period will
have to wait at least twenty minutes.
Solution: Let X denote the waiting time (in minutes) for the next train. Then
1
( ) , 0 30,
30
f x x = < < and ( ) 0 f x = elsewhere.
The required probability is then
30 30
20 20
1
( 20) ( ) ,
30
P X f x dx dx = =
which equals
|
30
20
1 1 1
(30 20) .
30 30 3
x = =
Interested readers may go through Mendenhall, Wackerly and Scheaffer (1990, 156-159) for very
interesting discussions.
E.4 The Standard Normal Distribution
The normal distribution was proposed by Carl F. Gauss (1777-1855) as a model for the relative
frequency distribution of errors, such as errors of measurement. Amazingly, this bell-shaped
curve provides an adequate model for the relative frequency distribution of data collected from
many different areas of science, engineering and business. It has the following functional form:
6
Chapter E: Continuous Random Variables
< < =
z e z f - ,
2
1
) (
2 / z
2
The following graph of the distribution is from STATISTICA:
The random variable Z is called a standard normal variable. It has the expected value 0 ) ( = Z E
and variance 1 ) ( = Z V . It is obvious that 50 . 0 ) 0 ( ) 0 ( = > = < Z P Z P . It is also interesting to
note that though theoretically 1 ) ( = < < Z P but practically ( 4 4) 1 P Z < < , even
1 )] 1 ( 3 0 ) 1 ( 3 0 [ + < < Z P i.e. A Standard Normal Variable lies within 3 (times) standard
deviation (1) of its mean (0).
The cumulative distribution function of Z is given by
2
/2
1
( ) .
2
z
t
Z
F z e dt
The solution to z of the equation ( ) 1
Z
F z = is the 100(1 )-th percentile of Standard
Normal Distribution and is denoted by
z . For example, ( 1.959964) 0.975, P Z < = or,
equivalently 025 . 0 ) 959964 . 1 ( = > Z P . That is 96 . 1 959964 . 1
025 . 0
= z s the 97.5-th
percentile of the Standard Normal Distribution.
Since
25 . 0 1 75 . 0 ) 67449 . 0 (
50 . 0 1 50 . 0 ) 0 ( ) 0 (
75 . 0 1 25 . 0 ) 67449 . 0 ( ) 67449 . 0 (
= = <
= = > = <
= = > = <
Z P
Z P Z P
Z P Z P
Probability Density Function
y=normal(x,0,1)
0.00
0.15
0.30
0.45
0.60
-3.50 -1.75 0.00 1.75 3.50
7
Chapter E: Continuous Random Variables
the quartiles of the distribution are given by 67449 . 0 and 0 , 67449 . 0
25 . 0 50 . 0 75 . 0
= = = z z z .
See the Table of Normal Percentiles given below:
z
0.005 2.575829
0.010 2.326348
0.015 2.170090
0.020 2.053749
0.025 1.959964
0.030 1.880794
0.035 1.811911
0.040 1.750686
0.045 1.695398
0.050 1.644854
0.055 1.598193
0.060 1.554774
0.065 1.514102
0.070 1.475791
0.075 1.439531
0.080 1.405072
0.085 1.372204
0.090 1.340755
0.095 1.310579
0.100 1.281552
0.200 0.841621
0.250 0.674490
0.500 0.000000
Note that the pdf ( )
Z
f z is location symmetric around location 0, that is (0 ) (0 ).
Z Z
f z f z + =
Hence,
05 . 0 95 . 0
z z = , and
01 . 0 99 . 0
z z = . That is, in general, for normal distribution we have
=
1
z z where 1 0 .
The probability that Z is less than or equal to 2.0 is denoted by ( 2.0) P Z . It is 0.9772
tabulated in the intersection of the first column (at 2) and first row (at 0). We write
( 2) 0.9772. P Z = Since ( 2) 0 P Z = = , we have ( 2) 0.9772. P Z < = The probability that
Z is less than or equal to 2.05 is denoted by ( 2.05) P Z . It is 0.9798 tabulated in the
intersection of the first column (at 2) and first row (at 0.05). The probability that Z is more than
2.05 is given by ( 2.05) 1 ( 2.05) P Z P Z > = which equals
( 2.05) 1 0.9798 0.0202 P Z > = . See TUTORIAL for details.
Moreover ( 2.0) 0.9772 P Z = implies that 2 z = is the 97.72
nd
percentile. To find the 97.72
nd
percentile of standard normal distribution, we write ( ) 0.9772 P Z k = . Then in the body of the
table we look for 0.9772. By comparing with ( 2.0) 0.9772 P Z = , we write 2. k = Hence
2 z = is the 97.72
nd
percentile.
8
Chapter E: Continuous Random Variables
To find the 75
th
percentile of the distribution, we write ( ) 0.75 P Z k = . Then in the body of the
table we look for 0.75 or any value closest to it. We find 0.7486. By comparing with
( 0.67) 0.7486 P Z = , we write 0.67. k Hence 0.67 z is the 74.86
th
or 75
th
percentile.
The Normal Distribution
It is characterized by two natural parameters mean and variance usually denoted by
2
and .
distributions. The pdf is given by
2
1 1
( ) exp ,
2 2
X
x
f x
(
| |
=
(
|
\ .
(
, , . x < < < < < <
The parameter is real and
2
is nonnegative. They are the mean and variance of the
distribution. Though these symbols are also used to denote the mean and variance of other
distributions, they should not be confused. The distribution is usually denoted by ) , (
2
N .
The function ( )
X
f x is location symmetric around , i.e., ( ) ( ).
X X
f x f x + =
If X is a random variable with mean and variance
2
i.e.
2
~ ( , ) X N , then
2
~ (0, ) X N and ~ (0, 1).
X
Z N
=
If
2
~ ( , ) X N , then is the center or mean of the distribution. Fifty percent of the area
under the normal distribution is below the mean which implies that mean and median coincide. In
fact mean, median and mode coincide for the normal distribution.
Since the quartiles of the standard normal distribution (with 0 = and ) 1 = are given by
67449 . 0 and 0 , 67449 . 0
25 . 0 50 . 0 75 . 0
= = = z z z , it follows that
0.75 0.75 0.75
0.67449, 0 and 0.67449
x x x
= = = so that
0.75 0.50 0.25
0.67449 , , 0.67449 , x x x = = = +
are the quartiles of normal distribution with mean ) ( and standard deviation ) ( .
Exampl e E.4.1 Mid term grades of the students of Engineering Statistics have a mean of 20.
The variance of the grades is 4. What is the probability that a randomly selected student got more
than 16. Explain this probability.
( 16) ( 2) ( 2) 0.9772 P X P Z P Z > = > = < =
That is 97.72% students got more than 16 in the mid term.
9
Chapter E: Continuous Random Variables
Exampl e E.4.2 A manufacturing process has a machine that fills coke to 300 ml bottles. Over a
long period of time, the average amount dispensed into the bottles was 300 mls, but there is a
standard deviation of 5 mls in these measurement. If the amounts of fill per bottle can be
assumed to be normally distributed, find the probability that the machine will dispense more than
310 mls of liquid in any one bottle. (cf. Scheaffer and McClave, 1995, 216-217)
Solution: Let X be the amount of fill in a bottle. Then
2
~ (300, 5 ). X N
300 310 300
( 310) ( 2) 0.0228
5 5
X
P X P P Z
| |
> = > = > =
|
\ .
Exampl e E.4.2 Experience indicates that the development time ) (Y for a photographic printing
paper is distributed as normal with mean 30 = seconds and standard deviation 1 second. Find
the probability that
(a) the development time ( ) X differs from its expected value ) ( by more than 2
seconds.
(b) the development time ( ) X will be within 1.645 seconds of its expected value ) ( .
(c ) the development time will be within 1.645 seconds of true development time
Solution:
(a) [ 2, , X 2] [ 28, , X 32] P X or P X or < > = < >
Note that if somebody uses the left hand side of the above identity he can do the problem without
knowing the value of . In either case the required probability is
2 2
( 2) ( 2) ,
1 1 1 1
X X
p P X P X P P
| | | |
= < + > = < + >
| |
\ . \ .
which equals
( 2) ( 2) 2(0.02275). p P Z P Z = < + > =
b. ( 1.645 1.645) ( 1.645 1.645), p P X P X = < < + = < < which equals
( 1.645 1.645) 0.90. p P Z = < < =
c. Same as above. (cf Hines and Montgomery, 1990)
Exampl e E.4.3 The compressive strength of samples of cement can be modeled by a normal
distribution with a mean of 6000 kilograms per square centimeter and a standard deviation of 100
kilograms per square meter.
(a) What is the probability that the strength of a sample is less than 6164.5 kg/cm
2
?
(b) What compressive strength is exceeded by 95% of the time?
(c) What compressive strength exceeds 5% of the time?
Solution:
10
Chapter E: Continuous Random Variables
a.
6000 6164.5 6000
( 6164.5) ( 1.645) 0.95
100 100
X
P X P P Z
| |
< = < = <
|
\ .
b. Let k be a fixed value of the variable X . Then we have ( ) 0.95, P X k > = or equivalently
( ) 0.05, P X k < = which can be standardized as
0.05,
X k
P
| |
< =
|
\ .
or, 0.05.
k
P Z
| |
< =
|
\ .
From the Standard Normal Probability Table, we have ( 1.645) 0.05 P Z < = so that by
comparison we have
1.645,
k
so that
1
.
1.96
Therefore the largest value of the standard deviation would be 1/1.96.
E.5 The Exponential Distribution
Life lengths of batteries or light bulbs usually follow exponentially decreasing distribution known
as Negative Exponential Distribution or simply Exponential Distribution. Relative frequency
11
Chapter E: Continuous Random Variables
histogram of samples of lifelengths or light bulbs can be approximated by exponential
distribution. The probability density function of an exponential random variable is given by
/
1
( ) , (0 ,0 ),
x
f x e x
= < < < and ( ) 0 f x = elsewhere.
This distribution is characterized by the only one scale parameter and may be represented, for
short, by ) ( Exp . The mean and variance of exponential distribution is given by
2 2
and = = respectively. Thus it is a nonnegative distribution whose mean and
standard deviation are the same. Many authors prefer to use 1/ = as the parameter of the
distribution. Many interesting problems for sums of exponential variables are variable in Hines
and Montgomery (1990).
Note that if ( ) P X k > = , it can be proved that ln(1/ ) k = . The proof is outlined below:
( ) ,
k
f x dx
or,
/
1
.
x
k
e dx
By integration, we have
/
,
x
k
e
( =
or,
/
0 .
k
e
= Since ln 1, e = we have ln . k =
In particular, if
1
( )
2
P X k > = , then ln2 k = which is the median of the distribution. Since
ln2 0.69 , = > = it follows that the density function is skewed to the right.
Exampl e E.3 Lifetime of a particular type of battery follow exponential distribution with mean 2
hundred hours. Find the probability that
a. the lifetime of a particular battery of this type is less than 2 hundred hours.
b. the lifetime of a particular battery of this type is more than 4 hundred hours.
c. the lifetime of a particular battery of this type is less than 2 hundred hours or more than 4
hundred hours.
d. it will survive three hundred hours more if it has already survived more than 5 hundred hours.
e. it will survive between 5 hundred and 8 hundred hours?
f. What is the probability that a battery will survive more than 5 hundred hours or more than 8
hundred hours?
Solution Let X =lifetime of a battery. Then ~ ( ) (2) X Exp Exp = .
a. The event of interest is { : 2}. X X < The required probability is
2
0
( 2) ( ) , P X f x dx < =
or,
2
/
0
1
( 2) .
x
P X e dx
< =
Then by integration, we have
12
Chapter E: Continuous Random Variables
2
2
/ 2/
/
0
0
1
( 2
1
) 1
/
.
x
x
e
P X e e
(
( < = = =
(
Since 2, = finally, the probability is
1
1 0.6321. e
b.
4
( ) ( ) 4 , P X f x dx
> =
or,
/
4
1
( ) . 4
x
P X e dx
> =
By integration, we have
4/
( . 4) P X e
> =
Since 2, = finally, the probability is
2
0.1353. e
c.
1 2
[{ 2}, or , { 4}] ( 2) ( 4) 1 , P X X P X P X e e
< > = < + > = +
by (a) and (b).
d. Let { : 8} A X X = > and { : 5}. B X X = > Then the required probability is
( )
( | ) .
( )
P AB
P A B
P B
=
Since , A B it follows that , AB A = and then
( )
( | ) .
( )
P A
P A B
P B
=
Now
/ 8/
8
1
( ) { 8} ,
x
P A P X e dx e
= > = =
and similarly
5/
( ) { 5} . P B P X e
= > = Then,
we have
8/
3/
5/
( | ) ,
e
P A B e
e
= =
which is the same as ( 8 5). P X > This property is called the memoryless property of an
exponential distribution. Finally, the probability is
3/2
0.2231. e
e.
5/ 8/ 5/2 8/2
(5 8) ( 5) ( 8) 0.0737. P X P X P X e e e e
< < = > > = =
13
Chapter E: Continuous Random Variables
It is also the probability that a battery survives more than 5 hundred hours but fails by 8 hundred
hours.
f. The required probability is ( ). P A B Since , A B it follows that . A B B = Then
( ) ( ), P A B P B = or,
8/2
( ) , P A B e
= or, ( ) 0.0183. P A B
The above can also be done by ( ) ( ) ( ) ( ). P A B P A P B P AB = +
Exampl e E.4 An electronic component is known to have a useful life represented by an
exponential density with failure rate
5
10
per hour.
a. Determine the mean time to failure.
b. Determine the fraction of such components that would fail before the mean life.
c. What is the median life time?
Solution:
a. Since the failure rate is
5
10
= for 0 x > and
( ) 0 f x = for 0. x < The fraction of such components that would fail before the mean life is
given by
1 / 1
0 0
( ) ( ) 1 0.6321.
x
P X f x dx e dx e
< = = =
c. The median (0 1) k k < < is a fixed value such that ( ) 0.5 P X k < = so that
1 / /
0 0
( ) 1 0.5.
k k
x k
f x dx e dx e
= = =
Then
/
0.5
k
e
= which can be inversed as
/
2
k
e
= so that
5
5 10
ln2 10 ln2 ln2 . k = = =
The Waiting Time Distribution of a Poisson Process
Time between incidents is a continuous variable and it may be proved that it follows an
exponential distribution. If ) (t Y , the number of incidents per t unit of time, follows Poisson
Process with t as the expected number of incidents in t unit of time, then T , the interarrival
time follows an exponential distribution with ( ) 1/ E T = as the expected interarrival time.
Derivation of the density function
Consider a Poisson process
t
X with a mean of ( ) , 0.
t
E X t t = > Then the probability that
there will be no count in t unit of time is given by
0
( )
( 0)
0!
t
t
t
P X e
= = which
is ( 0) .
t
t
P X e
= = Now let W be the waiting time one must wait to see the next count. Them
the event { } W t > is equivalent to { 0}.
t
X = Hence ( ) ( 0)
t
P W t P X > = = so that
( ) .
r
P W t e
> = Then the cumulative distribution function is given by
14
Chapter E: Continuous Random Variables
( ) ( ) 1 , 0.
t
F t P W t e t
= = > The derivative of this function is the density function of the
waiting time given below:
( ) , 0 , 0 ,
t
W
f t e t
= = .
Then
0.3(1)
( 0) 0.7408.
t
P X e e
= = =
(This is an example of Poisson Variable with unit time)
b. Let X be the number of accidents per hour follow a Poisson distribution with mean 3 . 0 =
per hour. Then W , the time between consecutive accidents, measured in hours, follows an
exponential distribution with
( ) 1/ E W = i.e. ( ) , 0
w
f w e w
= > and consequently
4 4 1.2
0
4 4
( 4) ( ) [ ] 0.3012.
w w
P W f w dw e dw e e e
> = = = = =
( X is a Poisson Variable with unit time)
15
Chapter E: Continuous Random Variables
c. Let
t
X , the number of accidents in t unit of time, follow a Poisson distribution with mean t so
that
( )
( )
!
x
t
t
t
P X x e
x
= = .
Then
2 2(0.3)
2
( 0) 0.5488. P X e e
= = =
(This is an example of Poisson Variable with non-unit time)
Exampl e E.6 Accidents occur at an intersection at an average rate of 0.3 per 30 minutes.
a. What is the probability that no accident happens in an hour?
b. Find the probability that two consecutive accidents occur more than 4 hours apart.
Solution:
a. Let X , the number of accidents per hour, follow a Poisson distribution with mean
(0.3/ 30)(60) 0.6 = = per hour so that
( )
!
x
P X x e
x
= = .
Then
0.6
( 0) 0.5488. P X e e
= = =
(By scaling the average, we solved it by Poisson Variable with unit time)
Al ternative Solution:
a. Let
t
X , the number of accidents in t unit of time, which is 30 minutes, follow a Poisson
distribution with mean t so that
( )
( )
!
x
t
t
t
P X x e
x
= = . In terms of the unit time (30 minutes),
(30minute
an hour an hour 2
30minutes
s)
(30minutes) t = = = so that
0.3(2)
2
( 0) 0.5488. P X e
= =
b. Let X be the number of accidents in an hour so that it follows a Poisson distribution with
mean 0.3(2) 0.6. = = Then W , the time between consecutive accidents, measured in the unit
of an hour, follows an exponential distribution with ( ) 1/ E W = i.e.,
( ) , 0 ,
t
W
f t e t
= < and ( ) 0
W
f t = elsewhere. It follows that
16
Chapter E: Continuous Random Variables
4 4(0.6)
4
4 4
( 4) ( ) 0.0907.
t t
W
P W f t dw e dt e e e
( > = = = = =
Al ternativel y Let
t
X , the number of accidents in t unit of time, which is 30 minutes, follow a
Poisson distribution with mean t so that
( )
( )
!
x
t
t
t
P X x e
x
= =
Then W , the time between consecutive accidents, measured in the unit of half an hour, follows
an exponential distribution with ( ) 1/ E W = i.e., ( ) , 0 ,
t
W
f t e t
= < and
( ) 0
W
f t = elsewhere. In terms of the unit time (30 minutes), 4 hours is
( )
4 hours 8 ( )
half
half an hour
half a
an
n
h
hour
our =
so that the event of interest is { : 8} W W > and consequently
8 8(0.3)
8
8 8
( 8) ( ) 0.0907.
t t
W
P W f t dw e dt e e e
( > = = = = =
E.6 The Gamma Probabili ty Distribution
The probability density function for gamma distribution is given by
1 /
( ) , 0 ,0 ,0
( )
x
x e
f x x
= < < < < <
where
1
0
( )
x
x e dx
=
=
=
= =
=
Since the above is a survival function by differentiating ( ) P T t with respect to t we have the
density function
1
for 0
( ) ( 1)!
0 for 0
r r t
t e
t
f t r
t
<
(Bean, 2001, 232). We will denote this distribution by ( , ) G r
Exampl e E.6.1 Instantaneous surges of electrical current occur randomly and independently on
a particular line at an expected rate of 0.1 per hour. The electrical system will fail after four
surges. Determine the probability that the system is functioning in 10 hours. (bean, 2001, 232)
Solution: Let
j
T be the time in hours between the ( 1) j th and the j th surge. It has
exponential distribution with mean rate 0.1 = . Then
1 2 3 4
T T T T T = + + + , the future lifetime of
the system measured in hours, has a gamma distribution (4, ) G where expected surging rate is
with mean and the desired probability is
10 3
0
(10 )
( 10) ,
!
n
n
e
p P T
n
=
= > =
which equals
10 0.1 3
0
(10 0.1)
,
!
n
n
e
p
n
=
which equals
1 3
0
,
!
n
e
p
n
=
=
and hence
a.
2 4 3
1 0
(2 )
2 0.1512
!
n
i
i n
e
P T
n
= =
| |
> =
|
\ .
b.
2 5 5 4
1 1 0
(2 )
2 1 2 1 0.7149
!
n
i i
i i n
e
P T P T
n
= = =
| | | |
= =
| |
\ . \ .
Note that
1
T represents the remaining service time, not the total service time , for the customer
currently being served. However since the total service time is exponentially distributed, the
remaining service time must also be exponentially distributed by the memoryless property.
The probability density function of a Gamma random variable X with two parameters, 0 > and
0 > is sometimes written as
1
( )
( ) ,
( )
x
X
x e
f x
0 x < <
where
1
0
( ) ,
y
y e dy
0. >
a. Verify that the PDF integrates to 1.
Solution:
1
0 0
( )
( ) ,
( )
x
X
x e
I f x dx dx
= =
Substituting x y = we have
1 1
0 0
(
( ) ( )
, )
x y
I
dy
x e dx y e
= =
1
0
1
,
( )
y
I y e dy
The integral
1
0
,
y
y e dy
= =
+
and variance
2
( )( 1)
=
+ + + +
.
There are excellent examples in Bean (2001, 263-265).
E.8 Cauchy Distributi on
The Cauchy random variable X assumes value over the entire real line and has the pdf:
2 2
1
( ) ,
X
f x
x
=
+
, x < < and 0 > is any constant.
Let us verify that the pdf integrates to 1.
2 2
1
( ) .
X
I f x dx dx
x
= =
+
Substituting , x y = we have
2
1
.
1
I dy
y
=
+
That is 1. I =
Let us derive the cdf of X and plot it. For , x < < the CPF (Cumulative Probability
Function) ( ) ( )
X
F x P X x = of X is given by
20
Chapter E: Continuous Random Variables
0
( ) ( ) ,
t x
X X
t
F x f t dt
=
=
=
or,
2 2
0
1
( ) .
t x
X
t
F x dt
t
=
=
=
+
Substituting , t y = we have
/
2
1
( ) .
1
y x
X
y
F x dy
y
=
=
=
+
Finally the CDF is
1
1 1
( ) tan ,
2
X
x
F x
| |
= +
|
\ .
0. >
E.9 The Weibull Probability Distribution
The probability density function for Weibull distribution is given by
1 /
( ) x , 0 ,0 ,0
y
f x e x
= < < < < <
The expected value and variance of the gamma distribution are, respectively,
1/
( ) (1 1/ ) E Y
= = + and
( )
2 2/ 2
(1 2/ ) (1 1/ )
= + + .
E.10 The Lognormal Probability Distribution
The probability density function for lognormal distribution is given by
2 2
(ln ) /(2 )
1
( ) , 0 ,0 ,0 .
2
x
f x e x
x
= < < < < <
Note that
2
ln ( , ). Y X N =
21
Chapter E: Continuous Random Variables
Tutorial
E.1 The total time, measured in units of 100 hours , that a student runs his computer over a
period of one year is a continuous random variable X that has the density function
, 0 1
( ) 2 , 1 2
0, elsewhere
x x
f x x x
< <
a. What is the probability that a student runs the computer between 75 hours to 150 hours per
year.
b. What is the first quartile of the total time a student runs his computer?
c. How much time, on an average, a student runs his computer?
d. What is the median amount of time a student runs his computer?
e. What is the third quartile?
Solution:
a.
1 1.5
0.75 1
(0.75 1.5) ( ) ( ) , P X f x dx f x dx < < = +
or,
1 1.5
0.75 1
(0.75 1.5) (2 ) , P X xdx x dx < < = +
or, (0.75 1.5) 0.594. P X < <
b. Since
1 1
0 0
( ) 0.5 0.25 f x dx xdx = = >
, the first quartile is a fixed value (0 1) k k < < such
that
0
( ) 0.25.
k
f x dx =
This yields
0
0.25,
k
xdx =
or,
2
0
1
0.25,
2
k
x ( =
or,
2
1/ 2, k = or,
1
.
2
k = Hence the first quartile is given by 0.7071. k
c. The mean is given by
2 1 2
0 0 1
( ) ( ) ( ) ( ) , E f x dx xf x dx xf x dx X x = = +
or,
1 2
0 1
( ) { } {2 } 1. E X x x dx x x dx = + =
d. Since
1 1
0 0
( ) 0.5, f x dx xdx = =
the median is given by 1.
e. Since
1 1
0 0
( ) 0.5 0.75 f x dx xdx = = <
, the third quartile is a fixed value (1 2) k k < < such
that
1
( ) 0.25.
k
f x dx =
This yields
1
(2 ) 0.25,
k
x dx =
or,
2
1
[2 0.5 ] 0.25
k
x x = or,
2
(2 0.5 ) (2 0.5) 0.25, k k =
2
0.5 2 1.75 0, k k + =
2
2 ( 2) 4(0.5)(1.75)
,
2(0.5)
k
=
22
Chapter E: Continuous Random Variables
1.2929, 2.7071. k Hence the third quartile is 1.2929.
Alternatively, since the density function is linear, the third quartile is approximately
2 0.7071 1.2929. = Is it true in general?
E.2 A box is to be constructed so that its height is 5 inches and its bases Y inches by Y inches ,
where Y is a random variable described by
( ) 6 (1 ), 0 1. f y y y y = < <
a. Find the expected value of
2
5 Y .
b. Explain the quantity you found in part (a). (Larsen, Richard and Marx, Morris L. , 2001, 213).
c. Derive 95
th
percentile of the above probability density function.
Parti al Solution:
a.
1
2
0
(5 ) {6 (1 )} 1.5. E Y y y y dy = =
= <
a. Calculate the mean of the random variable Y .
b. What is the probability that the weight of an article is more than 8.25 ounces?
24
Chapter E: Continuous Random Variables
c. The manufacture sells the article for a fixed price of $2.00. He guarantees to refund the
purchase money to any customer who finds the weight of his article to be less than 8.25 ounces.
His cost of production is related to the weight of the article by the relation ( /15) 0.35 Y + . Find
the expected profit per article.
Parti al Solution:
a. 9, b. 0.969 approx. c. The sale price of the article is given by 2 for 8.25 X and
0 for 8.25 X < . The profit (sale price cost) is given by
0 [( /15) 0.35] if 8.25
2 [( /15) 0.35] if 8.25
Y X
Z
Y X
+ <
=
+
E.18 The pH of a reactant is measured by the random variable X whose density function is
given by
25( 3.8) if 3.8 4
( ) 25( 4.2) if 4 4.2
0 elsewhere
x x
f x x x
a. What is the median pH of the reactant
b. What is the probability that the pH of the reactant exceeds 3.9?
Hints: (a) ( 4) 1/ 2 P X = (b) ( 3.9) 1 ( 3.9) 0.875 P X P X > = =
E.21 An engineer designed a ball whose radius r changes from 10 inches to 20 inches with pdf
given by
1 2
1500 (225 (3 45) ),10 20
( )
0, otherwise
r r
f r
a. Sketch the function ( ) f r
b. What proportion of balls have radius more than 15 inches?
c. What is the expected radius of the balls?
d. Find the expected surface area ( ) S of the balls.
e. Find the expected volume ( ) V of the balls?
Parti al Solution.
2
6 9 3
( )
5 50 500
f r r r = + ,
( )
3
45000 11600 63000 3600 E R = + = ,
2
4 S r = ,
3 3
4
( ) 4800 in
3
V E R = =
E.45 Two instructors J and K have been teaching statistics course for 10 years. Final exam
marks in the section taught by J usually varies around 75 with a standard deviation 9 while they
are 70 and 10 respectively for the section taught by K. What is the probability that a randomly
25
Chapter E: Continuous Random Variables
selected student of J will have score more than the average score of a student of K?
E.46 Let the diameter (in millimeters) of a hole drilled in a sheet metal can be modeled by the
probability density function
20( 12.5)
2 , 12.5
( )
0 otherwise
x
e x
f x
=
Note that the target diameter is 12.5 millimeters. If a part with a diameter larger than 12.6
millimeters is scrapped, what proportion of parts is scrapped? Find the median diameter of the
sheet metal.
E.48 At a certain location on high-way I-10, the number of cars exceeding the speed limit by more
than 20 miles per hour is a random variable having a Poisson distribution with an average of 8.4
cars per an half hour. What is the probability of a waiting time of less than 5 minutes between
cars exceeding the speed limit by more than 20 miles per hour?
(Miller, Irwin and Miller, Meryless, 1999, 207)
Parti al Solution: ( 5/30) P T < where T , time between arrivals, is an exponential distribution
with mean number of cars arriving 8.4 = per 30 minutes. Then
1/6
/6 1.4
0
( 1/6) 1 1 0.7534.
t
P T e dt e e
< = = =
Or , ( 5) P T < where T , time between arrivals, is an exponential distribution with mean number
of cars arriving 8.4 = /30 per minute, or, 0.28 per minute. Then
5
5 1.4
0
( 5) 1 1 0.7534.
t
P T e dt e e
< = = =
E.49 The number of planes per day at a small private airport is a random variable having a
Poisson distribution with a mean of 28.8. What is the probability that the time between two
consecutive arrivals is at least 1 hour? (Miller, Irwin and Miller, Meryless, 1999, 215)
(Ans: 0.3012 approx.)
Solution: The mean number of planes arriving per hour is 28.8/ 24 = per hour, i.e. 1.2 =
per hour. Let T be the time between consecutive arrivals. Then
1.2
1
( 1) 0.3012.
t
P T e dt e e
= = =
Mathematical Problems
E.54 Find the expected value of exponential random variable with mean 1.
Solution: The density function is given by < =
y e y f
y
0 , ) ( . The expected value is,
therefore,
26
Chapter E: Continuous Random Variables
E.55 Consider the gamma probability function
1
( )
( ) .
( )
x
X
x e
f x
a. Prove that
2 2
( ) ( 1) / E X = + and
2
( ) / . Var X =
E.56 Let ~ ( ). X Expo
a. Find the CDF ( )
X
F x of the distribution. Also derive the conditional CDF ( | ).
X
F x X t >
Solution: The CDF of the exponential random variable is
0 0 0
( ) ( ) ( ) 1 .
t x t x x
t t
X X
x
t t
F P X x f t dt e dt e e x
= =
= =
( = = = = =
The conditional CDF is given by
( )
( | ) ,
1 ( )
X
X
X
f x
F x X t
F t
> =
. x t This is given by
( )
( | ) ,
1 (1 )
x
x t x t
X t
e
F x X t e e e
e
> = = =
. x t
Choose 3 = say, and 0.10 t = say. Then choose
0.10,0.20,0.30,0.40,0.40,0.50, x = and calculate values of
( )
( | ) .
x t
X
F x X t e
> =
Then plot
( )
( | )
x t
X
F x X t e
> = in the vertical axis against x in the horizontal axis.
b. Prove that ( | ) ( ). P X t x X t P X x > + > = >
Solution: For 0, t > consider the events { } A X t x = > + and { }. B X t = > Then the
given problem reduces to
( ) ( )
( | ) ,
( ) ( )
P AB P A
P A B
P B P B
= =
Where we have used the fact AB A = . Note that if one set is a subset of the other, then
the intersection is the smaller set and union is the bigger set. Notice that A B so that
. AB A =
( )
( )
( | ) ( ),
( )
x t
x
t
P A e
P A B e P X x
P B e
= = = = >
That is ( | ) ( ). P X t x X t P X x > + > = >
27
Chapter E: Continuous Random Variables
Since the left hand side does not depend on t , this property is called memory less
property of exponential distribution.
E.57 Prove that the cdf (Cumulative Distribution Function), denoted by ( ), F u of the exponential
distribution with mean is given by
/
( ) 1 , 0 .
u
P Y u e u
= <
E.58 The sdf (survival distribution function) of the exponential distribution with mean is given
by is given by < = = > =
u e u Y P u Y P u S
u
0 , ) ( 1 ) ( ) (
/
.
The probability of an event can be found by direct integration or by the use of cdf or sdf.
E.58 Let Y have an Exponential Distribution with mean .
a. Solve the equation for ) ( ) ( c Y P c Y P > = for c .
b. Also solve the equation 1 ) ( 2 = c Y P for c
c. Interpret your results in (a) and (b).
E.59 Accidents occur at an intersection at an average rate of t per t unit of time.
a. What is the probability that no accident happens in t unit of time?
b. Find the probability that two consecutive accidents occur more than kt unit of time apart .
Solution:
a. ( 0)
t
P X e
= =
b. ( ) [ ( 0)]
k
t
P W kt P X > = =
E.60 Let
2
~ ( , ). X N Prove that x z
= + where x
< < = =
E.2 ( ) ( ) . E X x f x dx
=
E.3
2 2 2 2 2
( ) ( ) , ( ) ( ) . E X x f x d x V X E X
= =
E.4 The Normal Distribution
2
( , ): N ( ) , E X =
2
( ) . V X =
E.5 The Exponential Distribution:
/
1
( ) , 0,
x
f x e x
= ( ) , E X =
2
( ) . V X =
E.6 Waiting Time Distribution: ( ) , 0,
t
f t e t
= ( ) 1/ , E T =
2
( ) 1/ , V T =
E.7 The Gamma Distribution:
1 /
1
( ) x , 0 , 0 , 0
( )
x
f x e x
= < < <
( ) , E X =
2
( ) . V X =
29
Chapter E: Continuous Random Variables
30
Chapter E: Continuous Random Variables
31
Chapter E: Continuous Random Variables