Escolar Documentos
Profissional Documentos
Cultura Documentos
1, 2004, 99-108
Abstract
We derive the probability density of the ratio of components of the
bivariate normal distribution with arbitrary parameters. The density is a
product of two factors, the first is a Cauchy density, the second a very
complicated function. We show that the distribution under study does not
possess an expected value or other moments of higher order. Our particular
interest is focused on the shape of the density. We introduce a shape
parameter and show that according to its sign the densities are classified
into three main groups. As an example, we derive the distribution of the
ratio Z = Bm1 /(mBm ) for a polynomial regression of order m. For m = 1 ,
Introduction
1 and whose distributions and dependence are known from regression theory.
100
hence U does not possess an expected value or moments of higher order (Johnson
et al., 1994).
Some results about the ratio from the literature are:
(a) The ratio Z of two centred normal variables is a Cauchy variable (Jamnik,
1971: 149):
X
Y : N ( X = Y = 0, X , Y , 1)
X
: C a = X , b = X 1 2
Y
Y
Y
The case considered in (b) is not general and the result in the cited article is
presented in a very implicit way. Marsaglia dealt with the ratio of two independent
normal variables, having shown previously, however that any case could be
transformed into this setting.
The objective of our work is to derive the probability density for the ratio of
components of the bivariate normal distribution for a general setting. Let the
vector W = [ X Y ]T : N ( X , Y , X > 0, Y > 0, ) be distributed normally, with the
density (for 1 ):
( x X ) 2 2 ( x X )( y Y ) ( y Y ) 2
1
+
exp
2
2
2
2
2
(
1
)
Y
2 X Y 1
X Y
X
p W ( x, y) =
101
X2
X Y
X
E (W) = , var(W) =
.
Y2
Y
X Y
Our aim is to express the density function of the ratio Z = X / Y explicitly, in
terms of the parameters of the bivariate normal distribution. We shall also discuss
the degenerate situation, = 1 .
The following theorem is the basis for our derivation of the probability density for
the ratio (Jamnik, 1971: 148).
Theorem 1. Let W = [ X Y ]T be a continuously distributed random vector
with a probability density function p W ( x, y ) . Then Z = X Y is a continuously
distributed random variable with the probability density function
pZ ( z) =
y p W ( zy, y )dy =
yp
0
( zy, y )dy .
(2.1)
The
probability
density
for
Z=X Y,
where
1
X Y 1 2
R ( R)
=
exp sup R 2 1 +
2 2
2
( Y z 2 X Y z + X ) 2
( R)
XY 1 2
1
exp supR2 + 2 R (R) exp [supR2 R2 ]
2 2
2
(Y z 2XY z +X ) 2
(2.2)
where:
X Y X
X
Y
Y
(Y2X XY Y )z XY X + X2 Y X
X Y Y (2.2a)
R = R(z) =
=
2
2 2
2
2
XY 1 Y z 2XY z + X
X
X
2
2
1 z 2
z +
Y
Y
2 2
2
2
Y X 2 X Y X Y + X Y X
2
sup R =
=
X2 Y2 (1 2 )
2 X Y + Y
X Y Y
,
1 2
2
(2.2b)
102
( X Y z ) 2
=
sup R R = 2 2
Y z 2 X Y z + X2
2
X X Y
X Y Y
z 2 X
Y
2
z + X
Y
2
(2.2c)
The first factor in (2.2), the standard part, is the density for a non-centred
X Y
,
and X , to fully describe the distribution. It is strictly
X Y
Y
positive and asymptotically constant it has the same positive value for both
X Y
z = , due to the fact that R() = Y X
. Therefore, the asymptotic
X Y 1 2
behaviour of pZ (z ) is the same as that of the Cauchy density, so E (Z ) and other
moments do not exist.
We wrote the deviant part in (2.2) in two forms. The first form is nicer and can
also be found in Marsaglia (1965), but the second form is better for numerical
purposes.
A more detailed analysis of pZ (z ) led us to the definition of the shape
parameter :
=
based on R() and
Y
Y
Y
Y
X
(2.3)
1 2 ( X Y z )
dR
= X Y
. The sign of separates
3/ 2
dz
Y2 z 2 2 X Y z + X2
103
The derivative of the deviant part led us to the definitions of two quantities for
types I and II: u = X = X X
Y Y
Y
Y
X Y
X Y X
and d =
. u is the abscissa of
Y Y
X
Y
X
the local maximum and d the abscissa of local minimum of the deviant part. For
type I: d < a < u , and for type II: u < a < d ; as previously, a = X , the centre
Y
of the standard part (see Figure 1).
Type I
0.2
0.1
0.0
Density
0.3
(X,Y) ~ N( 2 , 1 , 1 , 1 , 0 )
0
-4
-2
-4
-2
Type II
0.10
0.05
0.0
Density
0.15
0
-10
-5
10
-10
-5
10
Figure 1: A case with a positive shape parameter (Type I) and with a negative shape
parameter (Type II). On the left, the standard Cauchy part (thick line) and the deviant
part (thin line) are presented; the functions are on different scales in order to depict the
shapes of both functions on one plot. The vertical dashed lines indicate the abscissas of
the local extremes of the deviant part, the horizontal dashed line is its asymptote. The
right plot presents the graph of the density pZ (z ) .
104
Type IIIa
0.15
0.10
0.0
0.05
Density
0.20
(X,Y) ~ N( 1 , 1 , 4 , 2 , 0.5 )
0
-4
-2
-4
-2
Type IIIb
0.08
0.04
0.0
Density
0.12
(X,Y) ~ N( 2 , 0 , 1 , 1 , 0.5 )
0
-10
-5
10
-10
-5
10
Type IIIc
0.10
0.0
0.05
Density
0.15
(X,Y) ~ N( 0 , 0 , 2 , 1 , 0.5 )
0
-10
-5
10
-10
-5
10
Figure 2: Three cases having zero value of the shape parameter (Type III). On the left,
the standard Cauchy part (thick line) and the deviant part (thin line) are presented; the
functions are on different scales in order to depict the shapes of both functions on one
plot. The vertical dashed line indicates the abscissa of the local extreme of the deviant
part, the horizontal dashed line is its asymptote. The right plot presents the graph of the
density pZ (z ) .
105
Type III describes the marginal case, not likely to occur in practice. In variant
IIIa (resp. IIIb), the deviant part has only a maximum (resp. a minimum) at z = a .
In variant IIIc, the deviant part is equal to constant 1 (see Figure 2).
The median M ( Z ) and mode(s) can not be obtained analytically for the
general case; further numerical calculations have to be done for each particular
case. But we have derived some partial results. For type I : M ( Z ) > X , for
Y
type II:
M (Z ) <
X
, for type III: pZ (z ) is symmetric and
Y
M (Z ) =
X
.
Y
Variants IIIa and IIIc are unimodal; generally, pZ (z ) may be uni- or bimodal.
The distribution function and quantiles require numerical integration.
Degenerate situation
Now, let us consider the case = 1 , but still with X > 0 , Y > 0 . Then, the
distribution of W = [ X Y ]T is degenerate, and with probability 1, it holds
Y Y
X X
; hence: Z =
= X +
Y
Y
X
Y
Y
Theorem 3.
given by
pZ ( z ) =
If Y : N ( Y , Y ) and Z = a +
1
( z a) exp
2
2
2 Y
2
c
, c 0 , then Z has the density
Y
z a Y
The function pZ (z ) from this theorem is much simpler than (2) and it is rather
easy to find its characteristics, including quantiles and distribution function. Also,
there are two modes that can be found explicitly, and between them there is a
removable singularity pZ (a) = 0 . The expected value, as in non-degenerate cases,
does not exist.
It is worth noting that in the degenerate case the shape parameter (3) is zero
precisely when pZ (z ) is symmetric, as in the non-degenerate case. According to
the sign of the shape parameter, the relations between the median M (Z ) and the
106
Examples
Now, let us discuss the two problems presented in the Introduction. First, we will
consider a linear regression E (Y | x) = 0 + 1 x . We shall be interested in the x-axis
intercept: X 0 = B0 / B1 , where B0 and B1 denote the estimators for 0 and 1 .
Under the assumption that Y | x : N ( 0+ 1 x , reg ) , the variable X 0 is expressed
as the ratio of two normally distributed and dependent random variables B0 and
B1 . Given the data
w=
1
2
xi
and
{( xi , yi ) , i = 1,..., n}
q=
reg
n( w 2 x 2 )
( x1 < xn ), we denote:
x=
1
xi ,
n
B0
x
B : N 0 , 1 , qw, q, w
. Then:
B0
x
1
(2) on making the substitution: X 0 , Y 1 , X q s , Y q ,
x w .
Now we shall be concerned with a general polynomial regression
E (Y | x) = 0 + 1 x + K + m x m , m 1 . Let us define Z = Bm1 /(mBm ) . For m = 1 ,
Z = X 0 from the first example, the estimator for the zero of a linear regression.
For m = 2 , Z is an estimator for the abscissa of the extreme of a quadratic
regression, and for m = 3 , Z is an estimator for the abscissa of the inflection point
of a cubic regression.
v = xik
i = 1,K, n
, the
k = 0,K, m n( m +1)
there are at least m + 1 distinct x-s, implies that the rank of v is precisely m + 1 .
Hence, v T v is invertible and d = ( v T v ) 1 = d jk ( j , k = 0,K, m ) ( m +1) ( m +1) . Let
(4.1)
107
2
vector Y is normally distributed with E (Y ) = v , var(Y) = reg
I . According to
(4),
is
also
normally
E (B) = (d v T ) E (Y) = ,
distributed,
2
var(B) = (d v T ) var(Y) (d v T ) T = reg
d.
B
0 L 0 1 0
Introduce two matrices: u =
and W = u B = m1 .
0 L 0 0 m 2( m+1)
mBm 21
E ( W) = u E (B) = u = m 1 and
m m
d m 1,m 1 md m1, m
2
2
var(W) = u var(B) u T = reg
u d u T = reg
md
. Therefore, the
2
m
d
m
1
,
m
m
,
m
is
also
normal
variable
with
d m1,m
.
distribution of W is N m1 , m m , reg d m1,m1 , m reg d m,m ,
d m1,m1d m,m
Hence, Z has a distribution with density function (2) with the exchange:
X m1 ,
Y m m
d m1,m
d m1,m1d m,m
X reg d m 1,m1
Y m reg d m,m
References
[1] Jamnik, R. (1971): Verjetnostni raun. Mladinska knjiga, Ljubljana.
[2] Johnson N.L, Kotz, S., and Balakrishnan, N. (1994): Continuous Univariate
Distributions. 1. John Wiley and Sons.
[3] Kamerud D. (1978): The random variable X/Y, X, Y normal. The American
Mathematical Monthly, 85, 207.
[4] Marsaglia, G. (1965): Ratios of normal variables and ratios of sums of
uniforms variables. JASA, 60, 163-204.
108
Appendix
a>0
t exp(at
+ bt + c)dt =
b2 b
ec
e
b
1 exp(bm am 2 ) + b
exp
+ m 2a
a 2a
2a
2a
4a 2a
0
t exp(at + bt + c)dt = +
b
where: r =
2a
2
t exp(at + bt + c)dt =
(r ) =
r2
1 2
e
2
ec
a
(r )
1 + r (r ) ,
(r ) = ( x) dx =
r
1 r
erf
.
2 2