Escolar Documentos
Profissional Documentos
Cultura Documentos
Contents
1 Introduction
3 Affine Processes
3.1 Examples of Affine Processes . . . . . . . . . . . . . . . . . . .
7
9
11
13
15
18
8 Correlated Default
19
I am extremely grateful for the wonderful work done by Maurizio Pratelli in arranging
my visit to Pisa and for organizing this course, for the hospitality of the Scuola Normale
Superiore, and for the support for the course offered by the Associazione Amici della
Scuola Normale Superiore, who were generously represented by Mr. Carlo Gulminelli.
These notes are currently in the form of a rough draft. Future improvements can be
obtained at www.stanford.edu/duffie/.
9 Credit Derivatives
9.1 Default Swaps . . . . . . . . .
9.2 Credit Guarantees . . . . . .
9.3 Spread Options . . . . . . . .
9.4 Irrevocable Lines of Credit . .
9.5 Ratings-Based Step-Up Bonds
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
21
21
22
24
26
28
40
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
50
50
51
52
53
54
Introduction
These are lecture notes for a two-week course at Scuola Normale Superiore,
Pisa, during April-May, 2002, on credit-risk modeling, emphasizing the valuation of corporate debt and credit derivatives, using the analytical tractability
and richness of affine state processes.
Appendix A containts a brief overview of structural models that are based
on default caused by an insufficiency of assets relative to liabilities, including
the classic Black-Scholes-Merton model of corporate debt pricing as well as a
standard structural model, proposed by Fisher, Heinkel, and Zechner [1989]
and solved by Leland [1994], for which default occurs when the issuers assets
reach a level so small that the issuer finds it optimal to declare bankruptcy.
The main part of the course, however, treats default at an exogenously
specified intensity process, exploiting as a special case affine Markov state
processes.
Section 2 begins with the notion of default intensity, and the related calculation of survival probabilities in doubly stochastic settings. The underlying
mathematical foundations are found in Appendix E. Section 3 introduces the
notion of affine processes, the main source of example calculations for the remainder of the course. Technical foundations for affine processes are found in
Appendix C. Section 4 explains the notion of risk-neutral probabilities, and
provides the change of probability measure associated with a given change
of default intensity (a version of Girsanovs Theorem). Technical details for
this are found in Appendix E.
By Section 5, we see the basic model for pricing defaultable debt in a setting with stochastic interest rates and stochastic risk-neutral default intensities, but assuming no recovery at default. The following two sections extend
the pricing models to handle default recovery under alternative parameterizations. Section 8 introduces multi-entity default modeling with correlation.
Section 9 turns to applications such as default swaps, credit guarantees, irrevocable lines of credit, and ratings-based step-up bonds. Exercises address
additional applications. Some notes provide direction to further reading.
This section introduces a model for a default time as a stopping time with
a given intensity process, as defined below. From the joint behavior of the
3
default time, interest-rates, the promised payment of the security, and the
model of recovery at default, as well as risk premia, one can characterize the
stochastic behavior of the term structure of yields on defaultable bonds.
In applications, default intensities may be allowed to depend on observable variables that are linked with the likelihood of default, such as debt-toequity ratios, volatility measures, other accounting measures of indebtedness,
market equity prices, bond yield spreads, industry performance measures,
and macroeconomic variables related to the business cycle. For details, see
Duffie and Singleton [2002]. This dependence could, but in practice does not
usually, arise endogenously from a model of the ability or incentives of the
firm to make payments on its debt. Because the approach presented here
does not depend on the specific setting of a firm, it has also been applied
to the valuation of defaultable sovereign debt, as in Duffie, Pedersen, and
Singleton [2000] and Pag`es [2000]. (For more on sovereign debt valuation,
see Gibson and Sundaresan [1999] and Merrick [1999].)
We fix a complete probability space (, F, P) and a filtration {Gt : t 0}
of sub--algebras of F satisfying the usual conditions, which are listed in
Appendix B. Also defined in Appendix B is the notion of a predictable
process, which is, intuitively speaking, a process whose value at any time t
depends only on the information in the underlying filtration {Gt : t 0} that
is available up to, but not including, time t.
Appendix E defines a nonexplosive counting process. Such a counting
process K records by time t the number Kt of occurences of events of concern. A basic example of a counting process is a Poisson process. A counting
process RK has an intensity if is a predictable non-negative process satt
isfying 0 s ds < almost surely for all t, with the property that a local
martingale M , the compensated counting process, is given by
Z t
Mt = Kt
s ds.
(2.1)
0
Elliott, Jeanblanc, and Yor [1999] give a new proof of this intensity result, which is
generalized by Song [1998] to the multi-dimensional case. Kusuoka [1999] provides an
example of this intensity result that is based on unobservable drift of assets.
(2.3)
h Rs
i
(2.5)
where, under the usual regularity for the Feynman-Kac approach, f ( ) solves
the partial differential equation (PDE)
Af (x, t) ft (x, t) (x)f (x, t) = 0,
(2.6)
X
1 X 2
f (x, t)i (x) +
f (x, t)ij (x),
x
2
x
x
i
i
j
i
i,j
(2.7)
= E e t (u) du Gt ,
(2.8)
consistent with (2.2). Appendix E connects the intensity of with its probability density function and its hazard rate.
Affine Processes
(3.2)
(3.4)
(3.5)
where
1
u(z) = 0 (z) + (z)b + (z)2 c2 1
2
v(z) = 0 (z) + (z)a 0 .
(3.6)
(3.7)
Because (3.5) must hold for all x, it must be the case that u(z) = v(z) =
0. (This is known as separation of variables.) This leaves the Riccati
equations:
1
0 (z) = (z)b + (z)2 c2 1
2
0 (z) = (z)a 0 ,
(3.8)
(3.9)
with the boundary conditions (0) = 0 and (0) = w, from the fact that
f (x, s) = w for all x. The explicit solutions for (z) and (z), developed
by Cox, Ingersoll, and Ross [1985] for purposes of bond pricing (that is, for
w = 0), is repeated in Appendix D, in the context a slightly more general
model with jumps.
The calculation (3.3) arises in many financial applications, some of which
will be reviewed momentarily. An obvious example is discounted expected
cash flow (with discount rate (Xt )), as well as the survival-probability calcuation (2.2) for an affine state process X and a default intensity (Xt ),
taking w = 0 in (3.3).
3.1
The solution X of (3.10) will never reach zero from a strictly positive initial condition
if
x > c2 /2, which is sometimes called the Feller condition.
4
A compound Poisson process has jumps at iid exponential event times, with iid jump
sizes.
10
p
1
dYt = 0 + 1 Vt
(3.14)
dt + Vt dB1t ,
2
which implies that the 2-dimensional process X = (V, Y ) is affine, with state
space D = R+ R. This leads to a simple method for pricing options, as
explained in Section 9. Extensions allowing for jumps have also useful for
the statistical analysis of stock returns from time-series data on underlying
asset returns and of option prices, as in Bates [1996] and Pan [1999].5
Basic to the theory of the market valuation of financial securities are riskneutral probabilities, artificially chosen probabilities under which the price
of any security is the expectation of the discounted cash flow of the security,
as will be made more precise shortly.
5
Among many other papers examining option prices for the case of affine state variables
are Bates [1997], Bakshi, Cao, and Chen [1997], Bakshi and Madan [2000], Duffie, Pan,
and Singleton [2000], and Scott [1997].
11
Risk-neutral probabilities are assigned by an equivalent martingale measure, a probability measure Q equivalent to P, in the sense that P and Q
assign zero probabilities to the same events in Gt , for each t. Harrison and
Kreps [1979] showed that the existence of an equivalent martingale measure
is equivalent (up to technical conditions) to the absence of arbitrage. Delbaen
and Schachermayer [1999] gave definitive technical conditions for this result.
There may be more than one equivalent martingale measure, however, and
for modeling purposes, one would work under one such measure. Common
devices for estimating an equivalent martingale measure include statistical
analysis of historical price data, or the modeling of market equilibrium. When
markets are complete, meaning that any contingent cash flow can be replicated by trading the available securities, the equivalent martingale measure
is unique (in a certain technical sense), and can be deduced from the price
processes of the available securities. For further treatment,see, for example,
Duffie [2001].
We will assume the existence of a short-rate
process, a progressively meaRt
surable process r with the property that 0 |r(u)| du < for all t, and such
that, for any times s and t > s, an investment of one unit of account (say,
one Euro) at any time s, reinvested continually
in short-term lending until
Rt
r(u)
du
. When we say under
any time t after s, will yield a market value of e s
Q, for some equivalent probability measure Q, we mean with respect to the
probability space (, F, Q) and the same given filtration {Gt : t 0}. For
the purpose of market valuation, we fix some equivalent martingale measure
Q, based on discounting at the short rate r. This means that, as of any time
t, a security paying at any bounded stopping time T an amount given by
a bounded GT -measurable
random variable F has a modeled price, on the
RT
Q
r(u) du
t
event {T > t}, of Et [e
F ], where, for convenience, we write EtQ for
expectation under Q, given Gt .
A risk-neutral intensity process for a default time is an intensity process
Q for the default time , under Q. We also call Q the Q-intensity of .
Artzner and Delbaen [1995] gave us the following convenient result.
Proposition. Suppose that a nonexplosive counting process K has a Pintensity process, and that Q is any probability measure equivalent to P. Then
K has a Q-intensity process.
The ratio Q / (for strictly positive) represents a risk premium for
uncertainty associated with the timing of default, in the sense of the following version of Girsanovs Theorem, which provides conditions suitable for
12
{i:T (i)t}
13
h Rs
St = E Q E Q e t r(u) du 1{ >s} F Fs Gt Gt
h
Rs
i
Q
Q
t r(u) du
F E 1{ >s} Fs Gt Gt .
= E e
The result then follows from the implication of double
stochasticity that, on
Rs
Q
(u)
du
.
the event { > t}, we have Q( > s | Fs Gt ) = e t
As a special case, suppose the driving filtration {Ft : t 0} is that
generated by a process X that is affine under Q, with state space D. It is
then natural to allow dependence of Q , r, and F on the state process X in
the sense that
Q
t = (Xt ),
rt = (Xt ),
F = euX(s) ,
(5.3)
R
R
however, this makes no difference, because (Xt ) dt = (Xt ) dt, given
that X(, t) = X(, t) for almost every t.
With (5.2) and (5.3), we can apply the basic property (3.3) of affine
processes, so that for t < , under mild regularity,
St = e(st)+(st)X(t) ,
for coefficients ( ) and ( ) satisfying the integro-differential associated
with (5.2), namely
Ae(st)+(st)x
0 (s t) 0 (s t) x (x) (x) = 0,
(5.4)
(st)+(st)x
e
where A is the generator (under Q) of X, and the boundary condition
e(0)+(0)x = 1. This in turn implies, by the same separation-of-variables
argument used in the simple example of Section 3, an associated generalized Riccati ODE for ( ) and ( ), with boundary condition (0) = 0
and (0) = 0 Rd . Solution of the ODE is explicit in certain cases, and
otherwise a routine numerical computation, say by a Runge-Kutta method.
A sufficient regularity condition for this solution is that X is a regular
affine process and that the short-rate process r is non-negative. (See Duffie,
Filipovic, and Schachermayer [2001] for details.)
The next step is to consider the recovery of some random payoff W at the
default time , if default occurs before the maturity date s of the security. We
adopt the assumptions of Theorem 1, and add the assumption that W = w ,
where w is a bounded predictable process that is also adapted to the driving
filtration {Ft : t 0} of the doubly stochastic assumption of the theorem
statement.
The market value at any time t before default of any default recovery is,
by the definition of the equivalent martingale measure Q,
i
h R
Q
t r(u) du
Jt = E e
(6.1)
1{ s} w Gt .
The assumption that is doubly-stochastic implies that it has a probability density under Q, at any time u in [t, s], conditional on Gt Fs , on the
event that > t, of
Ru Q
q(t, u) = e t (z) dz Q
u.
15
(For details, see Appendix E.) Thus, using the same iterated-expectations
argument of the proof of Theorem 5, we have, on the event that > t,
h R
i
Q
Q
t r(z) dz
Jt = E E e
1{ s} w Fs Gt Gt
Z s R
Q
tu r(z) dz
q(t, u)wu du Gt
= E
e
Z s t
=
(t, u) du,
t
(6.2)
The definition of GT is also given in Dellacherie and Meyer [1978]. Please note
that there is a typographical error in Dellacherie and Meyer [1978], Theorem IV.67(b), in
that the second sentence should read: Conversely, if Y is an FT0 -measurable. . . rather
than Conversely, if Y is an FT0 -measurable. . ., as can be verified from the proof, or, for
example, from their Remark 68(b).
16
RT
= EtQ e t r(u) du W
RT
Q
t r(u) du
= Et e
w(T )
Z s R
Q
tu [r(z)+Q (z)] dz Q
= Et
e
(u)w(u) du ,
(6.5)
(6.6)
(6.7)
and we are back in the setting of the previous Theorem. Intuitively speaking,
w(u) is the risk-neutral expected recovery given the information available in
the filtration {Gt : t 0} up until, but not including, time u, and given that
default will occur in the next instant, at time u.
In the affine state-space setting described at the end of the previous section, (t, u) can be computed by our usual affine methods, provided that
w(t) is of the form ea(t)+b(t)X(t) for deterministic a(t) and b(t). (In applications, a common assumption is that w(t) is deterministic, but the available
data appear to suggest otherwise for loan and bond recovery; see Duffie and
Singleton [2002].) In this case, it is an exercise to show that, under technical
regularity,
(t, u) = e(t,u)+(t,u)X(t) [c(t, u) + C(t, u) X(t)],
(6.8)
for readily computed deterministic coefficients , , c, and C. (For this extended affine calculation, see Duffie, Pan, and Singleton [2000].) This still
leaves
the simple but typically numerical task of computing of the integral
Rs
(t,
u) du, say by quadrature.
t
For the price of a typical defaultable bond promising periodic coupons
followed by its principal at maturity, one may sum the prices of the coupons
and of the principal, treating each of these payments as though it were a
separate zero-coupon bond. An often-used assumption, although one that
need not apply in practice, is that there is no default recovery for coupons,
and that all bonds of the same seniority (priority in default) have the same
recovery of principal, regardless of maturity. In any case, convenient parametric assumptions, based for example on an affine driving process X, lead
to straightforward computation of a term structure of defaultable bond yields
that may be applied in practical situations.
For the case of defaultable bonds with embedded American options, the
most typical cases being callable bonds or convertible bonds, the usual resort
is valuation by some numerical implementation of the associated dynamic
programming problems for optimal exercise timing. Acharya and Carpenter
[1999] and Berndt [2002] treat callable defaultable bonds. On the related
17
problem of convertible bond valuation, see Davis and Lischka [1999], Loshak
[1996], Nyborg [1996], and Tsiveriotis and Fernandes [1998]. On the timing
of call and conversion options on convertible bonds, see Ederington, Caton,
and Campbell [1997].
(7.1)
(7.3)
(7.4)
For empirical work on default-adjusted short rates, see Duffee [1999] for
an application to corporate bonds.
Correlated Default
19
doubly stochastic assumption, for any current time t < t(1), on the event
that i > t for all i (no defaults yet), we have
R t(k)
P (1 t1 , . . . , k tk | Gt ) = Et e t (s) ds ,
(8.1)
where
(t) =
i (t).
(8.2)
Now, in order to place the joint survivorship calculation into a computationally tractable setting, we suppose that the driving filtration {Ft : t 0}
is that generated by an affine process X, and that i (t) = i (Xt ), for an
affine function i ( ) on the state space D of X. The state Xt could include
industry or economy-wide business-cycle variables, or market yield-spread
information, as well as firm-specific data. In this case, (t) = M (Xt , t),
where,
X
i (x).
(8.3)
M (x, t) =
i: t(i) > t
(8.5)
Related calculations are explored in Duffie [1998b]. Applications include portfolio credit risk calculations and collateralized debt obligations (see
Duffie and Garleanu [2001]), credit-linked notes based on the first to default,
loans guaranteed by a defaultable guarantor, and default swaps signed by a
defaultable counterparty.
7
One could also solve in one step with a generalized Riccato equation having timedependent coefficients, but in practice one might prefer to use a time-homogeneous affine
model for which the solution coefficients (i) and (i) are known explicitly, arguing for
the recursive calculation of the joint survival probability.
20
Credit Derivatives
We end the course with for some examples of credit-derivative pricing, beginning with the most basic and popular, the default swap. We then turn
to credit guarantees, spread options on defaultable bonds, irrevocable lines
of credit, and ratings-based step-up bonds. For more examples and analysis
of credit derivatives, see Chen and Sopranzetti [1999], Cooper and Martin [1996], Davis and Mavroidis [1997], Duffie [1998b], Duffie and Singleton
[2002], Longstaff and Schwartz [1995b], and Pierides [1997].
9.1
Default Swaps
The simplest form of credit derivative is a default swap, which pays the buyer
of protection, at the default time of a stipulated loan or bond, the difference
between its face value and its recovery value, provided the default occurs
before a stated expiration date T . The buyer of protection makes periodic
coupon payments of some amount U each, until default or T , whichever is
first. This is, in effect, an insurance contract for the event of default.
The initial pricing problem is to determine the credit default swap rate
(CDS rate) U (which is normally expressed at an annualized rate, so that
semi-annual payments at a rate U = 0.03 per unit of face value means a CDS
rate of 6%). Some discussion and contractual details are provided in Duffie
and Singleton [2002].
Given a short-term interest rate process r and fixing an equivalent martingale measure Q, the total market value of the protection offered, per unit
of face value, is
R min(T, )
r(u) du
(1 W )1{ <T } ,
(9.1)
B = E Q e 0
where W is the recovery per unit of face value.
The market value of the coupon payments made by the buyer of protection
is
A=U
n
X
V (ti )
(9.2)
i=1
where t1 , . . . , tn = T are the coupon dates for the default swap and
Rt
(9.3)
U = Pn
(9.4)
For example, suppose that we are in the setting of Theorem 2, and that
the underlying bond has a risk-neutral default intensity process Q . Then
Rt
Q
0 [r(s)+Q (s)] ds
V (t) = E e
(9.5)
and, based on the same calculations used in Section 6,
Z T
B=
q(t) dt,
(9.6)
where
h Rt Q
i
q(t) = E Q e 0 [ (s)+r(s)] ds Q (t)(1 w(t)) ,
(9.7)
9.2
Credit Guarantees
22
borrower defaults. Our modeled price of the of the guaranteed loan is therefore conservative, that is, lowered by the assumption that the guarantor may
delay paying the obligation until the original maturity date of the loan.
For any given date t, the event that at least one of the borrower and
the guarantor survive until t is At = {B > t} {G > t} that at least one
survives to t. We have
Q(At ) = Q({B > t}) + Q({G > t}) Q({B > t} {G > t}). (9.8)
We will assume, in order to obtain concrete calculations, a doubly stochastic model for B and G , with respective risk-neutral intensities B and
G . From (9.8),
Rt B
Rt G
Q
0 [ (s)+G (s)] ds
E e
,
(9.9)
each term of which can be easily calculated if both intensities, B and G ,
are affine with respect to a state process X that is affine under Q.
From this, depending on the recovery model and the probabilistic relationship between interest rates and default times, one has relatively straightforward pricing of the guaranteed loan. For example, suppose that interest
rates are independent under Q of the default times B and G , and assume
constant recovery
of a fraction w of the face value of the loan. We let
Rt
(t) = E Q (e 0 r(u) du ) denote the price of a default-free zero-coupon bond
of maturity t, of unit face value. Then the market value of the guaranteed
loan, per unit of face value, is
Z T
V = (T )q(T ) w
(t)q 0 (t) dt,
(9.10)
0
where q(t) = Q(At ) is the risk-neutral probability that at least one of the
borrower and the guarantor survive to t, so that q 0 (t) is the risk-neutral
density of the time of default of the guaranteed loan. In an affine setting,
q 0 (t) is easily computed, so the computation of (9.10) is straightforward.
Many variations are possible, including a recovery from guarantor default
that differs from that of the original borrower.
23
9.3
Spread Options
The yield of a zero-coupon bond of unit face value, of price V , and of maturity
t is ( log V )/t. The yield spread of one bond relative to another of a lower
yield is simply the difference in their yields. The prices of defaultable bonds
are often quoted in terms of their yield spreads relative to some benchmark,
such as government bonds. (Swap rates are often used as a benchmark, but
we defer that issue to Duffie and Singleton [2002].)
We will consider the price of an option to put (that is, to sell) at a
given time t a defaultable zero-coupon bond at a given spread s relative to a
benchmark yield Y (t). The remaining maturity of the bond at time t is m.
This option therefore has a payoff at time t of
+
Z = e(Y (t)+s)m e(Y (t)+S(t))m ,
(9.11)
where S(t) is the spread of the defaultable bond at time t. For simplicity,
we will suppose that a contractual provision prevents exercise of the option
in the event that default occurs before the exercise date t. A slightly more
complicated result arises if the option can be exercised even after the default
of the underlying bond. We will also suppose for convenience that the benchmark yield Y (t) is the default-free yield of the same time m to maturity. The
market value of this spread option is therefore
i
h Rt
(9.12)
V = E Q e 0 r(u) du 1{ > t} Z .
We will consider a setting with a constant fractional loss ` of market value
at default, as in Section 7, so that the defaultable bond has a price at time
t on the event that > t, of
R t+m
(9.13)
e(Y (t)+S(t))m = EtQ e t R(u) du ,
where R(u) = r(u) + `Q (u) is the default-adjusted short rate.
In order to simplify the calculations, we will suppose that is doubly
stochastic driven by a state process X that is affine under Q, with a riskneutral intensity Q
t = a0 + a1 Xt . We also suppose that the default free
short-rate process r is affine with respect to X, so that rt = 0 + 1 Xt ,
and the default-free reference yield to maturity m is of thus of the form
Y (t) = 0 + 1 X(t).
The default adjusted short rate is Rt = 0 + 1 Xt , for 0 = 0 + `a0 and
1 = 1 + `a1 , so we have a defaultable yield spread, in the event of survival
24
=
=
=
=
=
=
(0 + 0 )m
(1 + 1 )m
(0 + s)m
1 m
f1 c 1
c 0 f0 .
(9.16)
(9.17)
(9.18)
where 0 = 0 + a0 , 1 = 1 + a1 , and
i
h Rt
Gt,,c,d (y) = E Q e 0 0 +1 X(t) ec0 +c1 X(t) 1{dX(t) y} .
(9.19)
Using the usual approach for option pricing in an affine setting, Gt,,c,d ( ) can
t,,c,d ( ), which is defined
be calculated by inverting its Fourier Transform G
by
Z
Gt,,c,d (z) =
eizy dGt,,c,d (y)
R h R
i
t
= E Q e 0 0 +1 X(t) ec0 +(c1 +izd)X(t) ,
using Fubinis Theorem.
That is, under regularity, we can apply the Levy Inversion Formula,
h
i
izy
Z
Im
G
(z)e
t,,c,d (0) 1
t,,c,d
G
Gt,,c,d (y) =
dz,
2
0
z
25
(t;z)X(0)
t,,c,d (z) = e(t;z)+
G
,
(9.20)
where we have indicated explicitly the dependence on z of the boundary
z).
condition of the generalized Riccati equations for
(t; z) and (t,
In practice, one might chooses a parameterization of X for which
and
are explicit (for example multivariate versions of the basic affine model),
or one may solve the generalized Riccati equations by a numerical method,
such as Runge-Kutta.
9.4
Banks often provide a credit facility under which a borrower is offered the
option to enter into short-term loans for up to a given notional amount N ,
until a given time T , all at a contractually fixed spread. The short-term loans
are of some term m. That is, at any time t among the potential borrowing
dates m, 2m, 3m, . . . , T , the borrower may enter into a loan maturing at time
t+m, of maximum size N , at a fixed spread s over the current reference yield
Y (t) for m-period loans.
Such a credit facility may be viewed as a portfolio of defaultable put
options sold to the borrower on the borrowers own debt. The borrower is
usually charged a fee, however, for the unused portion of the credit facility,
say F per unit of unused notional.
Our objective is to calculate the market value to the borrower of the
credit facility, including the effect of any fees paid. We will ignore the fact
that use of the credit facility may either signal or affect the borrowers credit
quality. This endogeneity is not easily treated directly by the option pricing
method that we will use. Instead, we will assume the same doubly-stochstic
affine model of default given in the previous application to defaultable bond
options. We will also assume that the bank is default free. (Otherwise, access
to the facility would be somewhat less valuable.)
For each unit of notional, the option to use the facility at time t actually
means the option to sell, at a price equal to 1, a bond promising to pay the
bank e(Y (t)+s)m at time t + m. The market value of this obligation at time t,
assuming that the borrower has survived to time t, is therefore
e(Y (t)+S(t))m e(Y (t)+s)m = e(sS(t))m ,
26
+
H(t) = 1 + F e(sS(t))m
h0 +h1 X(t)
Q
0 [0 +1 X(s)] ds
1+F e
1{gX(t) v}
= E e
= (1 + F )Gt,,0,g (v) Gt,,h,g (v).
We can therefore calculate U (t) by the Fourier inversion method of the previous application to defaultable debt options.
The fee F is paid at time t only when the borrower has not defaulted by
t, and only when the borrower is not drawing on the facility at t. We have
already accounted in our calculation of U (t) for the benefit of not paying the
fee when the facility is used, so the total market value of the facilty to the
borrower is
T /m
V (F ) =
U (mi) F e(mi)+(mi)X(0) ,
(9.22)
i=1
Rt
0 [0 +1 X(s)] ds
Q
= e(t)+(t)X(0) .
(9.23)
E e
If the credit facility is priced on a stand-alone zero-profit basis, the associated fee F would be set to solve the equation V (F ) = 0. In practice, the fee
27
has not necessarily set in this fashion. On May 3, 2002, The Financial Times
indicated concern by banks over their pricing policy on credit facilities, and
indicated fees on undrawn lines have recently been ranging from roughly 9
basis points for A-rated firms to roughly 20 basis points for BBB-rated firms.
Banks are also, apparently, begining to build some protection against the optionality in the exercise of these lines, by using fees (F ) or spreads (s) that
depend on the fraction of the line that is drawn.
9.5
Most publicly traded debt, and much privately issued debt, is assigned a
credit rating, essentially a credit quality score, by one or more of the major
credit rating agencies. For example, the standard letter credit ratings for
Moodys are Aaa, Aa, A, Baa, Ba, B, and D (for default). There are 3 refined
ratings for each letter rating, as in Ba1, Ba2, and Ba3. The term investment
grade means rated Aaa, Aa, A, or Baa. For Moodys, speculative-grade
ratings are those below Baa. For Standard and Poors, whose letter ratings
are AAA, AA, A, BBB, BB, B, C, and D, speculative-grade means below
BBB.
It has become increasingly common for bond issuers to link the size of the
coupon rate on their debt with their credit rating, offering a higher coupon
rate at lower ratings, perhaps in an attempt to appeal to investors based on
some degree of hedging against a decline in credit quality. This embedded
derivative is called a ratings-based step-up. For example, The Financial
Times reported on April 9, 2002, that a notional amount of approximately
120 billion Euros of such ratings-based step-up bonds had been issued by
telecommunications firms, one of which, a 25-billion-Euro Deutsche Telekom
bond, would begin paying at extra 50 basis points (0.5%) in interest per year
with the downgrade by Standard and Poors of Deutsche Telekom debt from
A to BBB+ on April 8, 2002, following8 a similar downgrade by Moodys.
There is a potentially adverse effect of such a step-up feature, however, for
a downgrade brings with it an additional interest expense, which, depending
on the capital structure and cash flow of the issuer, may actually reduce the
total market value of the debt, and even bring on further ratings downgrades,
8
Most ratings-based step-ups occur with a stipulated reduction in rating by any of the
major ratings agencies, but this particular bond required a downgrade by both Moodys
and Standard and Poors before the step-up provision could occur.
28
higher coupon rates, and so on. We will ignore this feedback effect in our
calculation of the pricing of ratings-based step-up bonds.
We will simplify the pricing problem by considering the common case in
which the only ratings changes that cause a change in coupon rate are into
and out of the investment-grade ratings categories. That is, we assume that
the coupon rate is cA whenever the issuers rating is investment grade, and
that the speculative-grade coupon rate is cB > cA .
Our pricing model is the same doubly-stochastic model used in earlier
applications. In order to treat ratings transitions risk, we will assume that
that the risk-neutral default intensity Q
t is higher for speculative grade ratings than for investment grade ratings. This is natural. In practice, however,
the maximum yield spread associated with investment grade fluctuates with
uncertainty over time. In order to maintain tractability, we will suppose the
issuer has an investment-grade rating whenever Q
t t , and is otherwise of
speculative grade, where (t) = 0 + 1 X(t). It would be equivalent for
purposes of tractability, since yield spreads are affine in X(t) in this setting,
to suppose that the maximal level of investment-grade straight-debt yield
spreads at a given maturity is affine with respect to X(t).
We will use, for simplicity, a model with zero recovery of coupons at
default, and recovery of a given risk-neutral expected fraction w of principal
at default.
The coupon paid at coupon date t, in the event of survival to that date,
is
c(t) = cA + (cB cA )1{Q (t) (t)}
= cA + (cB cA )1{hX(t) u} ,
(9.24)
i=1
where (0, t) is the market-value density for the recovery of principal, calculated in an affine setting as in (6.8).
30
Appendices
A
This appendix reviews the most basic classes of structural models of default
risk, which are built on a direct model of survival based on the sufficiency of
assets to meet liabilities.
For this appendix, we let B be a standard Brownian motion in Rd on
a complete probability space (, F, P ), and we fix the standard filtration
{Ft : t 0} of B.
A.1
For the Black-Scholes-Merton model, based on Black and Scholes [1973] and
Merton [1974], we may think of equity and debt as derivatives with respect to
the total market value of the firm, and priced accordingly. In the literature,
considerable attention has been paid to market imperfections and to control
that may be exercised by holders of equity and debt, as well as managers.
With these market imperfections, the theory becomes more complex and less
like a derivative valuation model.
With the classic Black-Scholes-Merton model of corporate debt and equity valuation, one supposes that the firms future cash flows have a total
market value at time t given by At , where A is a geometric Brownian motion,
satisfying
dAt = At dt + At dBt ,
for constants and > 0, and where we have taken d = 1 as the dimension of
the underlying Brownian motion B. One sometimes refers to At as the assets
of the firm. We will suppose for simplicity that the firm produces no cash
flows before a given time T . In order to justify this valuation of the firm,
one could assume there are other securities available for trade that create
the effect of complete markets, namely that, within the technical limitations
of the theory, any future cash flows can be generated as the dividends of
a trading strategy with respect to the available securities. There is then
a unique price at which those cash flows would trade without allowing an
arbitrage.
31
We take it that the original owners of the firm have chosen a capital
structure consisting of pure equity and of debt in the form of a single zerocoupon bond maturing at time T , of face value L. In the event that the
total value AT of the firm at maturity is less than the contractual payment
L due on the debt, the firm defaults, giving its future cash flows, worth AT ,
to debtholders. That is, debtholders receive min(L, AT ) at T . Equityholders
receive the residual max(AT L, 0). We suppose for simplicity that there are
no other distributions (such as dividends) to debt or equity. We will shortly
confirm the natural conjecture that the market value of equity is given by
the Black-Scholes option-pricing formula, treating the firms asset value as
the price of the underlying security.
Bond and equity investors have already paid the original owners of the
firm for their respective securities. The absence of well-behaved arbitrage
implies that at, any time t < T , the total of the market values St of equity
and Yt of debt must be the market value At of the assets. This is one of the
main points made by Modigliani and Miller [1958], in their demonstration of
the irrelevance of capital structure in perfect markets.
Markets are complete given riskless borrowing or lending at a constant
rate r and given access to a self-financing trading strategy whose value process
is A. (See, for example, Duffie [2001], Chapter 6.) This implies that there is
at most one equivalent martingale measure.
Letting BtQ = Bt + t, where = ( r)/, we have
dAt = rAt dt + At dBtQ .
A.2
A first-passage model of default is one for which the default time is the
first time that the market value of the assets of the issuer have reached
a sufficiently low level. Black and Cox [1976] developed the idea of firstpassage-based default timing, but used an exogenous default boundary. We
shift now to a slightly more elaborate setting for the valuation of debt and
equity, and consider the endogenous timing of default, using an approach
formulated by Fisher, Heinkel, and Zechner [1989] and solved and extended
by Leland [1994], and subsequently, by others.
33
Q
r(st)
At = Et
e
s ds .
(A.1)
t
We assume that At is well defined and finite for all t. The martingale representation theorem, which also applies under Q for the Brownian motion B Q ,
then implies that
dAt = (rAt t ) dt + t dBtQ ,
(A.2)
RT
where is an adapted Rd -valued process such that 0 t t dt < for all
T [0, ), and where B Q is the standard Brownian motion in Rd under Q
obtained from B and Girsanovs Theorem.
We suppose that the original owners of the firm chose its capital structure
to consist of a single bond as its debt, and pure equity, defined in detail below.
The bond and equity investors have already paid the original owners for these
securities. Before we consider the effects of market imperfections, the total
of the market values of equity and debt must be the market value A of the
assets, which is a given process, so the design of the capital structure is again
irrelevant from the viewpoint of maximizing the total value received by the
original owners of the firm.
For simplicity, we suppose that the bond promises to pay coupons at a
constant total rate c, continually in time, until default. This sort of bond
is sometimes called a consol. Equityholders receive the residual cash flow in
the form of dividends at the rate t c at time t, until default. At default,
the firms future cash flows are assigned to debtholders.
The equityholders dividend rate, t c, may have negative outcomes. It
is commonly stipulated, however, that equity claimants have limited liability, meaning that they should not experience negative cash flows. One can
arrange for limited liability by dilution of equity. That is, so long as the
market value of equity remains strictly positive, newly issued equity can be
sold into the market so as to continually finance the negative portion (ct )+
of the residual cash flow. (Alternatively, the firm could issue debt, or other
34
forms of securities, to finance itself.) When the price of equity reaches zero,
and the financing of the firm through equity dilution is no longer possible,
the firm is in any case in default, as we shall see. While dilution increases
the quantity of shares outstanding, it does not alter the total market value
of all shares, and so is a relatively simple modeling device. Moreover, dilution is irrelevant to individual shareholders, who would in any case be in a
position to avoid negative cash flows by selling their own shares as necessary
to finance the negative portion of their dividends, with the same effect as
if the firm had diluted their shares for this purpose. We are ignoring here
any frictional costs of equity issuance or trading. This is another aspect of
the Modigliani-Miller theory, that part of it dealing with the irrelevance of
dividend policy.
Equityholders are assumed to have the contractual right to declare default
at any stopping time T , at which time equityholders give up to debtholders
the rights to all future cash flows, a contractual arrangement termed strict
priority, or sometimes absolute priority. We assume that equityholders are
not permitted to delay liquidation after the value A of the firm reaches 0,
so we ignore the possibility that AT < 0. We could also consider the option
of equityholders to change the firms production technology, or to call in the
debt for some price.
The bond contract conveys to debtholders, under a protective covenant,
the right to force liquidation at any stopping time at which the asset value
A is as low or lower than some stipulated level, which we take for now to be
the face value L of the debt. Debtholders would receive A at such a time ;
equityholders would receive nothing.
Assuming that A0 > L, we first consider the total coupon payment rate
c that would be chosen at time 0 in order that the initial market value of
the bond is its face value L. Such a bond is said to be at par, and the
corresponding coupon rate per unit of face value, c/L, is the par yield. If
bondholders rationally enforce their protective covenant, we claim that the
par yield must be the riskless rate r. We also claim that, until default,
the bond paying coupons at the total rate c = rL is always priced at its
face value L, and that equity is always priced at the residual value, A L.
Finally, equityholders have no strict preference to declare default on a parcoupon bond before (L) = inf{t : At L}, which is the first time allowed
for in the protective covenant, and bondholders rationally force liquidation
at (L).
If the total coupon rate c is strictly less than the par rate rL, then eq35
uityholders never gain by exercising the right to declare default (or, if they
have it, the right to call the debt at its face value) at any stopping time T
with AT L, because the market value at time T of the future cash flows
to the bond is strictly less than L if liquidation occurs at a stopping time
U > T with AU L. Avoiding liquidation at T would therefore leave a
market value for equity that is strictly greater than AT L. With c < rL,
bondholders would liquidate at the first time (L) allowed for in their protective covenant, for by doing so they receive L at (L) for a bond that, if
left alive, would be worth less than L. In summary, with c < rL, the bond
is liquidated at (L), and trades at a discount price at any time t before
liquidation, given by
"Z
#
(L)
Yt = EtQ
er(st) c ds + er( (L)t) L
(A.3)
t
r( (L)t)
c
c
Q
< L.
=
+ Et e
L
r
r
A.3
(A.4)
m+
m2 + 2r 2
,
2
(A.6)
36
and where m = 2 /2. One can verify (A.5) as an exercise, applying Itos
Formula.
Let us consider for simplicity the case in which bondholders have no
protective covenant. Then equityholders declare default at a stopping time
that solves the maximum equity valuation problem
Z T
Q
rt
w(A0 ) sup E
e (t c) dt ,
(A.7)
T T
x > AB ,
(A.8)
where
1
Aw(x) = w0 (x)x + w00 (x) 2 x2 ,
2
(A.9)
x AB .
(A.10)
(A.11)
c
x
x
.
(A.12)
w(x) = x AB
1
AB
r
AB
37
This conjectured value of equity is merely the market value x of the total
future cash flows of the firm, less a deduction equal to the market value of
the debtholders claim to AB at the default time (AB ) using (A.5), less
another deduction equal to the market value of coupon payments to bondholders before default. The market value of those coupon payments is easily
computed as the present value c/r of coupons paid at the rate c from time
0 to time +, less the present value of coupons paid at the rate c from
the default time (AB ) until +, again using (A.5). In order to complete
our conjecture, we apply the smooth-pasting condition w0 (AB ) = 0 to this
functional form (A.12), and by calculation obtain the conjectured default
triggering asset level as
AB = c,
(A.13)
where
=
.
r(1 + )
(A.14)
x c,
and
W (x, c) = x c
x
c
(A.15)
"
#
x
c
1
,
r
c
x c.
(A.16)
38
where
1
AW (x, c) = Wx (x, c)x + Wxx (x, c) 2 x2 ,
2
(A.18)
(A.19)
where
f (x) = AW (x, c) rW (x, c) + (r )x c.
Because Wx is bounded, the last term of (A.19) defines a Q-martingale. For
x c, we have both W (x, c) = 0 and (r )x c 0, so f (x) 0.
For x > c, we have (A.8), and therefore f (x) = 0. The drift of q is
therefore never positive, and for any stopping time T we have q0 E Q (qT ),
or equivalently,
Z T
Q
rs
rT
W (A0 , c) E
e (s c) ds + e W (AT , c) .
(A.20)
0
W (A0 , c) = E Q
ers (s c) ds ,
(A.21)
0
9
We use a version of Itos Formula that can be applied to a real-valued function that is
C and is C 2 except at a point, as, for example, in Karatzas and Shreve [1988], page 219.
1
39
using the boundary condition (A.15) and the fact that f (x) = 0 for x > c.
So, for any stopping time T ,
"Z
#
(c)
W (A0 , c) = E Q
ers (s c) ds
(A.22)
EQ
EQ
0
T
rs
rT
e
Z
0
T
(s c) ds + e
rs
e (s c) ds ,
W (AT , c)
using the nonnegativity of W for the last inequality. This implies the optimality of the stopping time (c) and verification of the proposed solution
W (A0 , c) of (A.7).
This model was further elaborated to treat taxes in ?], coupon debt of
finite maturity in Leland and Toft [1996], endogenous calling of debt and recapitalization in Leland [1998] and Uhrig-Homburg [1998], incomplete observation by bond investors, with default intensity, in Duffie and Lando [2001],
and alternative approaches to default recovery in Anderson and Sundaresan
[1996], Anderson, Pan, and Sundaresan [1995], Fan and Sundaresan [1997],
Mella-Barral [1999], and Mella-Barral and Perraudin [1997].
Longstaff and Schwartz [1995a] developed a similar first-passage defaultable bond pricing model with stochastic default-free interest rates. (See also
Nielsen, Saa-Requejo, and Santa-Clara [1993] and Collin-Dufresne and Goldstein [1999].) Zhou [2000] bases pricing on first passage of a jump-diffusion.
Itos Formula
This appendix states Itos Formula, allowing for jumps, and including some
background properties of semimartingales. A standard source is Protter
[1990].
We first establish some preliminary definitions. We fix a complete probability space (, F, P ) and a filtration {Gt : t 0} satisfying the usual
conditions:
For all t, Gt contains all of the null sets of F.
For all t, Gt = s>t Gs , a property called right-continuity.
40
0<st
Like our next version of Itos Formula, this can be found, for example, in
Protter [1990], page 71.
A semimartingale is a process of the form V + M , where V is a finitevariation process and M is a local martingale.
Lemma 3. Suppose X and Y are semimartingales and at least one of them
is a finite-variation process. Let Z = XY . Then Z is a semimartingale and
dZt = Xt dYt + Yt dXt + Xt Yt .
(B.1)
We now extend the last two lemmas. From this point, B denotes a standard Brownian motion in Rd .
41
Lemma 4. Suppose
X = M + A, where A is a finite-variation process in
Rt
d
R and Mt = 0 u dBu is in Rd , where B is a standard Brownian moin Rd and is an Rdd progressively-measurable adapted process with
Rtion
t
ks k2 ds < almost surely for all t. Suppose f : Rd R is twice contin0
uously differentiable. Then
Z t
Z
1X t 2
f (Xt ) = f (X0 ) +
f (Xs ) dXs +
ij f (Xs ) (s s> )ij ds
2
0+
0
i,j
X
+
[f (Xs ) f (Xs ) f (Xs )Xs ],
0<st
f (x)
,
xi
ij2 f (x) =
2 f (x)
.
xi xj
Lemma 5. Suppose dXt = dAt + t dBt and dYt = dCt + vt dBt , where B is
a standard Brownian motion in Rd , and where A and C are finite-variation
d
processes,
and and
Rt
R t v are progressively measurable processes in R such that
s ds and 0 vs vs ds are finite almost surely for all t. Let Z = XY .
0 s
Then Z is a semimartingale and
dZt = Xt dYt + Yt dXt + Xt Yt + t vt dt.
(B.2)
This appendix, based on Duffie, Filipovic, and Schachermayer [2001], characterizes regular affine processes, a class of time-homogeneous Markov processes
that has arisen from a large and growing range of useful applications in fin
nance. Given a state space of the form D = Rm
+ R for integers m 0
and n 0, the key affine property, to be defined precisely in what follows,
is roughly that the logarithm of the characteristic function of the transition
distribution pt (x, ) of such a process is affine with respect to the initial state
x D. The coefficients defining this affine relationship are the solutions of a
family of ordinary differential equations (ODEs) that are the essence of the
tractability of regular affine processes. We review these ODEs, generalized
Riccati equations, and state the precise set of admissible parameters for
which there exists a unique associated regular affine process.
42
The class of regular affine processes include the entire class of continuousstate branching processes with immigration (CBI) (for example, Kawazu and
Watanabe [1971]), as well as the class of processes of the Ornstein-Uhlenbeck
(OU) type (for example, Sato [1999]). Roughly speaking, the regular affine
n
processes with state space Rm
+ are CBI, and those with state space R are
n
of OU type. For any regular affine process X = (Y, Z) in Rm
+ R , The
first component Y is necessarily a CBI process. Any CBI or OU type
process is infinitely decomposable, as was well known and apparent from
the exponential-affine form of the characteristic function of its transition distribution. A regular (to be defined below) Markov process with state space D
is infinitely decomposable if and only if it is a regular affine process. Regular
affine processes are also semimartingales, a crucial property in most financial
applications because the standard model of the financial gain generated by
trading a security is a stochastic integral with respect to the underlying price
process.
We will restrict our attention to the case of time-homogeneous conservative processes (no killing) throughout. For the case that allows for for
killing, see Duffie, Filipovic, and Schachermayer [2001]. For the case of timeinhomogeneous affine processes, see Filipovic [2001].
The remainder of the appendix is organized as follows. In Section C.2
we provide the definition of a regular affine process X (Definitions C.1 and
C.3) and the main characterization result of affine processes. Three other
equivalent characterizations of regular affine processes are reviewed: (i) in
terms of the generator (Theorem C.5), (ii) in terms of the the semimartingale
characteristics (Theorem C.8), and (iii) in terms of the property of infinite
decomposability (Theorem C.10). Proofs of these results are found in Duffie,
Filipovic, and Schachermayer [2001].
C.1
Basic Notation
For background and notation we refer to Jacod and Shiryaev [1987b] and
Revuz and Yor [1994]. Let k N. For and in Ck , we write h, i :=
1 1 + + k k (notice that this is not the scalar product on Ck ). We let
Semk be the convex cone of symmetric positive semi-definite k k matrices.
If U is an open set or the closure of an open set in Ck , we write U for the
closure, U 0 for the interior, and U = U \ U 0 for the boundary.
We use the following notation for function spaces:
43
C.2
Px a.s.,
(C.1)
x = (y, z) D.
n
Notice that fu Cb (D) if and only if u U := Cm
+ R , and this is why
the parameter set U plays a distinguished role. By dominated convergence,
Pt fu (x) is continuous in u U, for every (t, x) R+ D.
44
Pt fu (x) = e(t,u)+h(t,u),xi
(C.2)
= e
x = (y, z) D.
(C.3)
(C.4)
46
d
X
k,l=1
+
+
2 f (x)
+ hb + x, f (x)i
xk xl
(C.5)
D\{0}
where
Gf0 (x, ) = f (x + ) f (x) hJ f (x), J ()i
Z
t Y (t, u) = RY Y (t, u), e t w , Y (0, u) = v
(C.7)
Z
Z (t, u) = e t w
(C.8)
with
F (u) = ha
u, ui hb, ui
Z
hu,i
e
1 h
uJ , J ()i m(d),
D\{0}
Y
Ri (u) = hi u, ui iY , u
Z
hu,i
e
1 uJ (i) , J (i) () i (d),
D\{0}
47
(C.9)
(C.10)
for i I, and
iY =
Z =
i{1,...,d}
JJ
Rd ,
i I,
Rnn .
(C.11)
(C.12)
xD
48
(x)
Then the filtrations (Ft ) and (Ft ) are right-continuous, and X is still a
Markov process with respect to (Ft ). That is, (C.1) holds for Ft0 replaced by
Ft , for all x D.
By convention, we call X a semimartingale if X is a semimartingale on
(, F, (Ft ), Px ), for every x D. For the definition of the characteristics
of a semimartingale with refer to (Jacod and Shiryaev 1987b, Section II.2).
We emphasize that the characteristics below are associated to the truncation
function , defined in (C.4).
Let X 0 be a D-valued stochastic process defined on some probability space
0
( , F 0 , P0 ). Then P0 X 0 1 denotes the law of X 0 , that is, the image of P0 by
the mapping 0 7 X0 ( 0 ) : (0 , F 0 ) (, F 0 ).
The following is a characterization result for regular affine processes in
the class of semimartingales.
Theorem C.8 Let X be regular affine and (a, , b, , m, ) the related admissible parameters. Then X is a semimartingale and admits the characteristics (B, C, ), where
Z t
b + X
s ds
Bt =
(C.13)
0
!
Z t
m
X
Ct = 2
a+
i Ysi ds
(C.14)
(dt, d) =
i=1
m(d) +
m
X
!
Yti i (d)
dt
(C.15)
i=1
kl = kl + (1 kl )
k () l (d), if l I,
(C.16)
(C.17)
D\{0}
= kl , if l J ,
for 1 k d.
(C.18)
(k)
(C.19)
Theorem C.10 The Markov process (X, (Px )xD ) is regular affine if and
only if (Px )xD is infinitely decomposable.
Without going much into detail, we remark that in (C.5) we can distinguish the three building blocks of any jump-diffusion process, the diffusion
matrix A(x) = a + y1 1 + + ym m , the drift B(x) = b + x, and the Levy
measure (the compensator of the jumps) M (x, d) = m(d) + y1 1 (d) +
+ ym m (d). An informal definition of an affine process could consist of
the requirement that A(x), B(x), and M (x, d) have affine dependence on x.
(See, for example, Duffie, Pan, and Singleton [2000].) The particular kind of
this affine dependence in the present setup is implied by the geometry of the
state space D.
D.1
(D.1)
(u, Xt , t, T ) = E eiuXT |Xt ,
for real u in Rd . Because knowledge of is equivalent to knowledge of the
joint conditional transition distribution function of X, this result is useful
50
E eiuXt+1 (u, Xt , t, t + 1) | Xt = 0,
(D.3)
so any measurable function of Xt is orthogonal to the error (eiuXt+1
(u, Xt , t, t + 1)). Singleton [2001] uses this fact, together with the known
functional form of , to construct generalized method-of-moments estimators of the parameters governing affine processes and, more generally, the
parameters of asset pricing models in which the state process is affine. These
estimators are computationally tractable and, in some cases, achieve the
same asymptotic efficiency as the maximum likelihood estimator. ?] and
?] propose related, characteristic-function based estimators of the stochastic
volatility model of asset returns with volatility a square-root diffusion.
D.2
(u) = Et euX(s) .
(D.4)
We sometimes call ( ) the moment-generating function of X(s), for it has
the convenient property that, if its successive derivatives
0 (0), 00 (0), 000 (0), . . . , (m) (0)
up to some order m are well defined, then they provide us with the respective
moments
(k) (0) = Et [X(s)k ].
51
From (3.3), we know that (u) = e(t)+(t)X(t) , for coefficients (t) and
(t) obtained from the generalized Riccati equation for X. We can calculate
the dependence of (t) and (t) on the boundary condition (s) = u, writing
(t, u) and (t, u) to show this dependence explicitly. Then we have, by the
chain rule for differentiation,
0 (0) = e(t,0)+(t,0)X(0) [u (t, 0) + u (t, 0)X(0)],
where u denotes the partial derivative of with respect to its boundary
condition u. Successively higher-order derivatives can be computed by repeated differentiation. Pan [1999] provides an efficient recursive algorithm
for higher-order moments, even in certain multivariate cases.
For the multivariate case, the transform at u Rd is defined by
D.3
du ,
(D.6)
2
0
u
where Im(c) denotes the imaginary part of any
R complex number c. It is
sufficient, for example, that ( ) is integrable ( |(u)| du < +).
The integral in (D.6) is typically calculated by a numerical method such
as quadrature, which is rapid.
For affine processes, in a few cases, such as the Ornstein-Uhlenbeck
(Gaussian) model and the Feller diffusion (non-central 2 ), the probability
transition distribution is known explicitly.
52
D.4
This section summarizes some results from Duffie and Garleanu [2001] for
the solutions and for the basic affine model X of (3.11).
These generalized Riccati equations reduce in this special case to the form
d(t)
1
= n(t) + p(t)2 + q,
dt
2
(t)
d(t)
= m(t) + `
,
dt
1
(t)
(D.7)
(D.8)
b2 s
`
c2 + d2 e
`(a2 c2 d2 )
log
+
` s,
+
b2 c2 d2
c2 + d2
c2
(s) =
where
d1
a1
b1
n2 2pq
2q
p
n + pu + (n + pu)2 p(pu2 + 2nu + 2q)
= (1 c1 u)
2nu + pu2 + 2q
= (d1 + c1 )u 1
d1 (n + 2qc1 ) + a1 (nc1 + p)
=
a1 c1 d1
c1 =
n +
53
(D.10)
(D.11)
d1
c1
= b1
a2 =
b2
c1
d1
a1
=
.
c1
c2 = 1
d2
t [Tn , Tn+1 ),
(E.1)
Rt
nonnegative predictable process such that, for all t, we have 0 s ds <
almost surely. Then
R t a nonexplosive adapted counting process N has as its
intensity if {Nt 0 s ds : t 0} is a local martingale.
From Bremauds Theorem T12, without an important loss of generality
for our purposes, we can require an intensity to be predictable, as above, and
are both
we can treat an intensity as essentially unique, in that: If and
intensities for N , as defined above, then
Z
s |s ds = 0 a.s.
|s
(E.2)
0
almost
We note that if is strictly positive, then (E.2) implies that =
everywhere.
We can get rid of the annoying localness of the above local-martingale
characterization of intensity under the following technical condition, which
can be verified from Theorems T8 and T9 of Bremaud [1981].
Proposition 1. Suppose N is an adapted countingR process and is a nont
negative predictable process such that, for all t, E( 0 s ds) < . Then the
following are equivalent:
(i) N is nonexplosive and is the intensity of N .
Rt
(ii) {Nt 0 s ds : t 0} is a martingale.
Proposition 2. Suppose
N is a nonexplosive adapted counting process with
Rt
intensity , with 0 s ds < almost surely for all t. Let M be defined by
Rt
Rt
Mt = Nt 0 s ds. Then, for any predictable process H such that 0 |Hs |s ds
is finite almost surely for all t, a local martingale Y is well defined by
Z t
Z t
Z t
Yt =
Hs dMs =
Hs dNs
Hs s ds.
0
If, moreover, E
hR
t
0
i
|Hs |s ds < , then Y is a martingale.
Z t
Tn = inf t Tn1 :
u du = Zn .
(E.3)
Tn1
Now, we can define N by (E.1). (This construction can also be used for Monte
Carlo simulation of the jump times of N .) Finally, for each t, we let Gt be
the -algebra generated by Ft and {Ns : 0 s t}. By this construction,
56
(N (1) , . . . , N (k) ), where N (i) is a nonthe intensity (i) relative to Rthe augt (i)
(i)
(i)
of N . Let Mt = Nt 0 s ds.
martingale representation property for
RT
process such that, for some fixed time horizon T , 0 s s ds is finite almost
surely. A local martingale is then well defined by
Z t
Y
t = exp
(1 s )s ds
T (i) , t T,
(E.4)
0
{i:T (i)t}
= Q(Ns Nt = k | Ht ),
using the definition of Gt , and the fact that Ft Fs .
Finally, we calculate, under some regularity conditions, the density and
hazard rate of a doubly stochastic stopping time with intensity , driven
by some filtration.
Letting p(t) = P ( > t) define the survival function p : [0, ) [0, 1],
the density of the stopping time is (t) = p0 (t), if it exists, and the hazard
function h : [0, ) [0, ) is defined by
h(t) =
(t)
d
= log p(t),
p(t)
dt
Rt
0
h(u) du
R
0
0t (u) du
p (t) = E e
(t) ,
(E.5)
from which (t) and h(t) would be defined as above. Grandell [1976], pages
106107, has shown that (E.5) is correct provided that:
i) There is a constant C such that, for all t, E(2t ) < C.
ii) For any > 0 and almost every time t,
lim P (|(t + ) (t)| ) = 0.
Exercises
The following is a selection of exercises from Duffie [2001].
59
(L)
(3)
Verify (A.5). Hint: Let Mt = ert (At /K) . Show that M is a Q-martingale.
Exercise 3 (Subordinated Debt) Suppose equity and debt investors are informed according to the standard filtration of a standard Brownian motion B. Riskless borrowing
is available at constant rate r > 0. The capital structure of a given firm, Zinc, is made
up of three claims on the assets: equity and two different zero-coupon bonds. The first of
the two bonds to mature is a promise to pay F1 at time T1 > 0, subject to available assets
and prioritization. The second to mature pays F2 at T2 > T1 , subject to available assets.
Assuming there are any assets remaining, Zinc is liquidated at T3 > T2 , with all proceeds
of liquidation going to its equity owners. All bond payments are made out of current
assets. The production technology of Zinc is such that an asset stock of any amount x
at time t is translated into an asset stock of x exp[m(T t) + (BT Bt )] at any T > t,
where m > r and > 0, provided there were no bond payments between t and T . The
initial level of assets is x > 0. No investment in assets is made after time 0. Let At denote
the level of assets at any time t > 0. At the first maturity date T1 , the assets are reduced
so as to pay down as much as possible of the face value F1 of the bond maturing at T1 .
That is, if A(T1 ) F1 , then the asset level at T1 is A(T1 ) = A(T1 ) F1 . These assets
are then reinvested in the same technology until T2 , and the second bond is likewise paid
down, if possible. If the second bond is fully paid down at T2 , then the remaining assets
are reinvested until T3 and then paid out to equity. If, however, there is an insufficient
amount of assets to pay down the first bond at T1 , then the issuer defaults. Priority in
60
default is given to the second bond, that maturing at T2 . That is, if A(T1 ) < F1 , then
Zinc is immediately liquidated at time T1 , with min(F2 , A(T1 )) paid immediately to the
second bond (that maturing at T2 ), and with any remainder [A(T1 ) F2 ]+ going to
the first bond, that maturing at T1 . Thus, the first bond to mature is junior, as it is
subordinated to the second bond to mature. The second bond to mature is senior. Equity
owners get nothing in this scenario. Likewise, in the event at T2 that A(T2 ) < F2 , the
firm is immediately liquidated, with A(T2 ) going to the second bondholder, and nothing
to equity.
A separate firm, Zonk, with an identical technology (that is, with the same coefficients
(m, ) and driven by the same Brownian motion B), is fully equity funded (no bonds),
pays no dividends, receives no new investment, and is liquidated at T3 . The market value
of Zonks equity at any time t before T3 is Zonks level of assets. All securities, equities
and bonds, can be traded, long or short, and there are no transactions costs, taxes, or
other market imperfections. We suppose that there are no arbitrages whose market value
is bounded below, in the usual sense.
(A) Suppose that Zincs equity owners can purchase or sell units of additional assets at
time 0 (only), at a price of 1 unit of account per unit of assets. Any purchase is funded
by issuing additional equity. (This is called dilution.) The purchasers of new equity
pay a fair market value for their shares. For example, suppose the total initial assets of
the firm are increased by 100 units of account. The new initial level of assets, x + 100,
is invested in the given (m, ) technology, and, after paying down the bonds at T1 and
T2 , any residual assets, say Z, at T3 are split, with some fraction (0, 1) going to the
original equity shares, and the remaining fraction 1 going to the new equity shares
purchased at time zero. It must be the case that the market value of the claim to Z(1 )
at time T3 is worth 100 at time 0. In this scenario, the total market value of the bonds
would increase by some amount, and the total market value of the equity, new combined
with old, would increase by some amount. Any proceeds from a sale of assets at time 0
is paid immediately to initial equity owners as a dividend. This decision, to purchase or
sell some amount of assets at time 0, is the only decision ever made by equity owners.
The bond contracts have a protective covenant, however, stipulating that Zinc cannot sell
assets so as to reduce current assets below the total face value F1 + F2 of the two bonds.
(Assets may, however, go below F1 + F2 after time 0 if the production technology has
a sufficiently low outcome. If x < F1 + F2 , assets may be purchased in a nonnegative
amount, even if the total assets after the purchase is less than F1 + F2 .) State, and
rigorously justify, an optimal (equity-market-value maximizing) level of recapitalization.
That is, calculate the amount C of assets to be purchased (or sold, for the case C < 0)
so as to maximize the market value of the shares held by the original equity owners,
as it depends on (x, F1 , T1 , F2 , T2 , T3 , r, m, ), subject to x + C F1 + F2 for the case
x > F 1 + F2 .
(B) Suppose the protective covenant described in part (A) is modified so that the owners
of the senior bonds (those maturing at T2 ) can reset the minimum level to which assets
may be reduced through an equity dividend at time zero (only), from F1 + F2 to a new
minimum level F < F1 + F2 , selected by senior bondholders. Assume that the objective of
senior bondholders in resetting the protective covenant is to maximize the initial market
value of their own bonds. Could there be a situation in which senior bondholders would
61
choose to exercise their option to allow equity owners to reduce assets (by sale and equity
dividend) as far as some level F < F1 + F2 ? In one sentence, only, provide a reason why it
could be, or could not be, optimal for senior bondholders to do so. If it could be, construct
such an example, giving explicit numbers for x, r, m, , (F1 , T1 ), (F2 , T2 ), and T3 . If you
wish, you may take = 0. Provide, for your example, an optimal lower bound F on initial
assets that would be chosen by senior bondholders in the revised protective covenant.
Assume common knowledge of rationality by both bondholders and equity owners. In
particular, bondholders assume that, if they reset the protective covenant, then equity
owners will optimally buy or sell assets subject to the new covenant.
(C) Now, fixing the initial level x of assets, suppose that Zincs equity owners can
substitute the current technology with an alternative technology that has the same meangrowth parameter m, but has a higher volatility parameter
> . The same Brownian
motion B drives asset growth. Would Zincs equity owners choose to do so in all cases?
Please bear in mind that, while the all-equity firm Zonc remains as described above, there
may not be another firm that uses the riskier technology characterized by (m,
). Please
justify your answer theoretically. Please be precise, by giving a proof or counterexample.
Exercise 4 (Floating-Rate Notes) floating-rate notes Fix a probability space and a
filtration. Consider an arbitrage-free market in which, for each t and s > t, there is
available for purchase or sale a unit zero-coupon default-free bond maturing at s with
a bounded price at t of t,s with outcomes in (0, 1). A floating-rate note, with a given
maturity T , a face value of 1, and a spread KT , is defined as a claim to payment of the
face value 1 at the maturity date T , and payment at each integer date i {1, . . . , T }
of the spread KT plus the one-period default-free floating rate Li1 established at the
previous integer date. The one-period default-free floating rate Li1 established at date
i 1 is 1
i1,i 1, the simple interest rate on one-period default-free loans. (In practice,
the calendar length of the interpayment time interval associated with a given note may
vary from case to case. You may assume for concreteness that one unit of time is one
calendar year, and that bounded positions in zero-coupon bonds and floating-rate notes
are permitted. All prices and trading strategies are adapted.)
(A) A par floating-rate note is a floating-rate note whose market value at the issue
date is equal to its face value. Prove, without the benefit of assuming the existence of
a state-price deflator, short-rate process, or equivalent martingale measure, that for any
integer maturity T , the unique arbitrage-free default-free par floating-rate spread KT is
zero. (You may assume that the issue date is 0.)
(B) Now, suppose there is a complete probability space (, F, P ), and a filtration {Gt }
satisfying the usual conditions. There is an adapted nonnegative
R short-rate
process r, and
t
an equivalent martingale measure Q after deflation by exp 0 ru du . A corporation,
Zank, defaults at a stopping time that, under Q, is doubly stochastic with an intensity
process Q .
For simplicity, we suppose that any floating-rate note issued by Zank has no cash flows
at or after default. This is sometimes called a zero-recovery assumption. One can price
a defaultable floating-rate note of maturity T at any given spread KT by pricing each
of the promised payments at times 1 through T , and then adding up the prices. For an
62
explicit model, we now suppose that there is an Rn -valued process X that is affine
under Q, in the sense that, for any
c = (a, a
, b, b) R R Rn Rn ,
and for any times t and s > t, there exist a scalar (t, s, c) and some (t, s, c) in Rn such
that
h Rs
i
Rt
measure Q after deflation by exp 0 rs ds , with drt = a(t) dt + b(t) dBtQ , for continuous
functions a and b of time alone, where B Q is a standard Brownian motion in R2 under Q,
and b is therefore valued in R2 .
(A) For a given maturity T1 , show that the price process U of a zero-coupon bond
maturing at T1 satisfies dUt = rt Ut dt + U (t)v(t) dBtQ , t < T1 , and provide a formula for
63
RT
v(t) in terms of a and b. Hint: Use the fact that 0 1 rt dt is normally distributed under
Q. Alternatively, use the fact that this is an affine term-structure model.
(B)
Consider a firm whose total market value process A satisfies dAt = rt At dt +
At A dBtQ , A0 > 0, where A R2 . There are no dividends, and the drift, rt At , is
therefore dictated by the definition of an equivalent martingale measure. The firm has
issued L bonds maturing at T1 , each promising to pay 1 unit of account at maturity unless
the value AT of the firm is not large enough to cover the debt, in which case the bonds
share the value of the firm on a pro rata basis, meaning a default payment of AT /L to
each bondholder. Letting Z0 denote the price of a bond at time zero, compute explicitly
the default risk premium U0 Z0 , showing it to be of the form of a Black-Scholes put
option price with explicitly stated coefficients. Hint: Take U as a deflator.
Exercise 7 Suppose that is a stopping time with intensity , in the sense of Section
2, and let Nt = 1{ t} . Show that N is a nonexplosive counting process with intensity
{t 1{ t} : t 0}. Hint: Use the local-martingale characterization of intensity.
Exercise 8 Under the conditions of Theorem 6 and the assumption that w = (1
` )(S ), obtain the representation (7.2) of defaultable bond prices based on the defaultadjusted short rate. The following exercise asks for a bit more, taking ` as a primitive
rather than w.
Exercise 9 Suppose that ` is a given predictable process valued in [0, 1], to be treated
as a fractional loss in market value at default. Suppose, in the setting of Section 4, that
the promised payoff F , the short-rate process r, and the intensity Q of the default time
are bounded, and let
Z s
Q
Q
Yt = Et exp
(ru + `u u ) du F , t s.
(E.6)
t
Suppose that Y does not jump at , almost surely. (Conditions for this are analogous to
those of Theorem 4.) Show that there is a unique bounded adapted process S with the
property that S is the price process for a security that pays (1 ` )S at if s and
pays F at s if > s. Show, moreover, that for t < , we have St = Yt , and that for t ,
we have St = 0.
Exercise 10 Certain credit derivatives are based on the first to default, meaning that
they are based on the first min(1 , . . . , n ) of n default (stopping) times 1 , . . . , n . In
the setting of Section 2, suppose that, for each i, the stopping time i has intensity i ,
and that there is no simultaneous default, meaning that for any i and j 6= i, we have
P(i = j ) = 0. Show that min(1 , . . . , n ) has intensity 1 + + n .
Exercise 11 On a given complete probability space (, F, P ), let N be a Poisson process
with constant intensity . Let W be a random variable, independent of N , with outcomes
1 and 0, with respective probabilities p and 1 p. Let K = N W , and let Gt be the
64
completion of the -algebra generated by {(Ks , Ns ) : 0 s t}. Let be the first jump
time of K. Show that K is a nonexplosive counting process, and calculate its intensity
process . Calculate, for an arbitrary given time s, the survival probability P( > s), and
show that the convenient formula (2.2) does not apply. In particular, K is not doubly
stochastic.
References
Acharya, V. and J. Carpenter (1999). Corporate Bonds: Valuation, Hedging, and Optimal Exercise Boundaries. Working Paper, Department of
Finance, Stern School of Business.
Anderson, R., Y. Pan, and S. Sundaresan (1995). Corporate Bond Yield
Spreads and the Term Structure. Working Paper, CORE, Belgium.
65
67
Filipovic, D. (2001). Time Inhomogeneous Affine Processes. Working Paper, Department of Operations Research and Financial Engineering,
Princeton University.
Finger, C. (2000). A Comparison of Stochastic Default Rate Models. Working Paper, The Risk Metrics Group.
Fisher, E., R. Heinkel, and J. Zechner (1989). Dynamic Capital Strucutre
Choice: Theory and Tests. Journal of Finance 44, 1940.
Geske, R. (1977). The Valuation of Corporate Liabilities as Compound
Options. Journal of Financial Economics 7, 6381.
Gibson, R. and S. Sundaresan (1999). A Model of Sovereign Borrowing and
Sovereign Yield Spreads. Working Paper, School of HEC, University of
Lausanne.
Grandell, J. (1976). Doubly Stochastic Poisson Processes. Lecture Notes
in Mathematics, Number 529, New York: Springer-Verlag.
Harrison, M. and D. Kreps (1979). Martingales and Arbitrage in Multiperiod Securities Markets. Journal of Economic Theory 20, 381408.
Heston, S. (1993). A Closed-Form Solution for Options with Stochastic
Volatility with Applications to Bond and Currency Options. Review of
Financial Studies 6, 327344.
Hull, J. and A. White (1992). The Price of Default. Risk 5, 101103.
Hull, J. and A. White (1995). The Impact of Default Risk on the Prices
of Options and Other Derivative Securities. Journal of Banking and
Finance 19, 299322.
Jacod, J. and A. Shiryaev (1987a). Limit Theorems for Stochastic
Processes. New York: Springer-Verlag.
Jacod, J. and A.N. Shiryaev (1987b). Limit Theorems for Stochastic Processes, Volume 288 of Grundlehren der mathematischen Wissenschaften. Berlin-Heidelberg-New York: Springer-Verlag.
Jarrow, R., D. Lando, and S. Turnbull (1997). A Markov Model for the
Term Structure of Credit Risk Spreads. Review of Financial Studies
10, 481523.
Jarrow, R., D. Lando, and F. Yu (1999). Diversification and Default Risk:
An Equivalence Theorem for Martingale and Empirical Default Intensities. Working Paper, Cornell University.
70
Jarrow, R. and S. Turnbull (1995). Pricing Derivatives on Financial Securities Subject to Credit Risk. Journal of Finance 50, 5385.
Jarrow, R. and F. Yu (1999). Counterparty Risk and the Pricing of Defaultable Securities. Working Paper, Cornell University.
Jeanblanc, M. and M. Rutkowski (1999). Modelling of Default Risk: An
Overview. Working Paper, University of Evry Val of Essonne and Technical University of Warsaw.
Karatzas, I. and S. Shreve (1988). Brownian Motion and Stochastic Calculus. New York: Springer-Verlag.
Karr, A. (1991). Point Processes and Their Statistical Inference, 2d ed.
New York: Marcel Dekker, Inc.
Kawazu, K. and S. Watanabe (1971). Branching Processes with Immigration and Related Limit Theorems. Theory Probab. Appl. 16, 3654.
Kijima, M. (1998). Monotonicities in a Markov Chain Model for Valuing
Corporate Bonds Subject to Credit Risk. Mathematical Finance 8,
229247.
Kijima, M. and K. Komoribayashi (1998). A Markov Chain Model for
Valuing Credit Risk Derivatives. Journal of Derivatives 6 (Fall), 97
108.
Kusuoka, S. (1999). A Remark on Default Risk Models. Advances in Mathematical Economics 1, 6982.
Lando, D. (1994). Three Essays on Contingent Claims Pricing. Working
Paper, Ph.D. Dissertation, Statistics Center, Cornell University.
Lando, D. (1998). On Cox Processes and Credit Risky Securities. Review
of Derivatives Research 2, 99120.
Leland, H. (1994). Corporate Debt Value, Bond Covenants, and Optimal
Capital Structure. Journal of Finance 49, 12131252.
Leland, H. (1998). Agency Costs, Risk Management, and Capital Structure. Journal of Finance 53, 12131242.
Leland, H. and K. Toft (1996). Optimal Capital Structure, Endogenous
Bankruptcy, and the Term Structure of Credit Spreads. Journal of
Finance 51, 9871019.
71
73
74