Escolar Documentos
Profissional Documentos
Cultura Documentos
Published by
Blackwell Publishing, 9600 Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA 02148, USA.
Young individuals, taking the locational choices made by their altruistic parents as given, decide
whether or not to acquire skills. The use of location-specific word-of-mouth communication in the
transmission of information about (skilled) job opportunities implies that the local social environ-
ment partly determines an individuals expected returns to education. Stratified equilibria, when
they exist, are characterised by low intergenerational social mobility and inefficient use of talent. In
addition, the equilibrium responses to factors that generally encourage education may, in stratified
outcomes, be highly asymmetric across socio-economic groups. Non-stratified equilibria are likely to
be destabilised by measures that encourage education.
* We thank the Editor Marianne Bertrand and two anonymous referees for excellent comments and
suggestions. The usual disclaimer applies.
1
The same phenomenom is reflected in the intergenerational social mobility literature where it has been
noted that, at least for the UK, intergenerational correlation in earnings seems to have increased over time
(Blanden et al., 2004). For US evidence, see Mayer and Lopoo (2005).
2
See also Loury (1981).
[ 782 ]
[APRIL 2007 ] STRATIFICATION, SOCIAL NETWORKS AND MOBILITY 783
role through its long-run effects on childrens development including cognitive skills,
not through short-term liquidity constraints.
Relatedly, there is a growing literature that seeks to identify the effect of family
income on childrens educational attainment. This literature uses a variety of methods
in order to give the results a causal interpretation, including siblings approaches,
(randomised or natural) experiments, and IV methods.3 A frequent finding in this
literature is that, once more sophisticated methods are used in order to obtain a causal
interpretation, the estimated effect of family income on childrens educational
attainments is significantly reduced relative to standard OLS estimates.
In contrast the results from the Moving To Opportunity (MTO) programme indicate
significant improvements in child outcomes (Katz et al., 2001). In these programmes a
random selection of families receive financial support to meet higher housing costs
associated with moving to more affluent areas. What is interesting about these pro-
grammes is that their observed effect should be due purely to neighbourhood change,
hence pinning down the local social environment as a key channel through which
family income affects childrens outcomes.4
One upshot from these findings is that there is a potential usefulness of economic
models that explicitly link parental income to childrens educational outcomes
through the channel of local social environments. By explicitly modelling local social
interactions and equilibrium locational choices such models may well produce quite
different predictions from the predominantly atomistic approach used in the Becker-
Tomes model. This insight has spawned a growing theoretical literature that looks at
the causes and consequences of endogenous segregation when the social environment
(i.e. the neighbourhood) affects the human capital accumulation process. Models in
this literature can be broadly partitioned into three groups:
1. Models that follow the Tiebout tradition by modelling local public finance
of education (Fernandez and Rogerson, 1996; Benabou, 1996; Durlauf, 1996).
2. Models with direct human capital externalities or peer group effects (de
Bartolome, 1990; Benabou, 1993, 1996; Durlauf, 1996).
3. Models where neighbourhoods act as social networks (Streufert, 2000;
Moizeau et al., 2004).
The main methodological difference between the second and third class of models is
that while the last class tries to provide microfoundations for neighbourhood effects
the second class treats the externalities as primitives. The current article falls into the
third group, but stresses a mechanism that has hitherto not been explored in the
context of inequality of educational outcomes, namely the role played by the local
social environment in job search activities.5
3
Blow et al. (2005) provide a survey of methods and findings. See Shea (2000) for an example of an IV
approach, Levy and Duncan (2001) and Duncan et al. (1998) for siblings analyses, and Clark-Kauffman et al.
(2003) for an example of randomised experiments approach. See also Blanden and Gregg (2004) for a
discussion of these methods and applications to UK data.
4
There is also an empirical literature that seeks to determine the role of neighbourhood characteristics
(e.g. average local income) on a variety of childrens outcome variables. Examples include Case and Katz
(1991), Brooks-Gunn et al. (1993), and Ginther et al. (2000).
5
An alternative role of neighbourhoods is in providing information through which youngsters form beliefs
about the likely return to educational investments. Models of this type are provided by Streufert (2000) and
Moizeau et al. (2004).
Empirically it is clear that social contacts play an important role in the job search
process. Montgomery (1991) notes, for instance, that about between 24% and 74% of
workers in the US, depending on industry, report having found the jobs through
friends and relatives.6 From a cross-country point of view, the structure of the flow of
information in labour markets can be expected to differ; public employment agencies,
for example, appear to play a smaller role in the US than in many European countries.7
Legislation also varies across countries; an extreme case is Sweden where employers
are, in order to facilitate job-search, required to notify the National Labour-Market
Board about any vacancies created, suggesting that comparatively few jobs may be
allocated through social networks (Korpi, 2001). The theoretical literature on the role
and implications of social contacts in the job search process has, in contrast, only
recently taken off on a wider scale and now examines a wide range of issues such as
wage structure, mismatch etc. (Montgomery, 1991; Mortensen and Vishwanath, 1994;
Calvo-Armengol and Zenou, 2005; Bentolila et al., 2004).
In this article we present a simple stylised overlapping generations model in which
labour-market institutions in particular, the mode of dissemination of information
about scarce job opportunities constitute a driving force creating endogenous stra-
tification and income correlation across generations. In the model, young individuals
decide whether or not to acquire skills; however, due to imperfections in the labour
market, skilled jobs are effectively being rationed. Information about job opportunities
is spread by word-of-mouth communication, implying that an individual is more likely
to hear about an opening in a skilled job if he has many skilled workers in his local
environment. This generates a lower expected return to skill-acquisition for individuals
from adverse social environments, making them less likely to invest in education. Based
on the income obtained during working life, an individual then chooses a residential
area, which defines the social environment of the individuals offspring. Using this
setup we find that, while there always exists an equilibrium in which stratification does
not occur, there may also exist stable equilibria with asymmetric or stratified
communities.
The model allows us to consider the effects of various factors on the stability of non-
stratified equilibria as well as on the existence of stratified equilibria and on behaviour
starting from such equilibria. We find for example that factors that generally encourage
participation in education (e.g. a higher wage return to education and/or lower costs
of education) can be inherently destabilising, making stratified equilibria more likely to
result. Similarly, if the economy starts from a stratified equilibrium, the externality
generated by the information transmission mechanism can generate perverse equilib-
rium effects: in particular, the model highlights how enhanced fundamental incentives
for human capital investment can lead to highly asymmetric responses and to more
equilibrium inequality in education.
The models in the literature that are closest to our are Benabou (1993, 1996) and
Durlauf (1996). Benabou (1993) focuses on the simultaneous choice of education and
6
Further evidence from the US are provided by Corcoran et al. (1980), Granovetter (1995) and Holzer
(1988), and for the UK by Boheim and Taylor (2001).
7
Indeed, the research to date seems to have generated a widespread pessimism concerning the efficacy of
US public employment services (Holzer, 1988) while the British findings give a more favourable picture of the
public employment service (Gregg and Wadsworth, 1996).
1. The Model
We start by outlining the basics of the model. Time is continuous and the horizon is
infinite. Each individual belongs to a family characterised by overlapping generations.
Formally, an individuals lifetime has three economic phases: a youth-phase of length
T/2 during which education may be acquired, a working-life phase of length T, and a
retirement phase of length T/2. For simplicity we make the overlap short: each indi-
vidual parents a single child only at the very end of her working life.8 This implies that,
half the time the family has only one family member while the other half of the time it
has two. Each family occupies one house at any point in time.
The timing of economic activities over an individuals lifecycle is as follows:
A young individual must decide whether or not to acquire skills. Acquiring
education is, however, costly. Moreover, the individuals vary in aptitude or
talent, making the cost vary in the population. We take the cost to be entirely
financial (representing costs such as tuition fees that vary with ability thanks to
education providers being able to price discriminate by means of e.g. scholar-
ships and grants) but we also assume that there is a perfectly functioning credit
8
The short overlap ensures consistency of our assumptions that an individual makes a locational choice
based on her realised adult earnings and that the choice is without knowning the childs cognitive aptitude.
market on which the individuals can borrow. Hence there are no credit con-
straints. Let h be an individuals idiosyncratic cost of education. We assume that
h is independent across individuals in the population, drawn from a distribution
with support h; h. Let G(h) and g(h) be the CDF and PDF of h respectively; g(h)
is assumed to be positive on the entire support and the distribution is assumed
not to have any atoms.
During her working life an individual works and saves her earnings. There are
two types of jobs in the economy: good jobs and bad jobs. While good jobs can
only be filled by skilled workers, bad jobs can be filled by any worker. Bad jobs
are immediately available to anyone who wants them; good jobs, on the other
hand, are hard to find (see below).
When exiting the labour market the individual makes a decision about location
and consumption. Indeed, consumption only occurs in retirement. Individuals
who acquire education, thus, find themselves in the social environment chosen
by their parents; while it is important that education is acquired locally prior to
any possibility to move conditional on the childs ability, the assumption that
individuals cannot move during their working lives is for tractability.9
Our focus will be on steady states, and in order to ensure stationarity of the
endogenous variables in continuous time we assume that a frequent succession of
cohorts of individuals are born into the economy, each cohort being a continuum of
unit size. Formally, a new cohort is born every T/(2N ) units of time where N is a
(large) natural number. There are then, at any point in time, 4N individuals alive in the
economy. The number of families (which is the same as the number of houses), in
contrast, is 8N/3.10 We can then let the frequency of cohorts grow by letting N ! 1.
There is a single numeraire consumption good in the economy, produced by the
working individuals.
There are two exogenously given intrinsically identical locations of equal size; in
steady state each cohort will be equally split between the two locations. While the two
locations are intrinsically identical they can, in equilibrium, differ in terms of skill-
composition, referred to below as the local social environment. Since a parents
location choice occurs when her offspring is entering the youth period this choice
determines the social environment that the offspring will have during her youth and
working-life. Parents are assumed to be altruistic towards the children and may
therefore be willing to pay for a house in an area with a more favourable social
environment. We assume however that parents make their location choices before they
learn their offsprings cost of education.
Since the focus of the study is on social environments as a channel for shaping the
transmission of economic success across generations we adopt the modelling strategy of
making this the only channel. In particular ability is not transmitted across generations
9
It seems clear that our qualitative results would stand as long as an individuals initial social environment
has an effect on their work-life prospects.
10
The number of individuals alive at any given point in time is the length of an individuals life, which is
2T, divided by the time elapsed between the birth of successive cohorts, which is T/(2N ). The number of
families is smaller due to the fact that half of all family have, at any point in time, two current members.
fraction of workers (i.e. individuals of working age) in location j who are skilled.12 Since
we have assumed that there is a frequent succession of cohorts and since we are
focusing on steady states we treat c1 and c2 as constant over time, both over an indi-
viduals lifetime and across generations.
We assume that information about (a fraction of) the good jobs created is spread
through word-of-mouth communication among skilled workers. Some simplifying
assumptions allow us to determine the implications of the social environment:
Suppose that all families have the same number of immediate neighbours and
that those neighbours are representative of the area.13
Information about good jobs can only be passed on between immediate
neighbours.
There is no relay of information: information can only be passed on once.
Each skilled worker in the economy (whether currently in a good job or not) is
equally likely to hear about any good job that is created.
It is worth stressing at this point that the model also allows for some or all bad jobs to
be allocated through social networks; since there is no waiting time for a bad job, this
has no further consequence for the analysis.
Returning to the idea of a screening cost underlying the job-finding mechanism, it
seems quite reasonable that information reaches skilled workers only through their
school networks, say and that it is possible for the original receiver of the information
to credibly recommend another skilled worker. Montgomery (1991) develops an
adverse-selection model of referrals focusing directly on the hiring decision of firms,
thereby providing a foundation for the importance of social networks quite distinct
from ours. He also synthesises some empirical evidence on the importance of referrals
and a salient observation in that context is that referred workers earn higher wages (in
particular in the short run) and have lower quit rates; this arguably corroborates the
notion that referrals are associated with better matches.
Under the above assumptions the relative local job-finding rates (for skilled jobs)
reflect the relative frequencies of skilled workers in the two areas, c1 and c2.14 Specif-
ically, let aj(t) be the job-finding hazard of a skilled worker of age t in area j. We then
have that the local hazards stand in constant proportion to each other: for all t 0,
a1 t c1
: 1
a2 t c2
The skill frequency cj among the local working adults is thus indeed an appropriate
definition of the local social environment; moreover, what matters for the job-alloca-
tion process is the relative social environments c1/c2.
12
As noted below, this will not generally be the same as the skill composition of the retirees living in the
area. Retirees on the other hand do not transmit information about job openings.
13
Our model of informal dissemination of information about job openings can be viewed as a highly
simplified version of the model in Calvo-Armengol and Zenou (2005).
14
Formally, this rests on the assumption that skilled jobs that are created are cohort-specific and that there
is a frequent succession of cohorts. This implies that the probability that a skilled worker who is first to hear
about a specific vacancy will need it for herself is negligible.
X cj X cj
Fj t F t ; 4
j1;2
2 j1;2
2
since both sides of (4) measure the total number of skilled workers who have found
good jobs after t units of time.
15
Calmfors et al. (2002) cite evidence on the effectiveness of the efforts of the Swedish Employment
Service concluding, in particular, that intensifying the efforts in a way that is targeted at some clients is
effective in the sense that the future labour-market prosects of those clients are improved relative to the
control groups; while there may be some crowding out involved, the notion that employment services are
effective is corroborated.
16
To see this relation
Rt recall that aj Fj0 t=1 Fj t is the hazard rate associated with Fj(); hence,
ln1 Fj t 0 aj sds. Using (2) the result is then immediate.
It is easy to verify that (3) together with (4) imply a unique solution (F1(t), F2(t))
(both in the unit interval) for any value of c1,c2 and F(t) 2 [0, 1]. Equations (2) to (4)
also immediately imply the following stochastic-dominance result:
cj > ck implies Fj t > F t > Fk t for all t 2 0; T : 5
as the expected waiting time for a skilled worker in area j, i.e. the average time that the
skilled worker must spend in a bad job before finding a good job.17 As a corollary of the
stochastic-dominance result (5) we then have that:
cj > ck implies tj cj ; ck < t0 < tk ck ; cj ; 7
where we recall that t0 is the expected waiting time associated with the distribution F().
This elasticity is zero at cj 0 and thus increasing over some range by construction.
Moreover, it is readily seen that the elasticity is everywhere increasing unless p() is too
convex (a sufficient condition being that (p00 p) (p0 )2).20 In economic terms, this is
17
RT
If Fj(t) is differentiable then tj cj ; ck 0 tfj tdt where fj t Fj0 t is the local density of job finding
times.
18
This follows a sizable body of literature on neighbourhood effects and peer-group effects; contributions
in economics that relate to this article include de Bartolome (1990), focusing on externalities in education,
and Benabou (1996) employing more general neighbourhood effects.
19
The complementarities could have a richer timing structure but this simple structure is in line with our
emphasis on global spillovers, and obviously in the interest of tractability.
20
The exact condition, dep/dc [1 c(|p0 |/p p00 /|p0 |)]/|p/p0 | 0, is significantly weaker.
Using that p() is non-increasing, along with (7), (9) and (10), immediately yields:
cj > ck implies hj > hk and zj > zk : 11
y pj 1g
; 12
1g
where pj is the price of a house in area j, and where g measures the individuals
(relative) risk aversion. Let a 2 (0, 1) be the weight placed by an individual on the
The Author(s). Journal compilation Royal Economic Society 2007
792 THE ECONOMIC JOURNAL [APRIL
expected welfare of her child. The overall utility obtained by a parent with end-of-life
wealth y, denoted V(y), then satisfies the following Bellman equation
( )
y pj 1g
V y max aEh V ~
z pj jj ; 13
j1;2 1g
where z~ is the random earnings (net of education costs) of the child, and where the
expectation is taken conditional on the child growing up in area j. Note that the parent
takes into account that the value of the house will be passed on to the child. Decreasing
marginal utility ensures that there is equilibrium sorting: if the areas differ in
attractiveness, then for any g > 0 the parents with above median end-of-working-life
wealth y will outbid the poorer parents for the houses in the better area.21 While any g > 0
ensures sorting, we choose to focus on the limiting case where risk aversion is vanishingly
small, g ! 0, since this greatly simplifies the analysis of prices and welfare.22
From (13) we see that, in the limiting case of risk neutrality, the indirect utility V() is
linear, V (y) y C for some C not depending on y. Using this to replace V() in (13)
we obtain
y C max y pj azj pj C; 14
j1;2
Substituting for the value of the house using (15) the expected utility is simply
zj/(1 a). Moreover, since half of the cohort is born into each area, the total expected
welfare of the cohort is
1 X zj
W : 17
1 a j1;2 2
2. Steady-state Equilibrium
So far we have taken the fractions of skilled workers in the two areas as given and shown
how c1 and c2 determine local job-finding rates and rational education choices. We
have not required that c1 and c2 be consistent with those education choices, however.
When we impose this consistency requirements we obtain a characterisation of a steady-
state equilibrium.
2.1. Characterisation
A worker in community j will only invest in education if he is talented enough so that
h < hj , defined in (9). The fraction of agents in community j acquiring skills will then
be cj G(hj). Rationality and consistency can therefore succinctly be summarised in
the following two equilibrium conditions:
( )
Dw
cj G T tj cj ; ck ; j; k 1; 2; j 6 k: 18
pcj
In words, (18) says that the educational choices of the agents in each area should be
rational given the social environments in both communities that those educational
choices generate. Equation (18) implicitly defines cj as a function of ck 2 [0, 1] which
we can denote C(). We will refer to C() as the reaction function and to a particular value
of that function, cj C(ck), as a within-area equilibrium in area j (conditional on ck). A
steady-state equilibrium (c1, c2) is then simply two mutually consistent within-area equil-
ibria: a pair (c1, c2) satisfying c1 C(c2) and c2 C(c1).
Uniqueness of the within-area equilibrium follows from a within-area stability con-
dition. In order to develop this condition define
jk @T tj ck
et ; j; k 1; 2: 19
@ck T tj
jj
et is the elasticity of expected time in a good job for a skilled worker in area j with
jk
respect to the skill rate cj in the own area; when k 6 j, et is the elasticity of expected time
in a good job for a skilled worker in area j with respect to the skill rate ck in the opposite
24
The condition guarantees that the derivative of the right-hand side of (18) does not exceed one, which
corresponds to a local expansion of education not being self-reinforcing, in the abscence of adjustment in the
other community. The condition involves local and global spillover effects, and rules out the local comple-
mentarity being too strong.
25
Multiplicity stemming from local spillovers have been ruled out by the stability condition (20) which
guarantees uniqueness of the within-area equilibrium.
0
0 1
1
The number of steady-state equilibria will always be odd: if (c1, c2) is an asymmetric
equilibrium, its mirror image is also an equilibrium. Hence, we can without loss of
generality focus on the case where c1 c2. Even though there can, in principle, be any
odd number of equilibria we focus on the more plausible cases where
(i) the symmetric equilibrium is the only equilibrium, or
(ii) there is, in addition to the symmetric equilibrium, a single asymmetric equi-
librium, c1 > c2.26
2.3. Stability
Our notion of stability is the standard one based on the reaction function C(), viz. the
one requiring that a myopic adjustment process with the cs being adjusted alternately
converge to the equilibrium. As is well known, an equilibrium (c1, c2) is stable
0 0
according to this definition if and only if the slopes satisfy: |C (c1)||C (c2)| 1. An
expression for the slope of the reaction function can be developed from its definition
(18). Differentiating through this equation with respect to ck and making use of the
elasticity definitions (8) and (19) we obtain that
jk
g hj hj =ck et
C0 ck j jj
: 21
1 g hj hj =cj ep et
We show below (Lemma 2) that the slope is negative at any symmetric equilibrium. The
symmetric equilibrium is hence stable if and only if C0 (c ) 1 and unstable if
C0 (c ) < 1.
26
As noted above, the mirror image is also an equilibrium; we count qualitatively distinct equilibria only
however.
Given our assumption that there is either a single symmetric equilibrium or one
symmetric and one asymmetric equilibrium the following observation can easily be
verified: if the symmetric equilibrium is the unique equilibrium, it is also stable. If, in
addition to the symmetric equilibrium, there is an asymmetric equilibrium, then the
latter is stable while the symmetric equilibrium is unstable.
27
We should stress that the argument is presented here in intuitive terms. In reality, u does not fix the
relative value of the local waiting times but rather the relative size of the log of the survival function, i.e.
ln [1 Fj(t)], for all values of t (as can be seen from (3)). However, since the waiting time is an integral of the
survival function, the result goes through.
The Lemma highlights how the slope of the reaction function at a symmetric equi-
librium depends only on three factors. First, it depends on the fraction v of good jobs
The Author(s). Journal compilation Royal Economic Society 2007
798 THE ECONOMIC JOURNAL [APRIL
28
Note that the uniformity requirement is sufficient but not necessary.
Note that we focus on the case where there are some strictly positive local com-
plementarities since we know that, in their absence, only v and K influence the stability
of the symmetric equilibrium.
We can now proceed to explore how various factors affect the stability of the sym-
metric equilibrium. Since the symmetric equilibrium is stable if and only if
C0 (c ) > 1 we will say that a factor destabilises (stabilises) the symmetric equilibrium if
it decreases (increases) C0 (c ), either directly or indirectly through c . The factors that
we will consider are:
(i) A reduction in education cost, represented by an increase of the (constant)
density g.
(ii) An increase in the skill wage premium, represented by an increase in Dw.
(iii) An increase in the fraction of jobs that are allocated through social networks,
represented by an increase in v.
The first change corresponds to the education cost of each individual being reduced
by the same proportional factor. Since the wage gap reflects the underlying technol-
ogies, an increase in Dw may be interpreted as a skill-biased technological change.
Hence the first two factors both tend to generally encourage educational investments.
We can summarise the unambiguous effects as follows.
Proposition 4 Given the existence of local complementarities in education, b > 0, all factors
(i)(iii) destabilise the symmetric equilibrium.
The Proposition has two parts. The property that (iii) destabilises the symmetric
equilibrium is an important but unsurprising fact that, moreover, can easily be shown
to hold under less restrictive assumptions.29 The property that the symmetric, equal
opportunity shocks (i) and (ii) destabilise the symmetric equilibrium, on the other
hand, is more noteworthy; as we note elsewhere, this shows that measures intended to
promote equal opportunities by promoting education may have the unintended effect
of facilitating stratification. The mechanism by which these shocks destabilise the non-
stratified equilibrium is permeated by the scale effect: whenever the elasticity of the
local complementarity increases with c (as it does under Assumption 2), any expansion
of education including a completely symmetric expansion at a symmetric equilibrium
reduces the marginal cost of a further expansion, and this marginal-cost reduction
undermines the stability of the symmetric equilibrium.
Remark 1 While it would be tempting to conclude from (23) that an increase in the labour-
market distortions as measured by the statistic K will be destabilising in the same way that v is, this
29
An increase in b is also easily shown to destabilise the symmetric equilibrium.
is not generally true. The key difference is that, while changes in v do not affect the scale of the
symmetric equilibrium, an increase in K generally reduces c and can hence be stabilising.
As it turns out, however, this is the only unambiguous response from an asymmetric
equilibrium that can be derived. Importantly, any factor that has an unambiguous direct
30
The ambiguity for area 1 stems from the fact that t1 increases as a result of fewer individuals acquiring
education in the disadvantaged area 2.
5. Efficiency
Is stratification bad for welfare? The interpretation of this question is not as straight-
forward as it may first seem since it is not obvious what is the relevant comparison. In
order to analyse the efficiency of the equilibria in the model it useful to start by
disentangling aggregate welfare W (defined in (17)) by separating production from
education costs.
31
This positive direct effect of these two parameters are shown in the proof of Proposition 5. Note also that
if the generic parameter f has a positive direct effect, then both total effects cannot be negative since this
would require that [C0 (c0 1)]1 < (oc1/of)/(oc2/of) < C0 (c2) which contradicts that the equilibrium is stable
(since this implies that C (c1)C0 (c2) 1).
Consider first education costs. Denote the cost of educating a fraction c of a local
communitys cohort (measured per capita) by
Z hc
Cc pchg hdh; 27
h
If there were no local complementarities in education (i.e. p(c) 1 for all c) then the
cost C(c) would be guaranteed to be strictly convex: the cost in this case increases more
than proportionally with the number of students since the marginal student will always
have higher idiosyncratic cost than the infra-marginal students. Local complementa-
rities, p0 (c) < 0, counteract this effect by reducing each individuals effective idiosyn-
cratic cost of education as the number of students increase. However, since our main
focus is not on local complementarities we assume that C(c) is strictly convex
throughout. Hence we assume that the first mechanism dominates.
Note that in this model an individuals location during the youth (and adult) phase is
determined by her parents locational choice, and, moreover, since cognitive abilities
are i.i.d. the distributions of the idiosyncratic education costs h are always identical in
the two areas. Convexity of C () then has straightforward implications for the efficient
allocation of education. In particular, it implies that symmetry is efficient: the cost of
educating a fraction c of the individuals of a given cohort is minimised by educating a
fraction c of the youth in each area.
Consider next aggregate output of a given cohort. Equation (22) shows that the
expected waiting time of a randomly chosen skilled worker in the economy is always t0 ,
i.e. the waiting time associated with the exogenously given job-creation process. It then
follows that the aggregate output of a cohort is linear in the aggregate skill rate
c c1 c2 =2; in particular, aggregate output is TwL DwT t0 c. Hence we can
rewrite our welfare measure (ignoring the constant 1/(1 a)) as
z1 z2 1X
W TwL Dw T t0 c Ccj : 29
2 2 j
We can now start to approach the question of whether stratification is bad for wel-
fare. A first observation is that a stratified outcome is never desirable in the sense that
any asymmetric allocation is welfare dominated by some symmetric allocation. This
follows trivially from convexity of the cost function C(): any asymmetric allocation is
immediately dominated by the symmetric allocation with the same aggregate level of
education.32 It is a rather weak result however since the dominating symmetric
allocation need not be an equilibrium (not even an unstable one).
32
This result actually continues to hold (with weak dominance) even if C() fails to be convex. To see this
note that additive separability of W in c1 and c2 implies that there is always a symmetric efficient allocation: if
bc 2 arg maxc DwT t0 c Cc then the symmetric allocation c1 c2 bc is an efficient allocation.
The result suggests that when there is a stratified equilibrium welfare would have
been higher in the (unstable) symmetric equilibrium. However, it should be noted
that Proposition 6 is only a local result. Why does it not extend to the global case?
The reason is that aggregate education may be higher in the asymmetric equilib-
rium than in the symmetric equilibrium. With positive local spillovers there is
generally too little education in any equilibrium; hence the asymmetric equilibrium
may welfare dominate the symmetric one only if it involves a higher aggregate skill
rate.
6. Mobility
Intergenerational mobility tends to be measured in two different ways in the lit-
erature. The perhaps most common approach is to look at the correlation in out-
comes between generations. A second approach is to use a transition-matrix
characterisation of mobility, e.g. between income deciles. In the current model
successive generations within a family are linked via the locational choices made by
the adults at the end of their working lives. Indeed, this is the only link
between generations since there is, by assumption, no transmission of human cap-
ital and no other transfers. In this Section we outline two measures of intergener-
ational mobility and explore the relation between asymmetric outcomes and
mobility.
Consider a given family. It is easy to see that the family history will be a Markov
process in which the locational choice at time t serves as a state variable. Note e.g. that
the exact earnings, wealth, or education of the parent are not correlated with the
childs outcomes once we condition on the area into which the child is born.33 There is
a large number of families in the economy each characterised by the same Markov
process and there is no correlation across families. Hence, assuming that there is
mobility between areas in equilibrium, the steady state cross-sectional distributions of
locations and wealths in the population will correspond to the limiting distributions
associated with the Markov process.
Intergenerational social mobility is frequently measured in the literature as the
intergenerational correlation in earnings (Solon, 1999). Here we focus instead on end-
of-working-life wealth y for two reasons. First, wealth directly corresponds to utility and
hence the correlation in wealth measures the correlation in well-being. Second, end-
of-working-life wealth is directly related to the geographical mobility since it determines
the locational choice. Hence let yt be the end-of-working-life wealth accumulated by the
adult in generation t of an infinitely lived family. We then define:
Definition 1. The intergenerational wealth correlation is q Corr(yt, yt1).
Perfect mobility in this case is l 1/2 implying that a person growing up in area j is
equally likely to choose area j and k at the end of her life. If l 0 there is no transition
mobility.
The two measures of mobility are clearly related since the locational choices depend
on wealth but they not exactly the same: the correlation q takes into account the whole
wealth distribution while the transition mobility only considers the probability of
switching from above to below the median (or vice versa). In a symmetric equilibrium
there is perfect mobility according to either measure.34 But the converse is also true: if
an equilibrium is not symmetric mobility is not perfect. We prove this for the case of the
intergenerational wealth correlation.
Proposition 7. Let c1, c2 be a steady state equilibrium with c1 > c2. Then q > 0.
A curious feature of the model is that the exact degree of altruism a does not
play any role in determining the set of steady-state equilibria and hence does not
affect the aggregate distributions of neither education nor earnings. Altruism
does however impact on the degree of intergenerational social mobility in an
33
We could, on the other hand, equally well have used end-of-life wealth y as a state variable.
34
Strictly speaking, the transition mobility is not well-defined since the locations are de facto identical
making the locational choice indeterminate. Assuming that, given this indifference between areas, parents
flip a coin when choosing location, perfect transition mobility follows.
7. Discussion
Empirical evidence suggests the increased return to education that has been observed,
not least in the UK and the US, has been associated with a strengthening of the link
between parental income and childrens educational attainments. Determining the
mechanisms underlying the positive education-parental income relationship is of great
importance in order to identify appropriate policies to promote human-capital
investments and equality of opportunities. For example, policies designed to alleviate
financial constraints such as education subsidies and income transfer policies may
have unintended consequences if the link between parental income and education
decision is, in fact, not due to credit constraints. This point has recently been stressed
by several authors; e.g. Cameron and Heckman (1998), after finding empirically that
long-run factors appear to be of primary importance, simulate the effects of a general
increase in family income on college attendance. Their conclusion is that, while
increasing resources available to parents is likely to increase enrolment, the students
that are attracted into college have considerably lower ability than those already opting
to attend. If parental income matters through less direct but longer-run channels such
as e.g. the choice of neighbourhood, then a very different policy approach may be
required in order to promote equality in education; in particular, it may be necessary to
try to conceive of policies that more directly seek to minimising the effects of socio-
economic stratification.
In this article we have developed a simple model of the link between parental income
and childrens educational investments where the link obtains directly through
neighbourhood effects. The model formalises one particular role played by social
environments, namely the role played by social networks in job-finding processes.
While the model is still a version of a parental investment model in the sense that rich
parents can invest in good neighbours in order to improve the lifetime opportunities
of their children, credit constraints do not play a role. The model provides one
example of how, with this type of causal link, intuitions about the effects of policy or
other shifts in the environment may be very different from those obtained in more
atomistic models based on credit constraints. This was highlighted for example in the
finding that a general reduction in the cost of education or an increase in the returns
to skill, could lead to highly asymmetric educational responses across socioeconomic
groups.
The Author(s). Journal compilation Royal Economic Society 2007
806 THE ECONOMIC JOURNAL [APRIL
Appendix A: Proofs
Proof of Lemma 1. The adding up condition (4) and the relation between the local job-finding
CDFs (3) define F1(t) and F2(t). Differentiating through this equation system and using (2) we
obtain that
@F2 t F t F1 t 1 F2 tu tc1 =c2 ln1 F2 t
; A1
@c1 c2 c1 u1 F2 tu1
and
Proof of Lemma 2. Evaluating (A1) and (A2) at symmetry and using cj c implies that Fj(t)
F(t) and u 1 we obtain that
@Fj t @Fj t t1 F t ln1 F t
: A3
@cj @ck c 2c
c
RT
From (6), T tj 0 Fj tdt; differentiating this identity and substituting using (A3) yields
Z T
@T tj @T tj t
1 F t ln1 F tdt > 0: A4
@cj @ck c 2c 0
c
Then use (18) to substitute for Dw/p(cj), evaluate at symmetry, and use Assumption 1 (which
implies that c gh ); this yields
@T tj c
@ck c T t0 tK
C0 ck
A6
@T tj c 1 ep tK;
1 ep
@cj T t0
c
where
Z T
1
K 1 F t ln1 F tdt > 0; A7
2T t0 0
and where we made use of (A4). Finally the denominator is positive by the within-area stability
condition (20).
This is obviously satisfied if F(t) > 1 1/e 0.63 sufficiently early, verifying part (2). To see
part (1) make a change of variables x F(t) (note that this maps [0, T] to [0, 1] monotonically)
and thus dt/dx h(x) 1/f(t) > 0 pointwise:
Z T Z 1
fln1 F t 1gdt ln1 x 1hxdx: A11
0 0
R1
Since 0 ln1 x 1dx 0 the integral is negative if h(x) is increasing,that is if f(t) is
decreasing.
Proof of Proposition 4. Given Assumption 2 ep bc > 0, which we use to0 substitute for ep in
(23). Consider first the effect of scale c ; using the substitution it follows oC (c )/oc < 0; hence
an increase in c is destabilising given the existence of local spillovers. Of the three factors in the
proposition only v has a direct effect (i.e. not via the scale). Differentiating (23) yields the
following direct effect
@C0 c
K1 bcd ; A12
@t
with d 1/(1 bc vK)2 > 0. The direct effect is thus negative. Consider then the effects via
the scale c . Both g and Dw (but not v) impact on the scale c . To see this, note that the symmetric
equilibrium satisfies c pc gDwT t0 . Differentiating this identity gives
@c @c
DwT t0 d > 0; gT t0 d > 0; A13
@g @Dw
where d 1=pc 1 ep > 0 where the inequality follows from within-group stability (20)
jj
(using that gh /c 1 and et > 0 at the symmetric equilibrium).
Proof of Proposition 5. The direct effects of the various parameters in the model can be obtained
by differentiating (18) holding ck constant and making use of Assumption 1 and 2. This yields
@cj @cj @ tj
cj 2 pcj d > 0; gDw d ; A14
@b @t @t
@cj @cj
gT tj d > 0; DwT tj d > 0; A15
@Dw @g
j jj
with d 1=pcj 1 ep et > 0, where the inequality follows from within-group stability
(20). Note that since b, Dw and g have the unambiguous direct effects, their total equilibrium
effect on c1 and c2 will generally be ambiguous (see (26)): only v can have unambiguous effect.
The impact of v on the waiting time tj is via the hazard ratio u defined in (2). Given that c1 > c2
an increase in v increases u. Using the same technique as in the proof of Lemma 1 an increase in
u can easily be shown to increase F1(t) and reduce F2(t) for all t 2 (0, T). From (6) this implies
that @ t1 =@t < 0 while @ t1 =@t > 0. Hence oc1/ov > 0 while oc2/ov < 0. The result then follows
0
from (26) using that C (c1) < 0 (see Section 2).
Proof of Proposition 6. We prove the result for a marginal reduction in 0 education costs
represented by dg > 0. (The other two cases follow the same lines.) Initially C (c ) 1 since
the symmetric equilibrium is just stable. The increase in g increases c (see (A13)) and hence, via
ep , makes the symmetric equilibrium unstable. dg > 0 hence causes the emergence of a stable
asymmetric equilibrium.
We first show that the increase dg causes the same response in the aggregate education rate
along either equilibrium. The asymmetric equilibrium is characterised by c1 C(c2) and c2
C(c1). Differentiating this equation system and evaluating at c1 c2 c yields
dcj
C0 c dck cg c ; j; k 1; 2; j 6 k; A16
dg c dg c
0
where cg(c ) oc/og > 0 (obtained from (18)). Using C (c ) 1 (15) shows that the impact
on the aggregate education rate c is
!
dc 1 dc1 dc2 cg c
: A17
dg c 2 dg c dg c 2
where the inequality follows from C() being strictly convex and from Jensens inequality (noting
that the square function is convex).
Proof of Proposition 7. The random wealth of an individual born into area j is y~j z~j pj where
z~j is her location-specific random earnings (net of education costs). Recall that zj E~ zj and
that c1 > c2 implies z1 > z2 and p1 > p2 (see (11) and (15)). Hence we have that c1 > c2 implies
that y1 > y2 where yj E~ yj . Also let ym and y be the median- and mean wealth in the population
respectively. Then, given that c1 > c2, due to sorting, we have the following locational choice
rule: the adult at t chooses to buy a house in area 1 if and only if yt ym. Consider then the
covariance between yt and yt1; Cov(yt, yt1) E yt yyt1 y where we used Eyt y for
all t. It is convenient to take the expectation conditional on the location chosen by the adult at t.
Suppose first that this location is area 1. By the locational rule this is equivalent to yt ym. Note
that under this condition yt1 is an independent draw from the random variable y~1 . Hence we
have that
Eyt yyt1 yjyt y m y1 yEyt yjyt ym : A19
Proof of Proposition 8. The steady state (c1, c2) induces two local earnings distribution with
CDFs which we can denote Uj(z), j 1, 2. Since end-of-working-life wealth y is earnings plus the
value of the inherited house the local CDFs of wealth can be written as Pr(y < y0|j) Uj(y0 pj).
Since half the population is born into each area, the median wealth ym is characterised by
P m
j 1,2Uj(y pj) 1. There are then two area-specific critical earnings: an individual born into
area j will, at the end of her working life, buy a house in area 1 if and only if her earnings
exceed zjcr ym pj . House prices depend on altruism: from (15) we have pj(a) [a/(1 a)]zj
where z1 > z2 (see (11)). From the characterisation of the median we obtain
U01
ym0 a p10 a U02
ym0 a p10 a 0: A22
It follows thatp10 a m0
> y a > p20 a, whereby z1cr a is decreasing in a and z2cr a is increasing.
The result follows immediately.
Appendix B: A Microfoundation
In this Appendix we present a simple microfoundation for assumption that the creation of good
jobs is spread out over the working life of a cohort. The model, which is a simple efficiency wage
model and draws heavily on Saint-Paul (2002), implies the existence of an aggregate CDF
function F() for the age at which skilled workers find good jobs. In particular, it implies that the
number of good jobs created at any age t is proportional to the total number of skilled workers in
the cohort (and, in particular, does not depend on social composition). The model also implies
that F(t) 1 which says that all skilled workers will have found good jobs by the time they retire.
Consider the following version of an efficiency wage model. A worker who is in a good job can
try to access a stealing technology. She might be caught trying, but if she is successful she can
steal an amount h (at no risk) from the firm at every moment until retirement. We want to find
the job-creation rate at each age t 2 [0, T] that is compatible with a worker in a good job not
misbehaving; under the assumption of free entry by firms, this will be the equilibrium job-
creation rate (along with the competitive wage wH).
It is useful to derive value functions measuring expected discounted future earnings. Let V N(s)
denote the value of being employed in a good job not trying to steal with s T t time left in the
labour market. Trivially, for all s 2 [0, T],
wH
V N s 1 er s ; A23
r
where r is the interest rate. Let VT(s) denote the value of trying to access the stealing technology
with s time left in the labour market. VT() satisfies the asset equation
rV T s wH qV S s V T s pU s V T s V_ T s; A24
We can then show that the number of good jobs created for skilled workers, at any age t, is
proportional to the total number of skilled workers in the cohort and hence does not depend
on their composition in terms of social background. To see this note first that
References
Acemoglu, D. and Pischke, J.S. (2001). Changes in wage structure, family income, and childrens education,
European Economic Review, vol. 45(2) (May), pp. 890904.
Becker, G. and Tomes, N. (1986). Human capital and the rise and fall of families, Journal of Labor Economics,
vol. 4(3) (July), pp. S139.
Bedard, K. (2001). Human capital versus signaling models: university access and high school dropouts,
Journal of Political Economy, vol. 109(4) (August), pp. 74975.
Benabou, R. (1993). Workings of a city: location, education, and production, Quarterly Journal of Economics,
vol. 108(3) (August), pp. 61952.
Benabou, R. (1996). Equity and efficiency in human capital investment: the local connection, Review of
Economic Studies, vol. 63(2) (April), pp. 23764.
Bentolila, S., Michelacci, C. and Suarez, J. (2004). Social networks and occupational choice, mimeo, CEMFI,
Madrid.
Blanden, J., Goodman, A., Gregg, P. and Machin, S. (2004). Changes in intergenerational mobility in Britain,
in (M. Corak, ed.), Generational Income Mobility in North America and Europe. Cambridge: Cambridge
University Press.
Blanden, J. and Gregg, P. (2004). Family income and educational attainment: a review of approaches and
evidence for Britain, mimeo, Centre for the Economics of Education, London.
Blanden, J. and Machin, S. (2004). Educational inequality and the expansion of UK higher education,
Scottish Journal of Political Economy, vol. 51(2) (May), pp. 23049.
Blow, L., Goodman, A., Kaplan, G., Walker, I. and Windmeijer, F. (2005), How important is income in
determining childrens outcomes? A methodology review of econometric approaches, mimeo, Institute
for Fiscal Studies, London.
Boheim, R. and Taylor, M. (2001). Job search methods, intensity and success in Britain in the 1990s, mimeo,
CEPR, London.
Brooks-Gunn, J., Duncan, G.J., Klebanov, P.K. and Sealand, N. (1993). Do neighborhoods influence child
and adolescent development?, American Journal of Sociology, vol. 99(2) (September), pp. 35395.
Calmfors, L., Forslund, A. and Hemstrom, M. (2002). Vad vet vi om den svenska arbetsmarknadspolitikens
sysselsattningseffekter? (What do we know about the employment effects of Swedish labour market
policy?), Report 2002:8. IFAU, The Institute for Labour Market Policy Evaluation, Uppsala.
o-Armengol, A. and Zenou, Y. (2005). Job matching, social network and word-of-mouth communication,
Calv
Journal of Urban Economics, vol. 57(3) (May), pp. 50022.
Cameron, S. and Heckman, J.J. (1998). Life cycle schooling and dynamic selection bias: models and evidence
for five cohorts of American males, Journal of Political Economy, vol. 106(2) (April), pp. 262333.
Cameron, S. and Heckman, J.J. (2001). The dynamics of educational attainment for black, hispanic, and
white males, Journal of Political Economy, vol. 109(3) (June), pp. 45599.