Escolar Documentos
Profissional Documentos
Cultura Documentos
1. Introduction
2. Empirical Natural Increase Model
3. Data information
4. Empirical results: Model estimations and Hypotheses testing
5. Summary and Conclusions
6. References
1. Introduction
In this paper we are interested in creating an empirical model, which is able to explain natural population
increase for 105 countries worldwide and in identifying the various responsible factors and discovering
their relative impact. Besides we want to check, whether it is meaningful to create separate models for rich
and poor countries to get better results and to find out, if the same variables are important in explaining
nat. population increase, or whether there are different significant variables.
We do not use a formulated model based on a theoretical framework but we create a regression model by
ourselves based on considerations after dealing with literature about this problem.
We generate our regression model with Eviews using the ordinary least squares (OLS) procedure which
produces the most efficient estimations.
We can classify the countries after their population growth in 4 groups ( Table 1)
Basic information:
Population change is composed of 3 important variables: births, deaths and migration. If we subtract
deaths from births, we get the natural increase of a population, which generally accounts for the greatest
amount of population growth. Population growth itself arises from the natural increase rate added to the
net migration rate.
1
Pay attention: for some introducing parts of this paper we are dealing with Population growth, the model itself works with Natural growth.
2
http://www.prb.org/Content/NavigationMenu/PRB/Educators/Human_Population/Population_Growth/Population_Growth.htm
3
See Globale Trends 2002. Fakten Analysen Prognosen. Chapter: Weltbevölkerung und Verstädterung
Page 2 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
4
See Globale Trends 2002. Fakten Analysen Prognosen. Chapter: Weltbevölkerung und Verstädterung
5
selected definitions from PRB:
Total Fertility Rate (TFR)
The average number of children a woman would have assuming that current age-specific birth rates remain constant throughout her childbearing years (usually
considered to be ages 15 to 49)
Life Expectancy at Birth (LIFE)
The average number of years a newborn infant can expect to live under current mortality levels.
Percent Urban (URB)
Percentage of the total population living in areas termed “urban” by that country. The population living in towns of 2,000 or more or in national and provincial
capitals is classified “urban.”
Rate of Natural Increase (RNI)
The birth rate minus the death rate, implying the annual rate of population growth without regard for migration. Expressed as a percentage.
Page 3 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
This empirical model focuses on explaining the natural increase rate for 105 countries worldwide by
cross-section analyses with 11 different explanatory variables. 2 variables, these are POP15 and TFR are
direct factors for RNI. All the others can be classified into the indirect factors.
Economic developments are assumed to be captured by the average growth rate of real GDP per capita
between 1975 – 2002 and by the real GDP for the year 2002. Higher economic production leads to an
increase in the GDP and to higher economic development standards.
The level of Education of the population is measured by a special education index of the UNDP, which
‘measures a country's relative achievement in both adult literacy and combined primary, secondary and
tertiary gross enrolment’6, which in fact, delivers very similar results as the alphabet rate.
Population structure flows into this model by the percentage of population younger than 15, which can also
be seen as an indicator for the amount of natural increase in the last 15 years, by the percentage of
population older than 65, by the urban percentage and by the TFR.
Health conditions can be explained by the life expectancy and the amount of people who suffer from HIV.
Women: Furthermore, the UNDP created the Gender Empowerment Measure index to describe the degree
of emancipation of women especially in rich countries (even today it is not calculable for many poor
countries)
All these factors are assumed to be responsible for annual natural increase. But we have to be aware of the
fact, that we used 2002 data for these factors, whereas the dependent variable consists of data from period
2003 on account of failing to find data for 2002. So we have to assume that the explanatory variables for
2002 are not significant different from the ones for 2003. Nearly all of our variables contain values, which
do not change much within one year (e.g. GDP or literacy rate). As a result of missing exact data for the
HIV variable, we have partly estimates. The time period does not correspond exactly to the dependent
variable, and data are changing pretty fast, so we should be careful when we interpret the influence of this
variable on the natural increase.
1. Our first principle hypothesis is that the existence of a high economic standard leads to a smaller
natural increase.
Especially in rural areas of poor countries children do offer a variety of material benefits for their parents.
In rich countries women represent a larger part of the labour force and have more influence on political,
social and economical aspects in rich countries. Therefore they have less time and interest in childcare and
giving birth.
2. Our second principle hypothesis is that better social indicators, like health and education should
be indicators for smaller natural increase.
In particular for women it is enormously important, to have the opportunity to access education.
‘The number of children that a couple will have is determined by many factors, including health, religion,
culture, economic status and the ability to decide on the number of children – many of them are related to
the status of women.’7 This status of women can be measured by the GEM of the UNDP. Countries with
high social security will have smaller families, because people are less in the need of getting many children
for economic benefit. Life expectancy in many poor countries is currently decreasing on account of a high
mortality rate, caused by HIV. These hypotheses correspond to the first one of course, for the reason that
higher living standards generally correlate with higher economic standards.
6
http://www.undplao.org/HD%20measurement.htm
7
http://www.prb.org/Content/NavigationMenu/PRB/Educators/Human_Population/Women/The_Status_of_Women1.htm
Page 4 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
“Education, urbanization and labour force participation […] have a strong correlation with levels of
fertility.”8 But we have to be careful not to make the mistake, confusing correlation with causation.
3. Data Information
Some parts of the data9 used are obtained from UNDP Statistics, a database which offers many social and
economic statistics and from the ‘UNDP Human Development Report 2004’. Another reference is the
PRB 2004 World Population Data Sheet, and the ‘datafinder’ of the PRB, which are both available on the
website of the PRB. For some data, which were missing, MS Encarta Professional Edition 2005 was used.
The data set, which is indexed after the GDP, consists of 105 countries, which are selected by availability
of the data and by the size of the country’s total population.
At first view, the ranges of the different values of the explanatory variables are very large, which is
apparent from Table 2.
We become aware of the idea, that the varieties of these countries may perhaps make the estimation of a
single model more difficult. So it could be useful to compare rich countries with poor ones and to separate
them in later calculations. Nonetheless we have to keep in mind that this could be an inadequate
arrangement too. ( Table 1)
Table 3: Comparison: rich – poor
Poorest 30 Richest 30
LIT in % 63,9 96,9
LIFE in years 52,4 78.1
POP15 in % 41,4 18.1
RNI in % 2,21 0,35
HIV in % 5,21 0,35
TFR in % 4,7 1,7
GDP in $ per capita 1541 26951
8
See 4
9
UNDP Human Development Report 2004 / UNDP Statistics: GDP, GROWTH, LIT, EDU, URB, POP15, POP65, TFR, LIFE, GEM
PRB 2004 World Population Data Sheet / Datafinder of the PRB: RNI, HIV
10
Adult literacy rate (>15 years, 2002 UNDP)
Page 5 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
Austria itself is a typical rich country with a high life expectancy of 78.5 years, a Natural Increase of 0 %,
a TFR of 1.3 % and a GDP of 29220 USD. The population under the age of 15 amounts to 16.2 %.
The most important outliers are the United Arab Emirates, which have a high GDP of 22420 USD, but
only 77% of the inhabitants are able to read and the natural increase and the TFR are unusually high.
Botswana has a RNI of 0.1 %, population growth itself (with migration) amounts to -0.8 %, but has a
LIFE value of only 39.7 years and belongs to the poorest developing countries in the world.
As you can see in Table 4 RNI correlates strongly with POP15, POP65 and TFR, but very low with
GROWTH and moderately with all other variables.
The explanatory variables are also correlated among each other. Relatively high correlations can naturally
be seen between the variables that can be integrated in one umbrella term (cp. Empirical natural growth
model). For example LIFE and HIV show a high negative correlation (r=-0.75) as well as POP15, POP65
and TFR (r= - 0.81, r= 0.87, -0.74). However we have to keep in mind that a high correlation between two
independent variables doesn’t have to be desirable, because it might cause Multicollinearity.
Scatterplots11: EDU, GDP, GEM, HIV, LIFE, POP15, POP65, TFR, URB
4 4 4
3 3 3
2 2 2
RNI
RNI
RNI
1 1 1
0 0 0
-1 -1 -1
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 0 20000 40000 60000 80000 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
4 4 4
3 3 3
2 2 2
RNI
RNI
RNI
1 1 1
0 0 0
-1 -1 -1
0 10 20 30 40 30 40 50 60 70 80 90 10 20 30 40 50 60
11
without Cuba, Maldives, Nigeria
Page 6 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
4 4 2.0
1.6
3 3
1.2
2 2 0.8
RNI
RNI
RNI
0.4
1 1
0.0
0 0
-0.4
-1 -1 -0.8
0 4 8 12 16 20 0 1 2 3 4 5 6 7 8 9 50 60 70 80 90 100
Table 5 shows us the results of Equation 1 for the basic model which takes all the variables for all
countries.
The very low number of included observations in Equation 1, which is even lower than the excluded ones,
is mainly caused by the variable GEM, which is not available for many countries. Besides this, we notice,
that primarily direct and population factors are significant in this model. Adjusted R-squared is extremely
high, but we have to take into account, that this unrestricted model leads to a lost of grades of freedom and
so to a reduction of the efficiency of the estimations. As we saw in Table 3, growth does not correlate with
any of the variables. This is represented in the equation. GROWTH seems not to be of significant
relevance for natural increase. Also URB and GDP are clearly not significant.
Page 7 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
After removing redundant variables (see Table 6), these are URB, EDU, GEM, GDP and GROWTH, we
use Eviews for looking at the residuals from the regression in tabular and graphical form (Actual, Fitted,
Residual Table), in order to find outliers, which appear not explainable by our model and which could
lead to Heteroskedasticity. After removing these countries (Cuba, Maldives, Nigeria) from our data set we
obtain Equation 2.
Homoskedasticity
As can be seen in Table 8 our data unfortunately feature Heteroskedasticity, which is shown by the
probability value of 0.000002. This value is smaller than 0.05, which marks the critical value for
Homoskedasticity. Heteroskedasticity means the variance of the residuals in the linear regression model is
different across the sample. Referring to our sample it could mean that either the poor or the rich countries
differ more strongly in the Rate of Annual Natural Population Increase.
As we use the OLS procedure to estimate the parameters, the estimators and forecasts are still unbiased
and consistent but on the other hand inefficient and no longer BLUE. Tests of hypotheses and
interpretation of the results must be handled with care.
Page 8 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
Table 8
White Heteroskedasticity Test: with cross terms
F-statistic 7.122745 Probability 0.000000
Obs*R-squared 63.25390 Probability 0.000002
2 Jarque-Bera 4.698306
Probability 0.095450
0
-0.75 -0.50 -0.25 0.00 0.25 0.50 0.75
12
With more explanatory variables, but not with more observations. In this case also the values of Akaike and Schwarz are not mmeaningful fir comparison.
Page 9 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
corresponds very little with RNI, and t-statistics verify its unimportance for our model. Surprisingly, EDU
and GDP are also not significant, although for both variables in the scatter plots as well as in the corr.
matrix (-0.67, -0.53) a negative correlation is discernible. It seems that either these correlations are not
plain enough (outlier) or they are not linear and so not valid for our model. The corr. matrix shows for
URB and RNI correlation with a value of only 0.36, and the scatter plot proves that its influence seems to
be of no importance. GEM is not available for many countries and probably for the same reasons as for
EDU and GDP it’s not significant in our model.
Now we desire to check, if separate models for the 30 richest / 35 poorest countries of our data give us a
better insight into some correlations and causalities. It is intelligible, that (alleged) more homogenous
countries can be compared with each other with less external effects, which distortion our results.
After sorting out the non significant explanatory variables for the 30 richest countries, which are HIV,
GDP, GROWTH, LIT, EDU and POP15, the model in Table 10 remains.
13
Listed according to the size of the GDP
Page 10 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
Now we analyse the 35 “poor countries” in our sample separately. After removing the non significant
explanatory variables, which are TFR, URB, POP65, EDU, GROWTH and GDP, the model in Table 12
remains.
because in the scatter plot14 a tight correlation with RNI can be noticed. Also in this case it might be that
POP65 and also TFR are captured by POP15.
In our models, explaining Natural Population Increase, we could not find an influence of the economic
standard on RNI, which is surprising because in literature often a positive correlation between economic
standard or wealth and RNI is assumed.16 So our first hypothesis that the existence of a high economic
standard leads to a smaller natural increase cannot be fully affirmed by our outcomes. Nevertheless the
correlation matrix and its scatter plot indicate a positive correlation, which could imply that a high GDP
leads to a smaller RNI, but for other reasons GDP is not important in our model.
We also couldn’t find an influence of Education on RNI in any of the models, and of GEM in the general
model. So our assumptions that high education leads to a positive effect in our model cannot be affirmed,
either. However we see in the scatter plots, that a high education and a high GEM often correspond with a
smaller natural increase. The same as for GDP applies to GEM and EDU.
Regarding the “health variables” we could find an influence of HIV and LIFE. As expected, HIV
correlates negatively with RNI in the model for “poor countries” and in the general model. The influence
of life expectancy on RNI is in the opposite direction as expected. As life expectancy is often low in
countries with high population growth, and high in countries with little population growth, we thought that
high life expectancy would indicate little population growth.17 This can be due largely to some countries
in East-Europe, which distort our correlation. Our hypothesis regarding health can be affirmed partially,
namely the assumption that HIV influences the population increase in our model negatively.
Our hypothesis concerning population structure could be confirmed for all variables. A high amount of
young people and a high fertility rate lead to a high population increase, whereas a high amount of old
people and urbanisation is correlated with a population decrease.
Our study shows some limitations, which may diminish its explanatory power and could lead to poor
results for future data. It should be taken into consideration, that …
14
only with the data from the 35 poorest countries
15
see Globale Trends 2002. Fakten Analysen Prognosen. Chapter: Weltbevölkerung und Verstädterung
16
http://www.bmz.de/de/themen/armut/hintergrund/index.html#Die%20Ursachen%20von%20Armut
17
http://www.weltpolitik.net/Sachgebiete/Weltwirtschaft%20und%Globalisierung/Grundlagen/Grundlagen/Akteure%20der%20Weltwirtschaft.html
Page 12 of 13
Annual Natural Population Increase – An Empirical Study Eigner, Sagerschnig, Tirnitz SS05
In specifying the model, we implicitly assumed that X causes Y. “Although R^2 measures the goodness of
fit, it cannot be used to identify causality. In other words, the fact that X and Y are highly correlated does
not indicate whether changes in X cause changes in Y or vice versa. “The situation can also go both ways, a
situation known as feedback.” For instance: Inadequacies in social services lead to rapid population
growth. However in most poor countries we see that overpopulation can reduce food production gains
resulting from modernizing.
we haven’t included all countries of the world in our study
we possibly should have used some other explanatory variables, for example the use of contraception18, we
could have used an econometrical model as basis for our estimations (macro/microeconomic19).
it perhaps would have been better to use mean values of the explanatory variables (for instance 1975-2003)
instead of numbers from one year to obtain a more robust model.
we should have weighted variables after population size to avoid distortions caused by smaller countries.
it would have been meaningful to line up the models to 4 groups, as explained in Table 1 for getting
superior results. Because the biggest problem was certainly the heterogeneity of and within the countries.
By the proposed enhanced differentiation robustness and explanatory power of the model should raise.
6. References
Data and Statistics:
or http://cfapp2.undp.org/hdr/statistics/data/rc_select.cfm Zugriff: 04.06.2005
http://hdr.undp.org/statistics/
UNDP - Human Development Report 2004. Statistics.
General Information:
Stiftung Entwicklung und Frieden: Globale Trends 2002. Fakten Analysen Prognosen. Hg. von Hauchler, Messner,
Nuscheler. - Frankfurt a. M.: Fischer Taschenbuch Verlag 2001.
http://www.prb.org/Content/NavigationMenu/PRB/Educators/Human_Population/Human_Population__Fundamentals_of_Growth_and_
Change.htm Erstellungsdatum: November 2000, Zugriff: 04.06.2005
Page 13 of 13