Escolar Documentos
Profissional Documentos
Cultura Documentos
1
Universidade de São Paulo, Núcleo de Pesquisas Epidemiológicas em Nutrição e Saúde, São Paulo-SP, Brasil
2
Universidade do Sul de Santa Catarina, Programa de Pós-Graduação em Ciências da Saúde, Tubarão- SC, Brasil
3
Universidade Federal de Minas Gerais, Departamento Materno-Infantil e de Saúde Pública, Belo Horizonte-MG, Brasil
4
Universidade Federal de Minas Gerais, Departamento de Nutrição, Belo Horizonte-MG, Brasil
Abstract
Objective: to introduce the methodology used to calculate post-stratification weights of the 2012 Surveillance System for
Risk and Protective Factors for Chronic Diseases by Telephone Survey (Vigitel) and to compare the trends of indicators estimated
by cell-by-cell weighting and raking methods. Methods: in this panel of cross-sectional studies, the prevalences of smokers,
overweight, and intake of fruits and vegetables from 2006 to 2012 were estimated using the cell-by-cell weighting and raking
methods. Results: there were no differences in time trends of the indicators estimated by both methods, but the prevalence of
smokers estimated by the raking method was lower than the estimated by cell-by-cell weighting, whilst the prevalence of fruit
and vegetable intake was higher; for overweight, there was no difference between the methods. Conclusion: raking method
presented higher accuracy of the estimates when compared to cell-by-cell weighting method, proving to be most convenient,
although it presents register coverage bias.
Keywords: Health Surveys; Telephone; Bias; Surveillance; Serial Cross-Sectional Studies.
Correspondence:
Regina Tomie Ivata Bernal – Universidade de São Paulo, Núcleo de Pesquisas Epidemiológicas em Nutrição e Saúde, Av. Dr.
Arnaldo, No. 715, São Paulo-SP, Brasil. CEP: 01246-904
E-mail: rbernal@usp.br
weighting method is the easiness to calculate weights, but The estimates of the adult population of each state
its disadvantage is the need of databases to generate the capital, during the inter-census period (t), were
contingency table used to calculate weights.11 obtained from the geometric mean rate of annual growth
The objective of this study was to present the estimated for each one of the 14 population groups (six
methodology used to create post-stratification weights of age groups for the total population and four education
the 2012 Vigitel and to compare the trend of indicators levels for each sex). Moreover, the total estimate of
estimated by the cell-by-cell weighting and raking methods. Vigitel population by age was adjusted using the total
population from IBGE, provided by the Department of
Methods the Brazilian National Health System (Datasus)19 for
each surveyed year. This adjustment resulted from the
In this panel of cross-sectional studies, the prevalences multiplication of the total frequency per age, stratified
of smokers, overweight, and regular intake of fruits and by sex, by the total number of adults aged 18 or over
vegetables for five or more days a week were estimated in the population. The same procedure was used for
using Vigitel data from 2006 to 2012, using cell-by-cell education level, stratified by sex. In both cases, the
weighting and raking methods. total population, estimated from the adults’ sample was
The raking method11,18 was used as an alternative adjusted to the total size of the population.
to calculate post-stratification weighting to increase For each capital, the population estimates were
Vigitel sample. This method uses the annual frequency calculated per age groups (six categories) and per
distribution of the variables ‘age group’, ‘sex’ and education level, stratified by sex (eight categories), in
‘education level’ of the population. Due to the absence the period from 2006 to 2011 – except for 2010. This
of annual projections of the population’s education information was used to calculate Vigitel post-stratification
level, we chose to estimate the education level, sex and weights by raking method.11,18 This method works with
age group using data of the 2000 and 2010 Censuses. one variable at a time, leveling the distribution of the
The procedures used to calculate the projections and total variable in the sampling weights by the sample
to calculate weights using the raking method for each weights and in the population, through itineration
capital are presented below. Initially, we considered procedure. This process is repeated to each one of the
the microdata of the 200018 and 201019 Demographic variables used to calculate the weights, so the sample
Censuses for each state capital (i), in the calculation distribution becomes identical to the population for
of the geometric mean rate (rg(i)) of the annual growth these variables. The new post-stratification weights
of the adult population used for population projections for each capital were calculated using the statistical
during the inter-census period – Expression 1: software SAS (macro function Rakinge.sas21).
Vigitel sample weights, in each capital, are calculated
(√
rg(i)= 10 Population
___________ (i)
2010
( -1 by the ratio between the number of adults and the
Population(i)2000 number of landlines, due to the random sampling, in
For each capital, we calculated the population growth two stages: first, the registers of household telephones
rate, stratified by sex, for each of the six age groups (18 are used so the household samples can be drawn;
to 24; 25 to 34; 35 to 44; 45 to 54; 55 to 64; and 65 or second, one adult resident of the picked household is
over), and for four education levels (no schooling or also randomly selected.
incomplete Primary School; complete Primary School The analyses of Vigitel data require the survey sampling
or incomplete High School; complete High School complex plan (SCP) to be considered. The measure
or incomplete Higher Education; Higher Education used to assess the impact of weights in the prevalence
degree). Due to changes in the questions of 2000 and accuracy is the design effect (deff),9 which expresses
2010 Censuses, IBGE provided the combinations of the the ratio between the variance of estimates calculated
classification variables of education level for each census. with weights and the simple casual sample. A deff value
The use of the data imputation technique was necessary equals 1 indicates that the weights did not change the
for the category ‘Not mentioned’ of the variable ‘education estimate accuracy, whilst deff values over 1 indicate
level’, by adopting the most frequent value within each loss of accuracy due to the use of post-stratification
age group – stratified by sex – for the missing cases. weighting. Deff was calculated for each indicator using
Table 1 – Annual geometric mean rate of proportional variation (%) of population size according to age group,
education level and sex, in the municipality of Belém, Pará, 2000-2010
Sex
Variables Total
Male Female
Age group (in years)
18-24 -0.797 -0.983 -0.895
25-34 1.520 1.525 1.522
35-44 2.083 2.173 2.131
45-54 3.474 3.492 3.483
55-64 4.193 4.143 4.165
≥65 3.942 3.816 3.864
Education level
No schooling/Incomplete Primary School -1.477 -1.952 -1.727
Complete Primary School/Incomplete High School 0.396 0.476 0.438
Complete Primary School/Incomplete High School 5.645 5.319 5.461
Higher Education degree 6.275 7.565 6.980
Total 1.193 1.200 1.197
Figure 1 – Rakinge.sas for the calculation of post-stratification weights in the municipality of Belém, Pará, 2011
44, and more adults aged 45 or over (6). The second makes the sample size equal to 2,000 correspond to
variable is education level stratified by sex (7), with the effective sample size, equivalent to 391 and 787
eight categories: interviews, respectively (Table 2).
i) men with no schooling or incomplete Primary School; When we compare the historical series of adult
ii) men with complete Primary School or incomplete smokers prevalences, for all the state capitals and the
High School; FD, estimated by cell and raking methods, we observe a
iii) men with complete High School or incomplete decreasing trend for both methods; however, in raking
Higher Education; method, this trend was more marked, with prevalences
iv) men with Higher Education degree; lower than those estimated by the cell-by-cell weighting.
v) women with no schooling or incomplete Primary School; For the overweight estimates, we did not observe any
vi) women with complete Primary School or incomplete difference between the methods. The prevalence
High School; estimates of regular intake of fruits and vegetables for
vii) women with complete High School or incomplete five or more days a week presented increasing trend,
Higher Education; and the prevalences estimated by the raking method
viii) women with Higher Education degree. were higher than those in the cell method (Figure 2).
In this stage the total expanded sample of Vigitel (8)
is already equal to the total population, according to Discussion
IBGE (9); however, when the relative distribution of
the sample and education level are compared in Vigitel This is the first study that compared two post-
we observe a difference of -8.8% of men and -5.9% stratification methods: cell-by-cell weighting and raking.
of women, both with no schooling or with incomplete Raking method was first adopted in Vigitel in 2012 and
Primary School. In turn, there are more adults with presents advantages over cell-by-cell weighting, because
Higher Education degree. In the second iteration, the it uses annual population data, which reflect the changes
weighted sample of the variable ‘age group’ is the same occurred in the Brazilian population, whilst cell-by-cell
as the population; however, the difference between the weighting uses the same population data for different
relative distributions of the population and the sample years of research. Although raking method does not
differs from zero. This procedure is repeated until the eliminate the register coverage bias, it produces estimates
relative difference between Vigitel and the population is that express more accurately the trends of risk factor
equal to zero. In this example, to meet the convergence indicators in the population, when comparing with
criterion, seven iterations were necessary until the age cell-by-cell weighting. This new weighting method has
groups (10) and education level stratified by sex (11) already been used to produce new estimates based on
were equal to zero. At the end of the process, a file with Vigitel data, for the period from 2006 to 2011.
post-stratification weighting is generated. There are several methods to calculate post-
The evaluation of weighting impact in the accuracy stratification weighting. Among them, the cell-by-cell
of smokers prevalence, presented by the measure of weighting is widely used due to its simple weight
design effect (deff) showed deff values >1; this means calculation, which uses the cells of the population
there was accuracy loss in the use of post-stratification and sample contingency tables.11 This method has
weighting. Deff values produced by cell-by-cell weighting a limitation regarding the need of available data to
were higher than those produced by raking method, create the contingency table of the variables, since this
for all capitals and years. The higher the deff value is, information is available only every ten years (IBGE
the lower the estimate accuracy will be (Table 2). The Demographic Census). The Brazilian population has
accuracy loss happens due to the change in the sample been going through a demographic and socioeconomic
size, expressed by the following formula: transition,20 and this large period of one decade
ndeff= n/deff constitutes a limitation for this method. In turn,
For example, in 2006, the state capital Palmas-TO raking method uses only the simple frequencies of
presents a deff of 5.12 and 2.54 – which corresponds each variable of population and sample, which enables
to the effective sample size of 391 (2,000/5.12) and the use of population information from different
787 (2,000/2.54), respectively. That is, the weighting external sources, as well as interpolation of the
population variables during the inter-census period, were similar; however, the values of the prevalences
covering the changes in the population occurred in were different comparing the raking and cell-by-cell
the period. The limitation of the raking method is the weighting. We noticed differences in the prevalences
higher complexity of weighting calculation, because it of smokers and fruits and vegetable intake, whereas
needs an algorithm which is available in the statistics overweight prevalence did not show any difference.
package for post-stratification weighting. The differences observed may be explained by the
In the comparison between the estimates of both association between the indicators, regarding having
weighting methods, the trends of the three indicators a landline.21,22
Table 2 – Distribution of the design effect (deff) of weights in the Surveillance System of Risk and Protective
Factors for Chronic Diseases by Telephone Survey (Vigitel) estimated through the cell-by-cell and
ranking metods, in the prevalence of smokers, per state capital and year, in the 26 Brazilian state
capitals and the Federal District, Brazil, 2006-2015
20
15
Smoking prevalence
10
0
2006 2007 2008 2009 2010 2011 2012
Year
60
50
Overweight prevalence
40
30
20
10
0
2006 2007 2008 2009 2010 2011 2012
Year
Raking Cell (2000 Census)
40
35
30
Frequence of fruits and
vegetables intake
25
20
15
10
5
0
2006 2007 2008 2009 2010 2011 2012
Year
Figure 2 – Time series of prevalences of smokers, overweight and regular intake of fruits and vegetables in five
or more days a week, obtained from Vigitel data, according to weighting method, in the 26 Brazilian
state capitals and the Federal District, Brazil, 2006-2015
In the case of indicators positively associated to in the BRFSS.25 The CDC/USA included in its survey
education level, the increase in schooling by the population sample the population that uses only mobile phones
in the inter-census decade, as well as its aging, might (added to the population that uses landline) and
have led to the higher decrease of those indicators, as changed the weighting method. The system started
shown by this study. Other studies point that the smoking using the raking method to calculate post-stratification
prevalence is associated to low schooling.23 The increase weighting, due to the double register (landline and
in the education level of the Brazilians during the inter- mobile phones), the reduction of non response
census period justifies the reduction in the prevalence bias and the improve in the estimates accuracy. The
of smokers, which could be more consistently noticed methodology changing process adopted by BRFSS
by the raking weighting, since this method updates the was recommended by experts, with the release of
population estimates of sex, age and education level. Thus, studies that evaluated different variables used in the
the raking method could reveal smaller prevalences; construction of post-stratification weights, in order to
and a trend of sharper decrease of smoking among the reduce the bias generated by the low response rate.26,27
Brazilian population, confirmed by the National Health Regarding Vigitel, the post-stratification methods
Survey (PNS), conducted by the Ministry of Health in a used aim to reduce possible distortions in the estimates
partnership with IBGE, which pointed to a 20% reduction associated to the method of participants’ selection,
in smoking prevalence in the Brazilian population from through landline, which generates a selection bias.
2008 to 2013.24 Therefore, the raking method could To eliminate this bias, Vigitel should introduce in its
obtain more accurate estimates using Vigitel data. research individuals without landline.
Similarly, in the United States, Battaglia et al.13 evaluated While Vigitel sampling process does not change
the post-stratification weighting of 2003 data provided by we recommend the use o raking method. It enables
the Behavioral Risk Factor Surveillance System (BRFSS), the collection of more accurate prevalence estimates
whose response rate was lower than 50%. The estimates – when comparing with the cell-by-cell weighting –,
obtained by the weighting and raking method were making them closer to the reality presented in the
compared, both using population data from the Current 2010 Census. Thereby, it is possible to support the
Population Survey (CPS) to calculate post-stratification monitoring of risk and protective factors for chronic
weighting, in order to minimize non response bias. The non-communicable diseases, besides subsidizing,
results of the North American study showed that the BRFSS more adequately, public policies for health promotion.
tended to underestimate the indicators of risk factors
of the population due to the low response rate, fewer Authors’ contributions
than 50%, especially among the low income population,
who presented higher prevalence of risk factors. And Malta DC contributed to the study design and article’s
the estimates of prevalences obtained from the raking revision. Bernal RTI contributed to the study design,
method were higher, when compared to those provided conducted the statistical analysis and wrote the article
by the cell-by-cell weighting, given the association between text. Iser BPM and Claro RM contributed to data analysis,
the variables used to calculate weights and risk factors. writing and article revision. All the authors approved
In 2011, the Centers for Disease Control and its final version and declared to be responsible for all
Prevention (USA/CDC) implemented many changes aspects of the study, ensuring its accuracy and integrity.
References
1. Ministério da Saúde (BR). Secretaria de Vigilância em 2. Ministério da Saúde (BR). Secretaria de Vigilância em
Saúde. Secretaria de Gestão Estratégica e Participativa. Saúde. Secretaria de Gestão Estratégica e Participativa.
Vigitel Brasil 2006: vigilância de fatores de risco Vigitel Brasil 2007: vigilância de fatores de risco
e proteção para doenças crônicas por inquérito e proteção para doenças crônicas por inquérito
telefônico [Internet]. Brasília: Ministério da Saúde, telefônico [Internet]. Brasília: Ministério da Saúde,
2007 [citado 2017 ago 23]. 297 p. Disponível em: 2008 [citado 2017 ago 23]. 136 p. Disponível em:
http://bvsms.saude.gov.br/bvs/publicacoes/vigitel_ http://bvsms.saude.gov.br/bvs/publicacoes/vigitel_
brasil_2006.pdf brasil_2007.pdf
3. Ministério da Saúde (BR). Secretaria de Vigilância em 11. Kalton G. Compensating for missing survey data.
Saúde. Secretaria de Gestão Estratégica e Participativa. Michigan: Institute for Social Research, The University
Vigitel Brasil 2008: vigilância de fatores de risco e of Michigan; 1983.
proteção para doenças crônicas por inquérito telefônico
12. Bernal R, Silva NN. Cobertura de linhas
[Internet]. Brasília: Ministério da Saúde, 2009 [citado
telefônicas residenciais e vícios potenciais em
2017 ago 23]. 112 p. Disponível em: http://bvsms.saude.
estudos epidemiológicos. Rev Saúde Pública.
gov.br/bvs/publicacoes/vigitel_brasil_2008.pdf.
2009;43(3):421-6.
4. Ministério da Saúde (BR). Secretaria de Vigilância
13. Battaglia MP, Frankel MR, Link MW. Improving
em Saúde. Secretaria de Gestão Estratégica e
standard poststratification techniques for random-
Participativa. Vigitel Brasil 2009: vigilância de fatores
digit-dialing telephone surveys. Surv Res Methods
de risco e proteção para doenças crônicas por
[Internet]. 2008 [cited 2013 Nov 24];2(1):11-9.
inquérito telefônico [Internet]. Brasília: Ministério
Available from: https://ojs.ub.uni-konstanz.de/srm/
da Saúde, 2010 [citado 2017 ago 23]. 150 p.
article/view/597.
Disponível em: http://www.abeso.org.br/uploads/
downloads/75/553a24c01eb80.pdf. 14. Lepkowsky JM, Tucker NC, Brick JM, Leeuw E,
Japec L, Lavraskas PJ, et al. Advances in telephone
5. Ministério da Saúde (BR). Secretaria de Vigilância em
survey methodology. Hoboken (NJ): John Wiley &
Saúde. Secretaria de Gestão Estratégica e Participativa.
Sons; 2008.
Vigitel Brasil 2010: vigilância de fatores de risco
e proteção para doenças crônicas por inquérito 15. Flores-Cervantes I, Brick JM, Jones ME, Westat
telefônico [Internet]. Brasília: Ministério da Saúde, J, DiSogra C, Yen W. Weighting for nontelephone
2011 [citado 2017 ago 23]. 152 p. Disponível household in the 2001 California Health Interview
em: http://bvsms.saude.gov.br/bvs/publicacoes/ Survey [Internet]. In: Proceedings of the Joint
vigitel_2010.pdf. Statistical Meetings Section on Survey Research
Methods; 2002 Aug 11-15; New York. p.1002-7 [cited
6. Ministério da Saúde (BR). Secretaria de Vigilância em
2013 May 14]. Available from: http://www.amstat.org/
Saúde. Secretaria de Gestão Estratégica e Participativa.
sections/srms/Proceedings/y2002/Files/JSM2002-
Vigitel Brasil 2011: vigilância de fatores de risco
000661.pdf.
e proteção para doenças crônicas por inquérito
telefônico [Internet]. Brasília: Ministério da Saúde, 16. Bernal RTI, Malta DC, Araújo TS, Silva NN. Inquérito
2012 [citado 2017 ago 23]. 132 p. Disponível em: por telefone: pesos de pós-estratificação para corrigir
http://bvsms.saude.gov.br/bvs/publicacoes/vigitel_ vícios de baixa cobertura em Rio Branco, AC. Rev
brasil_2011_fatores_risco_doencas_cronicas.pdf Saúde Pública. 2013 abr;47(2):316-25.
7. Ministério da Saúde (BR). Secretaria de Vigilância em 17. Moura EC, Morais Neto OL, Malta DC, Moura L, Silva
Saúde. Departamento de Análise de Situação de Saúde. NN, Bernal R, et al. Vigilância de fatores de risco para
Plano de ações estratégicas para o enfrentamento das doenças crônicas por inquérito telefônico nas capitais
doenças crônicas não transmissíveis (DCNT) no Brasil dos 26 Estados brasileiros e no Distrito Federal
2011-2022 [Internet]. Brasília: Ministério da Saúde, (2006). Rev Bras Epidemiol. 2008;11 (supl 1):20-37.
2011 [citado 2017 ago 23]. 148 p. Disponível em: 18. Izrael D, Hoaglin DC, Battaglia MP. A SAS macro
http://bvsms.saude.gov.br/bvs/publicacoes/plano_ for balancing a weighted sample [Internet]. In:
acoes_enfrent_dcnt_2011.pdf. Proceedings of the Twenty-Fifth Annual SAS Users
8. Instituto Brasileiro de Geografia e Estatística. Group International Conference, Paper 275, 2000
Microdados da pesquisa de orçamentos familiares [cited 2010 Dec 12]. Available from: http://www2.sas.
[Internet]. 2014 [citado 2013 maio 14]. Disponível com/ proceedings/sugi29/207-29.pdf.
em: http://downloads.ibge.gov.br/downloads_ 19. Ministério da Saúde (BR). Portal da saúde. Datasus.
estatisticas.htm Demográficas e socioeconômicas [Internet]. 2012
9. Kish L. Survey sampling. New York: John Wiley & [citado 2013 dez 12]. Disponível em: http://www2.
Sons; 1965. datasus.gov.br/DATASUS/ index.php?area=0206.
10. Cochran WG. Sampling techniques. 3rd ed. New York: 20. Alves JED, Cavenaghi S. Tendências demográficas, dos
John Wiley & Sons; 1977. domicílios e das famílias no Brasil [Internet]. Aparte:
inclusão social em debate. 2012 [citado 2013 out 25. Center for Disease Control and Prevention. Behavioral
11]. Disponível em: http://www.ie.ufrj.br/aparte/pdfs/ risk factor surveillance system: BRFSS 2011 survey
tendencias_ demograficas_e_de_familia_24ago12.pdf. data and documentation [Internet]. Atlanta: Center
for Disease Control and Prevention; 2011 [cited 2017
21. Bernal RTI. Inquéritos por telefone: inferências
May 17]. Available from: http://www.cdc.gov/brfss/
válidas em regiões com baixa taxa de cobertura
annual_data/ annual_2011.htm
de linhas residenciais [tese]. São Paulo (SP):
Universidade de São Paulo, 2011. 26. Brennan N, Kenney G, Rajan S, Scheuren F, Wang
K, Zuckerman S, et al. 1997 NASF technical papers:
22. Malta DC, Iser BPM, Sá NNB, Yokota RTC, Moura
report no 16: assessing the new federalism an urban
L, Claro RM, et al. Tendências temporais no
institute program to assess changing social policies.
consumo de tabaco nas capitais brasileiras,
Washington: Urban Institute; 2001.
segundo dados do Vigitel, 2006 a 2011. Cad Saúde
Pública [Internet]. 2013 abr [citado 2013 set 27. Link MW et al. Reaching the U.S. cell phone
01];29(4):812-22. Disponível em: http://www. generation: comparison of cell phone survey
scielo.br/scielo.php?script=sci_ arttext&pid=S0102- results with an ongoing landline telephone survey.
311X2013000400018&lng=en &nrm=iso Public Opin Q [Internet]. 2007 [cited 2013 Nov
24];71(5):814-39. Available from: http://www.bebr.
23. Giovino GA, Mirza SA, Samet JM, Gupta PC, Jarvis MJ,
ufl.edu/files/ Reaching%20the%20US%20Cell%20
Bhala N, et al. Tobacco use in 3 billion individuals
Phone%20 Generation.pdf
from 16 countries: an analysis of nationally
representative cross-sectional household surveys.
Lancet. 2012 Aug;380(9842):668-79. Received on 30/11/2016
Approved on 24/04/2017
24. Malta DC, Vieira ML, Szwarcwald CL, Caixeta R, Brito
SMF, Reis AAC. Tendência de fumantes na população
brasileira segundo a pesquisa nacional de amostra de
domicílios 2008 e a pesquisa nacional de saúde 2013.
Rev Bras Epidemiol. 2015 dez;18(suppl 2);45-56.