Escolar Documentos
Profissional Documentos
Cultura Documentos
Text edited at the Institute for Transport Studies, University of Leeds, Leeds
LS2 9JT
Selection of a sample
In the context of a national statistical agency like Statistics Canada, the following
steps are needed to select a sample and ensure that this sample will fulfill its goals.
Estimation techniques: how the results from the sample will be extended to
the whole population.
Determine what the survey population will be (e.g., students, men aged 20
to 35, newborn babies, etc.).
2.
3.
4.
5.
For instance, suppose that you were surveying a rural town in Quebec to see what
percentage of residents are farmers. If you were provided with an area frame, then
you would be able to locate which roads to visit, but you would still have to find out
the names and addresses of the residents on each road.
When there is no single frame that is appropriate, multiple frames can be used.
Some sampling techniques using both types of frames will be discussed later.
A good frame should be complete and up-to-date; no member of the survey
population should be excluded from the frame or duplicated on the frame
(represented more than once); and no unit that is not part of the population (e.g.,
deceased persons) should be on the frame. The frame chosen will impact the
selected survey population. For instance, if a list of telephone numbers is used to
select a sample of households, then all households without telephones are excluded
from the survey population.
Top of Page
The sampling unit is part of the frame and therefore subject to being
selected.
The respondent unit or reporting unit provides the information needed by the
survey.
For example, in a survey about newborns in Edmonton, the sampling unit might be
a household, the reporting unit one of the parents or a legal guardian, and the unit
of reference the baby.
The sampling units may differ depending on the frame used. This is why the survey
population, survey frame and survey units are defined in conjunction with one
another.
Top of Page
The level of precision needed for the survey estimates will impact the sample size.
However, it is not as easy to determine the sample size as one may think.
Generally, the actual sample size of a survey is a compromise between the level of
precision to be achieved, the survey budget and any other operational constraints,
such as budget and time. In order to achieve a certain level of precision, the
sample size will depend, among other things, on the following factors:
The population size: To a certain extent, the bigger the population, the bigger
the sample needed. But once you reach a certain level, an increase in
population no longer affects the sample size. For instance, the necessary
sample size to achieve a certain level of precision will be about the same for
a population of one million as for a population twice that size.
The sampling and estimation methods: Not all sampling and estimation
methods have the same level of efficiency. You will need a bigger sample if
your method is not the most efficient. But because of operational constraints
and the unavailability of an adequate frame, you cannot always use the most
efficient technique.
Top of Page
Populations
Definition - a complete set of elements (persons or objects) that possess some
common characteristic defined by the sampling criteria established by the
researcher
Examples
Accessible population
Examples
Samples
Terminology used to describe samples and sampling methods
Sampling frame = a list of all the elements in the population from which
the sample is drawn
Examples
Simple random
Proportional
15% seniors
25% juniors
25% sophomores
35% freshmen
Disproportional
Example
Class
Population
Sample
Seniors
15%
25%
Juniors
25%
25%
Sophomores
25%
25%
Freshmen
35%
25%
Procedure
Systematic
Example
Characteristics
Not every element of the population has the opportunity for selection in
the sample
No sampling frame
Non-random selection
Restricts generalization
Quota
E.g. 50 males & 50 females; recruit the first 50 men and first 50
women that meet inclusion criteria
Handpicked subjects
Snowball
Longitudinal
Cross-sectional
Sample Size
Descriptive studies need large samples; e.g. 10 subjects for each item on the
questionnaire or interview guide
As the number of variables studied increases, the sample size also needs to
increase in order to detect significant relationships or differences
A minimum of 30 subjects is needed for use of the central limit theorem (statistics
based on the mean)
Power Analysis
Power analysis = a procedure for estimating either the likelihood of committing a Type II
error or a procedure for estimating sample size requirements
Type I error
Type II error
True
False
Gamma measures how wrong the null hypothesis is; it measures how strong the
effect of the IV is on the DV; and it is used in performing a power analysis
The textbook discusses 4 ways to estimate gamma (population effect size) based
upon:
Testing the difference between 3> means (ANOVA) - eta squared 2 for
small effects 2 = .01; medium effects 2 = .06; large effects 2 = .14
Level of significance criterion = alpha , use .05 for most nursing studies and your
calculations
Power = 1 - (beta); if beta is not known standard power is .80, so use this when
you are determining sample size
Population size effect = gamma or its equivalent, e.g. eta squared 2; use
recommended values for small, medium, or large effect for the statistical test you
plan to use to answer research questions or test hypothesis
Mathematical formulas and computer programs can also be used for calculation of sample
size
Sampling error = The difference between the sample statistic (e.g. sample mean)
and the population parameter (e.g. population mean) that is due to the random
fluctuations in data that occur when the sample is selected
Sampling bias
The difference between sample data and population data that can be
attributed to faulty sampling of the population
Random selection = from all people who meet the inclusion criteria, a sample is
randomly chosen
Random assignment
See Polit & Hungler, pg. 160-162 for random assignment to groups and
group random assignment to tx. using a random numbers table
Return to calendar/assignments
Sampling (statistics)
From Wikipedia, the free encyclopedia
This article is about statistically selecting a random (or "representative") subset of a population. For computer
simulation, see pseudo-random number sampling.
In statistics, quality assurance, and survey methodology, sampling is concerned with the selection of a subset
of individuals from within a statistical population to estimate characteristics of the whole population. Acceptance
sampling is used to determine if a production lot of material meets the governing specifications. Two
advantages of sampling are that the cost is lower and data collection is faster than measuring the entire
population.
Each observation measures one or more properties (such as weight, location, color) of observable bodies
distinguished as independent objects or individuals. In survey sampling, weights can be applied to the data to
adjust for the sample design, particularly stratified sampling (blocking). Results from probability
theory and statistical theory are employed to guide practice. In business and medical research, sampling is
widely used for gathering information about a population.
The sampling process comprises several stages:
Specifying a sampling method for selecting items or events from the frame
Development of Research
Tools and Modeling
Thomas G. Hinton
Savannah River Ecology Laboratory
P O Drawer E, Aiken, SC 29802
(803) 725-7454 office
(803) 725-3309 fax
thinton(at)uga.edu
The most powerful research tool developed by
Dr. Hinton is the Low Dose Rate Irradiation
Facility. The unique facility is ideally suited to the
study of chronic exposures to environmentally
relevant concentrations of pollutants.
The facility was originally designed to use turtles
as a model organism for studying how effects
are manifest along increasing levels of biological
complexity (i.e. from molecules to individuals
and populations) but now we are conducting the
majority of our research with Medaka fish.
Several technique papers emerged from this line
of research, including the development of a
molecular probe (Fig. 6) that facilitates the
quantification of a specific type of chromosome
aberration: reciprocal translocations. The
method, developed in collaboration with Dr. Joel
Bedford and Brant Ulsh of Colorado State
University, was the first application of
Fluorescence In Situ Hybridization (FISH) in an
ecological setting and holds great promise as a
biomarker that can couple molecular effects to
population-level impacts.