Você está na página 1de 6

Available online at http://www.journalcra.

com
INTERNATIONAL JOURNAL
OF CURRENT RESEARCH
International Journal of Current Research
Vol.11, pp.222-227, December, 2010
ISSN: 0975-833X
REVIEW ARTICLE
DETERMINANT SAMPLING SCHEME FOR RATIO AND REGRESSION
METHODS OF ESTIMATION
Subramani, J
Department of Statistics, Pondicherry University, R V Nagar, Kalapet,
Puducherry – 605 014, India

ARTICLE INFO ABSTRACT

Article History: In this paper, determinant sampling scheme for estimation of a finite
Received 12th September, 2010 population mean is extended for the ratio and regression methods of
Received in revised form estimation. The relative performance of determinant sampling along
15th October, 2010 with those of the simple random and systematic sampling are assessed
Accepted 13th November, 2010
Published online 28th December, 2010
for certain natural populations for estimation of the finite population
means through the methods of ratio and regression estimation.
Key words:
Determinant Sampling,
Simple Random sampling,
Systematic Sampling,
Natural population,
Linear trend, Trend free sampling. © Copy Right, IJCR, 2010 Academic Journals. All rights reserved.

INTRODUCTION In several sampling schemes including, simple


As stated by Subramani (2009), it is of interests for random sampling, systematic sampling, trend free
many research workers in their statistical Sampling, diagonal systematic sampling,
investigations to estimate the finite population generalized diagonal systematic sampling,
mean Y  1 N Y on the basis of a random sample determinant sampling schemes the resulting sample
N 
i1
i
mean y turns out to be an unbiased estimate of the
selected from the population U , where U is a finite population mean Y , where as the other sampling
population S  {u1 , u 2 , ..., u 3 } of N distinct and schemes such as balanced systematic sampling,
identifiable units. Let Y be a real variable with centered systematic sampling, truncated systematic
value Yi measured on U i , i  1, 2, 3, ..., N giving a sampling, etc, the resulting estimators are not
unbiased unless there is a perfect liner trend among
vector Y  (Y1, Y2 ,..., YN ) . Any ordered sequence the population values. For a detailed discussion
S  {u1 , u 2 , ...,u 3 }  {Ui1 , U i2 , ..., Uin } , 1  i l  N and 1  l  n about the above sampling schemes and their
is called a random sample of size n to be used to advantageous, the readers are referred to Cochran
get an estimate for the population mean Y . (1997), Murthy (1967), Fountain and Pathak
(1989), Mukerjee and Sengupta (1990), Rao
(1969), Subramani (2000; 2009 and 2010).
*Corresponding author: drjsubramani@yahoo.co.in Subramani and Tracy (1999) and the references
223 International Journal of Current Research, Vol.11, pp.222-227, December, 2010

cited there in. It is often used the auxiliary sampling scheme for selecting a random sample of
information in estimation techniques of sample size n are given below:
surveys to increase the precision of the sample
mean. The two important and widely used methods 1. Arrange the N population units
are the ratio and regression methods of estimation, U1 , U 2 , ..., U N in an n  k matrix
which provide better estimators than the usual
M (say).
estimators based on simple random sampling.
2. Select n random numbers
These methods involve an auxiliary variable X
correlated with the study variable Y , and the pair
t1 , t 2 , ..., t n one by one such that
of values ( xi , yi ) is measured for each and every t i  t j whenever i  j , t i being
unit in the sample. Further it is assumed that the the random number obtained at the
population mean X of the auxiliary variable X is i th draw, with 1  ti  k and
known in advance. This has motivated the present
study and the determinant sampling scheme has 1 i  n .
been extended for estimating finite population 3. The selected sampling units are
means through the methods of ratio and regression U t1 , U k  t 2 , ..., U (n -1)k  t n ,
estimators. The explicit expressions for the
located respectively in the positions
variance of ratio and regression estimators based
on determinant sampling scheme are derived. (1, t1 ), (2, t 2 )...(n, t n ) of the
Further the relative performance of determinant matrix M .
sampling, simple random sampling and systematic It is to be noted that the first order and second
sampling schemes are assessed for certain natural order inclusion probabilities are obtained as given
populations considered by Rao (1969). As a result below:
it has been observed that the determinant sampling 1
  , for i  1,2,3...N and
i
scheme performs better than the simple random k
sampling and the linear systematic sampling means  1
if i and j are fromdifferentrowsand fromdifferentcolumns
for estimating the finite population mean using  ij   k (k 1)

0 Otherwise
ratio and regression methods of estimation.
The first order inclusion probabilities are the same
2. Determinant Sampling Scheme for both the systematic sampling and the
determinant sampling schemes but the difference is
For the sake of simplicity and for the benefit of the on the second order inclusion probabilities. The
readers, the steps involved in selecting a two units in the same column will get the same
determinant sample of size n from a population of probability 1 in the case of systematic sampling
size N  kn are reproduced here. Let N  kn k
where as the two units from different rows and
where n  k , be the population size. The
from different columns will get the same
population units U 1 , U 2 , ..., U N are arranged in a probability 1 in the case of determinant
n  k matrix M (say) and the j-th row of M is k (k  1)
denoted by Rj , j  1, 2, ..., n . The elements of R j sampling and zero for other pair of units.
Let Yij be the observation corresponding to the unit
are {U(j - 1)k  i, i  1, 2, ..., k}. The determinant
sampling scheme consists of drawing n units from in the i th row and jth column, which is
the matrix M randomly such that the selected n corresponding to the unit U (i 1)k  j , then the
units are from different rows and from different sample observations are denoted by
columns of the matrix M . Hence no two selected
units are not from the same row or from the same S  {Yt1 , Yk  t 2 , ..., Y(n -1)k  t n } . The mean of a
column. The steps involved in the determinant determinant sample is denoted by yd and its
variance is given by
Subramani, Determinant sampling scheme for ratio and regression methods of estimation 224

Theorem 2.1: If xi , yi are a pair of values


V ( yd ) 
1
Nn( k  1)

k ( N  1) S 2  NnSc2  NkS r2 where observed on the i th unit of the population of size
k
1 k n 2 , S 2  1  (Y  Y ) 2 , N and xd , yd are the corresponding means of a
S2    (Yij  Y ) c i
( N  1) i 1 j 1 k i 1
determinant sample of size n , then the covariance
1 n is given by
S r2   (Y j  Y )
2
n i 1 1
Cov ( xd , y d )  
Nn ( k  1)
k ( N  1) S xy  NnS cxy  NkS rxy 
1 n , 1 k and Y  1 k n .
Yi   Yij Yj   Yij   Yij
n j 1 k i 1 N i 1 j 1 1 k n ,
S xy    ( X ij  X )(Yij  Y )
N i 1 j 1
It has also been proved by Subramani and Tracy
1 k
(1999) that the determinant sampling scheme S cxy   ( X i  X )(Yi  Y ) ,
performs better than the simple random sampling, k i 1
1 n 1 n
systematic sampling and stratified random S rxy   ( X j  X )(Y j  Y ) , Yi   Yij ,
sampling with unit per stratum schemes for the n i 1 n j 1
populations with a perfect linear trend among the 1 k 1 k n
population values. Consider the hypothetical Yj   Yij and Y    Yij .
population with a perfect linear trend among the
k i 1 N i 1 j 1
population values. Then the values of N
population units are in arithmetic progression. That The above theorem can be easily proved by
is, Yi  a  ib, i  1, 2, ..., N , where a and b following the steps similar to the theorem 2.3 of
are constants. For the above population with a Cochran (1977, page 25). When each X ij  Yij then
linear trend, the variances of the simple random the expression given above turns out to be the
sample mean V ( y r ) , systematic sample mean V ( y sy ) ,
expression given for V ( y d ) .
determinant sample man V ( y d ) and stratified
3. Ratio Methods of Estimation based on Determinant
random sample mean with unit per stratum Sampling Scheme
V ( yst ) are obtained as given below: ˆ
In general the ratio estimator YR of the population
(k  1)( N  1)b 2 , (k  1)(k  1)b 2 ,
V ( yr )  V ( y sy )  ˆ
mean Y and its variance V (YR ) are respectively
12 12
( k  n)[k  1]b 2 andV ( y )  (k  n)(k  1)b
2 given by
V ( yd )  st
12n 12n
By comparing the various variance expressions y
YˆR  X
(3.1)
given above, one can easily find that determinant x
sampling is more efficient than the other sampling and
schemes. In fact V ( y d )  V ( y st )  V ( y sy )  V ( y r )
V (YˆR )  V ( y )  R 2V ( x )  2 RCov( x , y ) (3.2)
.The equality sign attains only when n  1 . The
following theorem gives the expression for where x and y are the sample means of the
covariance between xd and y d , which will be auxiliary variable X and the study variable Y . By
useful for deriving the variances of the Ratio substituting the sample means of the variables X
estimator y Rd and linear regression estimator yld and Y obtained through various sampling schemes
of the population mean Y based on determinant in the above equation (3.1), one may get the
sampling. Here suffices Rd and ld are used to corresponding ratio estimators for the population
represent ratio and linear regression estimators mean Y . Similarly by substituting the
based on determinant sampling. corresponding expressions of V ( y ) , V (x ) and
225 International Journal of Current Research, Vol.11, pp.222-227, December, 2010

Cov ( x , y ) obtained through various sampling shown in literature (Cochran, 1977) that V ( ylr )
schemes in the equation (3.2) given above, one attains minimum when b  Cov( x, y ) . For assessing
may get the appropriate expressions for the V ( x)
variance of the ratio estimators. The derivations of the relative performance of determinant sampling
the above expressions are straightforward and with that of simple random sampling and
hence they are omitted here and only the final systematic sampling schemes, we have considered
results are given. For the sake of convenience of the minimum value of the respective variances.
the readers, the variances of the ratio estimators of The derivations of the above expressions are
the population mean based on simple random straightforward and hence they are omitted here
sampling, systematic sampling and determinant and only the final results are given. For the sake of
sampling schemes are given below: convenience of the readers, the variances of the
linear regression estimators of the population mean
V (YˆRr ) 
( N  n) 2
Nn

S y  R 2 S x2  2 RS xy  (3.3)
based on simple random sampling, systematic
sampling and determinant sampling schemes are
V (YˆRsy )  S cy
2
 R 2 S cx
2
 2 RS cxy (3.4) given below:
( N  n)  2
V (Yˆlr )  S y  b 2r S x2  2br S xy  (4.3)
1
k ( N 1)(S 2  R 2 S 2  2 RS )
y x xy
 Nn  
V (YˆRd )   
Nn(k 1)  Nn(S 2  R 2 S 2  2 RS )  Nk (S 2  R 2 S 2  2 RS )
V (Yˆlrsy )  S cy
 cy cx cxy ry rx rxy 
2 2 2 (4.4)
(3.5)  bsy S cx  2bsy S cxy
4. Regression Methods of Estimation based on  2 2 2 
1 k(N 1)(Sy  bd Sx  2bd Sxy)
Determinant Sampling Scheme V(Yˆlrd)  
Nn(k 1)  Nn(S  b S  2b S )  Nk(S  b S  2b S )
 2 2 2 2 2 2
 cy d cx d cxy ry d rx d rxy 

ˆ (4.5)
In general the linear regression estimator Ylr of the
Where b  Cov( xr , yr ) , b  Cov( xsy , y sy ) and
population mean Y and its variance V (Yˆlr ) are r sy
V ( xr ) V ( xsy )
respectively given by
Cov( xd , yd ) . That is, b , b and b are
bd  r sy d
V ( xd )
Yˆlr  y  b( X  x ) (4.1)
respectively the regression coefficients obtained
and V (Yˆlr )  V ( y )  b 2V ( x )  2bCov ( x , y ) (4.2) through simple random sampling, systematic
sampling and determinant sampling schemes.
where b is the known constants; x and y are the
5. Relative Performance of Determinant Sampling for
sample means of the auxiliary variable X and the a Certain Natural Populations
study variable Y . By substituting the sample
To evaluate the relative performance of ratio and
means of the variables X and Y obtained through
linear regression estimators based on determinant
various sampling schemes in the above equation sampling over simple random and systematic
(4.1), one may get the corresponding linear
sampling schemes, we have selected 9 natural
regression estimators for the population mean Y . populations. The populations numbered from 1 to
Similarly by substituting the corresponding 7 are taken from Cochran (1977).These populations
expressions of V ( y ) , V (x ) and Cov ( x , y ) are also considered by Rao (1969) to assess the
obtained through various sampling schemes in the relative performance of several ratio and regression
equation (4.2) given above, one may get the estimators. The populations numbered 8 and 9 are
appropriate expressions for the variance of the taken from Murthy (1967). Table 5.1 provides the
linear regression estimators. If the value of b is source, nature of the study variable Y and the
unknown then it can be replaced by the estimate of
the regression coefficient. However it has been
Subramani, Determinant sampling scheme for ratio and regression methods of estimation 226

Table 5.1: Description of the Populations considered for the


Empirical Study
Auxiliary
Population Study Variable Population
Source Variable
Number Y Size N
X
1 Cochran (1977) Size of the City Size of the City 49
Page 152 in USA 1930 in USA 1920
Cities 1-49
2 Cochran (1977) Size of the City Size of the City 25
Page 152 in USA 1930 in USA 1920
Cities 25-49
3 Cochran (1977) Size of the City Size of the City 24
Page 152 in USA 1930 in USA 1920
Cities 1-24
4 Cochran (1977) No. of Polio cases No. of not 34
Page 182 in the non inoculated inoculated group
group
5 Cochran (1977) No. of Polio cases No. of Placebo 34
Page 182 in the Placebo group children
6 Cochran (1977) Actual weight of Eye estimated 10
Page 203 peaches weight of peaches
7 Cochran (1977) No. of Persons No. of rooms 10
Page 325 in a block in a block
8 Murthy (1967) Area under wheat Cultivated area 34
Page 399 in 1964 in 1961
9 Murthy (1967) Area under wheat Cultivated area 34
Page 399 in 1964 in 1963

Table 5.2: Comparison of Ratio Estimators based on simple random,


systematic and determinant sampling for the population

Population n
Number
V ( y Rr ) V ( y Rsy ) V ( y Rd )
1 7 75.9807 45.8327 80.2353
2 5 139.4397 104.6067 135.2278
3 2 200.6973 322.8438 177.7578
3 127.7119 84.1308 135.1133
4 91.2265 136.7700 77.5781
4 2 2.6903 2.8459 2.7224
5 2 2.2649 2.4256 2.3298
6 2 2.9484 2.9913 2.4026
7 2 60.0763 73.4718 55.9298
8 2 2078.2930 1819.6810 2160.1780
9 2 424.5352 510.3750 411.7852

Table 5.3: Comparison of Linear Regression Estimators based on simple


random, systematic and determinant sampling for the population
Population n
Number
V ( ylr ) V ( ylrsy ) V ( ylrd )
1 7 67.1616 43.1821 69.4816
2 5 89.1116 13.6676 107.3137
3 2 200.2793 293.3779 177.3828
3 127.4473 78.9668 134.9287
4 91.0371 133.7642 77.2539
4 2 2.5292 2.8441 2.5769
5 2 2.2198 2.3759 2.2531
6 2 1.9652 2.1747 1.4815
7 2 54.9156 56.7016 48.5703
8 2 1935.8830 1553.2380 2021.5720
9 2 418.7012 501.1426 408.8789
227 International Journal of Current Research, Vol.11, pp.222-227, December, 2010

auxiliary variable X and the population size N . based on simple random sampling, systematic
We have considered all possible cases of k and n sampling and determinant sampling schemes
have minimum (maximum) variances
and hence we have totally 11 cases. The variances
of ratio estimators and the linear regression respectively in 2(0), 4(7) and 5(4) cases.
estimators obtained through determinant sampling,
simple random sampling and systematic sampling Hence we conclude that for practical purposes
one may use the determinant sampling scheme for
schemes are respectively presented in Table 5.2
estimating the population mean through the
and Table 5.3.
methods of ratio and linear regression estimation
The following strategies are used to evaluate the compared to simple random sampling and
relative performance of the ratio and linear systematic sampling schemes.
regression estimators based on these three different
sampling schemes. REFERENCES
1. An estimator is said to be the best it is has Cochran, W.G. 1977. Sampling Techniques, 3rd
the minimum variance among the Edition, John Wiley and Sons, New York
estimators considered here. Fountain, R.L and Pathak, P.L.1989. Systematic
2. An estimator is said to be the worst it is and non-random sampling in the presence of
has the maximum variance among the Linear Trends Communications in statistics.
estimators considered here. Theory and Methods, 18: 2511-2526.
3. An estimator is said to be a favourable Mukerjee, R and Sengupta, S 1990. Optimal
estimator if it has minimum variance more Estimation of a Finite Population Means in the
frequently and maximum variance rarely. Presence of Linear Trend, Biometrika., 77:
625-630.
The following are the tentative conclusions we Murthy, M.N. 1967. Sampling theory and Methods,
have arrived at the following conclusions based on Statistical Publishing House, Calcutta, India.
the results of the empirical study conducted and the Rao, J N K. 1969. Ratio and Regression Estimators
results presented in Tables 5.2 and 5.3 respectively – in New Developments in Survey Sampling,
for the cases of ratio estimators and the linear Ed. By N L Johnson and H.Smith, Jr., 213-234.
regression estimators. The observations made on Subramani, J. 2000. Diagonal Systematic Sampling
the relative performance of these different Scheme for Finite Populations, Jour. Ind. Soc.
sampling schemes are as follows: Ag. Statistics 53(2): 187-195.
Subramani, J. 2009 Further Results on Diagonal
 Among the 9 populations with 11 possible Systematic Sampling Scheme for Finite
cases considered, the variances of the ratio Populations, Journal of Indian Society of
estimators presented in Table 5.2, the simple Agricultural Statistics.63 (3): 277-282.
random sampling, systematic sampling and Subramani, J. 2010 Generalization of Diagonal
determinant sampling have minimum Systematic Sampling Scheme for Finite
variances respectively in 2, 4 and 5 cases. On Populations. Model Assisted Statistics and
the other hand these estimators have Applications, 5: 117-128.
maximum variances respectively in 1, 7, 3 Subramani, J. and Tracy, D. S. 1999. Determinant
cases. Sampling Scheme for Finite Populations.
 Similar results have been obtained from Internl. J. Math. and Statist. Sci., 8(1): 27-41.
Table 5.3 for the case of linear regression
estimators. That is, the regression estimators

Você também pode gostar