Interval Estimation of Population Proportion in a Double Sampling Scheme

Lee, Seung-Chun;Choi, Byong-Su;

doi:10.5351/KJAS.2009.22.6.1289

The Korean Journal of Applied Statistics (응용통계연구)

Volume 22 Issue 6
/
Pages.1289-1300
/
2009
/
1225-066X(pISSN)
/
2383-5818(eISSN)

The Korean Statistical Society (한국통계학회)

DOI QR Code

Interval Estimation of Population Proportion in a Double Sampling Scheme

이중표본에서 모비율의 구간추정

Lee, Seung-Chun (Department of Statistics, Hashin University) ;
Choi, Byong-Su (Department of Multimedia Engineering, Hansung University)

이승천 (한신대학교 정보통계학과) ;
최병수 (한성대학교 멀티미디어학과)

Received : 20090800
Accepted : 20091000
Published : 2009.12.31

https://doi.org/10.5351/KJAS.2009.22.6.1289 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

The double sampling scheme is effective in reducing the sampling cost. However, the doubly sampled data is contaminated by two types of error, namely false-positive and false-negative errors. These would make the statistical analysis more difficult, and it would require more sophisticate analysis tools. For instance, the Wald method for the interval estimation of a proportion would not work well. In fact, it is well known that the Wald confidence interval behaves very poorly in many sampling schemes. In this note, the property of the Wald interval is investigated in terms of the coverage probability and the expected width. An alternative confidence interval based on the Agresti-Coull's approach is recommended.

표본추출 비용의 절감을 위해 흔히 사용되는 이중표본추출방법은 대부분의 표본들이 2종류의 오류에 의해 오염이 되어 있어 통계적 분석이 상대적으로 용이하지 않다. 특히, 비율의 추론을 위한 중요한 분석 도구인 구간추정은 현재까지 우도추정량의 정규근사에 의존하는 Wald 방법만이 알려져 있으나 Wald 신뢰구간은 포함확률의 근사성 등에서 많은 문제가 있다는 것이 여러 연구에서 확인되고 있다. 본 연구에서는 이중표본추출에서 Wald 신뢰구간의 문제점을 파악하고 이에 대한 대안으로 Agresti-Coull 유형의 신뢰구간을 제시한다.

Keywords

References

이승천 (2006). 독립표본에서 두 모비율 차이에 대한 가중 Polya 사후분포 신뢰구간, <응용통계연구>, 19, 171–181
이승천 (2007). 베이지안 접근에 의한 모비율 선형함수의 신뢰구간, <응용통계연구>, 20, 257–266
Agresti, A. and Coull, B. A. (1998). Approximation is better than 'exact' for interval estimation of binomial proportions, American Statistician, 52, 119–126
Agresti, A. and Caffo, B. (2000). Simple and effective confidence intervals for proportions and differences of proportions result from adding two successes and two failures, American Statistician, 54, 280–288
Agresti, A. and Min, Y. (2005). Simple improved confidence intervals for comparing matched proportions, Statistics in Medicine, 24, 729–740 https://doi.org/10.1002/sim.1781
Boese, D. H., Young, D. M. and Stamey, J. D. (2006). Confidence intervals for a binomial parameter based on binary data subject to false-positive misclassification, Computational Statistics and Data Analysis, 50, 3369–3385 https://doi.org/10.1016/j.csda.2005.08.007
Braunstein, G. (2002). False-positive serum human chronic gonadotropin results: causes, characteristics, and recognition, American Journal of Obstetrics & Genecology, 187, 217–224 https://doi.org/10.1067/mob.2002.124284
Bross, I. (1954). Misclassification in tables, Biomometrics, 10, 478–486
Brown, L. D., Cai, T. T. and DasGupta, A. (2001). Interval estimation for a binomial proportion, Statistical Science, 16, 101–133
Kazemi, N., Dennien, B. and Dan, A. (2001). Mistaken identity: A case of false positive on CT angiography, Journal of Clinical Neuroscience, 9, 464–466 https://doi.org/10.1054/jocn.2001.0984
Lee, S.-C. (2006). Interval estimation of binomial proportions based on weighted Polya posterior, Computational Statistics and Data Analysis, 51, 1012–1021 https://doi.org/10.1016/j.csda.2005.10.008
Lee, S.-C. (2007). An improved confidence interval for the population proportion in a double sampling scheme subject to false-positive misclassification, Journal of the Korean Statistical Society, 36, 275–284
Price, R. M. and Bonett, D. G. (2004). An improved confidence interval for a linear function of binomial proportions, Computational Statistics and Data Analysis, 45, 449–456 https://doi.org/10.1016/S0167-9473(03)00007-0
Raats, V. M. and Moors, J. J. A. (2003). Double-checking auditors: A Bayesian approach, Statistician, 52, 351–365 https://doi.org/10.1111/1467-9884.00364
Swaen, V. M., Teggerler, O. and Amelsvoort, L. (2001). False positive outcomes and design characteristics in occupational cancer epidemiology studies, International Journal of Epidemiology, 30, 948–955
Tenenbein, A. (1970). A double sampling scheme for estimating from binomial data with misclassifications, Journal of the American Statistical Association, 65, 1350–1361
Tenenbein, A. (1971). A double sampling scheme for estimating from binomial data with misclassifications: sample size determination, Biometrics, 27, 935–944
Tenenbein, A. (1972). A double sampling scheme for estimating from multinomial data with application to sampling inspection, Technometrics, 14, 187–202
York, J., Madigan, D., Heuch, I. and Lie, R. T. (1995). Birth defects registered by double sampling: a Bayesian approach incorporating covariates and model uncertainty, Applied. Statistics, 44, 227–242

Cited by

Theoretical Considerations for the Agresti-Coull Type Confidence Interval in Misclassified Binary Data vol.18, pp.4, 2011, https://doi.org/10.5351/CKSS.2011.18.4.445
Bayesian confidence intervals of proportion with misclassified binary data vol.42, pp.3, 2013, https://doi.org/10.1016/j.jkss.2012.09.001

The Korean Journal of Applied Statistics (응용통계연구)

Interval Estimation of Population Proportion in a Double Sampling Scheme

이중표본에서 모비율의 구간추정

Abstract

Keywords

References

Cited by

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)