Proposition of polytomous discrimination index and test statistics

Choi, Jin Soo;Hong, Chong Sun;

doi:10.7465/jkdi.2016.27.2.337

Journal of the Korean Data and Information Science Society

제27권2호
/
Pages.337-351
/
2016
/
1598-9402(pISSN)

한국데이터정보과학회 (The Korean Data and Information Science Society)

DOI QR Code

다항판별지수와 검정통계량 제안

Proposition of polytomous discrimination index and test statistics

최진수 (성균관대학교 통계학과) ;
홍종선 (성균관대학교 통계학과)

Choi, Jin Soo (Department of Statistics, Sungkyunkwan University) ;
Hong, Chong Sun (Department of Statistics, Sungkyunkwan University)

투고 : 2016.02.12
심사 : 2016.03.07
발행 : 2016.03.31

https://doi.org/10.7465/jkdi.2016.27.2.337 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

현실세계의 예측 문제에서 세 범주 이상의 결과로 예측되는 경우가 많다. 이러한 경우에 대한 기존의 문헌연구에서는 부합성을 짝 접근방법으로 활용한 통계량은 범주의 뚜렷한 구분 없이 표현되었다. 최근 새롭게 표현한 평가자료와 이를 바탕으로 부합성을 재표현하여 통계량들을 새롭게 정의함으로써 직관적으로 의미 파악이 가능해졌지만 통계량들의 판단기준이 구체적이지 않은 문제점을 갖고 있다. 또한 이 통계량들은 가능한 부합성의 짝으로 구성되었지만 실제범주들간에서 예측범주들의 부합성을 추가적으로 고려할 수 있기에 이를 포함한 두 가지 통계량을 제안하였다. 제안한 통계량은 선택된 두 범주로부터 모든 가능한 경우들 사이를 판별하는 장점이 있다. 본 연구에서 제안한 두 가지 통계량은 지시함수로 표현되므로 비모수적 통계량으로 변환할 수 있다. 그러므로 부합성 통계량을 가설검정 방법으로 사용할 수 있음을 제안한다.

There exist many real situations that statistical decision problems are classified into more than two categories. In these cases, the concordance statistics by the pair approach are mostly used. However, the expression of the classification of categories are ambiguous. Recently, the standardized evaluation data and re-expressed concordance statistics are defined and could be explained their meanings. They have still some non-specific problems for standard criteria of the statistics. Since these can be considered between result and truth categories additionally, two alternative concordance statistics might be proposed in this paper. Some advantages are founded that the proposed statistics could be discriminated all possible cases for two randomly selected categories. Moreover since the proposed statistics are represented with indicator functions, these could be transformed non-parametrically, so that these concordances are used for hypothesis testing.

키워드

참고문헌

Choi, J. S. and Hong, C. S. (2016). Standardized polytomous discrimination index using concordance. Journal of the Korean Data & Information Science Society, 27, 33-44. https://doi.org/10.7465/jkdi.2016.27.1.33
Fawcett, T. (2003). ROC graphs: Notes and practical considerations for data mining researchers. HP Laboratories, Palo Alto, CA 94304.
Hand, D. J. and Till, R. J. (2001). A simple generalisation of the area under the ROC curve for multiple class classification problem. Machine Learning, 45, 171-186. https://doi.org/10.1023/A:1010920819831
Hanley, J. A. and McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143, 29-36. https://doi.org/10.1148/radiology.143.1.7063747
Hong, C. S. and Cho, M. H. (2015a). VUS and HUM represented with Mann-Whitney statistic. Communications for Statistical Applications and Methods, 22, 223-232. https://doi.org/10.5351/CSAM.2015.22.3.223
Hong, C. S. and Cho, M. H. (2015b). Test statistics for volume under the ROC surface and hypervolume under the ROC manifold. Communications for Statistical Applications and Methods, 22, 377-387. https://doi.org/10.5351/CSAM.2015.22.4.377
Hong, C. S., Joo, J. S. and Choi, J. S. (2010). Optimal thresholds from mixture distributions. Korean Journal of Applied Statistics, 23, 13-28. https://doi.org/10.5351/KJAS.2010.23.1.013
Hong, C. S. and Jung, D. G. (2014). Standard criterion of hypervolume under the ROC manifold. Journal of the Korean Data & Information Science Society, 25, 473-483. https://doi.org/10.7465/jkdi.2014.25.3.473
Hong, C. S., Jung, E. S. and Jung, D. G. (2013). Standard criterion of VUS for ROC surface. Korean Journal of Applied Statistics, 26, 977-985. https://doi.org/10.5351/KJAS.2013.26.6.977
Joseph, M. P. (2005). A PD validation framework for Basel II internal ratings-based systems. Creadit Scoring and Credit Control, IX.
Obuchowski, N. A., Goske, M. J. and Applegate, K. E. (2001). Assessing physicians' accuracy in diagnosing paediatric patients with acute abdominal pain: Measuring accuracy for multiple diseases. Statistics In Medicine, 20, 3261-3278. https://doi.org/10.1002/sim.944
Obuchowski, N. A. (2005). Estimating and comparing diagnostic tests' accuracy when the gold standard is not binary. Academic radiology, 12, 1198-1204. https://doi.org/10.1016/j.acra.2005.05.013
Swets, J. A. (1988). Measuring the accuracy of diagnostic systems. Science, 240, 1285-1293. https://doi.org/10.1126/science.3287615
Van Calster, B., Van Belle, V., Vergouwe, Y., Timmerman, D., Van Huffel, S. and Steyerberg, E. W. (2012). Extending the c-statistic to nominal polytomous outcomes: The polytomous discrimination index. Statistics In Medicine, 31, 2610-2626. https://doi.org/10.1002/sim.5321
Yan, L., Dodier, R., Mozer, M. C. and Wolniewicz, R. (2003). Optimizing classifier performance via the Wilcoxon-Mann-Whitney statistics. Proceedings of the 20th International Conference on Machine Learning, Washington D.C., 848-855.
Zou, K. H., O'Malley, A. J. and Mauri, L. (2007). Receiver operating characteristic analysis for evaluating diagnostic tests and predictive models. Circulation, 115, 654-657. https://doi.org/10.1161/CIRCULATIONAHA.105.594929

Journal of the Korean Data and Information Science Society

다항판별지수와 검정통계량 제안

Proposition of polytomous discrimination index and test statistics

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)