References
- Choi, J. S. and Hong, C. S. (2016). Standardized polytomous discrimination index using concordance. Journal of the Korean Data & Information Science Society, 27, 33-44. https://doi.org/10.7465/jkdi.2016.27.1.33
- Fawcett, T. (2003). ROC graphs: Notes and practical considerations for data mining researchers. HP Laboratories, Palo Alto, CA 94304.
- Hand, D. J. and Till, R. J. (2001). A simple generalisation of the area under the ROC curve for multiple class classification problem. Machine Learning, 45, 171-186. https://doi.org/10.1023/A:1010920819831
- Hanley, J. A. and McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143, 29-36. https://doi.org/10.1148/radiology.143.1.7063747
- Hong, C. S. and Cho, M. H. (2015a). VUS and HUM represented with Mann-Whitney statistic. Communications for Statistical Applications and Methods, 22, 223-232. https://doi.org/10.5351/CSAM.2015.22.3.223
- Hong, C. S. and Cho, M. H. (2015b). Test statistics for volume under the ROC surface and hypervolume under the ROC manifold. Communications for Statistical Applications and Methods, 22, 377-387. https://doi.org/10.5351/CSAM.2015.22.4.377
- Hong, C. S., Joo, J. S. and Choi, J. S. (2010). Optimal thresholds from mixture distributions. Korean Journal of Applied Statistics, 23, 13-28. https://doi.org/10.5351/KJAS.2010.23.1.013
- Hong, C. S. and Jung, D. G. (2014). Standard criterion of hypervolume under the ROC manifold. Journal of the Korean Data & Information Science Society, 25, 473-483. https://doi.org/10.7465/jkdi.2014.25.3.473
- Hong, C. S., Jung, E. S. and Jung, D. G. (2013). Standard criterion of VUS for ROC surface. Korean Journal of Applied Statistics, 26, 977-985. https://doi.org/10.5351/KJAS.2013.26.6.977
- Joseph, M. P. (2005). A PD validation framework for Basel II internal ratings-based systems. Creadit Scoring and Credit Control, IX.
- Obuchowski, N. A., Goske, M. J. and Applegate, K. E. (2001). Assessing physicians' accuracy in diagnosing paediatric patients with acute abdominal pain: Measuring accuracy for multiple diseases. Statistics In Medicine, 20, 3261-3278. https://doi.org/10.1002/sim.944
- Obuchowski, N. A. (2005). Estimating and comparing diagnostic tests' accuracy when the gold standard is not binary. Academic radiology, 12, 1198-1204. https://doi.org/10.1016/j.acra.2005.05.013
- Swets, J. A. (1988). Measuring the accuracy of diagnostic systems. Science, 240, 1285-1293. https://doi.org/10.1126/science.3287615
- Van Calster, B., Van Belle, V., Vergouwe, Y., Timmerman, D., Van Huffel, S. and Steyerberg, E. W. (2012). Extending the c-statistic to nominal polytomous outcomes: The polytomous discrimination index. Statistics In Medicine, 31, 2610-2626. https://doi.org/10.1002/sim.5321
- Yan, L., Dodier, R., Mozer, M. C. and Wolniewicz, R. (2003). Optimizing classifier performance via the Wilcoxon-Mann-Whitney statistics. Proceedings of the 20th International Conference on Machine Learning, Washington D.C., 848-855.
- Zou, K. H., O'Malley, A. J. and Mauri, L. (2007). Receiver operating characteristic analysis for evaluating diagnostic tests and predictive models. Circulation, 115, 654-657. https://doi.org/10.1161/CIRCULATIONAHA.105.594929