DOI QR코드

DOI QR Code

Comparison Studies of Classification Methods based on L1-Distance and L1-Data Depth

L1-거리와 L1-데이터뎁스를 이용한 분류방법의 비교연구

  • Baek Soo-Jin (Division of Bacterial Respiratory Infections, Center for Infectious Disease Research, Korea Center for Disease Control & Prevention, Korean National Institute of Health) ;
  • Hwang Jin-Soo (Department of Statistics, Inha University) ;
  • Kim Jean-Kyung (Department of Statistics, Inha University)
  • 백수진 (국립보건연구원 질병관리본부 호흡기세균팀) ;
  • 황진수 (인하대학교 자연과학대학 수학통계학부) ;
  • 김진경 (인하대학교 자연과학대학 수학통계학부)
  • Published : 2006.03.01

Abstract

We consider a new classification method(DnDclass) combining two classification rules based on $L_1$-distance(L1DISTclass) and $L_1$-data depth(L1DDclass). To investigate characteristics and to evaluate the performance of these classification methods, we use simulation data in various settings. Through this simulation study, we can confirm that the new method, DnDclass, performs relatively well in many cases.

$L_1$-데이터뎁스를 이용한 분류방법(L1DDclass)과 관측치들 사이의 $L_1$-거리를 이용한 분류방법(L1DISTclass)의 특징을 살펴보고, 이 두 방법을 결합한 새로운 분류방법 (DnDclass: Distance and Data-depth based classification)의 효용성을 소개하고자 한다. 모의실험을 통해 세가지 분류방법의 결과를 비교하고 제안된 분류방법이 다양한 경우에 더 효과적일 수 있다는 사실을 확인한다.

Keywords

References

  1. Alon, U., Barkai, N., Notterdam, D. A., Gish, K., Ybarra, S., Mack, D. and Levine, A. J. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proceeding of the National Academy of Sciences, 96, 6745-6750
  2. Christmann, A. (2000). Classification Based on the SVM and on Regression Depth. Statistical data analysis based on the Llnorm and related methods. Birkhauser, Statistics for industry and technology. Y. Dodge editor
  3. Ghosh, A. K. and Chaudhuri, P. (2005). On Maximum Depth and Related Classifiers. The Scandinavian Journal of Statistics, 32, 327-350 https://doi.org/10.1111/j.1467-9469.2005.00423.x
  4. Jornsten, R. (2004). Clustering and Classification Based on the L1 data depth. Journal of Multivariate Analysis, 90, 67-89 https://doi.org/10.1016/j.jmva.2004.02.013
  5. Jornsten, R., Vardi, Y. and Zhang, C-H. (2002). A Robust Clustering Method and Visualization Tool Based on Data Depth. Statistical data analysis based on the Llnorm and related methods. Birkhauser, Statistics for industry and technology. Y. Dodge editor
  6. Liu, R., Parelius, J. and Singh, K. (1999). Multivariate analysis by data depth: descriptive statistics, graphics and inference (with discussion). The Annals of Statistics, 27, 783-858 https://doi.org/10.1214/aos/1018031260
  7. Tibshirani, R., Walther, G., Botstein, D. and Brown, P. (2001). Cluster validation by prediction strength. Technical report, Stanford University, Department of Biostatistics
  8. Vardi, Y. and Zhnag, C.-H. (2000). The multivariate L-1 median and associated data depth. Proceeding of the National Academy of Sciences, 97, 1423-1426