DOI QR코드

DOI QR Code

Applications of Chatterjee correlation coefficient

Chatterjee 상관계수의 응용

  • Sojin Ahn (Division of Data and Information Sciences, Pukyong National University) ;
  • Dae-Heung Jang (Division of Data and Information Sciences, Pukyong National University)
  • 안소진 (부경대학교 데이터정보과학부 통계.데이터사이언스전공) ;
  • 장대흥 (부경대학교 데이터정보과학부 통계.데이터사이언스전공)
  • Received : 2023.02.15
  • Accepted : 2023.02.24
  • Published : 2023.06.30

Abstract

Chatterjee (2021) proposed a new correlation coefficient ξ as an alternative to overcome the disadvantages of the existing Pearson's correlation coefficient. Since this correlation coefficient is rank-based, it is robust to outliers, and the simple formula makes the concept easy to understand and the measure calculation speed is very fast. Through this paper, the application of this correlation coefficient was examined in two aspects (1. Bivariate distribution, 2. High-dimensional data).

Chatterjee (2021)는 기존의 피어슨 상관계수의 단점을 극복하기 위한 하나의 대안으로서 새로운 상관계수 ξ를 제안하였다. 이 상관계수는 순위를 기반으로 하기 때문에 이상점에 강건하고, 간단한 공식 때문에 개념을 이해하기 쉽고 측도 계산 속도가 아주 빠르다. 본 논문을 통하여 이 상관계수의 응용을 두 가지 측면(1.이변량 분포, 2. 고차원자료)에서 살펴보았다.

Keywords

Acknowledgement

이 논문은 부경대학교 자율창의학술연구비(2021년)에 의하여 연구되었음.

References

  1. Alon U, Barkai N, Notterman DA, Gish K, Ybarra S, Mack D, and Levine AJ (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays, Proceedings of the National Academy of Sciences USA, 96, 6745-6750. https://doi.org/10.1073/pnas.96.12.6745
  2. Auddy A, Deb N, and Nandy S (2021). Exact detection thresholds for Chatterjee's correlation, Unpublished paper, Available from: https://arxiv.org/abs/2104.15140(https://arxiv.org/pdf/2104.15140.pdf)
  3. Chatterjee S (2021). A new coefficient of correlation, Journal of American Statistical Association, 116, 2009-2022. https://doi.org/10.1080/01621459.2020.1758115
  4. Chin K, DeVries S, Fridlyand J et al. (2006). Genomic and transcriptional aberrations linked to breast cancer pathophysiologies, Cancer Cell, 10, 529-541. https://doi.org/10.1016/j.ccr.2006.10.009
  5. Golub TR, Slonim DK, Tamayo P et al. (1999). Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring, Science, 286, 531-537. https://doi.org/10.1126/science.286.5439.531
  6. Lin Z and Han F (2021). On boosting the power of Chatterjee's rank correlation, Unpublished paper, Available from: https://arxiv.org/abs/2108.06828(https://arxiv.org/pdf/2108.06828.pdf)
  7. Sadeghi B (2022). Chatterjee correlation coefficient: A robust alternative for classic correlation methods in geochemical studies- (including "TripleCpy" Python package), Ore Geology Reviews, 146, 104954.
  8. Shi H, Drton M, and Han F (2022). On the power of Chatterjee's rank correlation, Biometrika, 109, 317-333. https://doi.org/10.1093/biomet/asab028
  9. Singh D, Febbo PG, Ross K et al. (2002). Gene expression correlates of clinical prostate cancer behavior, Cancer Cell, 1, 203-209. https://doi.org/10.1016/S1535-6108(02)00030-2