DOI QR코드

DOI QR Code

Analysis of Symptoms-Herbs Relationships in Shanghanlun Using Text Mining Approach

텍스트마이닝 기법을 이용한 『상한론』 내의 증상-본초 조합의 탐색적 분석

  • Jang, Dongyeop (Department of Physiology, College of Korean Medicine, Gachon University) ;
  • Ha, Yoonsu (College of Korean Medicine, Gachon University) ;
  • Lee, Choong-Yeol (Department of Physiology, College of Korean Medicine, Gachon University) ;
  • Kim, Chang-Eop (Department of Physiology, College of Korean Medicine, Gachon University)
  • 장동엽 (가천대학교 한의과대학 생리학교실) ;
  • 하윤수 (가천대학교 한의과대학) ;
  • 이충열 (가천대학교 한의과대학 생리학교실) ;
  • 김창업 (가천대학교 한의과대학 생리학교실)
  • Received : 2019.10.29
  • Accepted : 2020.07.28
  • Published : 2020.08.25

Abstract

Shanghanlun (Treatise on Cold Damage Diseases) is the oldest document in the literature on clinical records of Traditional Asian medicine (TAM), on which TAM theories about symptoms-herbs relationships are based. In this study, we aim to quantitatively explore the relationships between symptoms and herbs in Shanghanlun. The text in Shanghanlun was converted into structured data. Using the structured data, Term Frequency - Inverse Document Frequency (TF-IDF) scores of symptoms and herbs were calculated from each chapter to derive the major symptoms and herbs in each chapter. To understand the structure of the entire document, principal component analysis (PCA) was performed for the 6-dimensional chapter space. Bipartite network analysis was conducted focusing on Jaccard scores between symptoms and herbs and eigenvector centralities of nodes. TF-IDF scores showed the characteristics of each chapter through major symptoms and herbs. Principal components drawn by PCA suggested the entire structure of Shanghanlun. The network analysis revealed a 'multi herbs - multi symptoms' relationship. Common symptoms and herbs were drawn from high eigenvector centralities of their nodes, while specific symptoms and herbs were drawn from low centralities. Symptoms expected to be treated by herbs were derived, respectively. Using measurable metrics, we conducted a computational study on patterns of Shanghanlun. Quantitative researches on TAM theories will contribute to improving the clarity of TAM theories.

Keywords

References

  1. Shin SW, Kim JB. Diagnosis system in Sanghanlun. Journal of Pathology in Korean Medicine. 1998;12(1):1-18.
  2. Shin HM. A Study on the Dose of Prescription in Shanghanlun. Journal of Korean Medicine. 1999;20(3):3-8.
  3. Hong SM, Hur IH, Byun HS, Sim SY, Kim KJ. A Case Study on Atopic Dermatitis with the Treatise on Febrile Diseases. The Journal of Korean Oriental Medical Ophthalmology & Otolaryngology & Dermatology. 2007;20(2):230-9.
  4. Oh MS, Jeon TD. A Study on the Significance of Sanghanron Prescription in Traffic Accident Patient. Journal of Oriental Rehabilitation Medicine. 2010;20(1):153-66.
  5. Hearst MA. Untangling text data mining. Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics. 1999:3-10.
  6. Bae JH, Son JE, Song M. Analysis of twitter for 2012 South Korea presidential election by text mining techniques. Journal of Intelligence and Information Systems. 2013;19(3):141-56. https://doi.org/10.13088/jiis.2013.19.3.141
  7. Chou CH, Sinha AP, Zhao H. A text mining approach to Internet abuse detection. Information Systems and e-Business Management. 2008;6(4):419-39. https://doi.org/10.1007/s10257-007-0070-0
  8. Spasic I, Ananiadou S, McNaught J, Kumar A. Text mining and ontologies in biomedicine: making sense of raw text. Briefings in bioinformatics. 2005;6(3):239-51. https://doi.org/10.1093/bib/6.3.239
  9. Lee CY. Understanding Current Traditional Korean Medicine - Preliminary Study for Discussion on the Identity Issue of TKM. J Physiol & Pathol Korean Med. 2010;24(5):758-69.
  10. Lee CY. Discussion on the Issues of the Modernization of the Fundamental Theories and Terms in Korean Medicine. J Physiol & Pathol Korean Med. 2013;27(5):540-52.
  11. Lee JH, Kim WY, Oh JH. Study on quantization of Korean medicine terminology concept. J Korean Medical Classics. 2014;27(1):099-109. https://doi.org/10.14369/skmc.2014.27.1.099
  12. Song YS, Yang D-h, Park YJ, Park YB. A study of relationship between excrement and materia medica in Bangyakhappyeon based on the data mining analysis. The Journal of the Society of Korean Medicine Diagnostics. 2012;16(2):33-45.
  13. Bae HJ, Kim CE, Lee CY, Shin SW, Kim JH. Investigation of the Possibility of Research on Medical Classics Applying Text Mining-Focusing on the Huangdi's Internal Classic. Journal of Korean Medical classics. 2018;31(4):27-46. https://doi.org/10.14369/JKMC.2018.31.4.027
  14. Jung WM, Lee T, Lee IS, Kim S, Jang H, Kim SY, et al. Spatial patterns of the indications of acupoints using data mining in classic medical text: a possible visualization of the meridian system. Evidence-Based Complementary and Alternative Medicine. 2015;2015.
  15. Oh JH. Can similarities in Medical thought be Quantified? The Journal Of Korean Medical Classics. 2018;31(2):71-82. https://doi.org/10.14369/JKMC.2018.31.2.071
  16. Kim SW, Kim KW, Lee BW. A study on combination of prescription of Shanghanlun using database. J Korean Medical Classics. 2019;32(1):171-89. https://doi.org/10.14369/JKMC.2019.32.1.171
  17. Kim SU, Lee HK, Jung HJ. Suggestions for writing the medical records based on the symptoms in Sanghanron. The Journal of the Society of Korean Medicine Diagnostics. 2014;18(2):85-109.
  18. Shen CW, Chen M, Wang CC. Analyzing the trend of O2O commerce by bilingual text mining on social media. Computers in Human Behavior. 2019;101:474-83. https://doi.org/10.1016/j.chb.2018.09.031
  19. Wang H, Can D, Kazemzadeh A, Bar F, Narayanan S. A system for real-time twitter sentiment analysis of 2012 us presidential election cycle. Proceedings of the ACL 2012 system demonstrations. 2012:115-20.
  20. Oh J, Bae H, Kim CE. Construction And Analysis Of The Time-Evolving Pain-Related Brain Network Using Literature Mining. Journal of Pain Research. 2019;12:2891. https://doi.org/10.2147/JPR.S217036
  21. Kwon Y, Natori Y, Tanokura M. New approach to generating insights for aging research based on literature mining and knowledge integration. PloS one. 2017;12(8).
  22. LEE JH, Baik YS, Jeong CH. Yoshimasu Todo's medical theory extracted from Yakjing III. The Journal Of Korean Medical Classics. 2006;19(2):66-73.
  23. Yeo IS. Korean medicine, see it as science? Research Institute for Healthcare Policy Korean Medical Association. 2011;9(3):70-5.