DOI QR코드

DOI QR Code

딥러닝 기반 임상 관계 학습을 통한 질병 예측

Disease Prediction By Learning Clinical Concept Relations

  • 투고 : 2021.10.08
  • 심사 : 2021.10.19
  • 발행 : 2022.01.31

초록

본 논문에서는 임상 의사 결정 지원을 위하여 의학 지식을 통해 임상 관계를 추출하고 딥러닝 모델을 이용하여 질병을 예측하는 방법을 제안한다. 의학 사전인 UMLS(Unified Medical Language System)와 암 관련 의학 지식에 포함된 임상 용어를 5가지로 분류한다. 분류된 임상 용어들을 사용하여 위키피디아 의학 문서를 추출한다. 추출한 위키피디아 의학 문서와 추출한 임상 용어를 매칭하여 임상 관계를 구축한다. 구축한 임상 관계를 이용하여 딥러닝 학습을 진행한 후 질의에서 표현된 의학 용어를 바탕으로 질의와 연관된 질병을 예측한다. 이후, 예측한 질병과 관계가 있는 의학 용어를 확장 질의로 선택한 뒤 질의를 확장한다. 제안 방법의 유효성을 검증하기 위해 TREC Clinical Decision Support(CDS), TREC Precision Medicine(PM) 테스트 컬렉션에 대해 비교 평가한다.

In this paper, we propose a method of constructing clinical knowledge with clinical concept relations and predicting diseases based on a deep learning model to support clinical decision-making. Clinical terms in UMLS(Unified Medical Language System) and cancer-related medical knowledge are classified into five categories. Medical related documents in Wikipedia are extracted using the classified clinical terms. Clinical concept relations are established by matching the extracted medical related documents with the extracted clinical terms. After deep learning using clinical knowledge, a disease is predicted based on medical terms expressed in a query. Thereafter, medical terms related to the predicted disease are selected as an extended query for clinical document retrieval. To validate our method, we have experimented on TREC Clinical Decision Support (CDS) and TREC Precision Medicine (PM) test collections.

키워드

과제정보

This research was supported by Basic Science Research Program through the National Research Foundation of Korea(NRF) funded by the Ministry of Education(NRF-2017R1D1A1B03036275).

참고문헌

  1. J. Lee, W. Yoon, S. Kim, D. Kim, S. Kim, S. H. So, and J. Kang, "BioBERT: a pre-trained biomedical language representation model for biomedical text mining," Bioinformatics, Vol.36, Iss.4, pp.1234-1240, 2020. https://doi.org/10.1093/bioinformatics/btz682
  2. C. Park, C. C. Took, and J. K. Seong, "Machine learning in biomedical engineering," Biomedical Engineering Letters, Vol.8, pp.1-3, 2018. https://doi.org/10.1007/s13534-018-0058-3
  3. K. Roberts and E. M. Voorhees, "Overview of the TREC 2016 clinical decision support track," In Proceedings of the Text Retrieval Conference 2016, 2016.
  4. K. Roberts and E. M. Voorhees, "Overview of the TREC 2020 precision medicine track," In Proceedings of the Text Retrieval Conference 2020, 2020.
  5. K. B. Cohen, D. D. Fushman, S. Ananiadou, and J. Tsujii, "Stronger biomedical NLP in the face of COVID-19," In Proceedings of the 20th Workshop on Biomedical Language Processing, 2020.
  6. M. Hassan, O. Makkaoui, A. Coulet, and Y. Toussaint, "Extracting disease-symptom relationships by learning syntactic patterns from dependency graphs," In Proceedings of the 2015 Workshop on Biomedical Natural Language Processing (BioNLP 2015), pp.71-80, 2015.
  7. L. Yao, C. J. Sun, X. L. Wang, and X. Wang, "Relationship extraction from biomedical literature using Maximum Entropy based on rich features," In Proceedings of the Ninth Internation al Conference on Machine Learning and Cybernetics(ICMLC'10), pp.3358-3361, 2010.
  8. L. Soldaini, A. Cohan, A. Yates, N. Goharian, and O. Frieder, "Query Reformulation for Clinical Decision Support Search," In proceedings of the Text Retrieval Conference 2014, 2014.
  9. F. Hu, D. T. Y. Wu, Q. Mei, and V. G. V. Vydiswaran, "Learning from medical summaries: The university of michigan at TREC 2015 clinical decision support track," In Proceedings of the Text Retrieval Conference 2015, 2015.
  10. J. Stober, B. S. E. Heale, K. Fulghum, and G. D. Fiol, "Concept based information retrieval for clinical case summaries," In Proceedings of the Text Retrieval Conference 2015, 2015.
  11. R. You, S. Peng, S. Zhu, and Y. Zhou, "FDUMedSearch at TREC 2015 clinical decision support track," In Proceedings of the Text Retrieval Conference 2015, 2015.
  12. Atlas of Genetics and Cytogenetics in Oncology and Haematology [Internet], http://atlasgeneticsoncology.org
  13. 1000 Genomes Project in Sanger Data Directory [Internet], https://www.sanger.ac.uk/data/1000-genomes/
  14. T. Strohman, D. Metzler, H. Turtle, and W. B. Croft, "Indri: A language model-based search engine for complex queries," In Proceedings of the International Conference on Intelligence Analysis, http://www.lemurproject.org/indri. 2005.
  15. X. Lium, L. Li, Z. Yang, and S. Dong, "SCUT-CCNL at TREC 2019 Precision Medicine Track," In Proceedings of the Text Retreival Conference 2019, 2019.
  16. S. H. Jo and K. S. Lee, "CBNU at TREC 2019 precision medicine track," In Proceedings of the Text Retrieval Conference 2019, 2019.
  17. S. H. Jo and K. S. Lee, "CBNU at TREC 2015 clinical decision support track," In Proceedings of the Text Retrieval Conference 2015, 2015.