DOI QR코드

DOI QR Code

Choosing preferable labels for the Japanese translation of the Human Phenotype Ontology

  • Ninomiya, Kota (National Institute of Public Health) ;
  • Takatsuki, Terue (Database Center for Life Science, Research Organization of Information and Systems) ;
  • Kushida, Tatsuya (BioResource Research Center, RIKEN) ;
  • Yamamoto, Yasunori (Database Center for Life Science, Research Organization of Information and Systems) ;
  • Ogishima, Soichi (Advanced Research Center for Innovations in Next-Generation Medicine, Tohoku University)
  • 투고 : 2020.03.16
  • 심사 : 2020.05.22
  • 발행 : 2020.05.28

초록

The Human Phenotype Ontology (HPO) is the de facto standard ontology to describe human phenotypes in detail, and it is actively used, particularly in the field of rare disease diagnoses. For clinicians who are not fluent in English, the HPO has been translated into many languages, and there have been four initiatives to develop Japanese translations. At the Biomedical Linked Annotation Hackathon 6 (BLAH6), a rule-based approach was attempted to determine the preferable Japanese translation for each HPO term among the candidates developed by the four approaches. The relationship between the HPO and Mammalian Phenotype translations was also investigated, with the eventual goal of harmonizing the two translations to facilitate phenotype-based comparisons of species in Japanese through cross-species phenotype matching. In order to deal with the increase in the number of HPO terms and the need for manual curation, it would be useful to have a dictionary containing word-by-word correspondences and fixed translation phrases for English word order. These considerations seem applicable to HPO localization into other languages.

키워드

참고문헌

  1. Kohler S, Carmody L, Vasilevsky N, Jacobsen JO, Danis D, Gourdine JP, et al. Expansion of the Human Phenotype Ontology (HPO) knowledge base and resources. Nucleic Acids Res 2018;47:D1018-D1027. https://doi.org/10.1093/nar/gky1105
  2. Progress of localization of Human Phenotype Ontology. Crowdin, 2020. Accessed 2020 Mar 5. Available from: https://crowdin.com/project/hpo-translation.
  3. Vos RA, Katayama T, Mishima H, Kawano S, Kawashima S, Kim JD, et al. BioHackathon 2015: semantics of data for life sciences and reproducible research [version 1]. F1000Res 2020;9:136. https://doi.org/10.12688/f1000research.18236.1
  4. Robinson PN. Deep phenotyping for precision medicine. Hum Mutat 2012;33:777-780. https://doi.org/10.1002/humu.22080
  5. International Federation of Pharmaceutical Manufacturers and Associations. RARE DISEASES: shaping a future with no-one left behind. Geneva: IFPMA, 2017. Accessed 2020 Jan 25. Available from: https://www.ifpma.org/wp-content/uploads/2018/02IFPMA_Rare_Diseases_Brochure_28Feb2017_FINAL.pdf.
  6. European Commission. Policy: rare diseases: what are they? Brussels: European Commission, 2016. Accessed 2020 Jan 25. Available from: http://ec.europa.eu/health/rare_diseases/policy/index_en.htm.
  7. Shire Human Genetic Technologies. Rare Disease Impact Report: Insights from Patients and the Medical Community. Lexington: Shire Human Genetic Technologies, 2013.
  8. Sanofi. Conducted a "patient journey survey" to listen to the daily feelings and worries of patients with Fabry disease. Tokyo: Sanofi, 2018. Accessed 2020 Jan 25. Available from: https://www.sanofi.co.jp/-/media/Project/One-Sanofi-Web/Websites/Asia-Pacific/Sanofi-JP/Home/press-releases/PDF/2018/20181205.pdf?la=ja.
  9. Robinson PN, Kohler S, Oellrich A; Sanger Mouse Genetics Project, Wang K, Mungall CJ, et al. Improved exome prioritization of disease genes through cross-species phenotype comparison. Genome Res 2013;24:340-348. https://doi.org/10.1101/gr.160325.113
  10. Japanese translation of Human Phenotype Ontology (in progress). San Francisco: GitHub, 2020. Accessed 2020 Mar 5. Available from: https://github.com/ogishima/HPO-japanese.
  11. Kaneko S, Fujita N, Ugawa Y, Kawamoto T, Takeuchi H, Takekoshi M, et al. Life science dictionary: a versatile electronic database of medical and biological terms. In: Dictionaries and Language Learning: How can Dictionaries Help Human and Machine Learning. The 3rd Asialex Biennial International Conference (Murata M, Yamada S, Tono Y, eds.), 2003 Aug 27-29, Meikai University, Uruyasa, Chiba, Japan. Tokyo: The Asian Association for Lexicography, 2003. pp. 434-439.
  12. Japanese translation of Mammalian Phenotype. Tokyo: Riken, 2020. Accessed 2020 Mar 5. Available from: https:// ja.brc.riken.jp/lab/bpmp/ontology/ontology_mp_j.html.
  13. mp_hp-align-equiv.owl. San Francisco: GitHub, 2020. Available from: https://github.com/obophenotype/upheno/blob/ master/hp-mp/mp_hp-align-equiv.owl.
  14. Mecab: yet another part-of-speech and morphological analyzer. Mecab, 2006. Accessed 2020 Mar 5. Available from: http://taku910.github.io/mecab/.
  15. MANBYO dictionary. Ikoma: Social Computing Laboratory, NAIST, 2019. Accessed 2020 Mar 5. Available from: http://sociocom.jp/-data/2018-manbyo/index.html.
  16. Morita H, Kawahara D, Kurohashi S. Morphological analysis for unsegmented languages using recurrent neural network language model. In: Proceedings of EMNLP 2015: Conference on Empirical Methods in Natural Language Processing, 2015 Sep 17-21, Lisbon, Portugal. Stroudsburg: Association for Computational Linguistics, 2015. pp. 2292-2297.
  17. Takaoka K, Hisamoto S, Kawahara N, Sakamoto M, Uchida Y, Matsumoto Y. Sudachi: a Japanese Tokenizer for Business. Paris: European Language Resources Association, 2018.