DOI QR코드

DOI QR Code

open-japanese-mesh: assigning MeSH UIDs to Japanese medical terms via open Japanese-English glossaries

  • Received : 2020.03.24
  • Accepted : 2020.05.26
  • Published : 2020.05.28

Abstract

The Medical Subject Headings (MeSH) thesaurus is a controlled vocabulary for indexing biomedical documents that is used for document retrieval and other natural language processing purposes. However, although the oariginal English MeSH is freely available, its Japanese translation has a restricted license. We attempted to create an open alternative, and for this purpose we made a script for assigning MeSH UIDs to Japanese medical terms using Japanese-English glossaries. From the MeSpEn glossary and MEDUTX dictionary, we generated a 12,457-word Japanese-MeSH dictionary.

Keywords

References

  1. Postell WD. Medicines for the Union Army. Bull Med Lib Assoc 1963;51:144-146.
  2. Bodenreider O. The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Res 2004;32:D267-D270. https://doi.org/10.1093/nar/gkh061
  3. Tateisi Y. Resources for assigning MeSH IDs to Japanese medical terms. Genomics Inform 2019;17:e16. https://doi.org/10.5808/GI.2019.17.2.e16
  4. Villegas M, Intxaurrondo A, Gonzalez-Agirre A, Marimon M, Krallinger M. The MeSpEN Resource for English-Spanish medical machine translation and terminologies: census of parallel corpora, glossaries and term translations. In: Proceedings of the LREC 2018 Workship "MultilingualBIO: Multilingual Biomedical Text Processing" (Melero M, Krallinger M, Gonzalez-Agirre A, eds.), 2018 May 8, Miyazaki, Japan. Paris: European Language Resources Association, 2018. pp. 32-39.
  5. Asia-Pacific Association for Machine Translation. UTX glossaries. Kyoto: Asia Pacific Machine Translation Association, 2017. Accessed 2020 Mar 21. Available from: https://aamt.info/english/download/.
  6. Ikegami Y. jaconv: Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku and Zenkaku. San Francisco: GitHub Inc., 2020. Accessed 2020 Mar 21. Available from: https://github.com/ikegami-yukino/jaconv.