Browse > Article
http://dx.doi.org/10.12654/JCS.2021.37.3.06

Digitization of Old Korean Texts with Obsolete Korean Characters and Suggestion for Improvement of Information Sharing  

Kim, Ha Young (Jangseogak, The Academy of Korean Studies)
Yoo, Woo Sik (WaferMasters, Inc.)
Publication Information
Journal of Conservation Science / v.37, no.3, 2021 , pp. 255-269 More about this Journal
Abstract
A vast amount of materials-such as prints, woodblock prints, manuscripts, old novels, and letters-written in old Korean and using old grammar and/or obsolete characters, are collected in many institutions, including the Jangseogak at the Academy of Korean Studies. Digitization of these texts has required a prolonged manual inputting process. Individual researchers, who majored in old Korean, have read and typed the characters into electronic documents, which depends upon individual skill, effort, and approach, and is particularly limiting because none can be significantly increased. To date, only a small proportion of the old Korean document collections, currently kept in storage, have been digitized and made available to the public. Even the electronic formats of the texts prove difficult to displaying correctly, due to the incompatibility between the old Korean characters and the character set on today's electronic devices. To improve the techniques and efficiency of digitizing old Korean texts, it is necessary to develop optical character recognition (OCR), which will analyze images of old Korean documents, as well as input, display, and storage methods.
Keywords
Old Korean characters; Old Korean documents; Old Korean font incompatibility; Digitization; Optical Character Recognition (OCR); Image analysis;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Kim, H.G., 1990, A study on the composition of Hunminjeongeum code system and computer processing method for the computerization of classical data in Korean and Korean literature, Korean Culture Research, 23, 145-187.
2 Nara National Research Institute for Cultural Properties, 2021, https://mojizo.nabunken.go.jp/ (March 22, 2021)
3 Kyungpook National University, 2021, http://www.dila.co.kr/bbs/write.php?bo_table=opentrans (March 22, 2021)
4 Online Hangul Input, 2021, Online Hangul Input - Dubeolsik Old Hangul Keyboard, https://pat.im/1179 (March 22,2021)
5 Wikipedia, 2021a, Optical character recognition, https://en.wikipedia.org/wiki/Optical_character_recognition (march 22, 2021)
6 Wikipedia, 2021d, Hangul Jamo (Unicode block), https://en.wikipedia.org/wiki/Hangul_Jamo_(Unicode_block) (March 22, 2021)
7 Yoo, Y. and Yoo, W.S., 2021, Digital image comparisons for investigating aging effects and artificial modifications using image analysis software. Journal of Conservation Science, 37(1), 1-12.   DOI
8 Unicode, 2021, Hangul Jamo, https://www.unicode.org/charts/PDF/U1100.pdf (March 22, 2021)
9 National Institute of Korean Language, 2021, Old Korean Input System, https://www.korean.go.kr/common/oldHangeul.do (March 22, 2021)
10 The Academy of Korean Studies, 2021, Yu-Yi Yangmunrok, http://jsg.aks.ac.kr/viewer/viewIMok?dataId=K4-6792%7C001#node?depth=2&upPath=001&dataId=001 (March 22, 2021)
11 Wikipedia, 2021b, ASCII, https://en.wikipedia.org/wiki/ASCII, (March 22, 2021)
12 Wikipedia, 2021c, Korean language and computers, https://en.wikipedia.org/wiki/Korean_language_and_computers (March 22, 2021)
13 Wikipedia, 2021e, Nalgaeset Hangul Input, https://ko.wikipedia.org/wiki/%EB%82%A0%EA%B0%9C%EC%85%8B_%ED%95%9C%EA%B8%80_%EC%9E%85%EB%A0%A5%EA%B8%B0 (March 22, 2021)
14 Yoo, W.S., 2020, Comparison of outlines by image analysis for derivation of objective validation results: "Ito Hirobumi's characters on the foundation stone" of the Main Building of Bank of Korea. Journal of Conservation Science, 36(6), 511-518. (in Korean with English abstract)   DOI
15 Kim, G., Kim, J.G., Kang K. and Yoo, W.S., 2019, Image-based quantitative analysis of foxing stains on old printed paper documents. Heritage, 2(3), 2665-2677.   DOI
16 Yoo, W.S., Yoo, S.S., Yoo, B H. and Yoo, S.J., 2021, Investigation on the conservation status of the 50-year-old "Yu Kil-Chun Archives" and an effective and practical method of preserving and sharing contents. Journal of Conservation Science, 37(2), 167-178. (in Korean with English abstract)   DOI