Research and Development of Document Recognition System for Utilizing Image Data

Kwag, Hee-Kue;

doi:10.3745/KIPSTB.2010.17B.2.125

The KIPS Transactions:PartB (정보처리학회논문지B)

Volume 17B Issue 2
/
Pages.125-138
/
2010
/
1598-284X(pISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

Research and Development of Document Recognition System for Utilizing Image Data

이미지데이터 활용을 위한 문서인식시스템 연구 및 개발

Kwag, Hee-Kue

곽희규 ((주)인지소프트)

Received : 2009.11.04
Accepted : 2010.03.04
Published : 2010.04.30

https://doi.org/10.3745/KIPSTB.2010.17B.2.125 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

The purpose of this research is to enhance document recognition system which is essential for developing full-text retrieval system of the document image data stored in the digital library of a public institution. To achieve this purpose, the main tasks of this research are: 1) analyzing the document image data and then developing its image preprocessing technology and document structure analysis one, 2) building its specialized knowledge base consisting of document layout and property, character model and word dictionary, respectively. In addition, developing the management tool of this knowledge base, the document recognition system is able to handle the various types of the document image data. Currently, we developed the prototype system of document recognition which is combined with the specialized knowledge base and the library of document structure analysis, respectively, adapted for the document image data housed in National Archives of Korea. With the results of this research, we plan to build up the test-bed and estimate the performance of document recognition system to maximize the utilization of full-text retrieval system.

본 연구는 공공기관이 소장한 이미지데이터의 검색 및 열람 등의 활용성을 높이기 위한 전문검색서비스 구현 시 필수적인 문서인식시스템의 고도화를 목표로 한다. 주요한 연구방향은 공공기관이 소장하고 있는 데이터를 사전에 분석하여 문서이미지 전처리 및 문서구조분석 기술을 개발하고, 문서인식 과정에서 활용하기 위한 이미지내용DB, 문자모델DB, 용어DB로 구성되는 특화된 지식베이스를 구축하는 것이다. 또한, 지식베이스 관리도구를 개발하여 향후 다양한 형태의 문서이미지로의 확장을 가능하게 한다. 최근 본 연구는 국가기록원에서 소장하고 있는 이미지데이터에 적합한 문서구조분석 라이브러리와 특화된 지식베이스를 결합한 문서인식 프로토타입 시스템 개발을 완료했다. 향후 본 연구의 결과는 방대한 소장자료의 검색 및 활용을 극대화할 전문검색시스템 연계를 위한 성능평가 및 테스트베드 구축에 활용될 것이다.

Keywords

References

김두식, 김상엽, 이성환, “한글문서 분석 및 인식기술의 최근 연구동향”, 전자공학회지, 제24권, 제9호, pp.1058-1070, 1997.
이준호, 이충식, 한선화, 김진형, “문자 인식에 의해 구축된 한글 문서 데이터베이스에 대한 정보 검색”, 한국정보처리논문지, 제6권, 제4호, pp.833-840, 1999.
정규식, 권희웅, “내용기반의 인쇄체 영문 문서 영상 검색을 위한 특징기반 단어 검색”, 한국정보과학논문지, 제26권, 제10호, pp.1204-1218, 1999.
오일석, 김수형, 유태웅, 곽희규, "문서 영상 처리 기술과 디지털 도서관", 한국정보과학회지, 제20권, 제8호, pp.24-34, 2002.
E. A. Galloway and G. V. Michalek, "The Heinz Electronic Library Interactive Online System(HELIOS): Building a digital archive using imaging, OCR, and natural language processing technologies," The Public-Access Computer Systems Review, Vol.6, No.4, pp.6-18, 1995.
K. Marukawa, T. Hu, H. Fujisawa and Y. Shima, "Document retrieval tolerating character recognition errors-evaluation and application," Pattern Recognition, Vol.30, No.8, pp.1361-1371, 1997. https://doi.org/10.1016/S0031-3203(96)00155-0
D. Doermann, "The indexing and retrieval of document images: A survey," Computer Vision and Image Understanding, Vol.70, No.3, pp.287-298, 1998. https://doi.org/10.1006/cviu.1998.0692
Digital Heritage Publishing Ltd., "The electronic version of Siku Quanshu," http://www.skqs.com.
T. Keaton, H. Greenspan and R. Goodman, "Keyword spotting for cursive document retrieval," Proceedings of the workshop on Document Image Analysis, pp.74-81, 1997. https://doi.org/10.1109/DIA.1997.627095
M. Droettboom, I. Fujinaga, K. MacMilan, G. S. Chouhury, T. DiLauro, M. Patton and T. Anderson, "Using the Gamera framework for the recognition of cultural heritage materials,” Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital Libraries, pp.11-17, 2002. https://doi.org/10.1145/544220.544223
S. Hara, “OCR for CJK classical texts preliminary examination,” Proc. Pacific Neighborhood Consortium(PNC) Annual Meeting, Taipei, Taiwan, pp.11-17, 2000.
M. Kojima, Y. Kawazoe and M. Kimura, “Automatic Tibetan Script Recognition by Computer,” Proceeding of the 7th Seminar of the International Association for Tibetan Studies, Graz, 1995, edited by Ernst Steinkellner, Vol.1, pp.527-533, 1997.
T. Shih, "Transformation of palace archives of Ming and Ching Dynasties onto CD-ROM and Internet," Proc. Pacific Neighborhood Consortium(PNC) Annual Meeting, Taipei, Taiwan, 2000.
Minsoo Kim, Kyutae Cho, Heegue Kwag, Jin Hyung Kim, "Segmentation Method of Handwritten Characters for Digitalizing Korean Historical Documents," The 6th international Conference on Document Analysis Systems, Florence, pp.114-124, 2004.
M. S. Kim, S. Ryu, K. T. Cho, T. H. Rhee, H. I. Choi, J. H. Kim, "Recognition-based Digitalization of Korean Historical Archives," Asian Information Retrieval Symposium(AIRS2004), Beijing, China, pp.186-189, 2004.
J. Beusekom, D. Keysers, F. Shafait, T. M. Breuel, “Example-Based Logical labeling of Document Title Page Images,” 2007, Proceedings of the 9th International Conference on Document Analysis and Recognition (ICDAR 2007), pp.919-923. https://doi.org/10.1109/ICDAR.2007.109
F. Shafait, J. Beusekom, D. Keysers, T. M. Breuel, “Structural Mixtures for Statistical layout Analysis,” 2008, Proc. 8th Int. Workshop on Document Analysis Systems (DAS) Accepted for publication. https://doi.org/10.1109/DAS.2008.61