A Syntactic Approach for Logical Structure Analysis of Document Images

문서 영상의 논리적인 구조 분석을 위한 구문론적인 접근 방식

  • Published : 2001.07.01

Abstract

본 논문에서는 다수의 페이지로 구성된 복잡한 구조의 문서로부터 SGML/XML에 기반한 전자 문서를 생성하기 위한 구문론적인 구조분석 방법을 제안한다. 특히 제안된 파싱 기법은 텍스트 라인을 기본 단위로 하는 기존 연구보다 논리적인 계층 구조를 보다 정확하고 빠르게 생성하기 위하여 텍스터 영역의 계층적인 트리 구조를 입력으로 받아들인다. 또한 문서 유형의 논리적인 구조 정보와 기하적인 특성을 효과적으로 기술할 수 있는 문서모델을 정의하고, 이의 자동 생성과 점증적인 학습 방법을 제안한다. 제안된 방법의 성능을 평가하기 위하여 과학 기술 논문으로부터 스캐닝한 372개의 논문 연상으로 실험한 결과, 제안된 방법은 기존 연구와 달리 다수의 문서 영상으로 구성된 문서에 대하여 논리적인 구조분석과 문서 모델의 자동 생성을 효율적으로 지원하였다. 특히 제안된 방법은 논리적인 구조분석의 최종 결과로서 SGML/XML 문서를 생성하기 때문에 문서의 재 사용성과 호환성을 높인다.

Keywords

References

  1. International Organization for Standardization, Information Processing-Text and Office Systems-Standard Generalized Markup Language(SGLM), ISO/IEC 8879, 1986
  2. World Wide Web Consortium, Extensible Markup Language(XML) 1.0, http://ww.w3c.org/TR/REC-xml, 2000
  3. K. M. Summers, 'Toward a Taxonomy of Logical Document Structures,' Proc. Dartmouth Institute for Advanced Graduate Studies(DAGS'95), pp.124-133, Boston, May 1995
  4. G. Nagy, J. Kanai, M. Krishnamoorthy, M. Thomas, and M. Viswanathan, 'Two Complementary Techniques for Digitized Document Analysis,' Proc. ACM Conf. Document Processing Systems, pp. 169-176, 1988 https://doi.org/10.1145/62506.62539
  5. G. Nagy, S. Seth, and M. Viswanathan, A Prototype Document Image Analysis System for Technical Journals, IEEE Computer, Vol. 25, No. 7, pp. 10-22, Jul. 1992 https://doi.org/10.1109/2.144436
  6. M. Krishnamoorthy, G. Nagy, S. Seth, and M. Viswanathan, 'Syntactic Segmentation and Labeling of Digitized Pages from technical Journals,' IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 15, No. 7, pp. 737-747, Jul. 1993 https://doi.org/10.1109/34.221173
  7. G.S.D. Farrow, C.S. Xydears, J.P. Oakley, A. Khorabi and N.G. Prelcic, 'A comparison of System Architectures for Intelligent Document Understanding,' Signal processing: Image Communication, Vol. 9, pp. 1-19, 1996 https://doi.org/10.1016/S0923-5965(96)00002-1
  8. D. Niyogi and S. N. Srihari, 'An Integrated Approach to Document Decomposition and Structural Analysis,' Int'l Journal of Imaging Systems and Technology, Vol. 7, pp.330-342, 1996 https://doi.org/10.1002/(SICI)1098-1098(199624)7:4<330::AID-IMA8>3.0.CO;2-9
  9. A. Dengel and G. Barth, High Level Document Analysis Guided By Geometric Aspects, Int'l Journal of Pattern Recognition and Artificial Intelligence, Vol. 2, No. 4, pp.641-655, 1988 https://doi.org/10.1142/S0218001488000406
  10. A. Dengel, R. Bleisinger, R. Hoch, F. Fein, and F. Hnes, From Paper to Office Document Standard Representation, IEEE Computer, Vol. 25, No. 7, pp. 63-67, Jul. 1992 https://doi.org/10.1109/2.144442
  11. S. Tsujimoto and H. Asada, Major Components of a Complete Text Reading System, Proc. IEEE, Vol. 80, No. 7, pp. 1133-1149, Jul. 1992 https://doi.org/10.1109/5.156475
  12. International Organization for Standardization, Information Technology-Text and Office Systems-Document Style Semantics and Specification(DSSSL), ISO/IEC10176, 1996
  13. L. O'Gorman and R. Kasturi, Document Image Analysis, IEEE Computer Society, 1995
  14. G. Nagy, Twenty Years of Document Image Analysis in PAMI, IEEE Trans. pattern Analysis and Machine Intelligence, Vol. 22, No. 1, pp. 38-62, Jan. 2000 https://doi.org/10.1109/34.824820
  15. G. A. Story, L. O'Gorman, D. Fox, L. L. Schaper, and H. V. Jagadish, The RightPages Image-Based Electronic Library for Alerting and Browsing, IEEE Computer, Vol. 25, No. 9, pp. 17-26, Sept. 1992 https://doi.org/10.1109/2.156379
  16. T. Hu and R. Ingold, 'A Mixed Approach toward an Efficient Logical Structure Recognition from Document Images,' Electronic Publishing: Origination, Dissemination and Design, Vol. 6, NO. 4, pp. 457-468, 1993
  17. A. Conway, 'Page Grammars and page parsing: A Syntactic Approach to Document Layout Recognition,' Proc. Second Int'l Conf. Document Analysis and Recognition, pp. 761-764, 1993 https://doi.org/10.1109/ICDAR.1993.395626
  18. Y. Tateisi and N. Itoh, 'Using Stochastic Syntactic Analysis for Extraction a Logical Structure from a Document Image,' Proc. Int'l Conf. Pattern Recognition, Vol. 2, pp. 391-394, oct. 1994
  19. B. Klein and P. Fankhauser, 'Error Tolerant Document Structure Analysis,' Proc. IEEE Int'l Forum on Research and Technology on Advances in Digital Libraries, pp. 116-127, 1997 https://doi.org/10.1109/ADL.1997.601207
  20. B. Klein and A. Abecker, 'Distributed knowledge-based parsing for Document Analysis and Understanding,' Proc. IEEE Int'l Forum on Research and Technology on Advances in Digital Libraries, pp. 6-15, May 1999 https://doi.org/10.1109/ADL.1999.777686
  21. D. Rus and K. Summers, Geometric Algorithms and Experiments for Automated Document Structuring, Mathematical and Computer Modelling, Vol. 26, No. 1, pp. 55-83, 1997 https://doi.org/10.1016/S0895-7177(97)00104-0
  22. K. M. Summers, Automatic Discovery of Logical Document Structure, Ph. D. Thesis, cornell University, Aug. 1998
  23. O. Hitz, L. Robadey, and R. Ingold, Analysis of synthetic document images, Proc. Fifth Int'l Conf. Document Analysis and Recognition, pp.374-377, Bangalore, India, Sep. 1999 https://doi.org/10.1109/ICDAR.1999.791802
  24. C. Lin, Y. Niwa, S. Narita, Logical Structure Analysis of Book Document Image Using contents Information, Proc. Fourth int'l Conf. Document Analysis and Recognition, Vol. II, pp. 1048-1051, 1997 https://doi.org/10.1109/ICDAR.1997.620669
  25. T. Kochi, and T. Saitoh, A Layout-Free Method for Element Extraction from Document Images, Proc. Workshop on Document Analysis System, pp. 336-345, 1998
  26. T. A. Bayer and H. Walischewski, 'Experiments on Extracting Structural Information from Paper Documents using Syntactic Pattern Analysis,' Proc. of the Third Int'l Conf. Document Analysis and Recognition, pp.476-479, 1995 https://doi.org/10.1109/ICDAR.1995.599039
  27. M. Worring, and A. W.M. Smeulders, 'Content-based Internet Access to paper Documents,' Int'l Journal on Document Analysis and Recognition, Vol.1, No. 4, pp. 209-220, 1999 https://doi.org/10.1007/s100320050020
  28. A. V. Aho, R. Sethi, and J. D. Ullman, Compilers: Principles, Techniques, and Tools, Addison-Wesley, 1986
  29. A. Bruggemann-Klein, 'Regular Expressions into Finite Automata,' Theoretical Computer Science, Vol. 120, No. 2, pp. 197-213, Nov. 1993 https://doi.org/10.1016/0304-3975(93)90287-4
  30. K. Koffka, Principles of Gestalt Psychology, Harcourt, Brace and World, New York, 1935
  31. A. R. R. Wang, Algorithms for Multi-level Logic Optimization, Ph.D. Thesis, The University of California, Berkeley, 1989
  32. K. H. Lee, Y. C. Choy and S. B. Cho, 'Geometric Structure Analysis of Document Images: A Knowledge-based Approach,' IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 22, No. 11, pp. 1224-1240, Nov. 2000 https://doi.org/10.1109/34.888708
  33. K. Zhang and D. Shasha, Simple Fast Algorithms for the Editing Distance between Trees and Related Problems, SIAM Journal on Computing, Vol. 18, No. 6, pp. 1245-1262, 1989 https://doi.org/10.1137/0218082