Efficient Skew Estimation for Document Images Based on Selective Attention

선택적 주의집중에 의한 문서영상의 효율적인 기울어짐 추정

  • Gwak, Hui-Gyu (Dept.of Computer Science, Chonnam National University) ;
  • Kim, Su-Hyeong (Dept.of Computer Science, Chonnam National University)
  • Published : 1999.10.01

Abstract

본 논문에서는 한글과 영문 문서 영상들에 대한 기울어짐 추정(skew estimation) 알고리즘을 제안한다. 제안 방법은 전체 문서 영상에서 텍스트 요소들이 밀집되어 있는 영역을 선별하고, 선별된 영역에 대해 허프 변환을 적용하는 선택적 주의집중(selective attention) 방식을 채택한다. 제안 방법의 기울기 추정 과정은 2단계로 구성되는데, coarse 단계에서는 전체 영상을 몇 개의 영역으로 나누고 동일한 영역에 속하는 데이타들간의 연결 각도를 계산하여 각 영역별 accumulator에 저장한다. accumulator에 저장된 빈도치를 기준으로 $\pm$45$^{\circ}$범위 내에서 최대 $\pm$1$^{\circ}$의 오차를 가진 각 영역별 기울기를 계산한 후, 이들 중 최대 빈도값을 갖는 영역을 선정하고 그 영역의 기울기 각도를 문서 영상의 대략적인 기울기 각도로 결정한다. Refine 단계에서는 coarse 단계에서 선정된 영역에 허프 변환을 적용하여 정확한 기울기를 계산하는데, coarse 단계에서 추정한 기울기의 $\pm$1$^{\circ}$범위 내에서 0.1$^{\circ}$간격으로 측정한다. 이와 같은 선택적 주의집중 방식을 통해 기울기 추정에 소요되는 시간 비용은 최소화하고, 추정의 정확도는 최대화 할 수 있다.제안 방법의 성능 평가를 위한 실험은 다양한 형태의 영문과 한글 문서 영상 2,016개에 적용되었다. 제안 방법의 평균 수행 시간은 Pentium 200MHz PC에서 0.19초이고 평균 오차는 $\pm$0.08$^{\circ}$이다. 또한 기존의 기울기 추정 방법과 제안 방법의 성능을 비교하여 제안 방법의 우수성을 입증하였다.Abstract In this paper we propose a skew estimation algorithm for English and Korean document images. The proposed method adopts a selective attention strategy, in which we choose a region of interest which contains a cluster of text components and then apply a Hough transform to this region. The skew estimation process consists of two steps. In the coarse step, we divide the entire image into several regions, and compute the skew angle of each region by accumulating the slopes of lines connecting any two components in the region. The skew angle is estimated within the range of $\pm$45 degree with a maximum error of $\pm$1 degree. Next we select a region which has the most frequent slope in the accumulators and determine the skew angle of the image roughly as the angle corresponding to the most frequent slope. In the refine step, a Hough transform is applied for the selected region within the range of $\pm$1 degree along the angle computed from the coarse step, with an angular resolution of 0.1 degree. Based on this selective attention strategy, we can minimize the time cost and maximize the accuracy of the skew estimation.We have measured the performance of the proposed method by an experiment with 2,016 images of various English and Korean documents. The average run time is 0.19 second on a Pentium 200MHz PC, and the average error is $\pm$0.08 degree. We also have proven the superiority of our algorithm by comparing the performance with that of other well-known methods in the literature.

Keywords

References

  1. Machine Vision and Applications v.2 Analysis of textual images using the Hough transform S.N. Srihari;V. Govindaraju
  2. Proc. 10th Int. Conf. on Pattern Recognition A document skew detection method using run length encoding and the Hough transform S.C. Hinds;J.L. Fisher;D.P. D'Amato
  3. Pattern Recognition v.27 no.10 Automated page orientation and skew angle detection for binary document image D.X. Le;G. Thoma;H. Weschler
  4. Pattern Recognition v.29 no.10 A robust and fast skew detection algorithm for generic documents B. Yu;A.K. Jain
  5. Journal of Electronic Imaging v.5 no.4 Comparative study of skew detection algorithms A. Amin;S. Fischer;A.F. Parkinson;R. Shiu
  6. Pattern Recognition Letters v.17 An improved document skew angle estimation technique U. Pal;B.B. Chaudhuri
  7. Pattern Recognition Letters v.18 A fast approach to the detection and correction of skew documents H.F. Jiang;C.C. Han;K.C. Fan
  8. Proc. SPSE 40th Conf. Symp. Hybrid Imaging Systems The skew angle of printed documents H.S. Baird
  9. Proc. 8th Int. Conf. on Pattern Recognition Detection of Linear oblique structures and skew scan in digitized documents W. Postl
  10. Pattern Recognition v.23 no.11 Automated entry system for printed documents T. Akiyama;N. Hagita
  11. Pattern Recognition v.30 no.9 Skew detection and text line position determination in digitized documents B. Gatos;N. Papamarkos;C. Chamzas
  12. Int. Journal on Document Analysis and Recognition v.1 no.1 Projection profile based skew estimation algorithm for JBIG compressed images J. Kanai;A.D. Bagdanov
  13. Pattern Recognition Letters v.4 A method of detecting the orientation of aligned components A. Hashizume;P.S. Yeh;A. Rosenfeld
  14. IEEE Trans. Pattern Anal. Mach. Intell. v.15 no.11 The document spectrum for page layout analysis L. O'Gorman
  15. Proc. 3rd Int. Conf. on Document Analysis and Recognition Cooperation of multi-layer perceptrons for the estimation of skew angle in text document images N. Rondel;G. Burel
  16. CVGIP: Graphical Models and Image Processing v.55 no.6 Skew correction of document images using interline cross-correlation H. Yan
  17. IEEE Trans. Image Processing v.6 no.2 Robust detection of skew in document images Avanindra;S. Chaudhuri
  18. 3rd IAPR Workshop on Document Analysis Systems Fast skew correction using block transformation H.K. Kwag;K.C. Lee;S.H. Kim;S.W. Jeong