Texture-based PCA for Analyzing Document Image

텍스처 정보 기반의 PCA를 이용한 문서 영상의 분석

  • Kim, Bo-Ram (Department of Computer Engineering, Yeungnam University) ;
  • Kim, Wook-Hyun (Department of Computer Engineering, Yeungnam University)
  • Published : 2006.06.21

Abstract

In this paper, we propose a novel segmentation and classification method using texture features for the document image. First, we extract the local entropy and then segment the document image to separate the background and the foreground using the Otsu's method. Finally, we classify the segmented regions into each component using PCA(principle component analysis) algorithm based on the texture features that are extracted from the co-occurrence matrix for the entropy image. The entropy-based segmentation is robust to not only noise and the change of light, but also skew and rotation. Texture features are not restricted from any form of the document image and have a superior discrimination for each component. In addition, PCA algorithm used for the classifier can classify the components more robustly than neural network.

Keywords