Search | Korea Science

A Method for Thresholding and Correction of Skew in Camera Document Images (카메라 문서 영상의 이진화 및 기울어짐 보정 방법)

Jang Dae-Geun;Chun Byung-Tae
- Journal of the Korea Society of Computer and Information
- /
- v.10 no.3 s.35
- /
- pp.143-150
- /
- 2005
Camera image is very sensitive to illumination that result in difficulties for recognizing character. Also Camera captured document images have not only skew but also vignetting effect and geometric distortion. Vignetting effect make it difficult to separate characters from the document images. Geometric distortion, occurred by the mismatch of angle and center position between the document image and the camera, make the shape of characters to be distorted, so that the character recognition is more difficult than the case of using scanner. In this paper, we propose a method that can increase the performance of character recognition by correcting the geometric distortion of document images using a linear approximation which changes the quadrilateral region to the rectangle one. The proposed method also determine the quadrilateral transform region automatically, using the alignment of character lines and the skewed angles of characters located in the edges of each character line. Proposed method, therefore, can correct the geometric distortion without getting positional information from camera.
PDF

Word Extraction from Table Regions in Document Images (문서 영상 내 테이블 영역에서의 단어 추출)

Jeong, Chang-Bu;Kim, Soo-Hyung
- The KIPS Transactions:PartB
- /
- v.12B no.4 s.100
- /
- pp.369-378
- /
- 2005
Document image is segmented and classified into text, picture, or table by a document layout analysis, and the words in table regions are significant for keyword spotting because they are more meaningful than the words in other regions. This paper proposes a method to extract words from table regions in document images. As word extraction from table regions is practically regarded extracting words from cell regions composing the table, it is necessary to extract the cell correctly. In the cell extraction module, table frame is extracted first by analyzing connected components, and then the intersection points are extracted from the table frame. We modify the false intersections using the correlation between the neighboring intersections, and extract the cells using the information of intersections. Text regions in the individual cells are located by using the connected components information that was obtained during the cell extraction module, and they are segmented into text lines by using projection profiles. Finally we divide the segmented lines into words using gap clustering and special symbol detection. The experiment performed on In table images that are extracted from Korean documents, and shows $99.16\%$ accuracy of word extraction.
https://doi.org/10.3745/KIPSTB.2005.12B.4.369 인용 PDF KSCI

A Kerword Spotting System of Korean Document Images (한글 문서 영상의 단어 검색 시스템)

최윤성;오일석
- Proceedings of the Korean Information Science Society Conference
- /
- 2002.10d
- /
- pp.586-588
- /
- 2002
본 논문은 한글 문서 영상의 단어 검색 시스템과 그 성능을 제시한다. 두 단계 검색 방법은 검색 속도 증가를 목적으로 하며, 첫 번째 단계에서는 매우 빠른 속도로 거친 정합을 통하여 후보 단어들을 추출한다. 두 번째 단계는 후보 단어들 중에서 미세한 정합을 통한 단어 검색이 이루어진다. 시스템은 문서 영상 구조 분석 모듈과 단어 검색 모듈로 구성된다. 실험 자료를 통해 시스템의 유용성을 입증한다.
PDF

Document Image Compression Using Binary Subband Analysis and Zerotree-based Arithmetic Coder (이진 대역분할과 Zerotree 기반 산술부호기를 이용한 문서 영상 압축)

김정권;김승환;이충웅
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 1999.06b
- /
- pp.45-50
- /
- 1999
이진 영상의 압축은 디지털 도서관, 팩시밀리 전송, 문서 입출력 시스템과 같이 한정된 대역폭과 저장 공간을 가진 응용 분야에서 절실히 요구되고 있다. 현재 많은 영상 압축 알고리즘이 채택하고 있는 대역분할 기법을 문서와 같은 이진 영상의 압축에 적용한다면, 점진적 전송, 축소영상을 통한 빠른 검색 등의 장점을 얻을 수 있다. 그러나, 이진 영상 신호가 두 단계의 휘도 값을 가지므로, 이에 적합한 대역분할 방법과 산술부호기를 선택하여야 한다. 본 논문에서는 표본화-XOR 대역분할 기법을 선택하여, 알파벳 수의 증가를 막고 공간영역에서 국부적인 성질을 얻을 수 있다 또한, 넓은 단일-색 영역을 Zerotree로 대표하여 부호화 되는 신호의 수를 줄이고, 대역분할 구조에서 예측성의 저하를 막기 위한 적절한 조건화문맥과 새로운 부호를 선택한다. 이진 영상에 적합한 대역분할 방법과 산술부호기를 선택하여, 대역분할의 장점과 우수한 압축 성능을 달성할 수 있다.
PDF

조응구조의 지시사상 (mapping) 이론

Park, Yeong-Gyu
- Annual Conference on Human and Language Technology
- /
- 1990.11a
- /
- pp.199-199
- /
- 1990
입력된 문서 영상으로부터 분리 추출된 문자 영상을 올바르게 인식하는 것은 문서 인식에서 가장 핵심적인 부분이다. 스캐너를 통해 입력되고 분리된 실제의 문자 영상은 많은 문제점들을 가지고 있다. 한글의 경우 이 중 개별 문자 영상내의 각 자소간의 접촉은 올바른 인식을 저해하는 주요한 원인이다. 이런 접촉의 문제를 효율적으로 해결하기 위해 한글의 구조적 특성을 지닌 "방향 필터"를 정의하고, 이것을 이용하여 세선화된 문자 영상을 추적하면서 선소들을 뽑아낸다. 이렇게 하여 얻은 선소들과 선소들간의 지식을 조합하여 한글자소 획을 추출케 되고 결국에는 이런 획의 조합을 통해 문자 영상을 인식하는 방법을 제안한다.
PDF

Segmentation and Contents Classification of Document Images Using Local Entropy and Texture-based PCA Algorithm (지역적 엔트로피와 텍스처의 주성분 분석을 이용한 문서영상의 분할 및 구성요소 분류)

Kim, Bo-Ram;Oh, Jun-Taek;Kim, Wook-Hyun
- The KIPS Transactions:PartB
- /
- v.16B no.5
- /
- pp.377-384
- /
- 2009
A new algorithm in order to classify various contents in the image documents, such as text, figure, graph, table, etc. is proposed in this paper by classifying contents using texture-based PCA, and by segmenting document images using local entropy-based histogram. Local entropy and histogram made the binarization of image document not only robust to various transformation and noise, but also easy and less time-consuming. And texture-based PCA algorithm for each segmented region was taken notice of each content in the image documents having different texture information. Through this, it was not necessary to establish any pre-defined structural information, and advantages were found from the fact of fast and efficient classification. The result demonstrated that the proposed method had shown better performances of segmentation and classification for various images, and is also found superior to previous methods by its efficiency.
https://doi.org/10.3745/KIPSTB.2009.16B.5.377 인용 PDF KSCI

Implementation of An Inappropriate Web-I mages Blocking System for Youth (청소년을 위한 유해 웹영상 차단시스템의 구현)

이은애;정명숙;김재건;하석운
- Proceedings of the Korea Multimedia Society Conference
- /
- 2000.04a
- /
- pp.319-323
- /
- 2000
인터넷이 활성화되면서 청소년에게 유해한 영상을 제공하는 사이트들이 급속히 범람하고 있으며, 이로 인해 청소년들의 정신 건강이 심각하게 훼손되고 있다. 본 논문에서는 청소년들이 접근하기 쉬운 유해 URL 의 웹문서에 대해 그 문서 내에 포함되어 있는 영상들의 유해성을 판별하여 유해 영상을 선택적으로 차단할 수 있는 시스템을 구현하여 제시한다. 유해 URL들에 대해 실험한 결과, 제안한 시스템의 효율은 full nudity의 경우에는 89.6% , 반라의 경우는 70.1% 의 차단 효율을 나타내었으며, 얼굴영상의 경우는 2%의 오판별이 있었다.
PDF

Block Classification of Document Images by Block Attributes and Texture Features (블록의 속성과 질감특징을 이용한 문서영상의 블록분류)

Jang, Young-Nae;Kim, Joong-Soo;Lee, Cheol-Hee
- Journal of Korea Multimedia Society
- /
- v.10 no.7
- /
- pp.856-868
- /
- 2007
We propose an effective method for block classification in a document image. The gray level document image is converted to the binary image for a block segmentation. This binary image would be smoothed to find the locations and sizes of each block. And especially during this smoothing, the inner block heights of each block are obtained. The gray level image is divided to several blocks by these location informations. The SGLDM(spatial gray level dependence matrices) are made using the each gray-level document block and the seven second-order statistical texture features are extracted from the (0,1) direction's SGLDM which include the document attributes. Document image blocks are classified to two groups, text and non-text group, by the inner block height of the block at the nearest neighbor rule. The seven texture features(that were extracted from the SGLDM) are used for the five detail categories of small font, large font, table, graphic and photo blocks. These document blocks are available not only for structure analysis of document recognition but also the various applied area.
PDF

Speed-up of Document Image Binarization Method Based on Water Flow Model (Water flow model에 기반한 문서영상 이진화 방법의 속도 개선)

오현화;김도훈;이재용;김두식;임길택;진성일
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.41 no.4
- /
- pp.75-86
- /
- 2004
This paper proposes a method to speed up the document image binarization using a water flow model. The proposed method extracts the region of interest (ROI) around characters from a document image and restricts pouring water onto a 3-dimensional terrain surface of an image only within the ROI. The amount of water to be filed into a local valley is determined automatically depending on its depth and slope. The proposed method accumulates weighted water not only on the locally lowest position but also on its neighbors. Therefore, a valley is filed enough with only one try of pouring water onto the terrain surface of the ROI. Finally, the depth of each pond is adaptively thresholded for robust character segmentation, because the depth of a pond formed at a valley varies widely according to the gray-level difference between characters and backgrounds. In our experiments on real document images, the Proposed method has attained good binarization performance as well as remarkably reduced processing time compared with that of the existing method based on a water flow model.
PDF KSCI

An Automatic Control System of a Camera Zoom and Focus Using the Fuzzy Inference and the Difference of the Light and Darkness (Fuzzy 추론 및 명암차이법을 이용한 카메라 줌, 포커스 자동 조절 시스템)

박홍선;박상욱;박정현;곽주원;손영선
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2002.12a
- /
- pp.406-409
- /
- 2002
본 논문에서는 문자인식이 가능하도록 줌, 포커스를 제어하여 한글 문서 영상을 확대/축소하는 시스템을 구현하였다. 한글 문서 영상에서 확대/축소 할 영역이 지정되면 그 영역의 가로, 세로 거리를 펄스 수로 변환한 후 Step모터를 제어하여 그 위치만큼 카메라를 이동시킨다. 문서 영상이 입력되면 문자인식이 가능한 크기만큼 줌을 제어하고, 피드백 되어진 영상으로부터 조정된 줌에 맞는 포커스로 근접 제어한 후, 더욱 선명한 영상을 얻기 위해 명암 차이에 의한 미세 조정을 하였다. 이 경우, 줌 및 포커스는 퍼지 추론으로 .제어하는 DC모터로 조정하였다.

Search Result 381, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)