• Title/Summary/Keyword: document image processing

Search Result 105, Processing Time 0.022 seconds

Adaptive Binarization for Camera-based Document Recognition (카메라 기반 문서 인식을 위한 적응적 이진화)

  • Kim, In-Jung
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.12 no.3
    • /
    • pp.132-140
    • /
    • 2007
  • The quality of the camera image is worse than that of the scanner image because of lighting variation and inaccurate focus. This paper proposes a binarization method for camera-based document recognition, which is tolerant to low-quality camera images. Based on an existing method reported to be effective in previous evaluations, we enhanced the adaptability to the image with a low contrast due to low intensity and inaccurate focus. Furthermore, applying an additional small-size window in the binarization process, it is effective to extract the fine detail of character structure, which is often degraded by conventional methods. In experiments, we applied the proposed method as well as other methods to a document recognizer and compared the performance for many cm images. The result showed the proposed method is effective for recognition of document images captured by the camera.

  • PDF

Design and implementation of XML document edit system that intend to MathML mathematical formula structure representation (MathML 수식 구조 표현을 지향하는 XML 문서 편집 시스템의 설계 및 구현)

  • 김철순;정회경
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.11a
    • /
    • pp.363-367
    • /
    • 2002
  • Represent of mathematical formula used within system handling document that is nonstructural in existent document editing system that is used in electron document processing that use computer is represented or processed by method that is nonstructural of image or text or etc. Such mathematical formular causes relative inconvenience to readablility and reusability of document and processing and exchange of document. Therefore, document editing system is required that can overcome such nonadvantage and apply MathML mathematical formula structure on efficiently structural document. Therefore, designed and implemented that document editing system for structural document creation of XML base that can mathematical formular editing of MathML base in this paper.

  • PDF

An Adaptive Binarization of Camera Document Image by Image Quality Estimation (화질 분석을 통한 카메라 문서 영상의 적응적 이진화)

  • Kim, In-Jung
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.9
    • /
    • pp.797-803
    • /
    • 2007
  • Adaptive binarization is very important for the camera-based document recognition. This paper proposes a binarization method which can effectively adapt to the variation of image Qualify. Firstly, it analyzes the effect of binarization parameters to the result and proposes a method to measure the image quality. Then, it statistically analyzes the relationship between the image quality and the binarization parameter. Finally, it proposes a binarization method that automatically adapts to the quality of the input image, using the analysis result. The experiment results show that there is a meaningful relationship between the image quality and the binarization parameter, and therefore, the proposed method can effectively adapt to the variation of image quality.

Mongolian Traditional Stamp Recognition using Scalable kNN

  • Gantuya., P;Mungunshagai., B;Suvdaa., B
    • International journal of advanced smart convergence
    • /
    • v.4 no.2
    • /
    • pp.170-176
    • /
    • 2015
  • The stamp is one of the crucial information of traditional historical and cultural for nations. In this paper, we purpose to detect official stamps from scanned document and recognize the Mongolian traditional, historical stamps. Therefore we performed following steps: first, we detect official stamps from scanned document based on red-color segmentation and document standard. Then we collected 234 traditional stamp images with 6 classes and 100 official stamp images from scanned document images. Also we implemented the processing algorithms for noise removing, resize and reshape etc. Finally, we proposed a new scale invariant classification algorithm based on KNN (k-nearest neighbor). In the experimental result, our proposed a method had shown proper recognition rate.

Design and Implementation of Two Dimensional Iconic Image Indexing Method using Signatures (시그니쳐를 이용한 2차원 아이코닉 이미지 색인 방법의 설계 및 구현)

  • Chang, Ki-Jin;Chang, Jae-Woo
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.4
    • /
    • pp.720-732
    • /
    • 1996
  • Spatial match retrieval methods for iconic image databases recognize an image document as several icon symbols. Therefore the iconic symbols are used as primary keys to index the image document. When a user requires content-based retrieval ofimages, a spatial match retrieval method converts a query image into iconic symbols and then retrieves relevant images by accessing stored images. In order to support content-based image retrieval efficiently, we, in this paper, propose spatial match retrieval methods using signatures for iconic image databases. For this, we design new index representations of two-dimensional iconic images and explain implemented system.. In addition, we compare the conventional 9-DLT and our two-dimensional image retrieval method in terms of retrieval precision and recall ratio. We show that our method is more efficient than the conventional method.

  • PDF

Structure Recognition Method of Invoice Document Image for Document Processing Automation (문서 처리 자동화를 위한 인보이스 이미지의 구조 인식 방법)

  • Dong-seok Lee;Soon-kak Kwon
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.28 no.2
    • /
    • pp.11-19
    • /
    • 2023
  • In this paper, we propose the methods of invoice document structure recognition and of making a spreadsheet electronic document. The texts and block location information of word blocks are recognized by an optical character recognition engine through deep learning. The word blocks on the same row and same column are found through their coordinates. The document area is divided through arrangement information of the word blocks. The character recognition result is inputted in the spreadsheet based on the document structure. In simulation result, the item placement through the proposed method shows an average accuracy of 92.30%.

Character Shape Distortion Correction of Camera Acquired Document Images (카메라 획득 문서영상에서의 글자모양 왜곡보정)

  • Jang Dae-Geun;Kim Eui-Jeong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.10 no.4
    • /
    • pp.680-686
    • /
    • 2006
  • Document images captured by scanners have only skewing distortion. But camera captured document images have not only skew but also vignetting effect and geometric distortion. Vignetting effect, which makes the border areas to be darker than the center of the image, make it difficult to separate characters from the document images. But this effect has being decreased, as the lens manufacturing skill is developed. Geometric distortion, occurred by the mismatch of angle and center position between the document image and the camera, make the shape of characters to be distorted, so that the character recognition is more difficult than the case of using scanner. In this paper, we propose a method that can increase the performance of character recognition by correcting the geometric distortion of document images using a linear approximation which changes the quadrilateral region to the rectangle one. The proposed method also determine the quadrilateral transform region automatically, using the alignment of character lines and the skewed angles of characters located in the edges of each character line. Proposed method, therefore, can correct the geometric distortion without getting positional information from camera.

Baseline Searching Method for Document Skew Detection (문서 영상의 기울기 검출을 위한 기준선 탐색 기법)

  • Shin, Myoung-Jin;Kim, Do-Hyeon;Cha, Eui-Young
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.2
    • /
    • pp.218-225
    • /
    • 2007
  • This paper presents a technique to detect a document skew that often occurs during document scanning. To correct a skewed document is essential for automatic processing system including character segmentation, character recognition and so on. The proposed algorithm can detect a skew angle exactly by searching characters baselines that have slant information of the document within a candidated area. To reduce processing time, we resized the image small and then established a ROI (region of interest) by morphology operations and connected components analysis. We compared our method with the existing method based on morphology operations and proved correctness and efficiency of the proposed algorithm through experiments and analysis with various kind of document images.

  • PDF

Fast Skew Detection of Document Image Using Morphological Operation (모폴로지 연산을 이용한 문서 이미지의 고속 기울기 검출 기법)

  • Shin Myoung-Jin;Kim Do-Hyun;Cha Eui-Young
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2006.05a
    • /
    • pp.796-799
    • /
    • 2006
  • This paper presents a new method for automatic detection of skew in a document image using mathematical morphology. To speed up processing, we use reduced image but it still requires long time to estimate the skew angle so the proposed method works with region of interest, not with whole image. Character strings are connected by using morphological closing operation and a component labeling is used to select region of interest. The method considers the lowermost pixels of characters in candidate regions in the binary image of original document image. Experimental results shows that the proposed method is extremely fast and robust as well as independent of script forms.

  • PDF

Simple Image Stenography Technology for Large Scale Text (대용량 텍스트를 위한 손실 없는 영상 은닉기술)

  • Rhee, Keun-Moo
    • Annual Conference of KIPS
    • /
    • 2008.05a
    • /
    • pp.1104-1107
    • /
    • 2008
  • These people where generally the image or the document nik technique silver document image, against the digital data of audio back all type the research is advanced being used with objective and the use which are various, is a d. Needs a low-end leveling instrument security text from the research which it sees and with substitution quantity the silver nik being simple it will be able to deliver the technique which is simple it embodied. It combined the text image first and the nose which is in the collar image of 24 bit depth which will reach ting it did and it rehabilitatedded and a higher officer technique and the result it used that the loss ratio of the text image to analyze is slight it was ascertained.