• Title/Summary/Keyword: 영상 전처리

Search Result 1,103, Processing Time 0.036 seconds

Development an Android based OCR Application for Hangul Food Menu (한글 음식 메뉴 인식을 위한 OCR 기반 어플리케이션 개발)

  • Lee, Gyu-Cheol;Yoo, Jisang
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.5
    • /
    • pp.951-959
    • /
    • 2017
  • In this paper, we design and implement an Android-based Hangul food menu recognition application that recognizes characters from images captured by a smart phone. Optical Character Recognition (OCR) technology is divided into preprocessing, recognition and post-processing. In the preprocessing process, the characters are extracted using Maximally Stable Extremal Regions (MSER). In recognition process, Tesseract-OCR, a free OCR engine, is used to recognize characters. In the post-processing process, the wrong result is corrected by using the dictionary DB for the food menu. In order to evaluate the performance of the proposed method, experiments were conducted to compare the recognition performance using the actual menu plate as the DB. The recognition rate measurement experiment with OCR Instantly Free, Text Scanner and Text Fairy, which is a character recognizing application in Google Play Store, was conducted. The experimental results show that the proposed method shows an average recognition rate of 14.1% higher than other techniques.

Depth Map Pre-processing using Gaussian Mixture Model and Mean Shift Filter (혼합 가우시안 모델과 민쉬프트 필터를 이용한 깊이 맵 부호화 전처리 기법)

  • Park, Sung-Hee;Yoo, Ji-Sang
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.5
    • /
    • pp.1155-1163
    • /
    • 2011
  • In this paper, we propose a new pre-processing algorithm applied to depth map to improve the coding efficiency. Now, 3DV/FTV group in the MPEG is working for standard of 3DVC(3D video coding), but compression method for depth map images are not confirmed yet. In the proposed algorithm, after dividing the histogram distribution of a given depth map by EM clustering method based on GMM, we classify the depth map into several layered images. Then, we apply different mean shift filter to each classified image according to the existence of background or foreground in it. In other words, we try to maximize the coding efficiency while keeping the boundary of each object and taking average operation toward inner field of the boundary. The experiments are performed with many test images and the results show that the proposed algorithm achieves bits reduction of 19% ~ 20% and computation time is also reduced.

The Noise Reduction of an Image using Expantion and Contraction Methods (확장과 수축방법을 이용한 차상의 잡음제거)

  • 신충호;오무송
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.04b
    • /
    • pp.664-666
    • /
    • 2002
  • 기울기 연산자나 표면접합에 의한 윤곽추출과는 다른 본 논문에서는 확장과 수축방법론을 이용한 윤곽추출 및 잡음제거 방법론을 제안한다. 제안한 알고리즘은 전처리과정을 거친 후 확장과 수축 방법론을 적용함으로써 영상 윤곽 추출률을 높이고자한다 특히, 기존의 윤곽선 추출방법인 소벨과 프리위트알고리즘은 전처리과정인 미디안필터를 거친 후에 사용하였으며, 제안한 방법은 기존의 윤곽선 추출 필터를 거친 후에 사용하였다. 구체적으로 서술하면 확장알고리즘에서는 일정한 임계치를 초과하면 흰색으로 대치하였고, 수축 알고리즘에서는 검정색으로 대치했다. 기존의 잡음제거과정은 윤곽선의 손실은 없었으나 잡음제거가 소량 이루어졌으며, 제안한 방법에서는 약간의 윤곽선 손실을 보였으나 완전하게 잡음을 제거시킬 수 있었다.

  • PDF

Development of Feature Extraction Algorithm for Finger Vein Recognition (지정맥 인식을 위한 특징 검출 알고리즘 개발)

  • Kim, Taehoon;Lee, Sangjoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.9
    • /
    • pp.345-350
    • /
    • 2018
  • This study is an algorithm for detecting vein pattern features important for finger vein recognition. The feature detection algorithm is important because it greatly affects recognition results in pattern recognition. The recognition rate is degraded because the reference is changed according to the finger position change. In addition, the image obtained by irradiating the finger with infrared light is difficult to separate the image background and the blood vessel pattern, and the detection time is increased because the image preprocessing process is performed. For this purpose, the presented algorithm can be performed without image preprocessing, and the detection time can be reduced. SWDA (Down Slope Trace Waveform) algorithm is applied to the finger vein images to detect the fingertip position and vein pattern. Because of the low infrared transmittance, relatively dark vein images can be detected with minimal detection error. In addition, the fingertip position can be used as a reference in the classification stage to compensate the decrease in the recognition rate. If we apply algorithms proposed to various recognition fields such as palm and wrist, it is expected that it will contribute to improvement of biometric feature detection accuracy and reduction of recognition performance time.

Metamorphosis Hierarchical Motion Vector Estimation Algorithm (변형계층적 모션벡터 추정알고리즘)

  • Kim Jeong-Woong;Yang Hae-Sool
    • Annual Conference of KIPS
    • /
    • 2006.05a
    • /
    • pp.709-712
    • /
    • 2006
  • 다양한 종류의 컴퓨터가 사람, 사물, 환경 속에 내재되어 있고, 이들이 서로 연결되어, 필요한 곳에서 활용할 수 있는 유비쿼터스 환경에서는 홈 네트워크를 통해 이 기종 기기간 다양한 데이터 교환을 요구한다. 더욱이 원활한 영상 데이터의 처리, 전송, 모니터링 기술은 핵심적 요소가 아닐 수 없다. 공간 및 시간적인 해상도, 컬러의 표현 그리고 화질의 측정방법 등 고전적 영상 처리 연구 분야뿐만 아니라 국한된 대역폭을 갖는 홈네트워크의 전송체계에서 전송률 문제에 대한 심도 있는 연구가 필요하다. 본 논문에서는 홈네트워크 상황에서 콘텐츠의 중심이 되는 영상 데이터의 전송과 처리 그리고 제어를 위하여 새로운 움직임 추정 알고리즘을 제안한다. 각도, 거리등 다양한 환경에서 전송되어지는 스테레오 카메라의 영상데이터들은 축소, 확대, 이동, 보정 등 전처리 후 제안된 변형계층 모션벡터 추정 알고리즘을 이용하여 압축 처리, 전송된다. 기존 모션벡터 추정 알고리즘의 장점을 계승하고 단점을 보완한 변형계층 알고리즘은 비정형, 소형 매크로 블록을 이용하여 휘도의 편차가 큰 영상의 효율적 움직임 추정에 이용된다. 본 논문에서 제안한 변형계층 알고리즘과 이를 이용해 구현된 영상시스템은 유비쿼터스 환경에서 다양하게 활용될 수 있다.

  • PDF

Analysis of Noise Power Spectrum According to Flat-Field Correction in Digital Radiography (디지털 의료영상에서 Flat-Field 보정에 따른 Noise Power Spectrum 분석)

  • Lee, Meena;Kwon, Soonmu;Chon, Kwon Su
    • Journal of the Korean Society of Radiology
    • /
    • v.7 no.3
    • /
    • pp.227-232
    • /
    • 2013
  • The pixels used in a digital X-ray detector have different sensitivities and offset values. A non-uniform image is consequently obtained. Flat-field correction was introduced to resolve this problem and carried out image preprocessing in a digital imaging system. Nevertheless, the non-uniform images caused by several reasons have been being occasionally acquired. In this study, the non-uniform images acquired in digital imaging systems were applied to flat-field correction, and NPSs were calculated and analyzed with those images before and after correction. It was confirmed that low frequency noise were effectively eliminated.

Application of ALF for Stereoscopic Video Coding (스테레오스코픽 비디오 부호화를 위한 적응루프필터 적용기법)

  • Lee, Byung-Tak;Kim, Jae-Gon;Lee, BongHo;Yun, Kugjin;Cheong, Won-Sik;Hur, Namho
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2011.07a
    • /
    • pp.378-380
    • /
    • 2011
  • 스테레오스코픽(stereoscopic) 3D 비디오 서비스는 기존 2D 와의 호환성을 유지하면서 새로운 3D 비디오 서비스를 제공할 수 있는 것으로, 전송 대역이 제한된 지상파 방송에서 높은 부호화 효율을 갖는 스테레오스코픽 비디오 코덱이 요구된다. 따라서 3D 를 위한 부가영상의 부호화를 위해 H.264/AVC 등을 고려하고 있으며, 또한 부가영상을 비실시간으로 전송하는 비실시간(Non-Real Time: NRT) 3D 서비스도 고려되고 있다. 본 논문에서는 NRT 3D 서비스를 위한 스테레오스코픽 비디오 부호화에 있어서, HEVC 에서 고려중인 적응루프필터(ALF: Adaptive Loop Filter)를 전/후처리 필터로 적용하는 기법을 제시한다. 특히, 부가영상의 후처리에 ALF 를 적용하기 위하여 부호화 과정에 결정되는 CU(Coding Unit) 구조를 이용하는 HEVC 와 달리 H.264/MVC 로 부호화한 부가영상의 매크로블록(MB) 부호화 모드를 이용한 ALF 적용 기법을 제안한다. 부가영상 부호화에 있어서 전처리 및 후처리 과정으로 ALF 를 적용함으로써 최대 약 20.5%의 부가영상의 부호화 성능 향상을 확인하였다.

  • PDF

A Real-time Vision Inspection System at a Laver Production Line (해태 생산라인에서의 실시간 시각검사 시스템)

  • Kim, Gi-Weon;Kim, Bong-Gi
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2007.06a
    • /
    • pp.601-604
    • /
    • 2007
  • In this paper dose a laver surface check using a real time image process. This system does false retrieval of a laver at a laver production line. At first, a laver image was read in real time using a CCD camera. In this paper, we use an area scan CCD camera. Image is converted into a binary code image using a high-speed imaging process board afterwards. A laver feature is extracted by a binary code image. Surface false retrieval is finally executed using a laver feature. In this paper, we use an area feature of a laver image.

  • PDF

Organ Recognition in Ultrasound images Using Log Power Spectrum (로그 전력 스펙트럼을 이용한 초음파 영상에서의 장기인식)

  • 박수진;손재곤;김남철
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.9C
    • /
    • pp.876-883
    • /
    • 2003
  • In this paper, we propose an algorithm for organ recognition in ultrasound images using log power spectrum. The main procedure of the algorithm consists of feature extraction and feature classification. In the feature extraction, as a translation invariant feature, log power spectrum is used for extracting the information on echo of the organs tissue from a preprocessed input image. In the feature classification, Mahalanobis distance is used as a measure of the similarity between the feature of an input image and the representative feature of each class. Experimental results for real ultrasound images show that the proposed algorithm yields the improvement of maximum 30% recognition rate than the recognition algorithm using power spectrum and Euclidean distance, and results in better recognition rate of 10-40% than the recognition algorithm using weighted quefrency complex cepstrum.

A Study on Automatic Target Recognition Using SAR Imagery (SAR 영상을 이용한 자동 표적 식별 기법에 대한 연구)

  • Park, Jong-Il;Kim, Kyung-Tae
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.22 no.11
    • /
    • pp.1063-1069
    • /
    • 2011
  • NCTR(Non-Cooperative Target Recognition) and ATR(Automatic Target Recognition) are methodologies to identify military targets using radar, optical, and infrared images. Among them, a strategy to recognize ground targets using synthetic aperature radar(SAR) images is called SAR ATR. In general, SAR ATR consists of three sequential stages: detection, discrimination and classification. In this paper, a modification of the polar mapping classifier(PMC) to identify inverse SAR(ISAR) images has been made in order to apply it to SAR ATR. In addition, a preprocessing scheme can mitigate the effect from the clutter, and information on the shadow is employed to improve the classification accuracy.