• 제목/요약/키워드: Segmentation and feature extraction

검색결과 190건 처리시간 0.028초

Color Space Based Objects Detection System from Video Sequences

  • Alom, Md. Zahangir;Lee, Hyo Jong
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2011년도 추계학술발표대회
    • /
    • pp.347-350
    • /
    • 2011
  • This paper propose a statistical color model of background extraction base on Hue-Saturation-Value(HSV) color space, instead of the traditional RGB space, and shows that it provides a better use of the color information. HSV color space corresponds closely to the human perception of color and it has revealed more accuracy to distinguish shadows [3] [4]. The key feature of this segmentation method is based on processing hue component of color in HSV color space on image area. The HSV color model is used, its color components are efficiently analyzed and treated separately so that the proposed algorithm can adapt to different environmental illumination condition and shadows. Polar and linear statistical operations are used to calculate the background from the video frames. The experimental results show that the proposed background subtraction method can automatically segment video objects robustly and accurately in various illuminating and shadow environments.

Support Vector Machine Based Phoneme Segmentation for Lip Synch Application

  • Lee, Kun-Young;Ko, Han-Seok
    • 음성과학
    • /
    • 제11권2호
    • /
    • pp.193-210
    • /
    • 2004
  • In this paper, we develop a real time lip-synch system that activates 2-D avatar's lip motion in synch with an incoming speech utterance. To realize the 'real time' operation of the system, we contain the processing time by invoking merge and split procedures performing coarse-to-fine phoneme classification. At each stage of phoneme classification, we apply the support vector machine (SVM) to reduce the computational load while retraining the desired accuracy. The coarse-to-fine phoneme classification is accomplished via two stages of feature extraction: first, each speech frame is acoustically analyzed for 3 classes of lip opening using Mel Frequency Cepstral Coefficients (MFCC) as a feature; secondly, each frame is further refined in classification for detailed lip shape using formant information. We implemented the system with 2-D lip animation that shows the effectiveness of the proposed two-stage procedure in accomplishing a real-time lip-synch task. It was observed that the method of using phoneme merging and SVM achieved about twice faster speed in recognition than the method employing the Hidden Markov Model (HMM). A typical latency time per a single frame observed for our method was in the order of 18.22 milliseconds while an HMM method applied under identical conditions resulted about 30.67 milliseconds.

  • PDF

A Rotation Invariant Image Retrieval with Local Features

  • You, Hee-Jun;Shin, Dae-Kyu;Kim, Dong-Hoon;Kim, Hyun-Sool;Park, Sang-Hui
    • International Journal of Control, Automation, and Systems
    • /
    • 제1권3호
    • /
    • pp.332-338
    • /
    • 2003
  • Content-based image retrieval is the research of images from database, that are visually similar to given image examples. Gabor functions and Gabor filters are regarded as excellent methods for feature extraction and texture segmentation. However, they have a disadvantage not to perform well in case of a rotated image because of its direction-oriented filter. This paper proposes a method of extracting local texture features from blocks with central interest points detected in an image and a rotation invariant Gabor wavelet filter. We also propose a method of comparing pattern histograms of features classified by VQ (Vector Quantization) among images.

자동차 부품 형상 결함 탐지를 위한 측정 방법 개발 (Development of An Inspection Method for Defect Detection on the Surface of Automotive Parts)

  • 박홍석;우펜드라 마니 툴라다르;신승철
    • 한국생산제조학회지
    • /
    • 제22권3호
    • /
    • pp.452-458
    • /
    • 2013
  • Over the past several years, many studies have been carried out in the field of 3D data inspection systems. Several attempts have been made to improve the quality of manufactured parts. The introduction of laser sensors for inspection has made it possible to acquire data at a remarkably high speed. In this paper, a robust inspection technique for detecting defects in 3D pressed parts using laser-scanned data is proposed. Point cloud data are segmented for the extraction of features. These segmented features are used for shape matching during the localization process. An iterative closest point (ICP) algorithm is used for the localization of the scanned model and CAD model. To achieve a higher accuracy rate, the ICP algorithm is modified and then used for matching. To enhance the speed of the matching process, aKd-tree algorithm is used. Then, the deviation of the scanned points from the CAD model is computed.

비전 센서를 이용한 다층 아크 용접에서 용접선 추적에 관한 연구 (A Study on Joint Tracking for Multipass Arc Welding using Vision Sensor)

  • 이정익;장인선;이세현;엄기원
    • Journal of Welding and Joining
    • /
    • 제16권3호
    • /
    • pp.85-94
    • /
    • 1998
  • Welding fabrication invariantly involves three district sequential steps: preparation, actual process execution and post-weld inspection. One of the major problems in automating these steps and developing autonomous welding system, is the lack of proper sensing strategies. Conventionally, machine vision is used in robotic arc welding only for the correction of pre-taught welding paths in single pass. In this paper, developed vision processing techniques are detailed, and their application in welding fabrication is covered. The software for joint tracking system is finally proposed.

  • PDF

비전센서를 이용한 다층 용접선 추적 시스템 (The Multipass Joint Tracking System by Vision Sensor)

  • 이정익;고병갑
    • 한국공작기계학회논문집
    • /
    • 제16권5호
    • /
    • pp.14-23
    • /
    • 2007
  • Welding fabrication invariantly involves three district sequential steps: preparation, actual process execution and post-weld inspection. One of the major problems in automating these steps and developing autonomous welding system is the lack of proper sensing strategies. Conventionally, machine vision is used in robotic arc welding only for the correction of pre-taught welding paths in single pass. However, in this paper, multipass tracking more than single pass tracking is performed by conventional seam tracking algorithm and developed one. And tracking performances of two algorithm are compared in multipass tracking. As the result, tracking performance in multi-pass welding shows superior conventional seam tracking algorithm to developed one.

약한 레이블을 이용한 확장 합성곱 신경망과 게이트 선형 유닛 기반 음향 이벤트 검출 및 태깅 알고리즘 (Dilated convolution and gated linear unit based sound event detection and tagging algorithm using weak label)

  • 박충호;김동현;고한석
    • 한국음향학회지
    • /
    • 제39권5호
    • /
    • pp.414-423
    • /
    • 2020
  • 본 논문은 약한 레이블 기반 음향 이벤트 검출을 위한 시간-주파수 영역분할 맵 추출 모델에서 발생하는 희소성 및 수용영역 부족에 관한 문제를 완화 시키기 위해, 확장 게이트 선형 유닛(Dilated Convolution Gated Linear Unit, DCGLU)을 제안한다. 딥러닝 분야에서 음향 이벤트 검출을 위한 영역분할 맵 추출 기반 방법은 잡음 환경에서 좋은 성능을 보여준다. 하지만, 이 방법은 영역분할 맵을 추출하기 위해 특징 맵의 크기를 유지해야 하므로 풀링 연산 없이 모델을 구성하게 된다. 이로 인해 이 방법은 희소성과 수용영역의 부족으로 성능 저하를 보이게 된다. 이런 문제를 완화하기 위해, 본 논문에서는 정보의 흐름을 제어할 수 있는 게이트 선형 유닛과 추가의 파라미터 없이 수용영역을 넓혀 줄 수 있는 확장 합성곱 신경망을 적용하였다. 실험을 위해 사용된 데이터는 URBAN-SED와 자체 제작한 조류 울음소리 데이터이며, 제안하는 DCGLU 모델이 기존 베이스라인 논문들보다 더 좋을 성능을 보였다. 특히, DCGLU 모델이 자연 소리가 섞인 환경인 세 개의 Signal to Noise Ratio(SNR)(20 dB, 10 dB, 0 dB)에서 강인하다는 것을 확인하였다.

A Survey of Real-time Road Detection Techniques Using Visual Color Sensor

  • Hong, Gwang-Soo;Kim, Byung-Gyu;Dogra, Debi Prosad;Roy, Partha Pratim
    • Journal of Multimedia Information System
    • /
    • 제5권1호
    • /
    • pp.9-14
    • /
    • 2018
  • A road recognition system or Lane departure warning system is an early stage technology that has been commercialized as early as 10 years but can be optional and used as an expensive premium vehicle, with a very small number of users. Since the system installed on a vehicle should not be error prone and operate reliably, the introduction of robust feature extraction and tracking techniques requires the development of algorithms that can provide reliable information. In this paper, we investigate and analyze various real-time road detection algorithms based on color information. Through these analyses, we would like to suggest the algorithms that are actually applicable.

Halftoning 영상을 이용한 3차원 특징 추출 (Feature Extraction of 3-D Object Using Halftoning Image)

  • 김도년;김소연;조동섭
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1992년도 하계학술대회 논문집 A
    • /
    • pp.465-467
    • /
    • 1992
  • This paper shows 3D vision system based on halftone image analysis. Any halftone image has its own surface vector normal to surface patch. To classily the given 3D images, all the patch on 3D object are transformed to black/white halftone. First we extract the general learning patterns which represents required slopes and their attributes. And next we propose 3D segmentation by searching intensity, slope and density. Artificial neural network is found to be very suitable in this approach, because it has powerful learning quality and noise tolerant. In this study, 3D shape reconstruct using pyramidian model. Our results are evaluated to enhance the quality.

  • PDF

영상 대 영상 매칭을 이용한 한글 문서 영상에서의 단어 검색 (Keyword Spotting on Hangul Document Images Using Image-to-Image Matching)

  • 박상철;손화정;김수형
    • 정보처리학회논문지B
    • /
    • 제12B권3호
    • /
    • pp.357-364
    • /
    • 2005
  • 본 논문에서는 두 단계 이미지 매칭을 이용하여 한글 문서영상에서 사용자 검색어를 빠르고 정확하게 검색할 수 있는 시스템을 제안한다. 본 시스템은 문자 분리, 검색어 영상 생성, 특징 추출 그리고 이미지 매칭 과정으로 구성된다. 매칭 과정에서 차원이 다른 두 가지 특징 벡터를 이용한다. 8쪽 분량의 문서 영상을 한국정보과학회 웹사이트에서 다운로드하였고, 그 문서로부터 1600개의 한글단어 영상을 획득하여 실험데이터로 사용하였다 그 결과 제안한 시스템은 기존에 제안된 영상-기반 한글 단어 검색 시스템보다 성능이 크게 향상되었음을 알 수 있었다.