• Title/Summary/Keyword: Segmentation and feature extraction

Search Result 190, Processing Time 0.022 seconds

Color Space Based Objects Detection System from Video Sequences

  • Alom, Md. Zahangir;Lee, Hyo Jong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.11a
    • /
    • pp.347-350
    • /
    • 2011
  • This paper propose a statistical color model of background extraction base on Hue-Saturation-Value(HSV) color space, instead of the traditional RGB space, and shows that it provides a better use of the color information. HSV color space corresponds closely to the human perception of color and it has revealed more accuracy to distinguish shadows [3] [4]. The key feature of this segmentation method is based on processing hue component of color in HSV color space on image area. The HSV color model is used, its color components are efficiently analyzed and treated separately so that the proposed algorithm can adapt to different environmental illumination condition and shadows. Polar and linear statistical operations are used to calculate the background from the video frames. The experimental results show that the proposed background subtraction method can automatically segment video objects robustly and accurately in various illuminating and shadow environments.

Support Vector Machine Based Phoneme Segmentation for Lip Synch Application

  • Lee, Kun-Young;Ko, Han-Seok
    • Speech Sciences
    • /
    • v.11 no.2
    • /
    • pp.193-210
    • /
    • 2004
  • In this paper, we develop a real time lip-synch system that activates 2-D avatar's lip motion in synch with an incoming speech utterance. To realize the 'real time' operation of the system, we contain the processing time by invoking merge and split procedures performing coarse-to-fine phoneme classification. At each stage of phoneme classification, we apply the support vector machine (SVM) to reduce the computational load while retraining the desired accuracy. The coarse-to-fine phoneme classification is accomplished via two stages of feature extraction: first, each speech frame is acoustically analyzed for 3 classes of lip opening using Mel Frequency Cepstral Coefficients (MFCC) as a feature; secondly, each frame is further refined in classification for detailed lip shape using formant information. We implemented the system with 2-D lip animation that shows the effectiveness of the proposed two-stage procedure in accomplishing a real-time lip-synch task. It was observed that the method of using phoneme merging and SVM achieved about twice faster speed in recognition than the method employing the Hidden Markov Model (HMM). A typical latency time per a single frame observed for our method was in the order of 18.22 milliseconds while an HMM method applied under identical conditions resulted about 30.67 milliseconds.

  • PDF

A Rotation Invariant Image Retrieval with Local Features

  • You, Hee-Jun;Shin, Dae-Kyu;Kim, Dong-Hoon;Kim, Hyun-Sool;Park, Sang-Hui
    • International Journal of Control, Automation, and Systems
    • /
    • v.1 no.3
    • /
    • pp.332-338
    • /
    • 2003
  • Content-based image retrieval is the research of images from database, that are visually similar to given image examples. Gabor functions and Gabor filters are regarded as excellent methods for feature extraction and texture segmentation. However, they have a disadvantage not to perform well in case of a rotated image because of its direction-oriented filter. This paper proposes a method of extracting local texture features from blocks with central interest points detected in an image and a rotation invariant Gabor wavelet filter. We also propose a method of comparing pattern histograms of features classified by VQ (Vector Quantization) among images.

Development of An Inspection Method for Defect Detection on the Surface of Automotive Parts (자동차 부품 형상 결함 탐지를 위한 측정 방법 개발)

  • Park, Hong-Seok;Tuladhar, Upendra Mani;Shin, Seung-Cheol
    • Journal of the Korean Society of Manufacturing Technology Engineers
    • /
    • v.22 no.3
    • /
    • pp.452-458
    • /
    • 2013
  • Over the past several years, many studies have been carried out in the field of 3D data inspection systems. Several attempts have been made to improve the quality of manufactured parts. The introduction of laser sensors for inspection has made it possible to acquire data at a remarkably high speed. In this paper, a robust inspection technique for detecting defects in 3D pressed parts using laser-scanned data is proposed. Point cloud data are segmented for the extraction of features. These segmented features are used for shape matching during the localization process. An iterative closest point (ICP) algorithm is used for the localization of the scanned model and CAD model. To achieve a higher accuracy rate, the ICP algorithm is modified and then used for matching. To enhance the speed of the matching process, aKd-tree algorithm is used. Then, the deviation of the scanned points from the CAD model is computed.

A Study on Joint Tracking for Multipass Arc Welding using Vision Sensor (비전 센서를 이용한 다층 아크 용접에서 용접선 추적에 관한 연구)

  • 이정익;장인선;이세현;엄기원
    • Journal of Welding and Joining
    • /
    • v.16 no.3
    • /
    • pp.85-94
    • /
    • 1998
  • Welding fabrication invariantly involves three district sequential steps: preparation, actual process execution and post-weld inspection. One of the major problems in automating these steps and developing autonomous welding system, is the lack of proper sensing strategies. Conventionally, machine vision is used in robotic arc welding only for the correction of pre-taught welding paths in single pass. In this paper, developed vision processing techniques are detailed, and their application in welding fabrication is covered. The software for joint tracking system is finally proposed.

  • PDF

The Multipass Joint Tracking System by Vision Sensor (비전센서를 이용한 다층 용접선 추적 시스템)

  • Lee, Jeong-Ick;Koh, Byung-Kab
    • Transactions of the Korean Society of Machine Tool Engineers
    • /
    • v.16 no.5
    • /
    • pp.14-23
    • /
    • 2007
  • Welding fabrication invariantly involves three district sequential steps: preparation, actual process execution and post-weld inspection. One of the major problems in automating these steps and developing autonomous welding system is the lack of proper sensing strategies. Conventionally, machine vision is used in robotic arc welding only for the correction of pre-taught welding paths in single pass. However, in this paper, multipass tracking more than single pass tracking is performed by conventional seam tracking algorithm and developed one. And tracking performances of two algorithm are compared in multipass tracking. As the result, tracking performance in multi-pass welding shows superior conventional seam tracking algorithm to developed one.

Dilated convolution and gated linear unit based sound event detection and tagging algorithm using weak label (약한 레이블을 이용한 확장 합성곱 신경망과 게이트 선형 유닛 기반 음향 이벤트 검출 및 태깅 알고리즘)

  • Park, Chungho;Kim, Donghyun;Ko, Hanseok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.5
    • /
    • pp.414-423
    • /
    • 2020
  • In this paper, we propose a Dilated Convolution Gate Linear Unit (DCGLU) to mitigate the lack of sparsity and small receptive field problems caused by the segmentation map extraction process in sound event detection with weak labels. In the advent of deep learning framework, segmentation map extraction approaches have shown improved performance in noisy environments. However, these methods are forced to maintain the size of the feature map to extract the segmentation map as the model would be constructed without a pooling operation. As a result, the performance of these methods is deteriorated with a lack of sparsity and a small receptive field. To mitigate these problems, we utilize GLU to control the flow of information and Dilated Convolutional Neural Networks (DCNNs) to increase the receptive field without additional learning parameters. For the performance evaluation, we employ a URBAN-SED and self-organized bird sound dataset. The relevant experiments show that our proposed DCGLU model outperforms over other baselines. In particular, our method is shown to exhibit robustness against nature sound noises with three Signal to Noise Ratio (SNR) levels (20 dB, 10 dB and 0 dB).

A Survey of Real-time Road Detection Techniques Using Visual Color Sensor

  • Hong, Gwang-Soo;Kim, Byung-Gyu;Dogra, Debi Prosad;Roy, Partha Pratim
    • Journal of Multimedia Information System
    • /
    • v.5 no.1
    • /
    • pp.9-14
    • /
    • 2018
  • A road recognition system or Lane departure warning system is an early stage technology that has been commercialized as early as 10 years but can be optional and used as an expensive premium vehicle, with a very small number of users. Since the system installed on a vehicle should not be error prone and operate reliably, the introduction of robust feature extraction and tracking techniques requires the development of algorithms that can provide reliable information. In this paper, we investigate and analyze various real-time road detection algorithms based on color information. Through these analyses, we would like to suggest the algorithms that are actually applicable.

Feature Extraction of 3-D Object Using Halftoning Image (Halftoning 영상을 이용한 3차원 특징 추출)

  • Kim, D.N.;Kim, S.Y.;Cho, D.S.
    • Proceedings of the KIEE Conference
    • /
    • 1992.07a
    • /
    • pp.465-467
    • /
    • 1992
  • This paper shows 3D vision system based on halftone image analysis. Any halftone image has its own surface vector normal to surface patch. To classily the given 3D images, all the patch on 3D object are transformed to black/white halftone. First we extract the general learning patterns which represents required slopes and their attributes. And next we propose 3D segmentation by searching intensity, slope and density. Artificial neural network is found to be very suitable in this approach, because it has powerful learning quality and noise tolerant. In this study, 3D shape reconstruct using pyramidian model. Our results are evaluated to enhance the quality.

  • PDF

Keyword Spotting on Hangul Document Images Using Image-to-Image Matching (영상 대 영상 매칭을 이용한 한글 문서 영상에서의 단어 검색)

  • Park Sang Cheol;Son Hwa Jeong;Kim Soo Hyung
    • The KIPS Transactions:PartB
    • /
    • v.12B no.3 s.99
    • /
    • pp.357-364
    • /
    • 2005
  • In this paper, we propose an accurate and fast keyword spotting system for searching user-specified keyword in Hangul document images by using two-level image-to-image matching. The system is composed of character segmentation, creating a query image, feature extraction, and matching procedure. Two different feature vectors are used in the matching procedure. An experiment using 1600 Hangul word images from 8 document images, downloaded from the website of Korea Information Science Society, demonstrates that the proposed system is superior to conventional image-based document retrieval systems.