• 제목/요약/키워드: Gaussian mixture method

검색결과 303건 처리시간 0.03초

An Intelligent Iris Recognition System (지능형 홍채 인식 시스템)

  • Kim, Jae-Min;Cho, Seong-Won;Kim, Soo-Lin
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • 제14권4호
    • /
    • pp.468-472
    • /
    • 2004
  • This paper presents an intelligent iris recognition system which consists of quality check, iris localization, feature extraction, and verification. For the quality check, the local statistics on the pupil boundary is used. Gaussian mixture model is used to segment and localized the iris region. The feature extraction method is based on an optimal waveform simplification. For the verification, we use an intelligent variable threshold.

Detection and Recognition of Illegally Parked Vehicles Based on an Adaptive Gaussian Mixture Model and a Seed Fill Algorithm

  • Sarker, Md. Mostafa Kamal;Weihua, Cai;Song, Moon Kyou
    • Journal of information and communication convergence engineering
    • /
    • 제13권3호
    • /
    • pp.197-204
    • /
    • 2015
  • In this paper, we present an algorithm for the detection of illegally parked vehicles based on a combination of some image processing algorithms. A digital camera is fixed in the illegal parking region to capture the video frames. An adaptive Gaussian mixture model (GMM) is used for background subtraction in a complex environment to identify the regions of moving objects in our test video. Stationary objects are detected by using the pixel-level features in time sequences. A stationary vehicle is detected by using the local features of the object, and thus, information about illegally parked vehicles is successfully obtained. An automatic alarm system can be utilized according to the different regulations of different illegal parking regions. The results of this study obtained using a test video sequence of a real-time traffic scene show that the proposed method is effective.

Improvement of Speech Reconstructed from MFCC Using GMM (GMM을 이용한 MFCC로부터 복원된 음성의 개선)

  • Choi, Won-Young;Choi, Mu-Yeol;Kim, Hyung-Soon
    • MALSORI
    • /
    • 제53호
    • /
    • pp.129-141
    • /
    • 2005
  • The goal of this research is to improve the quality of reconstructed speech in the Distributed Speech Recognition (DSR) system. For the extended DSR, we estimate the variable Maximum Voiced Frequency (MVF) from Mel-Frequency Cepstral Coefficient (MFCC) based on Gaussian Mixture Model (GMM), to implement realistic harmonic plus noise model for the excitation signal. For the standard DSR, we also make the voiced/unvoiced decision from MFCC based on GMM because the pitch information is not available in that case. The perceptual test reveals that speech reconstructed by the proposed method is preferred to the one by the conventional methods.

  • PDF

Adaptive Gaussian Mixture Learning for High Traffic Region (혼잡한 환경에서 적응적 가우시안 혼합 모델을 이용한 배경의 학습 및 객체 검출)

  • Park Dae-Yong;Kim Jae-Min;Cho Seong-Won
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • 제55권2호
    • /
    • pp.52-61
    • /
    • 2006
  • For the detection of moving objects, background subtraction methods are widely used. An adaptive Gaussian mixture model combined with probabilistic learning is one of the most popular methods for the real-time update of the complex and dynamic background. However, probabilistic learning approach does not work well in high traffic regions. In this paper, we Propose a reliable learning method of complex and dynamic backgrounds in high traffic regions.

Infrared Visual Inertial Odometry via Gaussian Mixture Model Approximation of Thermal Image Histogram (열화상 이미지 히스토그램의 가우시안 혼합 모델 근사를 통한 열화상-관성 센서 오도메트리)

  • Jaeho Shin;Myung-Hwan Jeon;Ayoung Kim
    • The Journal of Korea Robotics Society
    • /
    • 제18권3호
    • /
    • pp.260-270
    • /
    • 2023
  • We introduce a novel Visual Inertial Odometry (VIO) algorithm designed to improve the performance of thermal-inertial odometry. Thermal infrared image, though advantageous for feature extraction in low-light conditions, typically suffers from a high noise level and significant information loss during the 8-bit conversion. Our algorithm overcomes these limitations by approximating a 14-bit raw pixel histogram into a Gaussian mixture model. The conversion method effectively emphasizes image regions where texture for visual tracking is abundant while reduces unnecessary background information. We incorporate the robust learning-based feature extraction and matching methods, SuperPoint and SuperGlue, and zero velocity detection module to further reduce the uncertainty of visual odometry. Tested across various datasets, the proposed algorithm shows improved performance compared to other state-of-the-art VIO algorithms, paving the way for robust thermal-inertial odometry.

Design and Implementation of Harmful Video Detection Service using Audio Information on Android OS (안드로이드 OS 기반 음향 정보를 이용한 유해동영상 검출 서비스의 설계 및 구현)

  • Kim, Yong-Wun;Kim, Bong-Wan;Choi, Dae-Lim;Ko, Lag-Hwan;Kim, Tae-Guon;Lee, Yong-Ju
    • Journal of Korea Multimedia Society
    • /
    • 제15권5호
    • /
    • pp.577-586
    • /
    • 2012
  • The smartphone emerged due to the rapid development of the Internet has brought greater convenience to life in a positive manner. Recently, however, because of unconstrained exposure to harmful video, reckless use of smart phones has become a domestic issue in our society. In this paper, a service which detects harmful videos by using the acoustic information is designed and implemented on the Android OS. In order to implement the service of Android OS-based detection of the harmful movie, the speed of existing sound-based detection method for harmful videos is improved. The GMM(Gaussian Mixture Model) was used for classifier and the number of Gaussian Mixture was 18. The implemented service shows a detection rate of 97.02% for a total of 1,210 data files (approximately 687 hours) which comprises 669 general videos files (about 424 hours) and 541 harmful video files (about 263 hours). It's speed is 5.6 times faster than the traditional methods whitout reducing the detection rate.

Acoustic Model Transformation Method for Speech Recognition Employing Gaussian Mixture Model Adaptation Using Untranscribed Speech Database (미전사 음성 데이터베이스를 이용한 가우시안 혼합 모델 적응 기반의 음성 인식용 음향 모델 변환 기법)

  • Kim, Wooil
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • 제19권5호
    • /
    • pp.1047-1054
    • /
    • 2015
  • This paper presents an acoustic model transform method using untranscribed speech database for improved speech recognition. In the presented model transform method, an adapted GMM is obtained by employing the conventional adaptation method, and the most similar Gaussian component is selected from the adapted GMM. The bias vector between the mean vectors of the clean GMM and the adapted GMM is used for updating the mean vector of HMM. The presented GAMT combined with MAP or MLLR brings improved speech recognition performance in car noise and speech babble conditions, compared to singly-used MAP or MLLR respectively. The experimental results show that the presented model transform method effectively utilizes untranscribed speech database for acoustic model adaptation in order to increase speech recognition accuracy.

Efficient Speaker Identification based on Robust VQ-PCA (강인한 VQ-PCA에 기반한 효율적인 화자 식별)

  • Lee Ki-Yong
    • Journal of Internet Computing and Services
    • /
    • 제5권3호
    • /
    • pp.57-62
    • /
    • 2004
  • In this paper, an efficient speaker identification based on robust vector quantizationprincipal component analysis (VQ-PCA) is proposed to solve the problems from outliers and high dimensionality of training feature vectors in speaker identification, Firstly, the proposed method partitions the data space into several disjoint regions by roust VQ based on M-estimation. Secondly, the robust PCA is obtained from the covariance matrix in each region. Finally, our method obtains the Gaussian Mixture model (GMM) for speaker from the transformed feature vectors with reduced dimension by the robust PCA in each region, Compared to the conventional GMM with diagonal covariance matrix, under the same performance, the proposed method gives faster results with less storage and, moreover, shows robust performance to outliers.

  • PDF

Illumination Influence Minimization Method for Efficient Object (영상에서 효율적인 객체 추출을 위한 조명 영향 최소화 기법)

  • Kim, Jae-Seoung;Lee, Ki-Jung;Whangbo, Taeg-Keun
    • Journal of Digital Contents Society
    • /
    • 제14권1호
    • /
    • pp.117-124
    • /
    • 2013
  • This paper suggests the robust method of extraction for moving objects in illumination variation by using image sequence from an immovable camera. The most difficult part of the implication is the effect by illumination and noise. The object area is hardly estimated when the dusky area occurs in illumination variation by time change. This thesis describes the extraction of moving objects employed by Gaussian mixture model which is noise robust measure. Also, the report suggests the elimination method of illumination part in input image by the representative illumination image which is defined to minimize the illumination influence.

Spectrum Based Excitation Extraction for HMM Based Speech Synthesis System (스펙트럼 기반 여기신호 추출을 통한 HMM기반 음성합성기의 음질 개선 방법)

  • Lee, Bong-Jin;Kim, Seong-Woo;Baek, Soon-Ho;Kim, Jong-Jin;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • 제29권1호
    • /
    • pp.82-90
    • /
    • 2010
  • This paper proposes an efficient method to enhance the quality of synthesized speech in HMM based speech synthesis system. The proposed method trains spectral parameters and excitation signals using Gaussian mixture model, and estimates appropriate excitation signals from spectral parameters during the synthesis stage. Both WB-PESQ and MUSHRA results show that the proposed method provides better speech quality than conventional HMM based speech synthesis system.