• Title/Summary/Keyword: Mixture of Gaussian

Search Result 505, Processing Time 0.022 seconds

A fixed-point implementation and performance analysis of EGML moving object detection algorithm (EGML 이동 객체 검출 알고리듬의 고정소수점 구현 및 성능 분석)

  • An, Hyo-sik;Kim, Gyeong-hun;Shin, Kyung-wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.9
    • /
    • pp.2153-2160
    • /
    • 2015
  • An analysis of hardware design conditions of moving object detection (MOD) algorithm is described, which is based on effective Gaussian mixture learning (EGML). A simulation model of EGML algorithm is implemented using OpenCV, and the effects of some parameter values on background learning time and MOD sensitivity are analyzed for various images. In addition, optimal design conditions for hardware implementation of EGML-based MOD algorithm are extracted from fixed-point simulations for various bit-widths of parameters. The proposed fixed-point model of the EGML-based MOD uses only half of the bit-width at the expense of the loss of MOD performance within 0.5% when compared with floating-point MOD results.

Height Estimation of pedestrian based on image (영상기반 보행자 키 추정 방법)

  • Kim, Sung-Min;Song, Jong-Kwan;Yoon, Byung-Woo;Park, Jang-Sik
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.9 no.9
    • /
    • pp.1035-1042
    • /
    • 2014
  • Object recognition is one of the key technologies of the monitoring system for the prevention of various intelligent crimes. The height is one of the physical information of a person, and it may be important information for identification of the person. In this paper, a method which can detect pedestrians from CCTV images and estimate the height of the detected objects, is proposed. In this method, GMM (Gaussian Mixture Model) method was used to separate the moving object from the background and the pedestrian was detected using the conditions such as the width-height ratio and the size of the candidate objects. The proposed method was applied to the CCTV video, and the height of the pedestrian at far-distance, middle- distance, near-distance was estimated for the same person, and the accuracy was evaluated. Experimental results showed that the proposed method can estimate the height of the pedestrian as the accuracy of 97% for the short-range, 98% for the medium-range, and more than 97% for the far-range. The image sizes for the same pedestrian are different as the position of him in the image, it is shown that the proposed algorithm can estimate the height of pedestrian for various position effectively.

Data Detection Algorithm Based on GMM in the Acoustic Data Transmission System (음향 데이터 전송 시스템의 강인한 데이터 검출 성능을 위한 Gaussian Mixture Model 기반 연구)

  • Song, Ji-Hyun;Chang, Joon-Hyuk;Kim, Moon-Kee;Kim, Dong-Keon
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.48 no.4
    • /
    • pp.136-141
    • /
    • 2011
  • In this paper, we propose an approach to improve the data detection performance of the acoustic data transmission system based on the modulated complex lapped transform (MCLT). We first present an effective analysis of the features and the detection method of data in the acoustic data transmission system. And then feature vectors which are applied to the Gaussian mixture model (GMM) are selected from relevant parameters of the previous system for the efficient data detection. For the purpose of evaluating the performance of the proposed algorithm, Bit error rate (BER) of the received data was measured at different environments (music genres (rock, pop, classic, jazz) and different distances (1m∼5m) from the loudspeaker to the microphone in a office room) and yields better results compared with the conventional scheme of the acoustic data transmission system based on the MCLT.

A Hardware Implementation of EGML-based Moving Object Detection Algorithm (EGML 기반 이동 객체 검출 알고리듬의 하드웨어 구현)

  • Kim, Gyeong-hun;An, Hyo-sik;Shin, Kyung-wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.10
    • /
    • pp.2380-2388
    • /
    • 2015
  • A hardware implementation of MOD(moving object detection) algorithm using EGML(effective Gaussian mixture learning)- based background subtraction to detect moving objects in video is described. Some approximations of EGML calculations are applied to reduce hardware complexity, and pipelining technique is adopted to improve operating speed. The MOD processor designed in Verilog-HDL has been verified by FPGA-in-the-loop verification using MATLAB/Simulink. The MOD processor has 2,218 slices on the Virtex5-XC5VSX95T FPGA device and its throughput is 102 MSamples/s at 102 MHz clock frequency. Evaluation results of the MOD processor for 12 images in the IEEE CDW-2012 dataset show that the average recall value is 0.7631, the average precision value is 0.7778 and the average F-measure value is 0.7535.

Unmanned Enforcement System for Illegal Parking and Stopping Vehicle using Adaptive Gaussian Mixture Model (적응적 가우시안 혼합 모델을 이용한 불법주정차 무인단속시스템)

  • Youm, Sungkwan;Shin, Seong-Yoon;Shin, Kwang-Seong;Pak, Sang-Hyon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.3
    • /
    • pp.396-402
    • /
    • 2021
  • As the world is trying to establish smart city, unmanned vehicle control systems are being widely used. This paper writes about an unmanned parking control system that uses an adaptive background image modeling method, suggesting the method of updating the background image, modeled with an adaptive Gaussian mixture model, in both global and local way according to the moving object. Specifically, this paper focuses on suggesting two methods; a method of minimizing the influence of a moving object on a background image and a method of accurately updating the background image by quickly removing afterimages of moving objects within the area of interest to be monitored. In this paper, through the implementation of the unmanned vehicle control system, we proved that the proposed system can quickly and accurately distinguish both moving and static objects such as vehicles from the background image.

Performance Comparison of GMM and HMM Approaches for Bandwidth Extension of Speech Signals (음성신호의 대역폭 확장을 위한 GMM 방법 및 HMM 방법의 성능평가)

  • Song, Geun-Bae;Kim, Austin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.3
    • /
    • pp.119-128
    • /
    • 2008
  • This paper analyzes the relationship between two representative statistical methods for bandwidth extension (BWE): Gaussian Mixture Model (GMM) and Hidden Markov Model (HMM) ones, and compares their performances. The HMM method is a memory-based system which was developed to take advantage of the inter-frame dependency of speech signals. Therefore, it could be expected to estimate better the transitional information of the original spectra from frame to frame. To verify it, a dynamic measure that is an approximation of the 1st-order derivative of spectral function over time was introduced in addition to a static measure. The comparison result shows that the two methods are similar in the static measure, while, in the dynamic measure, the HMM method outperforms explicitly the GMM one. Moreover, this difference increases in proportion to the number of states of HMM model. This indicates that the HMM method would be more appropriate at least for the 'blind BWE' problem. On the other hand, nevertheless, the GMM method could be treated as a preferable alternative of the HMM one in some applications where the static performance and algorithm complexity are critical.

A block-based real-time people counting system (블록 기반 실시간 계수 시스템)

  • Park Hyun-Hee;Lee Hyung-Gu;Kim Jai-Hie
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.43 no.5 s.311
    • /
    • pp.22-29
    • /
    • 2006
  • In this paper, we propose a block-based real-time people counting system that can be used in various environments including showing mall entrances, elevators and escalators. The main contributions of this paper are robust background subtraction, the block-based decision method and real-time processing. For robust background subtraction obtained from a number of image sequences, we used a mixture of K Gaussian. The block-based decision method was used to determine the size of the given objects (moving people) in each block. We divided the images into $6{\times}12$ blocks and trained the mean and variance values of the specific objects in each block. This was done in order to provide real-time processing for up to 4 channels. Finally, we analyzed various actions that can occur with moving people in real world environments.

Natural Scene Text Binarization using Tensor Voting and Markov Random Field (텐서보팅과 마르코프 랜덤 필드를 이용한 자연 영상의 텍스트 이진화)

  • Choi, Hyun Su;Lee, Guee Sang
    • Smart Media Journal
    • /
    • v.4 no.4
    • /
    • pp.18-23
    • /
    • 2015
  • In this paper, we propose a method for detecting the number of clusters. This method can improve the performance of a gaussian mixture model function in conventional markov random field method by using the tensor voting. The key point of the proposed method is that extracts the number of the center through the continuity of saliency map of the input data of the tensor voting token. At first, we separate the foreground and background region candidate in a given natural images. After that, we extract the appropriate cluster number for each separate candidate regions by applying the tensor voting. We can make accurate modeling a gaussian mixture model by using a detected number of cluster. We can return the result of natural binary text image by calculating the unary term and the pairwise term of markov random field. After the experiment, we can confirm that the proposed method returns the optimal cluster number and text binarization results are improved.

Dynamic Control of Learning Rate in the Improved Adaptive Gaussian Mixture Model for Background Subtraction (배경분리를 위한 개선된 적응적 가우시안 혼합모델에서의 동적 학습률 제어)

  • Kim, Young-Ju
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.2
    • /
    • pp.366-369
    • /
    • 2005
  • Background subtraction is mainly used for the real-time extraction and tracking of moving objects from image sequences. In the outdoor environment, there are many changeable factor such as gradually changing illumination, swaying trees and suddenly moving objects, which are to be considered for the adaptive processing. Normally, GMM(Gaussian Mixture Model) is used to subtract the background adaptively considering the various changes in the scenes, and the adaptive GMMs improving the real-time performance were worked. This paper, for on-line background subtraction, applied the improved adaptive GMM, which uses the small constant for learning rate ${\alpha}$ and is not able to speedily adapt the suddenly movement of objects, So, this paper proposed and evaluated the dynamic control method of ${\alpha}$ using the adaptive selection of the number of component distributions and the global variances of pixel values.

  • PDF

Adaptation of Classification Model for Improving Speech Intelligibility in Noise (음성 명료도 향상을 위한 분류 모델의 잡음 환경 적응)

  • Jung, Junyoung;Kim, Gibak
    • Journal of Broadcast Engineering
    • /
    • v.23 no.4
    • /
    • pp.511-518
    • /
    • 2018
  • This paper deals with improving speech intelligibility by applying binary mask to time-frequency units of speech in noise. The binary mask is set to "0" or "1" according to whether speech is dominant or noise is dominant by comparing signal-to-noise ratio with pre-defined threshold. Bayesian classifier trained with Gaussian mixture model is used to estimate the binary mask of each time-frequency signal. The binary mask based noise suppressor improves speech intelligibility only in noise condition which is included in the training data. In this paper, speaker adaptation techniques for speech recognition are applied to adapt the Gaussian mixture model to a new noise environment. Experiments with noise-corrupted speech are conducted to demonstrate the improvement of speech intelligibility by employing adaption techniques in a new noise environment.