• Title/Summary/Keyword: gaussian mixture model

Search Result 417, Processing Time 0.025 seconds

A Study on Intelligent Control Algorithm Development for Cooperation Working of Human and Robot (인간과 로봇 협력작업을 위한 로봇 지능제어알고리즘 개발에 관한 연구)

  • Lee, Woo-Song;Jung, Yang-Guen;Park, In-Man;Jung, Jong-Gyu;Kim, Hui-Jin;Kim, Min-Seong;Han, Sung-Hyun
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.20 no.4
    • /
    • pp.285-297
    • /
    • 2017
  • This study proposed a new approach to develop an Intelligent control algorithm for cooperative working of human and robot based on voice recognition. In general case of speaker verification, Gaussian Mixture Model is used to model the feature vectors of reference speech signals. On the other hand, Dynamic Time Warping based template matching techniques were presented for the voice recognition about several years ago. We converge these two different concepts in a single method and then implement in a real time voice recognition enough to make reference model to satisfy 95% of recognition performance. In this paper it was illustrated the reliability of voice recognition by simulation and experiments for humanoid robot with 18 joints.

Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition

  • Lee, Sung-Joo;Kang, Byung-Ok;Jung, Ho-Young;Lee, Yun-Keun;Kim, Hyung-Soon
    • ETRI Journal
    • /
    • v.32 no.5
    • /
    • pp.801-809
    • /
    • 2010
  • This paper presents a statistical model-based noise suppression approach for voice recognition in a car environment. In order to alleviate the spectral whitening and signal distortion problem in the traditional decision-directed Wiener filter, we combine a decision-directed method with an original spectrum reconstruction method and develop a new two-stage noise reduction filter estimation scheme. When a tradeoff between the performance and computational efficiency under resource-constrained automotive devices is considered, ETSI standard advance distributed speech recognition font-end (ETSI-AFE) can be an effective solution, and ETSI-AFE is also based on the decision-directed Wiener filter. Thus, a series of voice recognition and computational complexity tests are conducted by comparing the proposed approach with ETSI-AFE. The experimental results show that the proposed approach is superior to the conventional method in terms of speech recognition accuracy, while the computational cost and frame latency are significantly reduced.

Speech/Mixed Content Signal Classification Based on GMM Using MFCC (MFCC를 이용한 GMM 기반의 음성/혼합 신호 분류)

  • Kim, Ji-Eun;Lee, In-Sung
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.2
    • /
    • pp.185-192
    • /
    • 2013
  • In this paper, proposed to improve the performance of speech and mixed content signal classification using MFCC based on GMM probability model used for the MPEG USAC(Unified Speech and Audio Coding) standard. For effective pattern recognition, the Gaussian mixture model (GMM) probability model is used. For the optimal GMM parameter extraction, we use the expectation maximization (EM) algorithm. The proposed classification algorithm is divided into two significant parts. The first one extracts the optimal parameters for the GMM. The second distinguishes between speech and mixed content signals using MFCC feature parameters. The performance of the proposed classification algorithm shows better results compared to the conventionally implemented USAC scheme.

Data Preprocessing and ML Analysis Method for Abnormal Situation Detection during Approach using Domestic Aircraft Safety Data (국내 항공기 위치 데이터를 활용한 이착륙 접근 단계에서의 항공 위험상황 탐지를 위한 데이터 전처리 및 머신 러닝 분석 기법)

  • Sang Ho Lee;Ilrak Son;Kyuho Jeong;Nohsam Park
    • Journal of Platform Technology
    • /
    • v.11 no.5
    • /
    • pp.110-125
    • /
    • 2023
  • In this paper, we utilize time-series aircraft location data measured based on 2019 domestic airports to analyze Go-Around and UOC_D situations during the approach phase of domestic airports. Various clustering-based machine learning techniques are applied to determine the most appropriate analysis method for domestic aviation data through experimentation. The ADS-B sensor is solely employed to measure aircraft positions. We designed a model using clustering algorithms such as K-Means, GMM, and DBSCAN to classify abnormal situations. Among them, the RF model showed the best performance overseas, but through experiments, it was confirmed that the GMM showed the highest classification performance for domestic aviation data by reflecting the aspects specialized in domestic terrain.

  • PDF

Berg Balance Scale Score Classification Study Using Inertial Sensor (관성센서를 이용한 버그균형검사 점수 분류 연구)

  • Hong, Sangpyo;Kim, Yeon-wook;Cho, WooHyeong;Joa, Kyung-Lim;Jung, Han-Young;Kim, K.S.;Lee, S.M.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.11 no.1
    • /
    • pp.53-62
    • /
    • 2017
  • In this paper, we present the score classification accuracy of BBS(Berg Balance Scale) which is the most commonly used balance evaluation tool using machine learning. Data acquisition was performed using the Noraxon system and an inertial sensor of Noraxon system was attached to the body in 8 locations (left and right ankle, left and right upper buttocks, left and right wrists, back, forehead). Based on the 3-axis accelerometer of the inertial sensor, the feature vector STFT(Short Time Fourier Transform) and SAM(Signal Area Magnitude) were extracted. Then, the items of the BBS were divided into static movement and dynamic movement depending on the operation characteristics, and the feature vectors were selected according to the sensor attachment positions which affect the score for each item of the BBS. Feature vectors selected for each item of BBS were classified using GMM(Gaussian Mixture Model). As a result of the accuracy calculation for 40 subjects, 55.5%, 72.2%, 87.5%, 50%, 35.1%, 62.5%, 43.3%, 58.6%, 60.7%, 33.3%, 44.8%, 89.2%, 51.8%, 85.1%, respectively.

On-Road Car Detection System Using VD-GMM 2.0 (차량검출 GMM 2.0을 적용한 도로 위의 차량 검출 시스템 구축)

  • Lee, Okmin;Won, Insu;Lee, Sangmin;Kwon, Jangwoo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.11
    • /
    • pp.2291-2297
    • /
    • 2015
  • This paper presents a vehicle detection system using the video as a input image what has moving of vehicles.. Input image has constraints. it has to get fixed view and downward view obliquely from top of the road. Road detection is required to use only the road area in the input image. In introduction, we suggest the experiment result and the critical point of motion history image extraction method, SIFT(Scale_Invariant Feature Transform) algorithm and histogram analysis to detect vehicles. To solve these problem, we propose using applied Gaussian Mixture Model(GMM) that is the Vehicle Detection GMM(VDGMM). In addition, we optimize VDGMM to detect vehicles more and named VDGMM 2.0. In result of experiment, each precision, recall and F1 rate is 9%, 53%, 15% for GMM without road detection and 85%, 77%, 80% for VDGMM2.0 with road detection.

Development of the Topography Restoration Method for Debris Flow Area Using Airborne LiDAR Data (항공 라이다 자료를 이용한 토석류 발생지역의 지형복원기법 개발)

  • Woo, Choong-Shik;Youn, Ho-Joong;Lee, Chang-Woo;Lee, Kyu-Sung
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.14 no.3
    • /
    • pp.174-187
    • /
    • 2011
  • The flowed soil is able to be estimated from topographic data of before and after the debris flow. However, it is often difficult to obtain airborne LiDAR data before the debris flow area. Thus, this study tries to develop a topographic restoration method that can provide spatial distribution of flowed soil and reconstruct the topography before the debris flow using airborne LiDAR data. The topographic restoration method can express a numerical formula induced from a Gaussian mixture model after extracting the cross sections of linear or non-linear in debris flowed area. The topographic restoration method was verified by two ways using airborne LiDAR data of before and after the debris flow. First, each cross section extracted from the debris flow sites to restore the topography was compared with airborne LiDAR data of before the debris flow. Also, the topographic data produced after the topographic restoration method applied to the debris flow sites was verified by airborne LiDAR DEM. Verifying the results of the topographic restoration method, overall fitting accuracy showed high accuracy close to 0.5m.

Smartphone-User Interactive based Self Developing Place-Time-Activity Coupled Prediction Method for Daily Routine Planning System (일상생활 계획을 위한 스마트폰-사용자 상호작용 기반 지속 발전 가능한 사용자 맞춤 위치-시간-행동 추론 방법)

  • Lee, Beom-Jin;Kim, Jiseob;Ryu, Je-Hwan;Heo, Min-Oh;Kim, Joo-Seuk;Zhang, Byoung-Tak
    • KIISE Transactions on Computing Practices
    • /
    • v.21 no.2
    • /
    • pp.154-159
    • /
    • 2015
  • Over the past few years, user needs in the smartphone application market have been shifted from diversity toward intelligence. Here, we propose a novel cognitive agent that plans the daily routines of users using the lifelog data collected by the smart phones of individuals. The proposed method first employs DPGMM (Dirichlet Process Gaussian Mixture Model) to automatically extract the users' POI (Point of Interest) from the lifelog data. After extraction, the POI and other meaningful features such as GPS, the user's activity label extracted from the log data is then used to learn the patterns of the user's daily routine by POMDP (Partially Observable Markov Decision Process). To determine the significant patterns within the user's time dependent patterns, collaboration was made with the SNS application Foursquare to record the locations visited by the user and the activities that the user had performed. The method was evaluated by predicting the daily routine of seven users with 3300 feedback data. Experimental results showed that daily routine scheduling can be established after seven days of lifelogged data and feedback data have been collected, demonstrating the potential of the new method of place-time-activity coupled daily routine planning systems in the intelligence application market.

Identification of shear layer at river confluence using (RGB) aerial imagery (RGB 항공 영상을 이용한 하천 합류부 전단층 추출법)

  • Noh, Hyoseob;Park, Yong Sung
    • Journal of Korea Water Resources Association
    • /
    • v.54 no.8
    • /
    • pp.553-566
    • /
    • 2021
  • River confluence is often characterized by shear layer and the associated strong mixing. In natural rivers, the main channel and its tributary can be separated by the shear layer using contrasting colors. The shear layer can be easily observed using aerial images from satellite or unmanned aerial vehicles. This study proposes a low-cost identification method extracting geographic features of the shear layer using RGB aerial image. The method consists of three stages. At first, in order to identify the shear layer, it performs image segmentation using a Gaussian mixture model and extracts the water bodies of the main channel and tributary. Next, the self-organizing map simplifies the flow line of the water bodies into the 1-dimensional curve grid. After that, the curvilinear coordinate transformation is performed using the water body pixels and the curve grid. As a result, the shear layer identification method was successfully applied to the confluence between Nakdong River and Nam River to extract geometric shear layer features (confluence angle, upstream- and downstream- channel widths, shear layer length, maximum shear layer thickness).

A Study on the Perlormance Variations of the Mobile Phone Speaker Verification System According to the Various Background Speaker Properties (휴대폰음성을 이용한 화자인증시스템에서 배경화자에 따른 성능변화에 관한 연구)

  • Choi, Hong-Sub
    • Speech Sciences
    • /
    • v.12 no.3
    • /
    • pp.105-114
    • /
    • 2005
  • It was verified that a speaker verification system improved its performances of EER by regularizing log likelihood ratio, using background speaker models. Recently the wireless mobile phones are becoming more dominant communication terminals than wired phones. So the need for building a speaker verification system on mobile phone is increasing abruptly. Therefore in this paper, we had some experiments to examine the performance of speaker verification based on mobile phone's voices. Especially we are focused on the performance variations in EER(Equal Error Rate) according to several background speaker's characteristics, such as selecting methods(MSC, MIX), number of background speakers, aging factor of speech database. For this, we constructed a speaker verification system that uses GMM(Gaussin Mixture Model) and found that the MIX method is generally superior to another method by about 1.0% EER. In aspect of number of background speakers, EER is decreasing in proportion to the background speakers populations. As the number is increasing as 6, 10 and 16, the EERs are recorded as 13.0%, 12.2%, and 11.6%. An unexpected results are happened in aging effects of the speech database on the performance. EERs are measured as 4%, 12% and 19% for each seasonally recorded databases from session 1 to session 3, respectively, where duration gap between sessions is set by 3 months. Although seasons speech database has 10 speakers and 10 sentences per each, which gives less statistical confidence to results, we confirmed that enrolled speaker models in speaker verification system should be regularly updated using the ongoing claimant's utterances.

  • PDF