• Title/Summary/Keyword: System GMM

Search Result 193, Processing Time 0.028 seconds

Forensic Automatic Speaker Identification System for Korean Speakers (과학수사를 위한 한국인 음성 특화 자동화자식별시스템)

  • Kim, Kyung-Wha;So, Byung-Min;Yu, Ha-Jin
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.95-101
    • /
    • 2012
  • In this paper, we introduce the automatic speaker identification system 'SPO(Supreme Prosecutors Office) Verifier'. SPO Verifier is a GMM(Gaussian mixture model)-UBM(universal background model) based automatic speaker recognition system and has been developed using Korean speakers' utterances. This system uses a channel compensation algorithm to compensate recording device characteristics. The system can give the users the ability to manage reference models with utterances from various environments to get more accurate recognition results. To evaluate the performance of SPO Verifier on Korean speakers, we compared this system with one of the most widely used commercial systems in the forensic field. The results showed that SPO Verifier shows lower EER(equal error rate) than that of the commercial system.

Multi-layer Speech Processing System for Point-Of-Interest Recognition in the Car Navigation System (차량용 항법장치에서의 관심지 인식을 위한 다단계 음성 처리 시스템)

  • Bhang, Ki-Duck;Kang, Chul-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.12 no.1
    • /
    • pp.16-25
    • /
    • 2009
  • In the car environment that the first priority is a safety problem, the large vocabulary isolated word recognition system with POI domain is required as the optimal HMI technique. For the telematics terminal with a highly limited processing time and memory capacity, it is impossible to process more than 100,000 words in the terminal by the general speech recognition methods. Therefore, we proposed phoneme recognizer using the phonetic GMM and also PDM Levenshtein distance with multi-layer architecture for the POI recognition of telematics terminal. By the proposed methods, we obtained high performance in the telematics terminal with low speed processing and small memory capacity. we obtained the recognition rate of maximum 94.8% in indoor environment and of maximum 92.4% in the car navigation environments.

  • PDF

People Detection Algorithm in the Beach (해변에서의 사람 검출 알고리즘)

  • Choi, Yu Jung;Kim, Yoon
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.5
    • /
    • pp.558-570
    • /
    • 2018
  • Recently, object detection is a critical function for any system that uses computer vision and is widely used in various fields such as video surveillance and self-driving cars. However, the conventional methods can not detect the objects clearly because of the dynamic background change in the beach. In this paper, we propose a new technique to detect humans correctly in the dynamic videos like shores. A new background modeling method that combines spatial GMM (Gaussian Mixture Model) and temporal GMM is proposed to make more correct background image. Also, the proposed method improve the accuracy of people detection by using SVM (Support Vector Machine) to classify people from the objects and KCF (Kernelized Correlation Filter) Tracker to track people continuously in the complicated environment. The experimental result shows that our method can work well for detection and tracking of objects in videos containing dynamic factors and situations.

Research of Gesture Recognition Technology Based on GMM and SVM Hybrid Model Using EPIC Sensor (EPIC 센서를 이용한 GMM, SVM 기반 동작인식기법에 관한 연구)

  • CHEN, CUI;Kim, Young-Chul
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2016.05a
    • /
    • pp.11-12
    • /
    • 2016
  • SVM (Support Vector machine) is powerful machine-learning method, and obtains better performance than traditional methods in the applications of muti-dimension nonlinear pattern classification. For the case of SVM model training and low efficiency in large samples, this paper proposes a combination of statistical parameters of the GMM-UBM (Universal Background Model) model. It is very effective to solve the problem of the large sample for the SVM training. The experiment is carried on four special dynamic hand gestures using the EPIC sensors. And the results show that the improved dynamic hand gesture recognition system has a high recognition rate up to 96.75%.

  • PDF

GMM based Nonlinear Transformation Methods for Voice Conversion

  • Vu, Hoang-Gia;Bae, Jae-Hyun;Oh, Yung-Hwan
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.67-70
    • /
    • 2005
  • Voice conversion (VC) is a technique for modifying the speech signal of a source speaker so that it sounds as if it is spoken by a target speaker. Most previous VC approaches used a linear transformation function based on GMM to convert the source spectral envelope to the target spectral envelope. In this paper, we propose several nonlinear GMM-based transformation functions in an attempt to deal with the over-smoothing effect of linear transformation. In order to obtain high-quality modifications of speech signals our VC system is implemented using the Harmonic plus Noise Model (HNM)analysis/synthesis framework. Experimental results are reported on the English corpus, MOCHA-TlMlT.

  • PDF

The Effect of Exports on Growth of Small and Medium-Sized Enterprises: Evidence from Vietnamese Manufacturing Firms

  • LE, Ngan Thi Thanh
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.1
    • /
    • pp.35-42
    • /
    • 2022
  • The paper aims to examine the impact of exports on the growth of Vietnamese manufacturing small and medium-sized enterprises (SMEs) by exploring the information of 36,053 enterprises across 24 manufacturing sectors from the Vietnam Annual Enterprise Survey (VAES) in the period 2014-2019. To deal with the problem of variable variance, autocorrelation, and endogeneity of the model, the paper uses the OLS regression method with a strong standard error method and system GMM. Export participation by SMEs is positively associated with business growth in terms of sales and total assets, according to the findings. The GMM estimate shows that the rate of sales growth among exporters is 36.5 percent greater than that of non-exporting enterprises in the case of the sales growth measure. Exporters' average total asset growth rate is 19% greater than the rate estimated for non-exporting businesses. The study's findings indicate the need of adopting policies that promote SMEs in transition economies like Vietnam to engage in exporting activities. Furthermore, the findings show that financial assistance and suitable ownership would enable SMEs to take advantage of export opportunities to increase sales and total assets.

Ultrasound imaging for age-related differences of lower extremity muscle architecture

  • Kim, Min Kyu;Ko, Young Jun;Lee, Hwang Jae;Ha, Hyun Geun;Lee, Wan Hee
    • Physical Therapy Rehabilitation Science
    • /
    • v.4 no.1
    • /
    • pp.38-43
    • /
    • 2015
  • Objective: To investigate and compare the size of the rectus femoris (RF), tibialis anterior (TA), and medial gastrocnemius (GMM) using ultrasound (US) imaging in young, elderly, and very elderly groups. Design: Cross sectional study. Methods: This study consisted of 25 young (age 20 years), 24 elderly (age 65-74 years), and 25 very elderly (age 75-90 years) people with no physical dysfunctions. The cross sectional area (CSAs) of the RF and muscle thickness of the TA and GMM were measured at rest and during contraction using an US system. Results: The CSA of the RF and thickness of the TA and GMM were significantly smaller in the elderly and very elderly groups than in the young group (p<0.05). There was a significant difference of the CSA of the RF at rest and during contraction between elderly and very elderly group (p<0.05). In the comparison of the TA and GMM thickness between elderly and very elderly groups, there were no significant differences except for the TA thickness during contraction. There was a significant difference in the percentage change in RF CSA among the three groups (p<0.05). Conclusions: Our results revealed loss of muscle mass in the RF, TA, and GMM in elderly and very elderly people (${\geq}65$ years old). In particular, the greatest age-related decline in muscle mass was observed for the RF. Furthermore, the CSA of the RF declined with aging in the very elderly groups (${\geq}75$ years old).

Real-Time Vehicle License Plate Detection Based on Background Subtraction and Cascade of Boosted Classifiers

  • Sarker, Md. Mostafa Kamal;Song, Moon Kyou
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.39C no.10
    • /
    • pp.909-919
    • /
    • 2014
  • License plate (LP) detection is the most imperative part of an automatic LP recognition (LPR) system. Typical LPR contains two steps, namely LP detection (LPD) and character recognition. In this paper, we propose an efficient Vehicle-to-LP detection framework which combines with an adaptive GMM (Gaussian Mixture Model) and a cascade of boosted classifiers to make a faster vehicle LP detector. To develop a background model by using a GMM is possible in the circumstance of a fixed camera and extracts the motions using background subtraction. Firstly, an adaptive GMM is used to find the region of interest (ROI) on which motion detectors are running to detect the vehicle area as blobs ROIs. Secondly, a cascade of boosted classifiers is executed on the blobs ROIs to detect a LP. The experimental results on our test video with the resolution of $720{\times}576$ show that the LPD rate of the proposed system is 99.14% and the average computational time is approximately 42ms.

Multiple Camera-based Person Correspondence using Color Distribution and Context Information of Human Body (색상 분포 및 인체의 상황정보를 활용한 다중카메라 기반의 사람 대응)

  • Chae, Hyun-Uk;Seo, Dong-Wook;Kang, Suk-Ju;Jo, Kang-Hyun
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.15 no.9
    • /
    • pp.939-945
    • /
    • 2009
  • In this paper, we proposed a method which corresponds people under the structured spaces with multiple cameras. The correspondence takes an important role for using multiple camera system. For solving this correspondence, the proposed method consists of three main steps. Firstly, moving objects are detected by background subtraction using a multiple background model. The temporal difference is simultaneously used to reduce a noise in the temporal change. When more than two people are detected, those detected regions are divided into each label to represent an individual person. Secondly, the detected region is segmented as features for correspondence by a criterion with the color distribution and context information of human body. The segmented region is represented as a set of blobs. Each blob is described as Gaussian probability distribution, i.e., a person model is generated from the blobs as a Gaussian Mixture Model (GMM). Finally, a GMM of each person from a camera is matched with the model of other people from different cameras by maximum likelihood. From those results, we identify a same person in different view. The experiment was performed according to three scenarios and verified the performance in qualitative and quantitative results.

Emergency Detection Method using Motion History Image for a Video-based Intelligent Security System

  • Lee, Jun;Lee, Se-Jong;Park, Jeong-Sik;Seo, Yong-Ho
    • International journal of advanced smart convergence
    • /
    • v.1 no.2
    • /
    • pp.39-42
    • /
    • 2012
  • This paper proposed a method that detects emergency situations in a video stream using MHI (Motion History Image) and template matching for a video-based intelligent security system. The proposed method creates a MHI of each human object through image processing technique such as background removing based on GMM (Gaussian Mixture Model), labeling and accumulating the foreground images, then the obtained MHI is compared with the existing MHI templates for detecting an emergency situation. To evaluate the proposed emergency detection method, a set of experiments on the dataset of video clips captured from a security camera has been conducted. And we successfully detected emergency situations using the proposed method. In addition, the implemented system also provides MMS (Multimedia Message Service) so that a security manager can deal with the emergency situation appropriately.