• Title/Summary/Keyword: ZCR

Search Result 59, Processing Time 0.037 seconds

A Voice Boundary Detection Method Using Dynamic Parameters Based On Neural Network (신경망 기반의 동적 파라미터들을 이용한 음성 경계 추출)

  • 마창수;김계영;최형일
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2002.10d
    • /
    • pp.616-618
    • /
    • 2002
  • 본 논문에서는 음성인식 성능을 높이기 위한 기본적 단계인 음성과 비음성 부분의 경계를 추출하는 음성 경계 추출 방법을 제안한다. 음성경계 추출을 위한 특징들로는 시간영역 분할 파라미터인 ZCR, MA를 사용하고 주파수 영역 분할 파라미터로 주파수 대역 파워 에너지 (Frequency band power energy), 포만트 계수 (Formant coefficient)를 사용하였고 각 파라미터들을 이용하여 음성 경계를 결정할 때 경험에 의해 임계치를 결정하는 단점을 보안하기 위해서 신경망을 이용한다. 신경망의 가중치와 임계치들은 지도 학습을 통해 최적화 되고, 학습을 통해 구성된 망을 음성과 비음성의 경계치 구분에 사용한다.

  • PDF

The EMG Measurement of Simple and Iterative Worker′s Muscle Fatigue (단순반복 근로자의 근육피로도에 관한 EMG분석)

  • 서승록;임완희
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.6 no.3
    • /
    • pp.79-86
    • /
    • 2001
  • The CTD(Cumulative Trauma Disorder) as a new kind of occupational disease occurs mainly to workers on handling line under the highly-specialized industrial environments. This study took into account their exposure to Cumulative Trauma Disorders(CTD) by the utilization of EMG system, with respect to worker's muscle fatigue test according to fulfillment of iterative and simple task. The findings of this study were as follows : From the result of AEMG test analysis, worker's fatigue extent according to elapsed time of task was inclined to be increased continually. On the other hand, after its task ending, their fatigue extent was inclined to be decreased than before-circumstance of refractory brick lifting. The transference of MF(Median Frequency) and MPF(Mean Power Frequency) had highly significant difference between muscle fatigue and the elapsed time of work. Especially, their fatigue extent to erectorspinae and multifidus to lift firebrick was increased in the mean time. The transference of ZCR(Zero Crossing Rate) had considerable significant difference between muscle fatigue and the elapsed time of work. In short, as the work went of the muscle fatigue extent increased gradually. Thus, it can be concluded that the fatigue of erectorspinae and multifidus extent according to fulfillment of iterative and simple task is gradually being increased.

  • PDF

Feature-Vector Normalization for SVM-based Music Genre Classification (SVM에 기반한 음악 장르 분류를 위한 특징벡터 정규화 방법)

  • Lim, Shin-Cheol;Jang, Sei-Jin;Lee, Seok-Pil;Kim, Moo-Young
    • Journal of the Institute of Electronics Engineers of Korea SC
    • /
    • v.48 no.5
    • /
    • pp.31-36
    • /
    • 2011
  • In this paper, Mel-Frequency Cepstral Coefficient (MFCC), Decorrelated Filter Bank (DFB), Octave-based Spectral Contrast (OSC), Zero-Crossing Rate (ZCR), and Spectral Contract/Roll-Off are combined as a set of multiple feature-vectors for the music genre classification system based on the Support Vector Machine (SVM) classifier. In the conventional system, feature vectors for the entire genre classes are normalized for the SVM model training and classification. However, in this paper, selected feature vectors that are compared based on the One-Against-One (OAO) SVM classifier are only used for normalization. Using OSC as a single feature-vector and the multiple feature-vectors, we obtain the genre classification rates of 60.8% and 77.4%, respectively, with the conventional normalization method. Using the proposed normalization method, we obtain the increased classification rates by 8.2% and 3.3% for OSC and the multiple feature-vectors, respectively.

The CTD Evaluation or Simple and Iterative Task through the Improvement of Working Conditions (작업환경개선을 통한 단순반복작업의 누적외상평가)

  • 서승록;임완희
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.6 no.4
    • /
    • pp.12-21
    • /
    • 2001
  • Recently, as work strength is deepened, as well, labor environments is changed, simple and iterative worker's Cumulative Trauma Disorders(CTD) is gradually being increased. Accordingly, this study was designed to represent its system design to carry out their iterative and simple task by machine through the difference of muscle fatigue between worker on handling line and worker under the work environments by Air Balance System for the purpose of analyzing their muscle fatigue test according to fulfillment of iterative and simple task. From the result of this study, with regard to the comparison of muscle fatigue between work on handling line and work on automation line on the occasion of refractory brick loading, their muscle fatigue extent under the work environments by Air Balance system was lower than it of handling by AMEG(64.1%), MF(65.3%), MPF(64.3%), ZCR(63.6%) respectively. And also, generally there showed similar transfer at the aspect of muscle mobilization. In other words, we can say that work environments by Air Balance System is beneficial at the aspect of alleviating works' fatigue extent on handling line. As well, the result of this study shows that worker's exposure to Cumulative Trauma Disorders(CTD) is relatively low.

  • PDF

A Study on Emotion Recognition of Chunk-Based Time Series Speech (청크 기반 시계열 음성의 감정 인식 연구)

  • Hyun-Sam Shin;Jun-Ki Hong;Sung-Chan Hong
    • Journal of Internet Computing and Services
    • /
    • v.24 no.2
    • /
    • pp.11-18
    • /
    • 2023
  • Recently, in the field of Speech Emotion Recognition (SER), many studies have been conducted to improve accuracy using voice features and modeling. In addition to modeling studies to improve the accuracy of existing voice emotion recognition, various studies using voice features are being conducted. This paper, voice files are separated by time interval in a time series method, focusing on the fact that voice emotions are related to time flow. After voice file separation, we propose a model for classifying emotions of speech data by extracting speech features Mel, Chroma, zero-crossing rate (ZCR), root mean square (RMS), and mel-frequency cepstrum coefficients (MFCC) and applying them to a recurrent neural network model used for sequential data processing. As proposed method, voice features were extracted from all files using 'librosa' library and applied to neural network models. The experimental method compared and analyzed the performance of models of recurrent neural network (RNN), long short-term memory (LSTM) and gated recurrent unit (GRU) using the Interactive emotional dyadic motion capture Interactive Emotional Dyadic Motion Capture (IEMOCAP) english dataset.

Speech Emotion Recognition Based on GMM Using FFT and MFB Spectral Entropy (FFT와 MFB Spectral Entropy를 이용한 GMM 기반의 감정인식)

  • Lee, Woo-Seok;Roh, Yong-Wan;Hong, Hwang-Seok
    • Proceedings of the KIEE Conference
    • /
    • 2008.04a
    • /
    • pp.99-100
    • /
    • 2008
  • This paper proposes a Gaussian Mixture Model (GMM) - based speech emotion recognition methods using four feature parameters; 1) Fast Fourier Transform(FFT) spectral entropy, 2) delta FFT spectral entropy, 3) Mel-frequency Filter Bank (MFB) spectral entropy, and 4) delta MFB spectral entropy. In addition, we use four emotions in a speech database including anger, sadness, happiness, and neutrality. We perform speech emotion recognition experiments using each pre-defined emotion and gender. The experimental results show that the proposed emotion recognition using FFT spectral-based entropy and MFB spectral-based entropy performs better than existing emotion recognition based on GMM using energy, Zero Crossing Rate (ZCR), Linear Prediction Coefficient (LPC), and pitch parameters. In experimental Results, we attained a maximum recognition rate of 75.1% when we used MFB spectral entropy and delta MFB spectral entropy.

  • PDF

Design and Implementation of Speech Music Discrimination System per Block Unit on FM Radio Broadcast (FM 방송 중 블록 단위 음성 음악 판별 시스템의 설계 및 구현)

  • Jang, Hyeon-Jong;Eom, Jeong-Gwon;Im, Jun-Sik
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2007.11a
    • /
    • pp.25-28
    • /
    • 2007
  • 본 논문은 FM 라디오 방송의 오디오 신호를 블록 단위로 음성 음악을 판별하는 시스템을 제안하는 논문이다. 본 논문에서는 음성 음악 판별 시스템을 구축하기 위해 다양한 특정 파라미터와 분류 알고리즘을 제안 한다. 특정 파라미터는 신호처리 분야(Centroid, Rolloff, Flux, ZCR, Low Energy), 음성 인식 분야(LPC, MFCC), 음악 분석 분야(MPitch, Beat)에서 각각 사용되는 파라미터를 사용하였으며 분류 알고리즘으로는 패턴인식 분야(GMM, KNN, BP)와 퍼지 신경망(ANFIS)을 사용하였고, 거리 구현은 Mahalanobis 거리를 사용하였다.

  • PDF

An Experiment of a Spoken Digits-Recognition System (숫자음성 자동 인식에 관한 일실험)

  • ;安居院猛
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.15 no.6
    • /
    • pp.23-28
    • /
    • 1978
  • This paper describes a speech recognition system for ten isolated spoken digits. In this system, acoustic parameters such as zero crossing rate, log energy and three formant frequencies estimated by linear prediction method were extracted for classification and/or recognition purpose(s). The former two parameters were used for the classification of unvoiced consonants and the latter one for the recognition of vowels and voiced consonants. Promising recognition results were obtained in this experiment for ten digit utterances spoken by a male speaker.

  • PDF

A Study on Muscle Fatigue Changes using AR Model-based Median Frequency in EMG (AR모델을 이용한 중앙주파수의 근피로 변화에 관한 연구)

  • Cho, EunSeuk;Cha, Sam;Lee, Sangsik;Lee, Kiyoung
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.2 no.1
    • /
    • pp.17-22
    • /
    • 2009
  • In this paper, we extract well-known parameters such as zero crossing rate(ZCR), low band energy(Band) and median frequency(MDF) from surface electromyogram (EMG), and compare to evaluate themselves as measures for fatigue. In experiments, 3 males and 3 females volunteered to participate in surface EMG recordings placed on the biceps brachii and each recording experiment continued until exhaustion.

  • PDF

A Study on Order Decision of AR Model for Median Frequency in Fatiguing EMG (근피로 중앙주파수를 위한 AR모델의 차수결정에 관한 연구)

  • Cho, Eun Seuk;Cha, Sam;Lee, Ki Young
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.3 no.1
    • /
    • pp.8-12
    • /
    • 2010
  • In this paper, we studied on AR model order decision for extraction of EMG median frequency by t-test and ANOVA and comparison of median frequency. And we extracted well-known parameters such as zero crossing rate(ZCR), low band energy(Band) and median frequency(MDF) from surface electromyogram (EMG). And we compared to evaluate themselves as measures for fatigue.

  • PDF