Search | Korea Science

On a Detection of the ZCR-Parameter for Higher Formants of Speech Signals (음성신호의 상위 포만트에 대한 ZCR-파라미터 검출에 관한 연구)

유건수
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1992.06a
- /
- pp.49-53
- /
- 1992
In many applications such as speech analysis, speech coding, speech recognition, etc., the voiced-unvoiced decision should be performed correctly for efficient processing. One of the parameters which are used for voice-unvoiced decision is zero-crossing. But the information of higher formants have not represented as the zero-crossing rate for higher formants of speech signals.
PDF

Implementation of Variable Threshold Dual Rate ADPCM Speech CODEC Considering the Background Noise (배경잡음을 고려한 가변임계값 Dual Rate ADPCM 음성 CODEC 구현)

Yang, Jae-Seok;Han, Kyong-Ho
- Proceedings of the KIEE Conference
- /
- 2000.07d
- /
- pp.3166-3168
- /
- 2000
This paper proposed variable threshold dual rate ADPCM coding method which is modified from the standard ADPCM of ITU G.726 for speech quality improvement. The speech quality of variable threshold dual rate ADPCM is better than single rate ADPCM at noisy environment without increasing the complexity by using ZCR(Zero Crossing Rate). In this case, ZCR is used to divide input signal samples into two categories(noisy & speech). The samples with higher ZCR is categorized as the noisy region and the samples with lower ZCR is categorized as the speech region. Noisy region uses higher threshold value to be compressed by 16Kbps for reduced bit rates and the speech region uses lower threshold value to be compressed by 40Kbps for improved speech quality. Comparing with the conventional ADPCM, which adapts the fixed coding rate. the proposed variable threshold dual rate ADPCM coding method improves noise character without increasing the bit rate. For real time applications, ZCR calculation was considered as a simple method to obtain the background noise information for preprocess of speech analysis such as FFT and the experiment showed that the simple calculation of ZCR can be used without complexity increase. Dual rate ADPCM can decrease the amount of transferred data efficiently without increasing complexity nor reducing speech quality. Therefore result of this paper can be applied for real-time speech application such as the internet phone or VoIP.
PDF

A Novel Algorithm for Discrimination of Voiced Sounds (유성음 구간 검출 알고리즘에 관한 연구)

Jang, Gyu-Cheol;Woo, Soo-Young;Yoo, Chang-D.
- Speech Sciences
- /
- v.9 no.3
- /
- pp.35-45
- /
- 2002
A simple algorithm for discriminating voiced sounds in a speech is proposed. In addition to low-frequency energy and zero-crossing rate (ZCR), both of which have been widely used in the past for identifying voiced sounds, the proposed algorithm incorporates pitch variation to improve the discrimination rate. Based on TIMIT corpus, evaluation result shows an improvement of 13% in the discrimination of voiced phonemes over that of the traditional algorithm using only energy and ZCR.
PDF

The Study on effect of the Muscle Activities for Dietshoes (Backless) (다이어트신발(Barkless)이 근육 활성도에 미치는 영향에 관한 연구)

Lee, Chang-Min;Oh, Yeon-Ju;Lee, Kyung-Deuk;Park, Seung-Bum;Lee, Hoon-Sik
- Korean Journal of Applied Biomechanics
- /
- v.16 no.3
- /
- pp.117-124
- /
- 2006
The modern convenient life formed by industrial development becomes lack of exercise and takes an interest in diet. Specially, professional walking shoes is developed as people take an interest in jogging, Those shoes, professional walking shoes or Dietshoes, increase exercise effects by change of heel types. Therefore, this study investigated motility effects by EMG experiment in order to measure Muscle Activities (MA) while wearing diet shoes (backless). Experiment was conducted by EMG measurement, from calf (gastrocnemius muscle), thigh (vastus muscle) and waist (erector spinae muscle), of 12 high school students. Exercise effects between the two shoes were analyzed by EMG (MF; Median Frequency, MPF; Mean Power Frequency, ZCR; Zero Crossing Rate). Results showed that the Dietshoes(MF: 48.21Hz, MPF: 65.0Hz, ZCR: 100.6Hz) had larger EMG value than that of Normal shoes(MF: 40.47Hz, MPF: 58.04Hz, ZCR: 82.09Hz). Also, in MA, the highest activities are showed in the calf, the second one is in waist, and last one is in thigh during gate. ANOVA between shoes in measurement parts showed significant effects in MF (gastrocnemius: p-value=.022, vastus laterals: p-value=.037, erector spinae: p-value=.082), MPF (gastrocnemius: p-value=.032, vastus laterals: p-value=.046, erector spinae: p-value=.090), and ZCR (gastrocnemius: p-value=.000, vastus laterals: p-value=.004, erector spinae: p-value=.134). And MA of Dietshoes is higher than that of Normal shoes, and decreasing rate of MA in Dietshoes is less than that of Normal shoes. Thus, this study validates exercise effects of Dietshoes.
https://doi.org/10.5103/KJSB.2006.16.3.117 인용 PDF

Coding Method of Variable Threshold Dual Rate ADPCM Speech Considering the Background Noise (배경 잡음환경에서 가변 임계값에 의한 Dual Rate ADPCM 음성 부호화 기법)

한경호
- Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
- /
- v.17 no.6
- /
- pp.154-159
- /
- 2003
In this paper, we proposed variable threshold dual rate ADPCM coding method which adapts two coding rates of the standard ADPCM of ITU G.726 for speech quality improvement at a comparably low coding rates. The ZCR(Zero Crossing Rate) is computed for speecd data and under the noisy environment, noise data dominant region showed higher ZCR and speech data dominant region showed lower ZCR. The speech data with the higher ZCR is encoded by low coding rate for reduced coded data and the speech data with the lower ZCR is encoded by high coding rate for speech quality improvements. For coded data, 2 bits are assigned for low coding rate of 16[Kbps] and 5 bits are is assigned for high coding rate of 40[Kbps]. Through the simulation, the proposed idea is evaluated and shown that the variable dual rate ADPCM coding technique shows the qood speech quality at low coding rate.
https://doi.org/10.5207/JIEIE.2003.17.6.154 인용 PDF KSCI

Spoken digit recognition Using the ZCR and PARCOR Coefficient (ZCR과 PARCOR 계수를 이용한 숫자음성 인식)

김학윤
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1985.10a
- /
- pp.75-78
- /
- 1985
본 연구는 시간 영역의 parament를 이용하여 한국어 숫자음(영, 일, 이, 삼, 사, 오, 육, 칠, 팔, 구)을 인식했다. 입력 음성 신호 X(n)의 Beginning Point와 Ending point를 ZCR(Zero-crossing Rate), Magnitude, Energy, Autocorrelation을 이용 Beginning point와 Ending point를 구하고 자음부의 인식은 위 계수들을 이용하여 행했다. 또, 유성음 부분에서는 PARCOR(Partial Autocorrelation), LPC(Linear Predictive Coding)를 이용 모음부와 유성자음을 인식하여 모음을 6개 부류(ㅏ, ㅑ, ㅗ, ㅜ, ㅠ, ㅣ)로 구분 인식했다. 이 방법에 의하면 입력 음성 신호 X(n)의 B.P(Beginning Point)와 E.P(Ending Point)를 쉽게 추출 가능하며 또한 각 Parameter를 이용하여 94.4%의 인식율을 얻었다.
PDF

Real-time implementation and performance evaluation of speech classifiers in speech analysis-synthesis

Kumar, Sandeep
- ETRI Journal
- /
- v.43 no.1
- /
- pp.82-94
- /
- 2021
In this work, six voiced/unvoiced speech classifiers based on the autocorrelation function (ACF), average magnitude difference function (AMDF), cepstrum, weighted ACF (WACF), zero crossing rate and energy of the signal (ZCR-E), and neural networks (NNs) have been simulated and implemented in real time using the TMS320C6713 DSP starter kit. These speech classifiers have been integrated into a linear-predictive-coding-based speech analysis-synthesis system and their performance has been compared in terms of the percentage of the voiced/unvoiced classification accuracy, speech quality, and computation time. The results of the percentage of the voiced/unvoiced classification accuracy and speech quality show that the NN-based speech classifier performs better than the ACF-, AMDF-, cepstrum-, WACF- and ZCR-E-based speech classifiers for both clean and noisy environments. The computation time results show that the AMDF-based speech classifier is computationally simple, and thus its computation time is less than that of other speech classifiers, while that of the NN-based speech classifier is greater compared with other classifiers.
https://doi.org/10.4218/etrij.2019-0364 인용 PDF KSCI

Fault Detection for Ceramic Heater in CVD Equipment using Zero-Crossing Rate and Gaussian Mixture Model (영교차율과 가우시안 혼합모델을 이용한 박막증착장비의 세라믹 히터 결함 검출)

Ko, JinSeok;Mu, XiangBin;Rheem, JaeYeol
- Journal of the Semiconductor & Display Technology
- /
- v.12 no.2
- /
- pp.67-72
- /
- 2013
Temperature is a critical parameter in yield improvement for wafer manufacturing. In chemical vapor deposition (CVD) equipment, crack defect in ceramic heater leads to yield reduction, however, there is no suitable ceramic heater fault detection system for conventional CVD equipment. This paper proposes a short-time zero-crossing rate based fault detection method for the ceramic heater in CVD equipment. The proposed method measures the output signal ($V_{pp}$) of RF filter and extracts the zero-crossing rate (ZCR) as feature vector. The extracted feature vectors have a discriminant power and Gaussian mixture model (GMM) based fault detection method can detect fault in ceramic heater. Experimental results, carried out by measured signals provided by a CVD equipment manufacturer, indicate that the proposed method detects effectively faults in various process conditions.
PDF KSCI

A Study on the Simple Algorithm for Discrimination of Voiced Sounds (유성음 구간 검출을 위한 간단한 알고리즘에 관한 연구)

장규철;우수영;박용규;유창동
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.8
- /
- pp.727-734
- /
- 2002
A simple algorithm for discriminating voiced sounds in a speech is proposed in this paper. In addition to low-frequency energy and zero-crossing rate (ZCR), both of which have been widely used in the past for identifying voiced sounds, the proposed algorithm incorporates pitch variation to improve the discrimination rate. Based on TIMIT corpus, evaluation result shows an improvement of 13% in the discrimination of voiced phonemes over that of the traditional algorithm using only energy and ZCR.
PDF KSCI

Speech Emotion Recognition with SVM, KNN and DSVM

Hadhami Aouani ;Yassine Ben Ayed
- International Journal of Computer Science & Network Security
- /
- v.23 no.8
- /
- pp.40-48
- /
- 2023
Speech Emotions recognition has become the active research theme in speech processing and in applications based on human-machine interaction. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: the first one is extracting only 13 Mel-frequency Cepstral Coefficient (MFCC) from emotional speech samples and the second one is applying features fusions between the three features: Zero Crossing Rate (ZCR), Teager Energy Operator (TEO), and Harmonic to Noise Rate (HNR) and MFCC features. Secondly, we use two types of classification techniques which are: the Support Vector Machines (SVM) and the k-Nearest Neighbor (k-NN) to show the performance between them. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning. A large set of experiments are conducted on Surrey Audio-Visual Expressed Emotion (SAVEE) dataset for seven emotions. The results of our experiments showed given good accuracy compared with the previous studies.
https://doi.org/10.22937/IJCSNS.2023.23.8.6 인용 PDF

Search Result 33, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)