Search | Korea Science

Jang, Kwang-Dong;Kwon, Oh-Wook
- Proceedings of the KSPS conference
- /
- 2005.11a
- /
- pp.163-166
- /
- 2005
We propose a speech emotion recognition method for natural human-robot interface. In the proposed method, emotion is classified into 6 classes: Angry, bored, happy, neutral, sad and surprised. Features for an input utterance are extracted from statistics of phonetic and prosodic information. Phonetic information includes log energy, shimmer, formant frequencies, and Teager energy; Prosodic information includes pitch, jitter, duration, and rate of speech. Finally a patten classifier based on Gaussian support vector machines decides the emotion class of the utterance. We record speech commands and dialogs uttered at 2m away from microphones in 5different directions. Experimental results show that the proposed method yields 59% classification accuracy while human classifiers give about 50%accuracy, which confirms that the proposed method achieves performance comparable to a human.
PDF

Ahn, Jung-Mo;Yim, Kye-Jong;Kay, Young-Chul;Koo, Myoung-Wan
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.2
- /
- pp.41-48
- /
- 1997
This paper proposes the methods for improving the recognition rate of theARS, especially equipped with the speech recognition capability. Telephone speech, which is the input to the ARS, is usually affected by the announcements from the system, channel noise, and channel distortion, thus directly applying the recognition algorithm developed for clean speech to the noisy telephone speech will bring the significant performance degradation. To cope with this problem, this paper proposes three methods: 1)the accurate detection of the inputting instant of the speech in order to immediately turn off the announcements from the system at that instant, 2)the effective end-point detection of the noisy telephone speech on the basis of Teager energy, and 3)the SDCN-based compensation of the channel distortion. Experiments on speaker-independent, noisy telephone speech reveal that the combination of the above three proposed methods provides great improvements on the recognition rate over the conventional method, showing about 77% in contrast to only 23%.
PDF

채장범
- Proceedings of the Korean Society of Precision Engineering Conference
- /
- 1996.04a
- /
- pp.259-263
- /
- 1996
In diagnosing of mechanical machinery, it is often improtant to get information about the movement inside the machine casing. If the values of internal tities may be derived from the measurement using sensors installed on the external casing, it would be much better in many senses. This paper discusses extracting internal gear movements byfrequencydemodulation from gear meshing force signatures which can be recovered from the vibrations though inverse filter. There are several way in demodulating signals. In this paper, especially, Hibert Transform, Wigner-Ville distribution, and Teager energy operator are examined and compared. Effects of noise on the frequency demodulation methods and the behavior of bandpass filtered noisy signal are discussed using simulated time-varying frequency signals.
PDF

Hadhami Aouani ;Yassine Ben Ayed
- International Journal of Computer Science & Network Security
- /
- v.23 no.8
- /
- pp.40-48
- /
- 2023
Speech Emotions recognition has become the active research theme in speech processing and in applications based on human-machine interaction. In this work, our system is a two-stage approach, namely feature extraction and classification engine. Firstly, two sets of feature are investigated which are: the first one is extracting only 13 Mel-frequency Cepstral Coefficient (MFCC) from emotional speech samples and the second one is applying features fusions between the three features: Zero Crossing Rate (ZCR), Teager Energy Operator (TEO), and Harmonic to Noise Rate (HNR) and MFCC features. Secondly, we use two types of classification techniques which are: the Support Vector Machines (SVM) and the k-Nearest Neighbor (k-NN) to show the performance between them. Besides that, we investigate the importance of the recent advances in machine learning including the deep kernel learning. A large set of experiments are conducted on Surrey Audio-Visual Expressed Emotion (SAVEE) dataset for seven emotions. The results of our experiments showed given good accuracy compared with the previous studies.
https://doi.org/10.22937/IJCSNS.2023.23.8.6 인용 PDF

Shin, Hoon-Chul;Han, Soo-Kyeong;Lyu, Joon-Soo;Cho, Soo-Hwan
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.67 no.7
- /
- pp.816-823
- /
- 2018
The Auto-synchronizer is essential equipment for synchronizing a generator to the power system. It is performing that measurement of the magnitude, frequency and phase of the voltage signal of the power system and generator. It is important to select the appropriate measurement algorithm for preventing various problem such as mechanical stress and Electrical problem. Teager Energy Operator(TEO) and Discrete separation algorithm(DESA) is measurable the instantaneous parameters of a sine wave using 5 samples and can be measured at a fast and with a simple operation. Therefore it has many advantages in measuring the parameters. In this paper, it confirmed measurement results using matlab simulations when there are synchronized in order of frequency, magnitude. Also it presented methods using digital filters and sample intervals to improve accuracy.
https://doi.org/10.5370/KIEE.2018.67.7.816 인용 PDF KSCI