Search | Korea Science

The Emotion Recognition System through The Extraction of Emotional Components from Speech (음성의 감성요소 추출을 통한 감성 인식 시스템)

Park Chang-Hyun;Sim Kwee-Bo
- Journal of Institute of Control, Robotics and Systems
- /
- v.10 no.9
- /
- pp.763-770
- /
- 2004
The important issue of emotion recognition from speech is a feature extracting and pattern classification. Features should involve essential information for classifying the emotions. Feature selection is needed to decompose the components of speech and analyze the relation between features and emotions. Specially, a pitch of speech components includes much information for emotion. Accordingly, this paper searches the relation of emotion to features such as the sound loudness, pitch, etc. and classifies the emotions by using the statistic of the collecting data. This paper deals with the method of recognizing emotion from the sound. The most important emotional component of sound is a tone. Also, the inference ability of a brain takes part in the emotion recognition. This paper finds empirically the emotional components from the speech and experiment on the emotion recognition. This paper also proposes the recognition method using these emotional components and the transition probability.
https://doi.org/10.5302/J.ICROS.2004.10.9.763 인용 PDF KSCI

Assessment of Telephone Speech Transmission Quality by Opinion Test (오피니언 테스트에 의한 전화 음성품질 평가)

Kwon, Yoon-Ju;Jang, Dae-Young;Kang, Kyeong-Ok;Kang, Seong-Hoon
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.1
- /
- pp.14-21
- /
- 1992
In order to establish the speech transmission quality of networks, a series of subjective tests for loudness rating(LR) and sidetone masking rating(STMR) among transmission impairments were carried out. As a result of subjective tests, relationships of mean opinion score(MOS) with LR and STMR, respectively, were obtained. Also, we obtained the cumulative MOS characteristics which represent the percentage of scores that subjects voted. Thus it is easy to achieve a strategic objective of customer satisfaction for present networks and new services.
PDF

Separation of Voiced Sounds and Unvoiced Sounds for Corpus-based Korean Text-To-Speech (한국어 음성합성기의 성능 향상을 위한 합성 단위의 유무성음 분리)

Hong, Mun-Ki;Shin, Ji-Young;Kang, Sun-Mee
- Speech Sciences
- /
- v.10 no.2
- /
- pp.7-25
- /
- 2003
Predicting the right prosodic elements is a key factor in improving the quality of synthesized speech. Prosodic elements include break, pitch, duration and loudness. Pitch, which is realized by Fundamental Frequency (F0), is the most important element relating to the quality of the synthesized speech. However, the previous method for predicting the F0 appears to reveal some problems. If voiced and unvoiced sounds are not correctly classified, it results in wrong prediction of pitch, wrong unit of triphone in synthesizing the voiced and unvoiced sounds, and the sound of click or vibration. This kind of feature is usual in the case of the transformation from the voiced sound to the unvoiced sound or from the unvoiced sound to the voiced sound. Such problem is not resolved by the method of grammar, and it much influences the synthesized sound. Therefore, to steadily acquire the correct value of pitch, in this paper we propose a new model for predicting and classifying the voiced and unvoiced sounds using the CART tool.
PDF

The Effects of Korean Traditional Rhythm Therapy on Voice of Parkinson's Disease Patients: A Preliminary Study

Heo, Soo-Min;Jeong, Ok-Ran
- Speech Sciences
- /
- v.12 no.2
- /
- pp.59-72
- /
- 2005
The purpose of this study was to investigate the effects of rhythm therapy program on MPT(Maximum Phonation Time) and acoustic parameters in patients with Parkinson's disease. The therapy program utilized 5 Korean traditional rhythms: jinngyang, jungmori, jungjungmori, jajinmori, and semachi. The therapy consisted of counseling regarding vocal hygiene and actual therapy procedures. Six subjects with Parkinson's disease participated in the study; 3 subjects in experimental group and the other 3 subjects in control group. The pre- and post- acoustic analyses were performed in both groups. The results of this study were as follows; 1) MPT was significantly increased in the experimental group, 2) mono-pitch was significantly improved in the experimental group, 3) mono-loudness was significantly improved in the experimental group, and finally, HNR was significantly increased in the experimental group compared to the control group.
PDF

Acoustic Variations in Epileptic Patients with Topiramate (간질 치료제 복용으로 인한 음성학적인 변화에 대한 연구)

Choi, Yoon-Mi;Kim, Sun-Jun;Kim, Hyun-Gi
- Speech Sciences
- /
- v.14 no.4
- /
- pp.221-232
- /
- 2007
Topiramate (TPM) is a new antiepileptic drug characterized by a clinical effective reduction in seizure frequency and it represents a useful drug effective in a wide range of epileptic patients. Known side effects are represented by weight loss, hypohidrosis, anorexia, sedation, nephrolithiasis, cognitive complaints and language disorders. This study is to examine acoustic characteristics of patients with TPM. 15 patients were assessed through a Computerized Speech Lab (CSL) applied before the beginning of therapy with TPM and 3 months after medication had been stabilized. Tests had been chosen to assess voice onset time (VOT), total duration (TD), vowel formants, loudness, pitch, speaking rate, and articulation patterns. We compared the data from patients and healthy volunteers. The statistical analysis of the results did not show changes in acoustic tests, except for TD which was increased. The increase of the TD is evaluated as a deterioration of fluency. Our results suggest that patients with TPM did not experience acoustic speech changes except that fluency was declined. Unlike previous studies, the medication of TPM has nothing to do with speech problems in patients with epilepsy.
PDF

오디오 사운드 크기 경쟁 속 오디오 사운드 크기 측정 방법 및 컨트롤 기술

Jo, Chung-Sang;Jang, Gyu-Sik;Kim, Je-U
- Information and Communications Magazine
- /
- v.29 no.4
- /
- pp.15-21
- /
- 2012
디지털 오디오 기술 중 하나인 MP3(MPEG-1 Layer III) 기술을 이용한 음원(음악, 노래 등) 파일이 대중화되면서 휴대용 멀티미디어 기기는 현대인들의 필수 아이템이 되고 있다. 반면 현대인은 휴대용 기기뿐만 아니라 다양한 소음으로 인해 청각적인 피로도가 크게 증가하고 있다. 또한 상업 음악은 사용자의 관심을 얻기 위하여 오디오 음원의 사운드 크기를 지속적으로 증가시키고 있다. 이러한 요인은 사람들의 청각 기능에 악영향을 미치고 있다. 본고에서는 휴대용 멀티미디어 기기의 대중화 및 오디오 사운드 크기(loudness)의 증가에 따라서 발생하고 있는 문제점을 설명하고, 문제점을 해결하기 위해 제안된 오디오 사운드 크기 측정 기술 및 오디오 사운드 크기 컨트롤 기술에 대해서 알아본다.
PDF KSCI

Study on the annoyance response of subway station noise using jury evaluation test (청감실험을 통한 도시철도 승강장 소음의 성가심 반응에 대한 연구)

Kim, Dong-Jun;Kim, Deuk-Sung;Son, Jin-Hee;Chang, Seo-Il
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2009.04a
- /
- pp.670-677
- /
- 2009
The purpose of this study is to reveal the quantitative dose-response relationship between the noise emitted in the platform of subway station and the public response. The noise measured in the platform of subway station was used for a jury test. In order to find the factors which influence annoyance response due to the platform noise, jurors were examined for the difference of the annoying response, interrelation of sound quality parameter and annoyance response. The platform noise level was 77.2$\sim$83.9 dBA and the most of passengers at the platform were highly annoyed. And screen door contributes to annoyance reduction of platform noise. The results from analyzing sound quality parameters shows that loudness and annoyance response have a high correlation coefficient.
PDF

Psvchoacoustical Evaluation of Floor Impact Noise (바닥충격음의 심리음향학적 평가)

전진용;정정호;조문재
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2001.05a
- /
- pp.253-258
- /
- 2001
Floor impact noises in apartment buildings have been investigated as they are most annoying in living environment. Several experiments were undertaken to compare perceived noisiness of floor impact noises generated by bang and tapping machine along with children's jumping and running. Results show that bang noise is more annoying than tapping noise and floor impact noise generated by children is less annoying than the noise generated by machine. The floor impact noise generated by children's jumping and running corresponds well with the bang-machine noise in terms of loudness, unbiased annoyance, ${\Phi}$$\_$0/ and IACC. The noise generated by children is somewhat different from machine noise; in spatial impression the real noise is similar to tapping-machine noise but is less annoying that the machine noises.
PDF

Between Invention and Discovery: A. G. Bell's Photophone and Photoacoustic Research (발명과 발견의 사이에서: 앨릭잰더 그레이엄 벨의 포토폰과 광음향학 연구)

Ku, Ja-Hyon
- The Journal of the Acoustical Society of Korea
- /
- v.31 no.2
- /
- pp.73-78
- /
- 2012
The photophone, Alexander Graham Bell's device for transmitting sound through light was patented in 1880. It included the transmitter modulating and reflecting strong light like sunlight to a distant receiver which produced sound. In this working of the photophone, the discovery of the sound-emitting effect under illumination was very essential. Longing for being famous in the scientific community, Bell focused on presenting various methods for producing sounds and for maximizing the loudness by performing intensive research on the photoacoustic effect. Bell's scientific research on photoacoustics was successful in establishing himself as a scientist and laid a foundation of photoacoustic analysis. And his invention became a basis for other researchers' subsequent technologies like fiber-optic communication.
https://doi.org/10.7776/ASK.2012.31.2.073 인용 PDF KSCI

Design and Implementation of Speech-Training System for Voice Disorders (발성장애아동을 위한 발성훈련시스템 설계 및 구현)

정은순;김봉완;양옥렬;이용주
- Journal of Internet Computing and Services
- /
- v.2 no.1
- /
- pp.97-106
- /
- 2001
In this paper, we design and implement complement based speech training system for voice disorder. The system consists of three level of training: precedent training, training for speech apprehension and training for speech enhancement. To analyze speech of voice disorder, we extracted speech features as loudness, amplitude, pitch using digital signal processing technique. Extracted features are converted to graphic interface for visual feedback of speech by the system.
PDF

Search Result 297, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)