• Title/Summary/Keyword: 포먼트

Search Result 98, Processing Time 0.029 seconds

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding (저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상)

  • Lee, Chang-Heon;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.29 no.1
    • /
    • pp.62-68
    • /
    • 2010
  • This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.

Influence of standard Korean and Gyeongsang regional dialect on the pronunciation of English vowels (표준어와 경상 지역 방언의 한국어 모음 발음에 따른 영어 모음 발음의 영향에 대한 연구)

  • Jang, Soo-Yeon
    • Phonetics and Speech Sciences
    • /
    • v.13 no.4
    • /
    • pp.1-7
    • /
    • 2021
  • This study aims to enhance English pronunciation education for Korean students by examining the impact of standard Korean and Gyeongsang regional dialect on the articulation of English vowels. Data were obtained through the Korean-Spoken English Corpus (K-SEC). Seven Korean words and ten English mono-syllabic words were uttered by adult, male speakers of standard Korean and Gyeongsang regional dialect, in particular, speakers with little to no experience living abroad were selected. Formant frequencies of the recorded corpus data were measured using spectrograms, provided by the speech analysis program, Praat. The recorded data were analyzed using the articulatory graph for formants. The results show that in comparison with speakers using standard Korean, those using the Gyeongsang regional dialect articulated both Korean and English vowels in the back. Moreover, the contrast between standard Korean and Gyeongsang regional dialect in the pronunciation of Korean vowels (/으/, /어/) affected how the corresponding English vowels (/ə/, /ʊ/) were articulated. Regardless of the use of regional dialect, a general feature of vowel pronunciation among Korean people is that they show more narrow articulatory movements, compared with that of native English speakers. Korean people generally experience difficulties with discriminating tense and lax vowels, whereas native English speakers have clear distinctions in vowel articulation.

Study of Emotion in Speech (감정변화에 따른 음성정보 분석에 관한 연구)

  • 장인창;박미경;김태수;박면웅
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2004.10a
    • /
    • pp.1123-1126
    • /
    • 2004
  • Recognizing emotion in speech is required lots of spoken language corpus not only at the different emotional statues, but also in individual languages. In this paper, we focused on the changes speech signals in different emotions. We compared the features of speech information like formant and pitch according to the 4 emotions (normal, happiness, sadness, anger). In Korean, pitch data on monophthongs changed in each emotion. Therefore we suggested the suitable analysis techniques using these features to recognize emotions in Korean.

  • PDF

Teaching Method of Correct Pronunciation from Formant Statistics (포먼트 통계치를 이용한 발음교정 지시 방법에 관하여)

  • Bak Il-Suh;Jo Cheol-Woo
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.69-72
    • /
    • 2004
  • In this paper, we tried to develop a vowel training assistant method using vowel formant statistics. Formant statistics were obtained from PBW set consists of 452 words from 8 persons. Then, we calculated distance from input formants to each center of vowel formant space. Based on the distance, direct ions to correct the speaker's manner of articulation, i .e. position of jaw and tongue.

  • PDF

한국어 단모음의 분석 및 인식에 관한 고찰

  • Lee, Yong-Ju
    • ETRI Journal
    • /
    • v.8 no.1
    • /
    • pp.6-15
    • /
    • 1986
  • 본고는 보상훈련 기간 중 일본 동북대학 응용정보학 연구센타에서 수행한 연구 결과를 기술한 것이다. 음소 단위에 의한 한국어의 대용량 단어인식을 위한 기초연구로서, 그 기본이 되는 단모음을 대상으로 포먼트 주파수에 의한 음운간의 특징 및 발성자간의 개인성의 분산을 살펴보고 Battacharyya 거리를 구하여 음운간의 식별의 곤란성을 도출하였다. 또한, Karbunen-Loeve변환 및 Bayes결정에 의한 인식 그리고 spectral local peak에 의한 인식등의 실험에 의해 효과적인 인식 방법에 관하여 고찰하였다 .

  • PDF

On the Pitch Alteration Methods for a High Quality Speech Synthesis (고음질 합성을 위한 피치변경법)

  • 배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.12 no.2
    • /
    • pp.66-77
    • /
    • 1993
  • 고음질 합성을 위해서는 파형부호화법이 바람직하다. 파형부호화법을 규칙에 의한 음성합성기법에 적용하기 위해서는 메모리용량의 문제와 피치변경법이 해결되어져야 한다.메모리 용량의 문제는 최근 반도체 기술에 의해 극복되어 졌으며 이제는 음원피치변경의 문제가 남아있다. 따라서 본 논문에서는 성도 포먼트의 특성은 변화시키지 않고, 음원피치를 변경시키는 문제에 대해 정리하였다. 먼저 기존의 제안된 몇가지 기법들의 장단점들을 열거한 다음에 우리 연구실에서 제안했던 방법들에 대해 논의하고자 한다.

  • PDF

A Study on Monitoring of Liver Function Based on Voice Signal Analysis for u-Health System (u-Health 시스템을 위한 음성신호 분석 기반의 간 기능 모니터링에 관한 연구)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The KIPS Transactions:PartB
    • /
    • v.18B no.6
    • /
    • pp.389-396
    • /
    • 2011
  • There is getting worse to various liver diseases due to change in eating habits, stress, alcohol etc in modern society. Therefore, we proposed methodology to diagnose early for liver disease to study the influence on voice in liver diseases. To this end, we carried out experiment to apply parameter of voice analysis to collect each voice inpatients and patients by treatment of liver diseases patients. Particularly, we carried out experiment to apply element value of pronunciation and the third formant frequency bandwidths about velar sounds associated liver in oriental medicine, then to produce objective index resonance cavity and influence vocalization in liver diseases. In addition, we carried out to study about design of system to monitoring a liver function in u-Health environment based on result by experiment.

A Study on the Channel Normalized Pitch Synchronous Cepstrum for Speaker Recognition (채널에 강인한 화자 인식을 위한 채널 정규화 피치 동기 켑스트럼에 관한 연구)

  • 김유진;정재호
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.61-74
    • /
    • 2004
  • In this paper, a contort- and speaker-dependent cepstrum extraction method and a channel normalization method for minimizing the loss of speaker characteristics in the cepstrum were proposed for a robust speaker recognition system over the channel. The proposed extraction method creates a cepstrum based on the pitch synchronous analysis using the inherent pitch of the speaker. Therefore, the cepstrum called the 〃pitch synchronous cepstrum〃 (PSC) represents the impulse response of the vocal tract more accurately in voiced speech. And the PSC can compensate for channel distortion because the pitch is more robust in a channel environment than the spectrum of speech. And the proposed channel normalization method, the 〃formant-broadened pitch synchronous CMS〃 (FBPSCMS), applies the Formant-Broadened CMS to the PSC and improves the accuracy of the intraframe processing. We compared the text-independent closed-set speaker identification on 56 females and 112 males using TIMIT and NTIMIT database, respectively. The results show that pitch synchronous km improves the error reduction rate by up to 7.7% in comparison with conventional short-time cepstrum and the error rates of the FBPSCMS are more stable and lower than those of pole-filtered CMS.

A Proposal for Effect Analysis Techniques of Kidney Hand Acupuncture through Face Image and Voice Signal Measurement (얼굴 영상 및 음성신호 측정을 통한 신장 수지침 효과 분석 기법의 제안)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.37 no.3C
    • /
    • pp.217-223
    • /
    • 2012
  • In this paper, we would like to propose techniques to analyze effect according to stimulation kidney associated hand acupuncture by applying technique to measure changes of facial image and voice signal. To this end, we measured color change of JIGAK(jaw) area associated kidney in facial image and voice signal stimulation before and after of kidney associated hand acupuncture. In addition, we measured changes of the first formant frequency bandwidth and Shimmer to element of voice signal analysis in connection with kidney in experiment. We can be measured reduction of the first formant frequency bandwidth and Shimmer, black of JIGAK area according to stimulation of kidney associated hand acupuncture. Finally, we would like to demonstrate objective effect of kidney associated hand acupuncture through the analysis of statistical significance by measurement techniques of facial image and voice signal.

Analysis of Association Relationship Between A16 Acupuncture Point and Heart Function Using Voice Signals (음성신호를 이용한 A16 혈자리와 심장 기능의 연관관계 분석)

  • Kim, Bong-Hyun;Cho, Dong-Uk
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.11B
    • /
    • pp.1651-1658
    • /
    • 2010
  • As indicators of life quality have recently shown great improvement, early stage medical examination and health care patterns are usually preformed before diseases occur. Thus, hand acupuncture, as an alternative medicine to reflect these movements of preventative work and health care, is widely used these days. Therefore, in this paper, we measured the change of voice signals elements associated with heart by stimulating the heart A16 acupuncture point, and then we investigated possible improvements of cardiac function through analysis of cross-comparisons between measurements of cardiac changes. With this in mind, we collected voice samples associated with heart before and after stimulating the corresponding A16 acupuncture point, and we performed an experiment by applying the second formant bandwidth and Jitter. As result, stimulating the A16 acupuncture point results to lowering the second formant bandwidth and Jitter. The result has proven that using voice signal processing technology can help improvement of heart function.