• 제목/요약/키워드: Pitch frequency

검색결과 473건 처리시간 0.023초

제주 방언의 낱말 악센트 (Word Accent of Cheju Dialects in Korean)

  • 박순복
    • 대한음성학회지:말소리
    • /
    • 제55권
    • /
    • pp.33-43
    • /
    • 2005
  • This paper investigates the word accent pattern of Cheju dialects in Korean and determines whether it varies according to the age as well as the word itself and where the speakers come from. On the basis on the theory of pitch accent, which was suggested by Koo(1993) and Jung(1965) for the Korean standard accent, the fundamental frequency of each syllable is measured. The syllable that has the highest frequency is labelled for 2, while the rests for 1. The results of the experiment are that the two syllabic words have 21 accent pattern, while the three syllabic words 121 pattern and the four syllabic words 1211. In addition to this characteristic of accent pattern in Cheju dialects, it is interesting that the older the speakers, the less accent pattern the utterance has as suggested above.

  • PDF

외 후두부 길이와 발화기본주파수 간의 상관관계 (Correlation Between the External Laryngeal Length and the Habitual Speaking Fundamental Frequency)

  • 남도현;임성수;최홍식
    • 말소리와 음성과학
    • /
    • 제1권4호
    • /
    • pp.187-193
    • /
    • 2009
  • For this study, the external laryngeal lengths of 9 females and 9 males with normal voices were measured together with their ages, heights, and weights, and after they read aloud sentences for 3 minutes, their habitual speaking fundamental frequencies, speaking low pitches, speaking high pitches, and vocal fold closed quotients were measured. The Spearman rank correlation analysis on these data showed a significant negative correlation between the external laryngeal length and the habitual speaking fundamental frequency for both females and males, a significant negative correlation between the external laryngeal length and the speaking high pitch for only males, a significant negative correlation between the external laryngeal length and the speaking low pitch for both females and males, and a significant positive correlation between the external laryngeal length and the vocal fold closed quotient for only males.

  • PDF

음성으로부터 감성인식 요소 분석 (Analyzing the element of emotion recognition from speech)

  • 박창현;심재윤;이동욱;심귀보
    • 한국지능시스템학회:학술대회논문집
    • /
    • 한국퍼지및지능시스템학회 2001년도 추계학술대회 학술발표 논문집
    • /
    • pp.199-202
    • /
    • 2001
  • 일반적으로 음성신호로부터 사람의 감정을 인식할 수 있는 요소는 (1)대화의 내용에 사용한 단어, (2)톤 (Tone), (3)음성신호의 피치(Pitch), (4)포만트 주파수(Formant Frequency), 그리고 (5)말의 빠르기(Speech Speed) (6)음질(Voice Quality) 등이다. 사람의 경우는 주파수 같은 분석요소 보다는 론과 단어, 빠르기, 음질로 감정을 받아들이게 되는 것이 자연스러운 방법이므로 당연히 후자의 요소들이 감정을 분류하는데 중요한 인자로 쓰일 수 있다. 그리고, 종래는 주로 후자의 요소들을 이용하였는데, 기계로써 구현하기 위해서는 조금 더 공학적인 포만트 주파수를 사용할 수 있게 되는 것이 도움이 된다. 그러므로, 본 연구는 음성 신호로부터 피치와 포만트, 그리고 말의 빠르기 등을 이용하여 감성 인식시스템을 구현하는 것을 목표로 연구를 진행하고 있는데, 그 1단계 연구로서 본 논문에서는 화가 나서 내뱉는 알과 기쁠 때 간단하게 사용하는 말들을 기반으로 하여 극단적인 두 가지 감정의 독특한 특성을 찾아낸다.

  • PDF

인공후두 제어원으로서의 흉골설골근 사용의 타당성 검증 (Electromyographic Study of the Sternohyoid Muscle to Control an Electrolarynx)

  • 민혜정;봉정표
    • 대한의용생체공학회:의공학회지
    • /
    • 제17권2호
    • /
    • pp.201-208
    • /
    • 1996
  • We have been studying an implant type EMG-controlled electrolarynx. First of all, we propose the sternohyoid muscle(SH) as a control source of the electrolarynx. The purpose of this study is to investigate the possibility that subjects control voluntarily the constriction of their SH, and produce the control signals of electrolarynx. For this pwnan, we carried out four experiments regarding the control of the electrolarynx. At the results, we found that subjects can control the start/stop of constriction and the amplitude of EMG of their SH. Also, we ascertained the possibility that the start/stop of contraction of SH controls OW/OFF of sound source of the electrolarynx and the amplitude of UG of SH controls the pitch frequency of the electrolarynx.

  • PDF

Noise Cancellation System Based on Frequency Domain Adaptive Filter Using Modified DFT Pair

  • Nakanishi, Isao;Nakamura, Youichi;Itoh, Yoshio;Fukui, Yutaka
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2000년도 ITC-CSCC -1
    • /
    • pp.225-228
    • /
    • 2000
  • It is well known that a Frequency Domain Adaptive Filter (FDAF) converges faster than a Time Domain Adaptive Filter (TDAF) even when the input signal is colored such as a speech signal. We have proposed the FDAF using the Modified Discrete Fourier Transform Pair (MDFTP) and its realization and effectiveness has been confirmed through the computer simulations. In this paper, we apply the FDAF using the MDFTP to the noise cancellation system. The proposed system is based on the Adaptive Line Enhancer (ALE) and utilizes single microphone; therefore it is suitable for the portable electronic equipment. Moreover, we propose to utilize the MDFT for detecting of the pitch in the speech because the number of data points in the MDFT must be equal to the pitch to confirmed that the noise can be removed to near the level of SNR.

  • PDF

위상 보상된 고조파 스케일링에 의한 음성합성용 피치변경법 (On a Pitch Alteration Method using Scaling the Harmonics Compensated with the Phase for Speech Synthesis)

  • 배명진
    • 한국음향학회지
    • /
    • 제13권6호
    • /
    • pp.91-97
    • /
    • 1994
  • 신호처리에서, 파형부화법은 음성신호의 잉여성분을 감소시킴으로써 파형을 유지하는 부호화 방법이다. 음성 합성의 경우, 고음질의 파형부호화법은 주로 분석에 의한 합성법에 이용된다. 그러나, 파형부호화법은 여기 파라미터와 성도 파라미터로 분리하지 않고 처리하기 때문에 규칙에 의한 합성에 적용되기 어렵다. 따라서 파형부호화법을 규칙에 의한 합성에 이용하기 위해서는 피치변경이 필요하다. 본 논문에서, 우리는 파형부호화법에서 음성신호를 성도 파라미터와 여기 파라미터로 분리함으로써 피치 주기를 바꿀 수 있는 새로운 피치변경법을 제안한다. 이 방법은 시-주파수 혼성영억 방법으로 시간영역에서 파형의 위상성분과 주파수영역에서 파형의 진폭성분을 보존한다. 따라서 파형부호화법은 음성처리에 있어 규칙에 의한 합성을 할 수 있다. 본 논문에서 제안한 알고리즘을 이용한 경우, 단지 $2.94\%의$ 스펙트럼 왜곡만이 일어났다. 즉, 스펙트럼 왜곡이 시간영역에서의 피치변경법보다 $5.06\%$ 이상 감소되었다.

  • PDF

정지비행 조건에서의 축소 로터 실험을 통한 소음 예측 기법 검증 (Validation of Noise Prediction Theory Using Scaled Rotor Experiment for Hovering Condition)

  • 민안기;이재하;이욱;최종수
    • 한국항공우주학회지
    • /
    • 제40권3호
    • /
    • pp.201-208
    • /
    • 2012
  • 본 논문에서는 정지비행 조건에서의 무향실 내 축소 로터 실험을 이용해 Lowson의 하중 소음식과 FW-H의 음향상사식으로 예측한 이산 주파수 소음(Discrete frequency noise)을 검증하였다. 소음 예측 기법의 방향성(Directivity) 검증은 전반적으로 실험결과와 유사하게 예측되었으며, 거리에 대한 검증의 경우 근거리(Near-field)에서는 FW-H식의 예측결과가, 원거리(Far-field)에서는 Lowson식의 예측결과가 실험결과와 더 유사한 것을 확인하였다. 피치 각(Collective pitch angle)에 대한 검증의 경우 낮은 피치각에서는 FW-H식의 예측결과가, 높은 피치각에서는 Lowson식의 예측결과가 실험결과와 더 유사한 것을 확인하였다.

Speaker Verification System with Hybrid Model Improved by Adapted Continuous Wavelet Transform

  • Kim, Hyoungsoo;Yang, Sung-il;Younghun Kwon;Kyungjoon Cha
    • The Journal of the Acoustical Society of Korea
    • /
    • 제18권3E호
    • /
    • pp.30-36
    • /
    • 1999
  • In this paper, we develop a hybrid speaker recognition system [1] enhanced by pre-recognizer and post-recognizer. The pre-recognizer consists of general speech recognition systems and the post-recognizer is a pitch detection system using adapted continuous wavelet transform (ACWT) to improve the performance of the hybrid speaker recognition system. Two schemes to design ACWT is considered. One is the scheme to search basis library covering the whole band of speech fundamental frequency (speech pitch). The other is the scheme to determine which one is the best basis. Information cost functional is used for the criterion for the latter. ACWT is robust enough to classify the pitch of speech very well, even though the speech signal is badly damaged by environmental noises.

  • PDF

개선된 혼성영역 교차상관법에 의한 G.723.1의 피치검색시간 단축에 관한 연구 (A Study on the Pitch Search Time Reduction of G.723.1 Vocoder by Improved Hybrid Domain Cross-correlation)

  • 조왕래;최성영;배명진
    • 전기학회논문지
    • /
    • 제59권12호
    • /
    • pp.2324-2328
    • /
    • 2010
  • In this paper we proposed a new algorithm that can reduce the open-loop pitch estimation time of G.723.1. The time domain cross-correlation method is simple but has long processing time by recursive multiplication. For reduction of processing time, we use the method that compute the cross-correlation by multiplying the Fourier value of speech by it's complex conjugate. Also, we can reduce the processing time by omitting the bit-reversing of FFT and IFFT for time-frequency domain transform. As a result, the processing time of improved hybrid domain cross-correlation algorithm is reduced by 67.37% of conventional time domain cross-correlation.

음성과 사상체질: 음원을 중심으로 (Voice and Sasang Constitution: In terms of source functions)

  • 문승재;박종주;황혜정
    • 대한음성학회지:말소리
    • /
    • 제48호
    • /
    • pp.19-33
    • /
    • 2003
  • Sasang Constitutional Medicine, a branch of traditional Korean medicine, believes that the health of human beings can be promoted by taking advantage of the fact that people have different constitutions. It utilizes the characteristics in human voice to diagnose the constitution of the patients. This study aims at establishing the relationship between Sasang constitutions and their corresponding voice characteristics by investigating source-related variables. Voice recordings of 23 patients from three different constitutions were obtained whose constitutions had been already diagnosed by the experts in the fields. Fundamental frequency related variables (average pitch, maximum/minimum pitch, pitch range), phonation type, speaking tempo were measured and analyzed for each group. The phonation type seemed to be a possible candidate for a successful variable to determine constitution. No statistically significant relationship was manifested between other variables and constitutions. Despite its failure to firmly establish the relationship between voice and constitutions, the current study suggests that future research should include not only source-related variables

  • PDF