• 제목/요약/키워드: Pitch Contour

검색결과 68건 처리시간 0.024초

학령전기 아동 발화 단어의 선율 특성 분석 (An Analysis of Tonal Characteristics in Pre-school Children's Word Utterance)

  • 이수연;정현주
    • 말소리와 음성과학
    • /
    • 제7권2호
    • /
    • pp.85-94
    • /
    • 2015
  • This study is to investigate the characteristic of tonal elements in word utterance of 30 pre-school children. For the analyses, 240 utterances of 4 syllable words were processed to extract acoustic values and then the data was transformed into tonal height in order to examine the contour. The results show that the mean pitch of a note is $C4{\frac{1}{2}}(271.17Hz)$ and high and low pitched notes are $C5{\frac{1}{2}}(452.57Hz)$ and $G{\sharp}3{\frac{1}{2}}(192.54Hz)$. The pitch patterns of the 4 syllables measured at the frication and aspiration portion are $E4{\frac{1}{2}}-F4-B3{\frac{1}{2}}-A3$ and F4-E4-B3-A3. The pitch patterns of consonant clusters are $B3{\frac{1}{2}}-D4-B3{\frac{1}{2}}-A3{\frac{1}{2}}$ and $A{\sharp}3{\frac{1}{2}}-C4-A3-D4{\frac{1}{2}}$. The analyses of tonal elements in this study provide evidentiary data on tonal height helpful for developing melodic contour.

에너지 연산자에 기초한 간단한 피치 추적 방법 (A Simple Pitch Tracking Algorithm based on the Energy Operator)

  • Tai-Ho Lee
    • 융합신호처리학회논문지
    • /
    • 제5권1호
    • /
    • pp.1-5
    • /
    • 2004
  • 유성음의 피치주파수 궤적을 추정할 수 있는 새로운 방법을 제시하였다. 이 방법은 에너지연산자[1]를 두 번 적용하는데 기초하고 있다. Kaiser의 에너지연산자는 정현파의 진폭과 주파수 정보를 추출하는 기능을 가지고 있다. 변조모형에 의하면 유성음은 피치 신호로 변조된 포만트들의 합성으로 파악될 수 있으므로 이 파형의 진폭 포락선을 추출해서 피치 신호와 유사한 파형을 얻는다. 이 파형의 평균 주파수를 검출하여 피치 주파수를 구하는 것이다. 앞부분은 Gopalan의 접근법[9]과 마찬가지이나, 뒷부분의 LPC-스펙트럼 분석등의 과정 대신 또 한번 에너지 연산자를 적용하도록 하여 매우 단순화되고 온라인 적용이 가능한 알고리듬을 얻었다. 추정 결과는 거친 편이지만 온라인으로 피치 궤적의 일반적 스케치를 얻는데 유용할 것으로 기대된다.

  • PDF

Prosodic Disambiguation of Low versus High Syntactic Attachment across Lexical Biases in English

  • Jeon, Yoon-Shil;Yoon, Kyu-Chul
    • 말소리와 음성과학
    • /
    • 제4권1호
    • /
    • pp.55-65
    • /
    • 2012
  • In this study, the prosodic disambiguation of the syntactic attachment differences was investigated in relation to the effect of lexical bias. Speech materials were composed of N1-conj-N2-PP phrases such as "walkers and runners with dogs." The results show that the use of durational pattern is dominant over the pitch pattern to differentiate the attachment differences. The characteristic pitch contour was the rise and fall over N1 and N2 in the high attachment. The pitch contour in the low attachment was the rise and fall over N2 and N3 although the frequency of such patterns was lower for the low attachment case. For the durational pattern, the lengthening in the N2 region plays a significant role in the disambiguation of the syntactic attachments. The interaction between the lexical bias and the syntactic attachment was not statistically significant in the duration data.

음성신호의기본주파수 검출 (On a Detection for the Fundamental Frequency of Speech Signals)

  • 배명진
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.42-47
    • /
    • 1994
  • A pitch detector is an essential component in a variety of speech processing systems. Besides providing valuable insights into the nature of the exciation source for speech production, the pitch contour of an utterance is useful for recognizing speakers, aids-to-the handicapped, and is required in almost all speech analysis-synthesis system. Because of the importance of the pitch detection, a wide variety algorithms for pitch detection have been proposed in speech procesing literature. Thus, in this paper we discuss th evarious type of pitch detection algorithms which have been proposed until now. Then we provide th eperformance measurements for seven pitch detection algorithms.

  • PDF

Pitch Accent Realization in North Kyungsang Korean: Tonal Alignment as a Function of Nasal Position in Syllables

  • Sohn, Hyang-Sook
    • 말소리와 음성과학
    • /
    • 제3권2호
    • /
    • pp.37-52
    • /
    • 2011
  • This study investigates patterns of the alignment of the accentual peaks in bisyllabic words of the CVNCV, CVNV, and CVNNV structures in North Kyungsang Korean. Based on the tonal alignment, patterns of the F0 pitch excursion are discussed relative to one another. Issues are addressed concerning how the tonal targets are aligned, and how the tonal specifications of nasals in postvocalic, intervocalic, and prevocalic environments are supplied in the LH, HL, and HH classes. Tonal specification of nasals in various environments is accounted for by extension of the L target, displacement of the pitch peak, and interpolation between two tonal targets, depending on the tonal class. The results in this study provide preliminary evidence that the categorical alignment of the tonal targets is implemented by simply checking the presence or absence of a nasal before or after the nucleus vowel on the segmental string, without reference to the constituency of the nasal in the syllable structure. However, the prosodic structure has a key role to play in explaining speaker-dependent variations in the tonal alignment. Sensitivity to tautosyllabicity has an effect on the shape of the F0 contour, and disparity in the patterns of the pitch excursion is represented as a function of syllable structure correlated with segmental composition of the nasal.

  • PDF

자연스러운 여성 합성음을 위한 한국어의 피치 변화 법칙 (The Rule of Korean Pitch Variation for a Natural Synthetic Female Voice)

  • 김중원;박대덕;김보현;권철홍
    • 한국음향학회지
    • /
    • 제15권6호
    • /
    • pp.26-32
    • /
    • 1996
  • 본 논문은 자연스러운 여성 합성음을 위한 피치 변화 법칙을 세웠다. 피치 변화 법칙이 적용되는 기본 단위, 즉 억양구는 주로 어절(들)로 이것의 첫번째, 두번째, 마지막 음절의 피치값을 연결해 피치 변화 곡선을 형성하였는데, 첫번째, 두번째 음절의 피치값은 각 음절의 초성에 따라, 마지막 음절의 피치값은 기능어의 종류에 따라 결정되었다. 억양구 사이에는 '쉼(pause)이 있는 경계' 또는 '쉼이 없는 경계'가 오며, 쉼이 있는 경계에는 relaxation이 있다. 이러한 억양구의 피치 변화 곡선, 경계 현상들이 모여 한 문장의 피치 턴을 만들었다.

  • PDF

성문진동 패턴의 정량적인 해석을 위한 새로운 시스템 설계와 음성분석 (A New EGG System Design and Speech Analysis for Quantitative Analysis of Human Glottal Vibration Patterns)

  • 김종찬;이재천;김덕원;오명환;윤대희;차일환
    • 대한의용생체공학회:의공학회지
    • /
    • 제20권4호
    • /
    • pp.427-433
    • /
    • 1999
  • 본 논문에서는 고음질 음성부호화 및 압축, 음성인식 및 합성등의 성능개선에 있어서 중요한 파라메터인 피치정보를 실시간으로 검출하기 위해 연구를 수행하였다. 이를 위하여 변복조 스폿(spot) 전극을 적용한 새로운 EGG(Electroglottograph) 측정 시스템을 개발하여 이용한 안정된 피치검출 알고리즘을 연구하였다. 구체적으로 EGG 신호에 의한 성문 닫힘시점을 실시간으로 결정하여 EGG 기반의 피치궤적 알고리즘을 개발하였고, 음성신호만의 피치궤적 추적기와 성능비교를 수행하여 우월한 성능을 가짐을 보였다. 또한, EGG 신호를 이용한 음성분석을 수행하여 한국인에 있어서 성문의 다양한 진동신호 패턴의 측정 및 분석을 통해 한국인 음원의 모델과 성문신호 패턴에 대한 정량적인 해석을 하였다.

  • PDF

최소 자승오차 방식을 이용한 세그먼트 피치패턴의 정형화 (A New Stylization Method using Least-Square Error Minimization on Segmental Pitch Contour)

  • 이정철
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 1994년도 제11회 음성통신 및 신호처리 워크샵 논문집 (SCAS 11권 1호)
    • /
    • pp.107-110
    • /
    • 1994
  • In this paper, we describe the features of the fundamental frequency contour of Korean read speech, and propose a new stylization method to characterize the Fø pattern of segments. Our algorithm consists of three stylization processes : the segment level, the syllable level, and the sord level. For stylization of Fø contour in the segment level , we applied least square error minimization method to determine Fø values at initial, medial, and final position in a segment. In the syllable level, we determine the stylized Fø pattern of a syllable using the mean Fø value of each word and style information for each word, syllable and segment, we reconstruct Fø contour of sentences. The simulation results show that the error is less than 10% of the actual Fø contour for each sentence. In perception test, there is little difference between the synthesized speech with the original difference between the synthesized speech with the original Fø contour and the synthesized speech with the stylized Fø contour.

  • PDF

The continuous or categorical effects for HH vs. HL and HH vs. LH in lexical pitch accent contrasts of Korean

  • Kim, Jungsun
    • 말소리와 음성과학
    • /
    • 제6권4호
    • /
    • pp.53-65
    • /
    • 2014
  • The current research examines whether pitch contour shapes in North Kyungsang pitch accent contrasts provide a phonetic dimension for phonological discreteness in a mimicry task. Two pitch accent continua resynthesized were created for HH vs. HL and HH vs. LH. To confirm a phonetic dimension for accounting for pitch accent categories in North Kyungsang Korean, the mimicries of speakers of two dialects (i.e., North Kyungsang & South Cholla) were compared. One of the findings showed that, for North Kyungsang speakers, the range of mean f0 peak times was a phonetic dimension undergoing a continuous shift within a stimulus continuum for both HH vs. HL and HH vs. LH. On the other hand, for South Cholla speakers, there were no apparent shifts around categorical boundaries for either HH vs. HL or HH vs. LH. Regarding individual mimicries on f0 peak timing, there are many variations. For HH vs. LH, three North Kyungsang speakers showed a discrete pattern reflecting a shift in phonological categories, but for HH vs. HL, there was no such distinction showing a categorical shift, though there were statistically significant differences for two speakers. Interestingly, one of the North Kyungsang speakers showed a continuous phonetic dimension for both HH vs. HL and HH vs. LH. Lastly, the f0 valley timing did not exhibit a discrete or gradient phonetic dimension for speakers of either dialect. On the basis of these results, what is interesting is that the tonal target such as high tone in North Kyungsang pitch accent categories within the autosegmental-metrical (AM) theory may be realized within individual cognitive systems for representing the interaction of perception and production.

억양의 근접복사 유형화를 이용한 감정음성의 음향분석 (An acoustical analysis of emotional speech using close-copy stylization of intonation curve)

  • 이서배
    • 말소리와 음성과학
    • /
    • 제6권3호
    • /
    • pp.131-138
    • /
    • 2014
  • A close-copy stylization of intonation curve was used for an acoustical analysis of emotional speech. For the analysis, 408 utterances of five emotions (happiness, anger, fear, neutral and sadness) were processed to extract acoustical feature values. The results show that certain pitch point features (pitch point movement time and pitch point distance within a sentence) and sentence level features (pitch range of a final pitch point, pitch range of a sentence and pitch slope of a sentence) are affected by emotions. Pitch point movement time, pitch point distance within a sentence and pitch slope of a sentence show no significant difference between male and female participants. The emotions with high arousal (happiness and anger) are consistently distinguished from the emotion with low arousal (sadness) in terms of these acoustical features. Emotions with higher arousal show steeper pitch slope of a sentence. They have steeper pitch slope at the end of a sentence. They also show wider pitch range of a sentence. The acoustical analysis in this study implies the possibility that the measurement of these acoustical features can be used to cluster and identify emotions of speech.