• 제목/요약/키워드: Pitch Contour

검색결과 68건 처리시간 0.024초

직교 벡터 공간 변환을 이용한 음성 개성 변환 (Voice personality transformation using an orthogonal vector space conversion)

  • 이기승;박군종;윤대희
    • 전자공학회논문지B
    • /
    • 제33B권1호
    • /
    • pp.96-107
    • /
    • 1996
  • 본 논문에서는 직교 벡터 공간 변환을 이용한 새로운 음성 개성 변환 알고리즘을 제안하였다. 음성 개성 변환이란 임의 환자(source)가 가지고 있는 몇 개의 특징 변수를 다른 화자(target)의 특징 변수로 변환하는 기법이다. 본 논문에서는 LPC 켑스트럼 계수와 여기 신호의 스펙트럼, 그리고 피치 궤적을 변환하여 음성 개성변환을 구현하였다. LPC 켑스트럼 계수의 변환을 위해 직교 벡터 공간 변환 기법이 제안되었다. 이 기법은 KL(Karhunen-Loeve)변환을 이용한 principle component의 분리와 최소 자승 오차를 갖는 선형 좌표 변환을 통해 LPC 켑스트럼의 변환을 수행한다. 또한, 화자간의 운율적인 특징을 변환하기 위해 피치 궤적 변환 기법이 제안되었다. 피치 궤적 변환을 위하여 먼저 두 화자간의 기준 피치 패턴의 작성하고 기준 패턴간의 대응 관계를 추정한 후 이를 이용하여 source 화자의 피치 패턴이 target 피치 패턴으로 변환되도록 하였다. 컴퓨터를 이용한 모의 실험 결과 제안된 알고리즘은 객관적인 평가와 주관적인 평가에 있어서 우수한 성능을 나타내었다.

  • PDF

인공와우이식 난청인의 말소리 운율변화에 따른 구어 이해와 음도 변별, 선율윤곽 확인 간 관련성 (The Relationship Between Perception of Prosody, Pitch Discrimination, and Melodic Contour Identification in Cochlear Implants Recipients)

  • 김은연;문일준;조양선;정원호;홍성화
    • 인간행동과 음악연구
    • /
    • 제14권2호
    • /
    • pp.1-18
    • /
    • 2017
  • 본 연구에서는 인공와우이식 난청인(N = 15)을 대상으로 말소리 운율변화에 따른 구어 이해와 음도 변별, 선율윤곽 확인(Melodic contour identification: MCI) 간 관련성을 살펴보았다. 말소리 운율 변화에 따른 구어이해를 살펴보기 위해 말소리 운율지각 검사를 시행하였고, 긍정적인 운율과 부정적인 운율 조건에 따른 의미 변화를 피검자에게 판단하게 하였다. 검사 시 긍정적인 의미(Positive meaning: PW)와 중립적인 의미(Neutral meaning: NW)를 갖는 낱말 및 낱말 조합 형태를 제시하고, 긍정적인 운율과 부정적인 운율 조건에 따른 의미 변화를 피검자에게 판단하게 하였다. 음도 변별 검사를 위해서는 단음도 변화 변별 과제와 3개 음으로 구성된 패턴에서의 음도 변별 과제가 실시되었다. MCI 검사는 기대 확률을 달리한 세부 검사 1, 2로 구성하여 시행하였다. 실시한 검사 간 관련성을 살펴본 결과, 말소리 운율지각 검사 결과는 보청기 착용으로도 청지각적 이득을 기대할 수 없었던 기간과 유의한 관련성을 보였다. PW와 NW 검사에서 운율 조건에 따라 유의한 수행 차를 보였지만, 단어조합 형태에 따른 통계적 유의성은 발견하지 못하였다. 말소리 운율지각 검사 결과는 MCI 1과 유의한 상관을 보인 반면(p < .01), 말지각 검사 수행력과는 유의한 관련성을 보이지 않았다. 이는 인공와우이식 후 시각적 단서 없이 말소리, 음소 지각이 가능해졌다 하더라도 미묘한 운율 변화에 따른 의미 지각의 제한은 계속될 수 있음을 시사한다. 또한 인공와우이식 후 선율윤곽 변화 확인은 음도 변별에 비해 제한을 보이며, 운율지각과 관련 있음을 확인할 수 있었다.

가정의 물리적, 인적 음악 환경과 아동의 음악성 발달에 관한 연구 (A Study on Musical Home Environment and Children's Musical Development)

  • 김명순;이소희
    • 대한가정학회지
    • /
    • 제37권7호
    • /
    • pp.83-94
    • /
    • 1999
  • The purpose of this study was to explore musical development of 3- to S-year-old children and their musical home environment. The subjects were one hundred ninety-four children and their mothers enrolled in four kindergartens in Seoul. Each child sang the birthday song with peers in a birthday play setting. It was audiotaped for the children to sing the song. Questionnaire of musical home environment developed by the researchers was used for the mothers. The children's rhythm and pitch development were coded by the scoring categories of Project Spectrum(Krechevsky, 1994). The data were analyzed by t-test, ANOVA, Scheffe, and Pearson correlation. The results of this study were as follows: Firstly, there was no a significant difference in the children's rhythm development among three age-groups as well as between boys and girls. Among rhythm subcategories, the unit of note was ranked in the highest score and the pulse the next. Secondly, there were significant differences in children's pitch development among three age-groups and between boys and girls. The older children significantly achieved higher scores than the younger. Among pitch subcategories, the contour was ranked in the highest score and the interval the next. Thirdly, the children's musical development and their physical home environment related to music were correlated positively. The children's pitch development was significantly related to the mothers' musical attitude and the children's rhythm development to the mothers' educational levels.

  • PDF

Electroglottography를 사용한 한국어 폐쇄자음의 특성 및 임상적 적용 (Characteristics of Korean Stop Consonants by Using Electroglottography and Its Clinical Application)

  • 채윤정;김현기;홍기환
    • 음성과학
    • /
    • 제4권2호
    • /
    • pp.157-177
    • /
    • 1998
  • An electroglottography (EGG) was used to investigate the function of the vocal folds during their vibration. In this study, four Korean native speakers and 10 vocal polyp patients were selected. To investigate the dynamic change of EGG waveforms for the three-way distinction of Korean stops, a DSP-Sona graph model 5500, a Rino- Laryngeal stroboscope, a CSL model 4300B and a Laryngograph were used. An EGG Model 4338 was used to exam the vocal polyp of patients' voices during high, low, comfortable pitch production. The purpose of this study is to investigate the characteristics of Korean stop consonants in relation to pitch and to observe laryngeal movement during vocal fold vibration and speech production. The basic data accumulated during this research can be applied in clinical treatment. The results are as follows: on the Korean stop consonants, the aspirated stop is the highest in the GOT and PC1. On the angle of vowel contour, the angle of lenis is smaller than the angle of heavily aspirated and glottalized stops. The fundamental frequency is lowest at the lenis stop, In vocal polyp patients', the low pitch range is smaller than in normal speakers'. The pitch break and the vocal fry were observed. The jitter and OQ value are higher in vocal polyp patients than in those of normal speakers'.

  • PDF

RFC모델의 한국어 억양 곡선에의 적용 (Application of Rise/Fall/connection(RFC) Model to Korean Intonation)

  • 표경란;김형순;최규수
    • 대한음성학회지:말소리
    • /
    • 제35_36호
    • /
    • pp.157-173
    • /
    • 1998
  • This is a pilot study on applying the Rise/Fall/connection(RFC) model to Korean intonation tot speech synthesis. RFC model contains successive intonation events, which can be pitch accents and intonation boundary tones. The intonation contour of RFC model is composed of piecewise linear curves of rise, fall, and connection elements, and each element can have any amplitude and duration. In this paper, elements of RFC model is slightly modified to accommodate the characteristics of Korean intonation. Subjective preference test was conducted to compare the modified RFC model with the original one. The results show that the intonation contour produced by the modified RFC model is perceptually indistinguishable from that of the original RFC model, while the former requires less number of labels than the latter.

  • PDF

Stem-ML에 기반한 한국어 억양 생성 (Korean Prosody Generation Based on Stem-ML)

  • 한영호;김형순
    • 대한음성학회지:말소리
    • /
    • 제54호
    • /
    • pp.45-61
    • /
    • 2005
  • In this paper, we present a method of generating intonation contour for Korean text-to-speech (TTS) system and a method of synthesizing emotional speech, both based on Soft template mark-up language (Stem-ML), a novel prosody generation model combining mark-up tags and pitch generation in one. The evaluation shows that the intonation contour generated by Stem-ML is better than that by our previous work. It is also found that Stem-ML is a useful tool for generating emotional speech, by controling limited number of tags. Large-size emotional speech database is crucial for more extensive evaluation.

  • PDF

톱니형 휜이 부착된 원주의 근접후류특성 연구 (III) - 속도회복 메카니즘에 관하여 - (Characteristics of Near Wake Behind a Circular Cylinder with Serrated Fins (III) - Mechanism of Velocity Recovery -)

  • 류병남;김경천;부정숙
    • 대한기계학회논문집B
    • /
    • 제27권3호
    • /
    • pp.347-356
    • /
    • 2003
  • The characteristics of near wakes of circular cylinders with serrated fins are investigated experimentally using a hot-wire anemometer for various freestream velocities. Near wake structures of the fin tubes are observed using a phase average technique. With increasing fin height and decreasing fin pitch. oscillation of streamwise velocity increases. It file oscillation of lateral velocity decreases. The time averaged V-component velocity distribution of the finned tube is contrary to that of the circular cylinder due to the different strength of entrainment flow. This strength is affected by the distance of (equation omitted) = 1.0 contour lines. (equation omitted) = 1.0 contour line approaches to the wake center line when the fin density is increased. When the distance between (equation omitted) = 1.0 contour lines comes close the shear force should be increased and the flow toward the wake center line can be more strengthened because of the shear force. Factors related to the velocity recovery in the near wake of the finned tube are attributed to tile turbulent intensity, the boundary layer thickness. the position and strength of entrainment process.

영어 학습 시의 발성 교정 기술에 관한 연구 (Study on the pronunciation correction in English Learning)

  • 김재민;백승권;한민수
    • 한국음향학회:학술대회논문집
    • /
    • 한국음향학회 2000년도 하계학술발표대회 논문집 제19권 1호
    • /
    • pp.119-122
    • /
    • 2000
  • In this paper, we implement an elementary system to correct accent, pronunciation, and intonation in English spoken by non-native English speakers. In case of the accent evaluation, energy and pitch information are used to find stressed syllables, and then we extract the segment information of input patterns using a dynamic time warping method to discriminate and evaluate accent position. For the pronunciation evaluation. we utilize the segment information using the same algorithm as in accent evaluation and calculate the spectral distance measure for each phoneme between input and reference. For the intonation evaluation. we propose nine pattern of slope to estimate pitch contour, then we grade test sentences by accumulated error obtained by the distance measure and estimated slope. Our result shows that 98 percent of accent and 71 percent of pronunciation evaluation agree with perceptual measure. As the result of the intonation evaluation. system represent the similar order of grade for the four sentences having different intonation patterns compared with perceptual evaluation.

  • PDF

A Pedagogical Choice for Improving the Perception of English Intonation

  • Kim, Sung-Hye;Jeon, Yoon-Shil
    • 영어어문교육
    • /
    • 제15권4호
    • /
    • pp.95-108
    • /
    • 2009
  • One of the learning difficulties for Korean learners of English is the intonation of English focused yes/no questions. Focused words in English yes/no questions are realized as low pitch accents which contrast with high pitch accents in Korean counterparts. In order to improve Korean students' intonation, direct and metalinguistic explanations on the intonation of English focused yes/no questions were given to Korean learners of English. In pre-tests and post-tests, students' perceptions on the target items were measured. The study results showed that phonetic explanation using intonation contour enhanced students' perception on English intonation. With respect to the position of focused words, sentence initial and medial focused questions were more difficult than sentence final focused questions. The perception was most improved in sentence initial focused questions. The study showed the immediate effects of the explicit instruction on perceptions of English intonation.

  • PDF

표준 중국어의 경계억양에 관한 연구 (Study of Boundary Tone in Mandarin Chinese)

  • 손남호
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.43-47
    • /
    • 2003
  • This paper is phonetic study of $F_{0}$ range and boundary tone in Mandarin Chinese. The production data from 6 Chinese speakers show that there are declination, pitch resetting and tonal variation of boundary tone. In declarative sentence, $F_{0}$ declines gradually over the utterance but mid-sentence boundary prevents $F_{0}$ of following syllable from declining because of pitch resetting. $F_{0}$ range of syllable is expanded before the mid- and final sentence boundaries. In interrogative one, $F_{0}$ ascends gradually over the utterance and mid-sentence boundary makes $F_{0}$ of following syllable rise more. $F_{0}$ range of sentence final syllable is expanded and $F_{0}$ contour shows rising curve.

  • PDF