• 제목/요약/키워드: fundamental frequency (F0)

검색결과 138건 처리시간 0.026초

L1-L2 Transfer in VOT and f0 Production by Korean English Learners: L1 Sound Change and L2 Stop Production

  • Kim, Mi-Ryoung
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.31-41
    • /
    • 2012
  • Recent studies have shown that the stop system of Korean is undergoing a sound change in terms of the two acoustic parameters, voice onset time (VOT) and fundamental frequency (f0). Because of a VOT merger of a consonantal opposition and onset-f0 interaction, the relative importance of the two parameters has been changing in Korean where f0 is a primary cue and VOT is a secondary cue in distinguishing lax from aspirated stops in speech production as well as perception. In English, however, VOT is a primary cue and f0 is a secondary cue in contrasting voiced and voiceless stops. This study examines how Korean English learners use the two acoustic parameters of L1 in producing L2 English stops and whether the sound change of acoustic parameters in L1 affects L2 speech production. The data were collected from six adult Korean English learners. Results show that Korean English learners use not only VOT but also f0 to contrast L2 voiced and voiceless stops. However, unlike VOT variations among speakers, the magnitude effect of onset consonants on f0 in L2 English was steady and robust, indicating that f0 also plays an important role in contrasting the [voice] contrast in L2 English. The results suggest that the important role of f0 in contrasting lax and aspirated stops in L1 Korean is transferred to the contrast of voiced and voiceless stops in L2 English. The results imply that, for Korean English learners, f0 rather than VOT will play an important perceptual cue in contrasting voiced and voiceless stops in L2 English.

요들송에 대한 전기성문파형검사(EGG)를 이용한 발성학적 접근 (A Phonetic Analysis of Yodel Singing by the Electroglottographic(EGG) Measurement)

  • 서동일;최헝식
    • 음성과학
    • /
    • 제7권2호
    • /
    • pp.113-126
    • /
    • 2000
  • A comparative phonetic analysis of Yodel singing and Belcanto singing by the electroglottographic(EGG) measurement was done in three singers. One professional tenor singer(SDI) who is also well trained in Yodel singing, another yodler(KWS) who is not so trained in Belcanto singing, and the other training tenor singer(CSK) who is not well trained both yodel and Belcanto singing. Closed quotient(CQ), speed quotient(SQ) and fundamental frequency (F0) at the initial modal part(I) , middle falsetto part(M), and final modal part(F) of the same phrase were measured by EGG machine and program(Kay model 4338). In the middle part, not only CQ but also SQ of the Yodel singing were much smaller than that of Belcanto singing in all three singers. However, accuracy of parameters in Belcanto singing of the yodler(KWS) and both Yodel singing and Belcanto singing of the training singer(CSK) were inferior to that of trained tenor singer(SDI). Possible advantages of utilizing Yodel singing training under the guidance of feedback control by the EGG for hyperfunctional voice disorders such as vocal nodules were discussed.

  • PDF

AMDF 함수를 이용한 음성 신호의 피치 추정 Algorithm들에 관한 연구 (A Study of the Pitch Estimation Algorithms of Speech Signal by Using Average Magnitude Difference Function (AMDF))

  • 소신애;이강희;유광복;임하영;박지수
    • 예술인문사회 융합 멀티미디어 논문지
    • /
    • 제7권4호
    • /
    • pp.235-242
    • /
    • 2017
  • 본 논문은 음성 신호의 Average Magnitude Difference Function (AMDF)에서 peaks (혹은 nulls)들을 찾는 알고리즘들을 제안하였다. AMDF 함수는 Autocorrelation Function (ACF)과 같이 음성 신호의 피치를 추정하는 함수로 널리 사용 하고 있다. 음성신호에서 fundamental frequency (F0)를 estimation하는 것은 매우 중요한 task이며 또한 상당한 어려움이 따른다는 것이 여러 연구들을 통해서 잘 알려진 사실이다. 본 논문에서는 AMDF 함수의 특성을 이용하여 개발한 두 가지의 알고리즘을 제시하였다. 첫째는 Local Minima에 Threshold 값을 적용하여 피치 주기를 측정 할 수 있는 nulls들을 찾아내는 알고리즘이고, 다음은 AMDF 함수와 ACF 함수 사이의 관계식을 응용한 알고리즘이다. 한국어의 감정 표현 언어들로 구성된 제시문을 널리 사용하고 있는 상용 기기로 녹음한 음성 신호를 본 논문이 제안한 알고리즘들에 적용하여서 시뮬레이션을 통해 음성 신호의 피치 주기를 측정하여서 그 성능을 알아보았다.

입출력 공동 주파수 동조를 통한 VCO의 성능 개선에 관한 연구 (A Study on the Improvement of Performance in VCO Using In/Out Common Frequency Tuning)

  • 서경환;장정석
    • 한국전자파학회논문지
    • /
    • 제21권5호
    • /
    • pp.468-474
    • /
    • 2010
  • 본 논문에서는 K-band(18.6 GHz) 대역에서 동작하는 VCHO(Voltage Controlled Harmonic Oscillator)를 설계 및 제작하였다. 제안된 구조의 발진기는 두 개의 hair-pin 공진기들이 각각 능동소자의 입력단과 출력단에 위치한다. 또한 두 개의 공진기를 동시에 동조하는 구조를 통하여 기본 주파수 억압 특성과 2차 고조파($2f_0$)의 출력을 개선하였다. VCHO의 제작 및 측정 결과에 의하면 출력 전력은 -2.41 dBm, 기본 주파수 억압 특성은 -21.84 dBc 그리고 위상 잡음은 -101.44 dBc/Hz @ 100 kHz의 특성을 얻을 수 있었다. 또한 바렉터 다이오드의 전압 변화에 따른 주파수 동조 범위는 약 10.58 MHz를 얻었으며, 이 때 ${\pm}0.19\;dB$의 전력 평탄도를 얻을 수 있었다.

체배기 이론을 이용한 Ka-대역 고조파 믹서 설계 (A Ka-band Harmonic Miter Design Using Multiplier Theory)

  • 고민호;강석엽;박효달
    • 한국통신학회논문지
    • /
    • 제30권11A호
    • /
    • pp.1104-1109
    • /
    • 2005
  • 본 논문에서는 주파수 채배기 이론에 근거하여 단일 능동소자로 입력된 기본 LO 주파수($f_{LO}$)의 3차 고조파 성분($3f_{LO}$)의 진폭이 최대가 되는 바이어스 전압을 선택하여 두 입력신호($f_{RF}$, $f_{LO}$)에 대해서 고차 출력신호성분($f_{RF}{\pm}3f_{LO}$)이 최대가 되는 고조파 먹서(harmonic mixer)를 설계 및 제작하였다. 제안된 설계 방법에 의해서 제작된 고조파 먹서는 플라스틱(Plastic) 패키지의 MESFET 소자를 사용하여 기존 Ka-대역에서 동작하는 믹서 회로들이 나타내는 높은 부품 가격, 생산성 및 회로의 복잡도 문제를 해결할 수 있었으며 RF 주파수신호($f_{RF}$=33.5GHz)에 대해서 LO 주파수 신호($f_{LO}$=11.5 GHz)의 3차 고조파 신호($3f_{LO}$=34.5 GHz)가 최대가 되는 게이트 바이어스 전압을 선택하여 중간주파수($3f_{LO}-f_{RF}$=1.0GHz)에서 -10 dB의 낮은 변환 손실 특성을 나타내었다.

성전환자와 정상인이 발성한 모음의 음향분석과 지각실험 (An Acoustic Analysis and Perceptual Study of Korean Vowels Produced by Transgenders and Noraml Adults)

  • 조성미;정옥란
    • 음성과학
    • /
    • 제10권3호
    • /
    • pp.145-155
    • /
    • 2003
  • This study compared $F_{0}$ and the first three formants of eight Korean monophthongs produced by nine transgenders (male to female) to those of eighteen normal adults. Voice analysis was done by Praat (version 4.049). A one-way ANOVA with Tukey HSD post hoc tests were performed to determine statistical differences in $F_{0}$ and formant values obtained from transgenders, and normal male and female subjects. Results indicated that there was no significant difference in $F_{1}$ of /u/, /$\Lambda$/, and /o/, $F_{2}$ of /u/, /$\Lambda$/, and /i/ and $F_{3}$ of /u/ among the 3 groups (transgenders, normal males and normal females). However, in the comparison of transgenders vs. males, a significant difference was observed in $F_{0}$ of /o/, and $F_{2}$ of /i/, /a/, /e/, and /${\ae}$/ and $F_{3}$ of /e/. Furthermore, in the comparison of transgenders vs. females, a significant difference was also observed in $F_{0}$ of all vowels, $F_{1}$ of /i/, /$\alpha$/, /e/, /${\ae}$/, and /i/. $F_{2}$ of /i/, and /${\ae}$/, and $F_{3}$ of /i/, /$\alpha$/, /$\Lambda$/, /e/, /${\ae}$/, /i/, and /o/. Also, perceptual judgment of the transgenders' voice came out somewhat correlated strongly with their $F_{0}$ values but not much with the formant values. It was concluded that the transgenders' acoustic parameters are placed in between those of the normal males and females in. terms of fundamental and formant frequency analyses of vowels. Thus, it was assumed that those differences might stem from the transgenders' original big resonating cavities.

  • PDF

전기성문전도(EGG) 시스템의 개발 및 평가 (Implementation and Evaluation of Electroglottograph System)

  • 김기련;김광년;왕수건;허승덕;이승훈;전계록;최병철;정동근
    • 대한의용생체공학회:의공학회지
    • /
    • 제25권5호
    • /
    • pp.343-349
    • /
    • 2004
  • 전기성문전도는 발성시에 성문의 진동이 전기적 임피던스를 이용하여 검출되는 신호이다. 본 연구는 이러한 전기성문전도를 기록하기 위한 장비를 구현하고 음성분석 및 후두질환 진단에 대한 적용생을 평가하고자 하였다. 전기성문전도의 하드웨어는 2 쌍의 링전극, 동조증폭기, 검파기, 저역통과필터, 자동이득조절부 등으로 구성되며, 2.7MHz의 반송파 신호를 이용하고 진폭 변조 방식의 검파를 통해 임피던스 신호를 추출하도록 하였다. 추출된 신호는 PC 사운드 카드의 라인 입력을 통해 샘플링되고 양자화되었다. 검출 신호를 분석하기 위한 파라미터는 패래 시간을(CQ), 개폐 속도율(SQ), 개폐속도지수(SI), 성대진동 주파수(F0), 성대진동 주파수변동지수(Jitter), 성대진동 진폭변동지수(Shimmer) 등을 추출하였다. 전기성문전도를 분석한 결과, F0가 증가할수록 CQ는 커지고, SQ와 SI는 작아지는 경향을 보였으며, 전기성문전도와 음성 선호의 기본주파수가 일치함을 알 수 있었다. CQ, SQ, SI는 정상인과 후두암 환자를 비교한 결과 유의한 차이를 보였다. 이러한 결과는 성대의 운동을 관찰할 수 있는 휴대용 전기성문전도 계측기의 구현이 가능하게 하였고, 성대 기능 이상 검사가 가능함을 시사하였다.

읽기과제에서 나타난 뇌성마비인의 기본주파수 및 진폭의 분포 특성 (Distributions on F0 and Amplitude of Persons with Cerebral Palsy in the Reading Task)

  • 남현욱;최양규
    • 대한음성학회지:말소리
    • /
    • 제66호
    • /
    • pp.1-20
    • /
    • 2008
  • The purpose of this study was to investigate the characteristics of fundamental frequency(F0) and amplitude distributions in persons with cerebral palsy(CP) in the reading task. Participants were divided into three groups: 6 persons with spastic CP, 6 persons with athetoid CP and 6 normal persons who are around 15-20 years old. On the results of this study, firstly, in F0 distributions, most of the spastic CPs tended to appear narrow distributions on the basis of mode, but most of the athetoid CPs were opposite, and both of the CP groups tended to distribute highly on lower and higher frequencies than mean and mode. On the other hand, normal persons had a tendency to appear narrow distributions on the basis of mode. Finally, in amplitude distributions, the spastic CPs showed a tendency that there are little differences between the distribution of mode and the others, and most of the athetoid CPs showed a tendency that the distributions of mode were higher than the others. In addition to, the normal persons had a tendency that the distributions of mode were remarkably higher than both of the CP groups.

  • PDF

Closure Duration and Pitch as Phonetic Cues to Korean Stop Identity in AP-medial Position: Perception Test

  • Kang, Hyun-Sook;Dilley, Laura
    • 음성과학
    • /
    • 제14권4호
    • /
    • pp.25-39
    • /
    • 2007
  • The present study investigated some perceptual phonetic attributes of two Korean stop types, aspirated and lax, in medial position of an accentual phrase. The intonational pattern across syllables (Jun, 1993) is argued to depend on the type of stop (aspirated vs. lax) only in the initial position of an accentual phrase. In Kang & Dilley (2007), we showed that significant differences between aspirated and lax stops in medial position of an accentual phrase exist in closure duration, voice-onset time, and fundamental frequency (F0) values for post-stop vowels. In the present perception experiment, we investigated whether these phonetic attributes contribute to the perception of these two types of stops: The closure durations and/or F0's of post-stop vowels on accentual-phrase medial words were altered and twenty native Korean speakers then judged these words as beginning with an aspirated or lax stop. Both closure duration and F0 significantly affected judgments of stop identity. These results indicate that a wider range of acoustic cues that distinguish aspirated and lax Korean stops in production also plays a role in perception. To account for these results we suggest some phonetic and phonological models of consonant-tone interactions for Korean.

  • PDF

L2 Proficiency Effect on the Acoustic Cue-Weighting Pattern by Korean L2 Learners of English: Production and Perception of English Stops

  • Kong, Eun Jong;Yoon, In Hee
    • 말소리와 음성과학
    • /
    • 제5권4호
    • /
    • pp.81-90
    • /
    • 2013
  • This study explored how Korean L2 learners of English utilize multiple acoustic cues (VOT and F0) in perceiving and producing the English alveolar stop with a voicing contrast. Thirty-four 18-year-old high-school students participated in the study. Their English proficiency level was classified as either 'high' (HEP) or 'low' (LEP) according to high-school English level standardization. Thirty different synthesized syllables were presented in audio stimuli by combining a 6-step VOTs and a 5-step F0s. The listeners judged how close the audio stimulus was to /t/ or /d/ in L2 using a visual analogue scale. The L2 /d/ and /t/ productions collected from the 22 learners (12 HEP, 10 LEP) were acoustically analyzed by measuring VOT and F0 at the vowel onset. Results showed that LEP listeners attended to the F0 in the stimuli more sensitively than HEP listeners, suggesting that HEP listeners could inhibit less important acoustic dimensions better than LEP listeners in their L2 perception. The L2 production patterns also exhibited a group-difference between HEP and LEP in that HEP speakers utilized their VOT dimension (primary cue in L2) more effectively than LEP speakers. Taken together, the study showed that the relative cue-weighting strategies in L2 perception and production are closely related to the learner's L2 proficiency level in that more proficient learners had a better control of inhibiting and enhancing the relevant acoustic parameters.