• 제목/요약/키워드: Voiced

검색결과 282건 처리시간 0.023초

유성음 구간 검출을 위한 간단한 알고리즘에 관한 연구 (A Study on the Simple Algorithm for Discrimination of Voiced Sounds)

  • 장규철;우수영;박용규;유창동
    • 한국음향학회지
    • /
    • 제21권8호
    • /
    • pp.727-734
    • /
    • 2002
  • 본 논문에서는 유ㆍ무성음 구간을 검출하기 위한 간단한 알고리즘을 제안한다. 제안된 방법은 음성의 유ㆍ무성음의 주기성에 대한 특성을 보완할 수 있는 저대역 에너지와 영교차율, 그리고 주기성의 안정성을 판단하기 위한 피치 변화량을 파라미터로 사용하였다. 유ㆍ무성음의 구간검출을 음소단위의 검출이라는 측면에서 접근하여 음소군의 검출율과 음소군내의 음소의 검출율을 얻었다. TIMIT코퍼스 (corpus)를 데이터베이스로 사용하여 실험했을 때 유성음 음소 검출율이 약 13% 향상되었다.

한국인 영어학습자의 영어 어말자음 유/무성에 따른 모음길이 변화현상에 대한 실험음성학적 연구 - 마찰음, 폐찰음 중심으로 한 발성실험을 통하여 - (An Experimental Studies on Vowel Duration Differences before Voiced and Voiceless Consonants pronounced by Korean Learners of English - From Fricatives and Affricates sounds -)

  • 신동진;사재진
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.91-95
    • /
    • 2005
  • The aim of this paper is to investigate the effects of postvocalic voicing(Contrasting voiceless fricative and affricate with voiced fricative and affricate) on vowel duration. In particular we focused on the durational differences between vowels followed by voiceless and voiced consonants across three groups of speakers: English speakers, English bilinguals and Korean learners of English. the result of experimental I showed that durations of vowels preceding voiced fricative and affricates as well as voiced stops are significantly longer than those preceding voiceless counterparts. Experiment Ⅱ indicated that as the subjects exposed themselves longer to English speaking society, their pronunciation was increasingly similar to those of English native speakers.

  • PDF

정보이론 기반 중세국어 'ㅸ'의 음운론적 대립에 대한 연구 (Information Theoretic Approach to Middle Korean [ß])

  • 박선우
    • 한국어학
    • /
    • 제79권
    • /
    • pp.63-89
    • /
    • 2018
  • This study explores contrastive relation among voiced bilabial fricative [${\ss}$], voiceless bilabial stop [p] and glide [w] in Middle Korean consonant system based on Probabilistic Model. Preceding researches about voiced bilabial fricative [${\ss}$] proposed two influential arguments. One is voiced bilabial fricative [${\ss}$] was an independent phoneme, the other is it was not an independent phoneme but an allophone of voiceless bilabial stop [p] in Middle Korean. This study applies Probabilistic Phonological Relationship Model (PPRM) for solving the problem of dichotomy about contrastive and allophonic relations. The analysis result of the contrastive entropy by PPRM suggests that voiced bilabial fricative [${\ss}$] was just an allophone of voiceless bilabial stop [p] or glide [w] in Middle Korean. Comparing the entropies between [p] and other consonants with the entropies between [${\ss}$] and other consonants, a continuum defined in terms of entropy reveals that [${\ss}$] in Middle Korean was more allophonic than phonemic.

유성음과 무성음의 경계를 이용한 연속 음성의 세그먼테이션 (Segmentation of continuous Korean Speech Based on Boundaries of Voiced and Unvoiced Sounds)

  • 유강주;신욱근
    • 한국정보처리학회논문지
    • /
    • 제7권7호
    • /
    • pp.2246-2253
    • /
    • 2000
  • In this paper, we show that one can enhance the performance of blind segmentation of phoneme boundaries by adopting the knowledge of Korean syllabic structure and the regions of voiced/unvoiced sounds. eh proposed method consists of three processes : the process to extract candidate phoneme boundaries, the process to detect boundaries of voiced/unvoiced sounds, and the process to select final phoneme boundaries. The candidate phoneme boudaries are extracted by clustering method based on similarity between two adjacent clusters. The employed similarity measure in this a process is the ratio of the probability density of adjacent clusters. To detect he boundaries of voiced/unvoiced sounds, we first compute the power density spectrum of speech signal in 0∼400 Hz frequency band. Then the points where this paper density spectrum variation is greater than the threshold are chosen as the boundaries of voiced/unvoiced sounds. The final phoneme boundaries consist of all the candidate phoneme boundaries in voiced region and limited number of candidate phoneme boundaries in unvoiced region. The experimental result showed about 40% decrease of insertion rate compared to the blind segmentation method we adopted.

  • PDF

영어의 유무성 폐쇄음 앞 모음 길이 차이에 대한 몇 가지 문제들 (Further Issues on the Duration Differences in Vowels due to the Voicing of the Following Stops in English)

  • 오은진
    • 말소리와 음성과학
    • /
    • 제4권3호
    • /
    • pp.85-92
    • /
    • 2012
  • It is a well-known phenomenon that vowel duration in English is generally longer before a voiced stop than a voiceless one. Past research has postulated that the closure duration of the voiceless stop is generally longer than that of the voiced stop and that the duration of a preceding vowel is determined complementarily by the closure duration of the stop. To shed further light on the phenomenon, this study examined fourteen native speakers of American English who read the monosyllabic words [bVC] (V = [i, ɪ, eɪ, ɛ, æ, ʌ, ɑ], C = [t, d]). First, we found that mean vowel duration was 38 ms longer before the voiced stop than the voiceless (mean duration ratio = 1.24). Second, mean closure duration of the voiced stop was only shorter by 5 ms compared to the voiceless stop (mean duration ratio = 0.97). Therefore, for our subjects, vowel duration was not determined complementarily by the closure duration of the following stop. Third, vowels with longer inherent durations (viz., tense, diphthong, and low vowels) tended to show larger duration ratios in the voiced and voiceless contexts than the vowels with shorter durations (viz., lax vowels). This indicates that the lengthening of inherently shorter vowels before a voiced stop is limited in order to avoid overlapping with longer vowels in the duration range. Fourth, there was no significant gender difference in vowel duration ratios in the contexts of voiced and voiceless stops. Finally, considerable individual differences were found in the vowel and consonant duration ratios.

음성신호에서 천이구간의 근사합성에 관한 연구 (A Study on Approximation-Synthesis of Transition Segment in Speech Signal)

  • 이시우
    • 한국콘텐츠학회논문지
    • /
    • 제5권3호
    • /
    • pp.167-173
    • /
    • 2005
  • 유성음원과 무성음원을 사용하는 음성부호화 방식에 있어서, 같은 프레임 안에 모음과 무성자음이 있는 경우에 음질저하현상이 나타난다. 본 논문에서는 같은 프레임 안에 유성음과 무정자음이 같이 존재하지 않도록 Zero Crossing Rate과 개별피치 펄스를 사용하여 무성자음을 포함한 천이구간을 추출하는 방법과 주파수대역을 분할하여 TSIUVC를 근사합성하는 방법을 제안한다. 실험결과, 0.547kHz 이하 2.813kHz 이상의 주파수 정보를 사용하여 TSIUVC 음성파형을 양호하게 근사합성 할 수 있었으며, TSIUVC의 추출율은 여자와 남자음성에서 각각 $91\%$$96.2\%$를 얻었다. 이 방법은 음성합성, 음성분석, 새로운 Voiced/Silence/TSIUVC의 음성부호화 방식에 활용할 수 있을 것으로 기대된다.

  • PDF

주파수 분할 및 최소 자승법을 이용한 TSIUVC 근사합성법에 관한 연구 (A Study on TSIUVC Approximate-Synthesis Method using Least Mean Square and Frequency Division)

  • 이시우
    • 한국멀티미디어학회논문지
    • /
    • 제6권3호
    • /
    • pp.462-468
    • /
    • 2003
  • 유성음원과 무성음원을 사용하는 음성부호화 방식에 있어서, 같은 프레임 안에 모음과 무성자음이 있는 경우에 음질저하 현상이 나타난다. 본 연구에서는 같은 프레임안에 유성음과 무성자음이 존재하지 않도록 FIR-STREAK 필터 와 zerocrossing rate을 이용한 개별피치 펄스를 사용하여 연속음성에서 무성자음을 포함한 천이구간(TSIUVC)을 탐색, 추출하는 방법을 제안한다. 또한 본 논문에서는 최송 자승법과 주파수 대역 분할을 이용한 TSIUVC 근사합성법을 제안하였다. 실험 결과, 0.547KHz 이하 2.813KHz 이상의 주파수 정보를 사용하여 TSIUVC 음성파형을 양호하게 근사합성할 수 있었으며, 최대 오차신호가 일그러짐이 적은 TSIUVC 근사합성 파형에 중요한 역할을 한다는 것을 알 수 있었다. 이 방법은 음성합성, 음성분석, 새로운 Voiced/Silence/TSIUVC의 음성부호화 방식에 활용할 수 있을 것으로 기대된다.

  • PDF

Level Crossing과 DPCM을 사용한 유성음/무성음/묵음의 분류 (Voiced/Unvoiced/Silence Classification of Speech Signal by Level Crossing and DPCM)

  • 김진영;성굉모
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1987년도 전기.전자공학 학술대회 논문집(II)
    • /
    • pp.1615-1618
    • /
    • 1987
  • 시간 영역에서 만들어진 음성신호의 파라미터을 이용하여 주어진 음성신호의 구간이 유성음, 무성음, 혹은 묵음인지를 분류하는 새로운 알고리듬을 제시하였다. 이에 사용한 파라미터은 구간내에서 샘플링된 값의 절대치 합과 일정한 level 이상의 peak의 합(T-peak), T-peak와 절대치 합의 비 그리고, DPCM의 절대치 합들이다. 이를 파라미터를 이용하여 간단히 유성음/무성음/묵음 구간을 분류 할였다. This paper proposes new algorithm for classifying speech signal frame into voiced, unvoiced, silence frame, using the parameters extracted from time domain behavior of speech signal The parameters used in this paper are absolute magnitude, the sum of peaks lager than reference level (T-peak), the ratio of T-peak to absolute magnitude and the magnitude of signal outputs of DPCM. Using this parameters, speech signal is more easily classified into voiced/unvoiced/silence frame.

  • PDF

입술 트릴의 방법에 따른 음향학적 및 전기성문파형검사 측정치 비교 (A comparison of acoustic & electroglottographic measures according to voiced lip trill methods)

  • 이승진;이광용;임재열;최홍식
    • 말소리와 음성과학
    • /
    • 제9권4호
    • /
    • pp.107-114
    • /
    • 2017
  • The purpose of the current study was to compare selected acoustic and electroglottographic measures (closed quotient, pitch, and loudness) among vowel phonation, traditional voiced lip trill ($VLT_T$), modified voiced lip trill methods ($VLT_M$). A total of 21 participants without voice complaints produced 4-second long samples using each phonation method. Results indicated that mean closed quotient of $VLT_M$ was higher than that of vowel phonation and $VLT_T$, while its range and standard deviation measures were higher than those of vowel phonation. Mean, range, standard deviation, maximum of pitch measures of $VLT_M$ were higher than those of vowel phonation. Lastly, mean and maximum loudness of the $VLT_M$ were higher than $VLT_T$. In conclusion, the current data indicate the possibility to use the $VLT_M$ as a training method for singing or a strategy to facilitate generalization effect of voice therapy. Current results also reflect the necessity for further study pertaining to the long-term effect of the $VLT_M$ training method. Clinical implications are discussed.

Using Korean Phonetic Alphabet (KPA) in Teaching English Stop Sounds to Koreans

  • Jo, Un-Il
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2000년도 7월 학술대회지
    • /
    • pp.165-165
    • /
    • 2000
  • In the phoneme level, English stop sounds are classified with the feature of 'voicing': voiceless and voiced (p/b, t/d, k/g). But when realized, a voiceless stop is not alwats the same sound. For example, the two 'p' sounds in 'people' are different. The former is pronounced with much aspiration, while the latter without it. This allophonic differnece between [$P^h$] and [p] out of an English phoneme /p/ can be well explained to Koreans because in Korean these two sounds exist as two different phonemes {/ㅍ/ and /ㅃ/ respectively). But difficulties lie in teaching the English voiced stop sounds (/b, d, g/) to Koreans because in Korean voiced stops do not exist as phonemes but as allophones of lenis sounds (/ㅂ, ㄷ, ㄱ/). For example, the narrow transcription of '바보' (a fool) is [baboo]. In the word initial position, Korean lenis stops are pronounced voiceless and even with a slight aspiration while in the inrervocalic environments they become voiced, That is in Korean voiced stops do not occur independently and neither they have their own letters. To explain all these more effectively to Koreans, it is very helpful to use Korean Phenetic Alphabet (KPA) which is devised by Dr. LEE Hyunbok (a professor of phonetics at Seoul National Univ. and chairman of Phonetic Society of Koera.)(omitted)

  • PDF