통합 검색 | Korea Science

러시아어 파열음에 나타나는 연자음의 음향음성학적 연구 (A Phonetic Study of Russian Soft Plosives)

변군혁
- 대한음성학회지:말소리
- /
- 제61호
- /
- pp.15-29
- /
- 2007
The present study investigates acoustic cues of russian soft plosive consonants. In previous studies, russian soft consonants are distinguished from hard consonants by F1, F2 of following vowels. The result showed: (1) that F0 of soft plosive consonants in following vowels were lower than those of hard plosive consonants; (2) and that VOT of soft plosive consonants were longer than those of hard plosive consonants. Hence, the present that, in addition to F1, F2, VOT and F0 are detected as acoustic cues that differentiate soft plosive consonants from hard plosive consonant in Russian.
PDF

한국어 파열음 인식을 위한 피쳐 셉 입력 인공 신경망 모델에 관한 연구 (A STUDY ON THE IMPLEMENTATION OF ARTIFICIAL NEURAL NET MODELS WITH FEATURE SET INPUT FOR RECOGNITION OF KOREAN PLOSIVE CONSONANTS)

김기석;김인범;황희융
- 대한전기학회:학술대회논문집
- /
- 대한전기학회 1990년도 하계학술대회 논문집
- /
- pp.535-538
- /
- 1990
The main problem in speech recognition is the enormous variability in acoustic signals due to complex but predictable contextual effects. Especially in plosive consonants it is very difficult to find invariant cue due to various contextual effects, but humans use these contextual effects as helpful information in plosive consonant recognition. In this paper we experimented on three artificial neural net models for the recognition of plosive consonants. Neural Net Model I used "Multi-layer Perceptron ". Model II used a variation of the "Self-organizing Feature Map Model". And Model III used "Interactive and Competitive Model" to experiment contextual effects. The recognition experiment was performed on 9 Korean plosive consonants. We used VCV speech chains for the experiment on contextual effects. The speech chain consists of Korean plosive consonants /g, d, b, K, T, P, k, t, p/ (/ㄱ, ㄷ, ㅂ, ㄲ, ㄸ, ㅃ, ㅋ, ㅌ, ㅍ/) and eight Korean monothongs. The inputs to Neural Net Models were several temporal cues - duration of the silence, transition and vot -, and the extent of the VC formant transitions to the presence of voicing energy during closure, burst intensity, presence of asperation, amount of low frequency energy present at voicing onset, and CV formant transition extent from the acoustic signals. Model I showed about 55 - 67 %, Model II showed about 60%, and Model III showed about 67% recognition rate.
PDF

LPC를 이용한 평안방언의 음향지표에 관한 연구 (A Study for Acoustic Cues of Pyoung-An Do Dialect Using LPC)

송철규;이명호;김영배
- 대한의용생체공학회:의공학회지
- /
- 제13권3호
- /
- pp.195-200
- /
- 1992
This paper deal with the acoustic cues of Pyoung-An Do dialect using linear prediction. Also, this paper descrbes a statistical comparison between standard tone speech data and Pyoung-An Do dia lects. The analysis done mainly focused on the distribution of formants and pitch periods accord to ac- cents variation. For the purpose of objective comparison, the experiments are performed by extracts for- mant LPC spectrum and pithch periods from average magnitude difference function waveforms. Summing up the results, In disyllable words (VCV pattern) , prepositioned vowels have longer phona lion time than postpositioned vowels and the intrin, iii phonation time is whore longer in the low vowels than in the high ones. The africative consonants show the mixed characteristics of the plosive and frlc ative consonants. The remarkable acoustic cues are the low frequency noise-like waves just before the 1st formants in the plosive consonants, the high frequency noise-like waves in the fricative consonants, and phonation time is not affected by the kinds of prepositioned or postpositioned vowels.
PDF

연속음성에서 천이구간의 탐색, 추출, 근사합성에 관한 연구 (A Study on a Searching, Extraction and Approximation-Synthesis of Transition Segment in Continuous Speech)

이시우
- 한국정보처리학회논문지
- /
- 제7권4호
- /
- pp.1299-1304
- /
- 2000
In a speed coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech quality in case coexist with a voiced and an unvoiced consonants in a frame. So, I propose TSIUVC(Transition Segment Including UnVoiced Consonant) searching, extraction ad approximation-synthesis method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This method based on a zerocrossing rate and pitch detector using FIR-STREAK Digital Filter. As a result, the extraction rates of TSIUVC are 84.8% (plosive), 94.9%(fricative), 92.3%(affricative) in female voice, and 88%(plosive), 94.9%(fricative), 92.3%(affricative) in male voice respectively, Also, I obain a high quality approximation-synthesis waveforms within TSIUVC by using frequency information of 0.547kHz below and 2.813kHz above. This method has the capability of being applied to speech coding of low bit rate, speech analysis and speech synthesis.
PDF

조음 특성과 음소 대표 구간을 이용한 우리말 파열음의 인식 (Plosive consonants recognition using acoustic properties with the frames representing each phoneme)

박찬응;이쾌희
- 전자공학회논문지S
- /
- 제34S권4호
- /
- pp.33-41
- /
- 1997
Korean unvoiced phonemes consist of nonstationary parts comparing that the vowels and nasal consonants consist of quasi-stationary part. And some phonemes, which have smae point of articulation but differnt manner of articulation, has similar characteristics, so it makes to be hard to distinguish each other. A new method usin gchanges and characteristics of acoustic properties of these phonemes to improve recognition rate are proposed. And because these changes and cahracteristics evidently occur in continuous speech except some unvoiced consonants are articulated as voiced phoneme in case to be used as an midial between voiced phonemes, this method can be applied easily. The features of the frames extracted to represent each phonemes are used asinputs to the hierarchical neural network. And with these results final decision for phoneme recognition is made thorugh post processing which the new method is applied to. Through the experimental recognition results for 9 unvoiced consonants which belong to bilabial, alveolar, and velar phoneme series, 89.4% recognition rate to distinguish in same phoneme series is obtained, and 85.6% recognition rate is obtained in case of including cistinguishing phoneme series.
PDF

순환 신경망 모델을 이용한 한국어 음소의 음성인식에 대한 연구 (A Study on the Speech Recognition of Korean Phonemes Using Recurrent Neural Network Models)

김기석;황희영
- 대한전기학회논문지
- /
- 제40권8호
- /
- pp.782-791
- /
- 1991
In the fields of pattern recognition such as speech recognition, several new techniques using Artifical Neural network Models have been proposed and implemented. In particular, the Multilayer Perception Model has been shown to be effective in static speech pattern recognition. But speech has dynamic or temporal characteristics and the most important point in implementing speech recognition systems using Artificial Neural Network Models for continuous speech is the learning of dynamic characteristics and the distributed cues and contextual effects that result from temporal characteristics. But Recurrent Multilayer Perceptron Model is known to be able to learn sequence of pattern. In this paper, the results of applying the Recurrent Model which has possibilities of learning tedmporal characteristics of speech to phoneme recognition is presented. The test data consist of 144 Vowel+ Consonant + Vowel speech chains made up of 4 Korean monothongs and 9 Korean plosive consonants. The input parameters of Artificial Neural Network model used are the FFT coefficients, residual error and zero crossing rates. The Baseline model showed a recognition rate of 91% for volwels and 71% for plosive consonants of one male speaker. We obtained better recognition rates from various other experiments compared to the existing multilayer perceptron model, thus showed the recurrent model to be better suited to speech recognition. And the possibility of using Recurrent Models for speech recognition was experimented by changing the configuration of this baseline model.

한극파열자음발음시의 생리기전 (STUDIES ON KOREAN PHONOLOGY (PART II) -;HYSIOLOGICAL PRODUCTION MECHANISMS OF KOREAN STOP CONSONANTS(Summarized Version)-)

김병욱
- 대한치과의사협회지
- /
- 제10권9호
- /
- pp.605-625
- /
- 1972
한글의 파열자음(Plosive Consonants of stop consonants)인 ㄱ, R, ㅋ, ㅃ, ㅍ이나, ㄷ, ㄸ, ㅌ의 삼중 구별은 외국어에서는 찾아 볼수 없는 한글 고유의 특성이라고 생각되어 왔다. 이는 한국인 학자에게 뿐만 아니라 외국인 학자에게도 크나큰 관심사가 되어 왔다. 그 가장 큰 이유중의 하나가 소위 파열자음의 생리기전면에서 볼때, p,b나 k,g의 구별이 단순히 무성(Voiceless)대 유성(Voiced)의 차이냐 또는 근육의 힘의 차이냐 하는 문제가 오랫동안 국제학계에서 논난의 대상이 되어왔기 때문이다. 둘째로는 기이하게도 한글의 파열자음은 모두 무성자음으로 외국어에서 볼수있는 유성대 무성의 대조는 없으며, 다만 근육의 gal의 대조가 있지 않을까 생각되어 왔기 때문이다. 그러나 현재까지는 이 세가지 종류의 한글파열자음 발음시의 생리기전의 차이를 규명한 연구가 없었다. 저자는 이점에 착안하여 미국 위스컨신대학교 언어병리학 및 생리학 실험실에서 고도로 발달된 최신 연구기구를 이용하여 한글파열 자음 발음시의 생리기전의 차이를 규명하였다. 서론 연구방법, 연구결과, 결론에 관한 자세한 내용은 영문초록에 기술되어 있다.
PDF

연령에 따른 VCV 문맥에서 한국어 폐쇄음의 성대진동개시시간 (Effect of Age on the Voice Onset Time of Korean Stops in VCV contexts)

이슬기;이영미
- 말소리와 음성과학
- /
- 제7권3호
- /
- pp.37-44
- /
- 2015
This study investigated the effects of the age of Korean speakers, place of articulation, and phonation types on voice onset time (VOT) of stops. Twenty-five preschoolers, 25 schoolers, and 25 adults who had no history of speech and language impairment produced plosives in /VCV/ words in isolation. A three-way ($3{\times}3{\times}3$) mixed design was used with the age of speakers (preschoolers, schoolers, adults) as a between-subject factor, the place of articulation (bilabials, alveolars, velars) and phonation types (plain, tense, aspirated consonants) as a within-subject factor. The dependent measure was the VOT values. Results revealed that three main effects were statistically significant. Preschoolers exhibited longer VOTs than adults (p<.05). There were significant differences in VOTs among the place of articulation, showing that speakers had the longest VOTs for velars (velars > alvelars > bilabials) (all p<.05). In addition, the VOTs for aspirated consonants were longer than those for plain and tense consonants, and the differences were significant among three phonation types (aspirated > tense > plain) (all p<.05). The current results suggested that VOTs would be linked to age and development, and schoolers over the age of 11 years had achieved adult-like VOTs. Moreover, the place of articulation and phonation types in Korean stops showed marked factors in normal speakers' VOT patterns.
https://doi.org/10.13064/KSSS.2015.7.3.037 인용 PDF KSCI

포르만트 위치비교를 이용한 구개열 환자의 발음분석 (Sound Analysis of Cleft Platate Patinents Using Formant Position)

김덕원;송철규
- 대한의용생체공학회:의공학회지
- /
- 제11권2호
- /
- pp.283-288
- /
- 1990
As one of the main purpose of the physical management of cleft palate is to provide for the anatomic and physiologic requisites for speech, the speech must be as one of the criteria for determining when physical management has been achieved. But there is no objective methods to evaluate the speech of cleft palate patients. The authors tried to analyze the speech of adult cleft palate patients using sound spectrog raphy and compared with normal adults. The results were obtained as follows ; 1. In Vowels, cleft palate patients of both sexes showed reduction of frequency of the first and second formant as compared to normal. There was minimal difference in front vowels (i, e, ae) 2. In consonants, cleft palate patients showed reduction of frequency of the first formant in both sexes but reduction of frequency of the second formant was noticed only in fe- male patients. 3. There was no statistical difference in sound spectrograph between plosive, fricative, africative, nasal, and glide consonants.
PDF

청각장애 성인의 말명료도 예측 요인: 조음정확도를 중심으로 (Predicting Variables of Speech Intelligibility in Adults with Hearing Impairment: Focusing on Correct Articulation)

성희정;최은아;윤미선
- 대한음성학회지:말소리
- /
- 제61호
- /
- pp.1-14
- /
- 2007
The purpose of this study was to analyze the relationship between segmental correctness and speech intelligibility in adults with hearing impairment. Segmental correctness was measured by percentage of correct vowels(PCV) and percentage of correct consonants(PCC). The results were shown as follows: First, PCV and PCC could predict speech intelligibility with statistical significance. Second, in consonant classes divided by place and manner of articulation, the PCC of plosives and alveolar sounds were significant predicting variables in each group ($R^{2}=50%;\;59%$). According to this study, the importance of segmental correctness on speech intelligibility of adults with hearing impairment was confirmed. Also correctness of plosive sounds in manner and alveolar sounds in place were significant factors to speech intelligibility.
PDF

검색결과 13건 처리시간 0.168초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)