Search | Korea Science

Analysis of Speech Signals by linear prediction and It's Application (선형 예측법에 의한 음성신호의 분석과 그 응용 방안)

김명규
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.18 no.4
- /
- pp.27-33
- /
- 1981
In this paper, the effect of tone variation of speech signals is discussedty showing the variations of the linear prediction model spectra and the estimated vocal tract shape for Korean vowels. As an application of the analysis results a speech spenthesis scheme by combination of phonemes is also discussed based on experimental results.
PDF

Analysis and Comparisons of Acoustical Characteristics of Pathologic Voice before and after Surgery (후두질환에 대한 술전 술후 음성의 음향적 특성비교 분석)

Kim, Dae-Hyun;Jo, Cheol-Woo;Baek, Moo- Jin;Wang, Soo-Geun
- Speech Sciences
- /
- v.7 no.3
- /
- pp.285-294
- /
- 2000
In this paper the acoustic characteristics of pathological voice, which are measured before and after surgical operation, are compared. This experiment is conducted for the purpose of predicting patients' speech after operation. The voices are recorded from the same patients. Jitter, shimmer and other parameters are. computed and their statistical characteristics are compared. Also spectral changes, such as formant frequency shift and spectral slope change, are compared. From the experimental results, it is verified that not only source characteristics but also vocal tract components vary. And this indicates that the modification of source parameters are not enough for the prediction. Also the result indicates that the operation causes change to both the physical shape of vocal folds and the manner of articulation.
PDF

Hunminjeongeum Phonetics (I): Phonetic and Phoniatric Consideration for Explanation of Designs of Middle Vowel Letters (훈민정음 음성학(I): 중성자(홀소리) 제자해에 대한 음성언어의학적 고찰)

Choi, Hong-Shik
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.33 no.2
- /
- pp.77-82
- /
- 2022
Hunminjeongeum was made by the Great King Sejong, and composed of 17 consonant and 11 vowel letters. All the 28 letters were made according to the shape of vocal organ or space at the point of articulation for each letters. This review article focused on phonetic and phoniatric consideration for explanation of the designs of the middle vowel letters, especially three main vowel letters [ • (天, heaven), ㅡ (地, earth), ㅣ (人, human)] using video-fluoroscopic evaluation as well as computed tomography scanning, etc. During articulating / • / sound, a ball-like space at frontal portion of the oral cavity was found, tongue was contracted, and sound was deep (舌縮而聲深). During /ㅡ/ sound, a flat air space between oral tongue and hard palate was created. Tongue was slightly contacted neither deep nor shallow (舌小縮而聲不深不淺). During /ㅣ/ sound, tongue was not contacted and Sound is light (舌不縮而聲淺). Tongue was moved forward making longitudinal oro-pharyngeal air space. So, I'd like to suggest that we had better change the explanation drawing from a philosophical modeling to a more scientific modeling from real vocal tract space modeling during articulating middle vowels of Hunminjeongeum.
https://doi.org/10.22469/jkslp.2022.33.2.77 인용 PDF KSCI

On a Pitch Alteration Method Compensated with the Spectrum for High Quality Speech Synthesis (스펙트럼 보상된 고음질 합성용 피치 변경법)

문효정
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.123-126
- /
- 1995
The waveform coding are concerned with simply preserving the wave shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the wave form coding with high quality are mainly used to the synthesis by analysis. However, because the parameters of this coding are not classified as either excitation and vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In this paper, we proposed a new pitch alteration method that can change the pitch period in waveform coding by using scaling the time-axis and compensating the spectrum. This is a time-frequency domain method that is preserved in the phase components of the waveform and that has a little spectrum distortion with 2.5% and less for 50% pitch change.
PDF

The Study on the Acoustical Characteristics and Speech Intelligibility of Vowels Produced by the Maxillectomized Patients before and after Obturator-Wearing (Palatal Cancer환자의 Obturator 장착전후 모음의 음향학적 특성과 말 명료도에 관한 연구)

최성희;정문규;김호중;표화영;심현섭;최홍식
- Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
- /
- v.10 no.2
- /
- pp.140-148
- /
- 1999
The use of obturator is the prosthetic rehabilitation approach for restoration of the defected maxillary shape and function for the patients with palatal defect. The obturator can change the shape of vocal tract and nasality, but few reports on the effects of the change were presented. So, the authors performed the experimental study to compare the difference between the sizes of vowel triangles produced by maxillectomized patients before and after obturator-wearing and to consider how much improvement in speech intelligibility can be expected by obturator wearing. The 8 patients who were totally maxillectomized due to palatal cancer were participated as subjects. They produced 5 vowels(/a/, /i/, /u/, /e/, /o/) before and after obturator-wearing. The formants of the vowels were analyzed by the spectrogram of CSL, and their speech intelligibility were judged by normal 8 listeners. As results, the frequency of the first and the second formant showed no significant difference between the articulation before and after wearing, but the comparison of the sizes of vowel triangles, related with the speech intelligibility, showed significant difference. The vowel triangle of the articulation after wearing was larger than that of the articulation before wearing. /i/ showed the lowest speech intelligibility score among the vowel articulation before wearing. After wearing obturators, their scores increased on the whole, especially, in /a/, but the intelligibility of /u/ decreased after wearing.
PDF

On a Pitch Alteration Technique in the V/UV Spectrum for High Quality Speech Synthesis Technique (고음질 합성방식용 V/UV 스펙트럼상의 피치변경법에 관한 연구)

Jo, Wang-Rae;Bae, Myung-Jin;Kim, Dong-Sung
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.6
- /
- pp.99-103
- /
- 1996
Most waveform coding techniques attempt to reduce redundancy of speech signal while preserving the shape of the waveform. In speech synthesis, wavefrom coding methods are used to the synthesis by rule for high quality speech. However, it is difficult to apply the waveform coding to the synthesis by rule because the parameters of the wavefrom coding cannot be classified as either the excitation or the vocal tract parameters. The proposed method shows little spectrum distortion of 2.7% or less for 50% pitch changes. It also achieves smooth connection of wavefrom magnitudes among the frames by compensating the phase in time domain.
PDF

On a Pitch Change of the Waveform Coding by the Cepstrum Analysis of Speech Waveforms (켑스트럼 분석에 의한 파형부호화의 피치변경에 관한 연구)

Bae, Myung-Jin;Lee, Mi-Suk
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.4
- /
- pp.14-21
- /
- 1992
The waveform coding is concerned with simply preserving the wave shape of speech signal through a redundancy reduction process. In area of the speech synthesis, the waveform codings with high quality are mainly used to the synthesis by analysis. However, because the parameters of this coding are not classified as either excitation parameters and vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In this paper, we proposed a new pitch alternation method that can change the pitch periods in the waveform coding by using the cepstrum analysis. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing.
PDF

A Comparison of Resonance Parameters before and after Pharyngeal Flap Surgery:A Preliminary Report (인두피판술 전.후의 공명파라미터의 비교: 예비연구)

Kang, Young-Ae;Kang, Nak-Heon;Lee, Tae-Yong;Seong, Cheol-Jae
- Phonetics and Speech Sciences
- /
- v.1 no.3
- /
- pp.133-144
- /
- 2009
Pharyngeal flap surgery changes the space and shape of the oral cavity and vocal tract, and these changing conditions bring resonance change. The purpose of this study was to determine the most reliable and valuable parameters for evaluating hypernasality to distinguish two patients before and after pharyngeal flap surgery. Each patient was asked to clearly speak the vowels /a/, /i/, /u/, /e/, /o/ for voice recording. There were nine parameters: Formant (F1, F2, F3), Bandwidth (BW1, BW2, BW3), LPC energy slope ($\Delta$ |A2-A1/F2-F1|), and Band Energy (0-500 Hz, 500-1000 Hz) by each vowel. From the results of discrimination analyses on acoustic parameters, the vowels /a/, /e/ appeared to be insignificant but vowels /i/, /u/, /o/ appeared to be efficient in the separation. A 95%, 100%, and 100% recognition score could be reached when vowels /i/, /u/, and /o/ were analyzed. The results showed that F2, BW3, and LPC slope are more important parameters than the others. Finally, there is a relation between perceptual evaluation score and LPC energy slope of acoustic parameters by least square slope.
PDF

An Acoustical Study of English Diphthongs Produced by American Males and Females (미국인 남성과 여성이 발음한 영어이중모음의 음향적 연구)

Yang, Byung-Gon
- Phonetics and Speech Sciences
- /
- v.2 no.2
- /
- pp.43-50
- /
- 2010
English vowels can be divided into monophthongs and diphthongs depending on the number of vocal tract shapes. Diphthongs are usually produced with more than one shape. This study attempts to collect acoustical data of English diphthongs published by Hillenbrand et al.(1995) online and to examine acoustic features of the diphthongs for phoneticians and English teachers. Sixty three American males and females were chosen after excluding those subjects with different target vowels or ambiguous formant tracks. The author used Praat to obtain the acoustical data systematically at eleven equidistant timepoints over the diphthongal segment. Obvious errors were corrected based on the spectrographic display of each diphthong. Results show that the formant trajectories of the diphthongs produced by the American males and females appeared quite similar. When the female formant values were uniformly normalized to those of the males, almost a perfect collapse occurred. Secondly, the diphthongal movements on the vowel space appeared not linear due to the coarticulatory gesture for the following consonant. Thirdly, the average duration of the diphthongs produced by the females was 1.156 times longer than that of the males while the pitch ratio between the two groups turned out to be 1.746 with a similar contour over measurement points. The author concludes that English diphthongs produced by various groups can be compared systematically when the acoustical values are obtained at proportional timepoints. Further studies will be desirable on the comparison of English diphthongs produced by native and nonnative speakers.
PDF

On a Pitch Alteration Method by Time-axis Scaling Compensated with the Spectrum for High Quality Speech Synthesis (고음질 합성용 스펙트럼 보상된 시간축조절 피치 변경법)

Bae, Myung-Jin;Lee, Won-Cheol;Im, Sung-Bin
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.4
- /
- pp.89-95
- /
- 1995
The waveform coding technique has concerned with simply preserving the waveform shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the waveform coding with high sound quality is mainly used to the synthesis by analysis. However, since the parameters of this coding are not classified into either excitation or vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In order to apply the waveform coding to the synthesis by rule, the pitch alteration technique is required in prosody control. In this paper, we propose a new pitch alteration method that can change the pitch period in waveform coding by scaling the time-axis and compensating the spectrum. This is relevant to the time-frequency domain method were the phase components of the waveform is preserved with a little spectrum distortion of 2.5 % and less for 50% pitch change.
PDF

Search Result 21, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)