Search | Korea Science

Improvement of Bit Rate applying the Speaking Rate and PSOLA Technique of Speech in CELP Vocoder (음성신호의 발성율과 PSOLA기법을 적용한 음성 보코더 전송률 개선에 관한 연구)

장경아;서지호;배명진
- Proceedings of the IEEK Conference
- /
- 2003.11a
- /
- pp.45-48
- /
- 2003
In general, speech coding methods are classified into the following three categories: the waveform coding, the source coding and the hybrid coding. Fast speaking is possible to encode with a few information compared with slow speaking rate. In case of speaking rate, low frequency band is more important than high frequency band while listening. Speech vocoding technique is developing to way with low bit rate and complexity and high sound quality. the CELP type of vocoder support very good sound quality with low bit rate but these vocoders don't consider about the speaking rate. When we consider speaking rate and encode the frame depending on the speaking rate, the bit rate is able to reduce the bit rate than the conventional vocoder. We propose the technique to estimate the speaking rate and applied PSOLA technique in case of the frame of slow speaking rate. As a result of simulation bit rate can be reduced about 300 bps.
PDF

A Study on Measuring the Speaking Rate of Speaking Signal by Using Line Spectrum Pair Coefficients

Jang, Kyung-A;Bae, Myung-Jin
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.3E
- /
- pp.18-24
- /
- 2001
Speaking rate represents how many phonemes in speech signal have in limited time. It is various and changeable depending on the speakers and the characters of each phoneme. The preprocessing to remove the effect of variety of speaking rate is necessary before recognizing the speech in the present speech recognition systems. So if it is possible to estimate the speaking rate in advance, the performance of speech recognition can be higher. However, the conventional speech vocoder decides the transmission rate for analyzing the fixed period no regardless of the variety rate of phoneme but if the speaking rate can be estimated in advance, it is very important information of speech to use in speech coding part as well. It increases the quality of sound in vocoder as well as applies the variable transmission rate. In this paper, we propose the method for presenting the speaking rate as parameter in speech vocoder. To estimate the speaking rate, the variety of phoneme is estimated and the Line Spectrum Pairs is used to estimate it. As a result of comparing the speaking rate performance with the proposed algorithm and passivity method worked by eye, error between two methods is 5.38% about fast utterance and 1.78% about slow utterance and the accuracy between two methods is 98% about slow utterance and 94% about fast utterances in 30 dB SNR and 10 dB SNR respectively.
PDF

Increase in Speaking Rate by $3{\sim}8$-year-old Korean Children (한국어 발화 속도의 연령별 증가에 관한 연구 －만 $3{\sim}8$ 세 아동을 대상으로－)

Kim, Tae-Kyung;Chang, Kyung-Hee;Lee, Phil-Young
- Speech Sciences
- /
- v.13 no.3
- /
- pp.83-95
- /
- 2006
This study attempts to suggest a criterion of Korean language development. For this purpose we investigated speaking rates of the spontaneous utterances produced by 144 children, aged 3 to 8. We analyzed each subject's speaking rate and its relevance with speaker's age, gender and utterance length. To determine the relative contributions of variables to the speaking rate, multiple regression was conducted. Results of this study can be summarized as follows: (1) The mean and maximum values of the speaking rate increased with the growth of age. (2) A statistically significant increase in speaking rate appeared at two-year intervals. (3) There was no significant difference between male and female groups in the speaking rate. (4) The multiple regression analysis has shown that along with the speaker's age, the utterance length(the mean number of syllables per utterance) is also important in estimating the speaking rates.
PDF

On a Study of Measurement Method of Utterance Velocity for the Reduction of Transmission Rate in CELP Vocoder. (LSP 파라미터를 이용한 발성측정법)

장경아;배명진
- Proceedings of the IEEK Conference
- /
- 2000.11d
- /
- pp.199-202
- /
- 2000
Speaking Rate has variety depends on the situation and habit of speakers. It has been many studied about speaking rate In speaker recognition. The study of speaking rate in speech recognition is one of considerable matter when It is recognized the speakers and it is measured by many speech data base and complicate estimation for accuracy. In this paper, conventional vocoder process the speech signal when encoding and transmitting without regard to speaking rate so in order to apply the speaking rate for vocoder It should be considered the simpler algorithm and less computation amount than the conventional method of speaking rate used In speech recognition. We proposed the speaking rate algorithm which is used the simple parameter with Line Spectrum Pair (LSP). The proposed peaking rate method is measured by the information of LSP in speech. We measured the variety rate of phenomenon about utterances which have different velocity, respectively. As a result, It has distinct variation rate of phenomenon between utterances uttered fast and slow and the rate is 42.8% higher in case of uttered fast than in case of uttered slow.
PDF

Asymmetric effects of speaking rate on the vowel/consonant ratio conditioned by coda voicing in English

Ko, Eon-Suk
- Phonetics and Speech Sciences
- /
- v.10 no.2
- /
- pp.45-50
- /
- 2018
The vowel/consonant ratio is a well-known cue for the voicing of postvocalic consonants. This study investigates how this ratio changes as a function of speaking rate. Seven speakers of North American English read sentences containing target monosyllabic words that contrasted in coda voicing at three different speaking rates. Duration measures were taken for the voice onset time (VOT) of the onset consonant, the vowel, and the coda. The results show that the durations of the onset VOT and vowel are longer before voiced codas, and that the durations of all segments increase monotonically as speaking rate decreases. Importantly, the vowel/consonant ratio, a primary acoustic cue for coda voicing, was found to pattern asymmetrically for voiced and voiceless codas; it increases for voiced codas but decreases for voiceless codas with the decrease in speaking rate. This finding suggests that there is no stable ratio in the duration of preconsonantal vowels that is maintained in different speaking styles.
https://doi.org/10.13064/KSSS.2018.10.2.045 인용 PDF KSCI

An Improvement of Korean Speech Recognition Using a Compensation of the Speaking Rate by the Ratio of a Vowel length (모음길이 비율에 따른 발화속도 보상을 이용한 한국어 음성인식 성능향상)

박준배;김태준;최성용;이정현
- Proceedings of the IEEK Conference
- /
- 2003.11b
- /
- pp.195-198
- /
- 2003
The accuracy of automatic speech recognition system depends on the presence of background noise and speaker variability such as sex, intonation of speech, and speaking rate. Specially, the speaking rate of both inter-speaker and intra-speaker is a serious cause of mis-recognition. In this paper, we propose the compensation method of the speaking rate by the ratio of each vowel's length in a phrase. First the number of feature vectors in a phrase is estimated by the information of speaking rate. Second, the estimated number of feature vectors is assigned to each syllable of the phrase according to the ratio of its vowel length. Finally, the process of feature vector extraction is operated by the number that assigned to each syllable in the phrase. As a result the accuracy of automatic speech recognition was improved using the proposed compensation method of the speaking rate.
PDF

A study of speaking rate on Parkinson's disease with palilalia (동어반복증을 동반한 파킨슨병 환자의 말속도 연구)

Kim, Sun Woo
- Phonetics and Speech Sciences
- /
- v.8 no.3
- /
- pp.61-66
- /
- 2016
The purpose of this study is to examine the speaking rate(overall speaking rate and articulatory rate) of Parkinson's disease patients with palilalia(PDP). Palilalia is traditionally characterized by not only compulsive repetitions of words and phrases, but also by increased rate of speech based on auditory perception. Since Souques(1908) first characterized palilalia as fast speech rate from the perspective of auditory perception, few studies have evaluated PDP speech using acoustic methods. To compare the speech rate between PDP and normal subjects, we included five PDP and eight control subjects(age over 55), as well as the date acquired under reading tasks(standardized Korean paragraph). The difference in median of the overall speaking rate was not statically significant between the PDP group(median 5.25, IQR 1.30) and normal group(median 4.76, IQR 0.71). The PDP, however, had a significantly higher syllables per second on the articulatory rate(median 6.60, IQR 1.04) than normal subjects(median 5.60, IQR 0.52). Results indicated no differences in pause over 250msec and disfluency duration between the two groups. To provide useful insight into PDP speech, multiple levels of analysis should be employed.
https://doi.org/10.13064/KSSS.2016.8.3.061 인용 PDF KSCI

Study on the Improvement of Speech Recognizer by Using Time Scale Modification (시간축 변환을 이용한 음성 인식기의 성능 향상에 관한 연구)

이기승
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.6
- /
- pp.462-472
- /
- 2004
In this paper a method for compensating for thp performance degradation or automatic speech recognition (ASR) is proposed. which is mainly caused by speaking rate variation. Before the new method is proposed. quantitative analysis of the performance of an HMM-based ASR system according to speaking rate is first performed. From this analysis, significant performance degradation was often observed in the rapidly speaking speech signals. A quantitative measure is then introduced, which is able to represent speaking rate. Time scale modification (TSM) is employed to compensate the speaking rate difference between input speech signals and training speech signals. Finally, a method for compensating the performance degradation caused by speaking rate variation is proposed, in which TSM is selectively employed according to speaking rate. By the results from the ASR experiments devised for the 10-digits mobile phone number, it is confirmed that the error rate was reduced by 15.5% when the proposed method is applied to the high speaking rate speech signals.
PDF KSCI

Effects of Speaking Rate on Korean Vowels (발화속도에 따른 한국어 모음의 음향적 특성)

이숙향;고현주;한양구;김종진
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.1
- /
- pp.14-22
- /
- 2003
In this study, we examined the acoustic characteristics of Korean vowels through a production test under three conditions of speaking rates (slow, normal, fast). The effects of a change in speaking .ate on vowel duration were found to be very strong. The faster speaking rate was, the shorter the total duration of vowels was. But the duration ratio of two components of diphthong was not changed significantly according to changes in speaking rate. But unlike the temporal aspects, the formant value of vowels at their steady-state and change ratio of formant of semivowels were not affected strongly by the change in speaking rate.
PDF KSCI

The Noise Effect on Stuttering and Overall Speech Rate: Multi-talker Babble Noise (다화자잡음이 말더듬의 비율과 말속도에 미치는 영향)

Park, Jin;Chung, In-Kie
- Phonetics and Speech Sciences
- /
- v.4 no.2
- /
- pp.121-126
- /
- 2012
This study deals with how stuttering changes in its frequency in a situation where adult participants who stutter are exposed to one type of background noise, that is, multi-talker babble noise. Eight American English-speaking adults who stutter participated in this study. Each of the subjects read aloud sentences under each of three speaking conditions (i.e., typical solo reading (TSR), typical choral reading (TCR), and multi-talker babble noise reading (BNR)). Speech fluency was computed based on a percentage of syllables stuttered (%SS) and speaking rate was also assessed to examine if there was significant change in rates as a measure of vocal change under each of the speaking conditions. The study found that participants read more fluently both during BNR and during TCR than during TSR. The study also found that participants did not show significant changes in speaking rate across the three speaking conditions. Some discussion was provided in relation to the effect of multi-talker babble noise on the frequency of stuttering and its further speculation.
https://doi.org/10.13064/KSSS.2012.4.2.121 인용 PDF

Search Result 117, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)