Search | Korea Science

Aerodynamic Characteristics of Whispered and Normal Speech during Reading Paragraph Tasks (문단낭독 시 속삭임 발화와 정상 발화의 공기역학적 특성)

Pyo, Hwayoung
- Phonetics and Speech Sciences
- /
- v.6 no.3
- /
- pp.57-62
- /
- 2014
The present study was performed to investigate and discuss the aerodynamic characteristics of whispered and normal speech during reading paragraph tasks. 39 normal females(18-23 yrs.) read 'Autumn' paragraph with whispered and normal phonation. Their readings were recorded and analyzed by 'Running Speech' in Phonatory Aerodynamic System(PAS) instrument. As results, during whispered speech, the total duration was longer and the numbers of inspiration were more frequently shown than normal speech. The Peak expiratory and inspiratory rate were higher in normal speech, but the expiratory and inspiratory volume were higher in whispered speech. By correlation analysis, both whispered and normal speech showed significantly high correlation between total duration and expiratory/inspiratory airflow duration; numbers of inspiration and inspiratory airflow duration; expiratory and inspiratory volume. These results show that whispered speech needs more respiratory effort but shows poorer aerodynamic efficacy during phonation than normal speech.
https://doi.org/10.13064/KSSS.2014.6.3.057 인용 PDF KSCI

Usefulness of Speech Therapy for Patients with Submucous Cleft Palate Treated with Furlow Palatoplasty (점막하 구개열 치료에 있어 Furlow 구개성형술 전후 언어 치료의 유용성)

Baek, Rongmin;Park, Mikyong;Heo, Chanyeong
- Archives of Plastic Surgery
- /
- v.32 no.3
- /
- pp.375-380
- /
- 2005
Furlow palatoplasty has been favored by many plastic surgeons as the primary treatment for the velopharyngeal insufficiency associated with submucous cleft palate. The purpose of this article is to introduce an efficacy of Furlow palatoplasty and speech therapy performed on patients who were diagnosed belatedly as having submucous cleft palates. From 2002 to 2004, four submucous cleft palate patients over 5 years of age with velopharyngeal insufficiency received Furlow palatoplasty. The patients were evaluated through the preoperative perceptual speech assessment, nasometry, and videonasopharyngoscopy. Postoperatively, two patients achieved competent velopharyngeal function in running speech. One of the remaining two could achieve competent velopharyngeal function with visual biofeedback speech therapy and the other could not use her new velopharyngeal function in running speech because of her age. Speech therapy can correct the articulation errors and thus improve the velopharyngeal function to a certain extent by eliminating some compensatory articulations that might have an adverse influence on velopharyngeal function. This study shows that Furlow palatoplasty can successfully correct the velopharyngeal insufficiency in submucous cleft palate patients and speech therapy has a role in reinforcing surgical result. But age is still a restrictive factor even though surgery was well done.
PDF KSCI

Speech Enhancement Using Lip Information and SFM (입술정보 및 SFM을 이용한 음성의 음질향상알고리듬)

Baek, Seong-Joon;Kim, Jin-Young
- Speech Sciences
- /
- v.10 no.2
- /
- pp.77-84
- /
- 2003
In this research, we seek the beginning of the speech and detect the stationary speech region using lip information. Performing running average of the estimated speech signal in the stationary region, we reduce the effect of musical noise which is inherent to the conventional MlMSE (Minimum Mean Square Error) speech enhancement algorithm. In addition to it, SFM (Spectral Flatness Measure) is incorporated to reduce the speech signal estimation error due to speaking habit and some lacking lip information. The proposed algorithm with Wiener filtering shows the superior performance to the conventional methods according to MOS (Mean Opinion Score) test.
PDF

A Real-Time Implementation of Speech Recognition System Using Oak DSP core in the Car Noise Environment (자동차 환경에서 Oak DSP 코어 기반 음성 인식 시스템 실시간 구현)

Woo, K.H.;Yang, T.Y.;Lee, C.;Youn, D.H.;Cha, I.H.
- Speech Sciences
- /
- v.6
- /
- pp.219-233
- /
- 1999
This paper presents a real-time implementation of a speaker independent speech recognition system based on a discrete hidden markov model(DHMM). This system is developed for a car navigation system to design on-chip VLSI system of speech recognition which is used by fixed point Oak DSP core of DSP GROUP LTD. We analyze recognition procedure with C language to implement fixed point real-time algorithms. Based on the analyses, we improve the algorithms which are possible to operate in real-time, and can verify the recognition result at the same time as speech ends, by processing all recognition routines within a frame. A car noise is the colored noise concentrated heavily on the low frequency segment under 400 Hz. For the noise robust processing, the high pass filtering and the liftering on the distance measure of feature vectors are applied to the recognition system. Recognition experiments on the twelve isolated command words were performed. The recognition rates of the baseline recognizer were 98.68% in a stopping situation and 80.7% in a running situation. Using the noise processing methods, the recognition rates were enhanced to 89.04% in a running situation.
PDF

A New Least Mean Square Algorithm Using a Running Average Process for Speech Enhancement

Lee, Soo-Jeong;Ahn, Chan-Sik;Yun, Jong-Mu;Kim, Soon-Hyob
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.3E
- /
- pp.123-130
- /
- 2006
The adaptive echo canceller (AEC) has become an important component in speech communication systems, including mobile station. In these applications, the acoustic echo path has a long impulse response. We propose a running-average least mean square (RALMS) algorithm with a detection method for acoustic echo cancellation. Using colored input models, the result clearly shows that the RALMS detection algorithm has a convergence performance superior to the least mean square (LMS) detection algorithm alone. The computational complexity of the new RALMS algorithm is only slightly greater than that of the standard LMS detection algorithm but confers a major improvement in stability.
PDF KSCI

Comparison of Speech Intelligibility & Performance of Speech Recognition in Real Driving Environments (자동차 주행 환경에서의 음성 전달 명료도와 음성 인식 성능 비교)

Lee Kwang-Hyun;Choi Dae-Lim;Kim Young-Il;Kim Bong-Wan;Lee Yong-Ju
- MALSORI
- /
- no.50
- /
- pp.99-110
- /
- 2004
The normal transmission characteristics of sound are hardly obtained due to the various noises and structural factors in a running car environment. It is due to the channel distortion of the original source sound recorded by microphones, and it seriously degrades the performance of the speech recognition in real driving environments. In this paper we analyze the degree of intelligibility under the various sound distortion environments by channels according to driving speed with respect to speech transmission index(STI) and compare the STI with rates of speech recognition. We examine the correlation between measures of intelligibility depending on sound pick-up patterns and performance in speech recognition. Thereby we consider the optimal location of a microphone in single channel environment. In experimentation we find that high correlation is obtained between STI and rates of speech recognition.
PDF

A Comparison Study of Breath Groups during Reading Paragraph Tasks in Normal Adults and Adult Patients with Voice Disorders: A Preliminary Study (정상 성인 화자와 음성장애 성인 화자의 문단낭독 시 호흡단락에 대한 비교 연구: 예비연구)

Pyo, Hwayoung;Kim, Soyeon;Baek, Seungkuk
- Phonetics and Speech Sciences
- /
- v.6 no.4
- /
- pp.181-187
- /
- 2014
The present study was performed to investigate the characteristics of breath groups while reading paragraph in normal adults and adult patients with voice disorders. 10 normal females(avr. 20.6 yrs.), 10 young voice disorder females(avr. 33.5 yrs., P1 group), and 10 old voice disorder females(avr. 56.3 yrs., P2 group) read a paragraph of 210 syllables. By using the 'Running Speech' program of the Phonatory Aerodynamic System(PAS), total duration, numbers of breath groups, duration per breath group, and numbers of syllables per breath group were measured, and their correlations with aerodynamic measurement results of reading were analyzed. As a result, in total duration, numbers of breath groups, normals scored highest and P2 group speakers, lowest. Normals showed the longest duration per breath group which was not significant. P2 group speakers showed the highest numbers of syllables per breath group. Correlation analysis showed significantly high correlation scores of total duration and expiratory airflow; numbers of breath groups and inspiratory volume.
https://doi.org/10.13064/KSSS.2014.6.4.181 인용 PDF KSCI

A Study on Speech Recognition in a Running Automobile (주행중인 자동차 환경에서의 음성인식 연구)

양진우;김순협
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.5
- /
- pp.3-8
- /
- 2000
In this paper, we studied design and implementation of a robust speech recognition system in noisy car environment. The reference pattern used in the system is DMS(Dynamic Multi-Section). Two separate acoustic models, which are selected automatically depending on the noisy car environment for the speech in a car moving at below 80km/h and over 80km/h are proposed. PLP(Perceptual Linear Predictive) of order 13 is used for the feature vector and OSDP (One-Stage Dynamic Programming) is used for decoding. The system also has the function of editing the phone-book for voice dialing. The system yields a recognition rate of 89.75% for male speakers in SI (speaker independent) mode in a car running on a cemented express way at over 80km/h with a vocabulary of 33 words. The system also yields a recognition rate of 92.29% for male speakers in SI mode in a car running on a paved express way at over 80km/h.
PDF

Fast short length running FIR structure in discrete wavelet adaptive algorithm

Lee, Chae-Wook
- Journal of the Institute of Convergence Signal Processing
- /
- v.13 no.1
- /
- pp.19-25
- /
- 2012
An adaptive system is a well-known method for removing noise from noise-corrupted speech. In this paper, we perform a least mean square (LMS) based on wavelet adaptive algorithm. It establishes the faster convergence rate of as compared to time domain because of eigenvalue distribution width. And this paper provides the basic tool required for the FIR algorithm whose algorithm reduces the arithmetic complexity. We consider a new fast short-length running FIR structure in discrete wavelet adaptive algorithm. We compare FIR algorithm and short-length fast running FIR algorithm (SFIR) to the proposed fast short-length running FIR algorithm(FSFIR) for arithmetic complexities.
PDF KSCI

Improvement of Speech Recognition Performance in Running Car by Considering Wind Noise (바람잡음을 고려한 자동차에서의 음성인식 성능 향상)

Lee, Ki-Hoon;Lee, Chul-Hee;Kim, Chong-Kyo
- Proceedings of the KSPS conference
- /
- 2004.05a
- /
- pp.231-234
- /
- 2004
This paper describes an efficient method for improving the noise-robustness in speech recognition in a running car by considering wind noise. In driving car, mainly three kind of noises engine noise, tire noise and wind noise, are severely affect recognition performance. Especially wind noise is an important factor in driving car with window opened. We analyzed wind noise in various driving conditions that are 60, 80, 100 km/h with window fully opened, window half opened. We clarified that the recognition rate is significantly degenerated when the wind noise components in the frequency range above 200 Hz are large. We developed a preprocessing method to improve the noise robustness despite of wind noise. We adaptively changed the cutoff frequency of the front-end high-pass filter from 100 through 200 Hz according to the level of the wind noise components. By this method, the recognition rate is considerably improved for all kind of driving conditions
PDF

Search Result 36, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)