• Title/Summary/Keyword: Speech function

Search Result 696, Processing Time 0.02 seconds

Development of Speech-Language Therapy Program kMIT for Aphasic Patients Following Brain Injury and Its Clinical Effects (뇌 손상 후 실어증 환자의 언어치료 프로그램 kMIT의 개발 및 임상적 효과)

  • Kim, Hyun-Gi;Kim, Yun-Hee;Ko, Myoung-Hwan;Park, Jong-Ho;Kim, Sun-Sook
    • Speech Sciences
    • /
    • v.9 no.4
    • /
    • pp.237-252
    • /
    • 2002
  • MIT has been applied for nonfluent aphasic patients on the basis of lateralization of brain hemisphere. However, its applications for different languages have some inquiry for aphasic patients because of prosodic and rhythmic differences. The purpose of this study is to develop the Korean Melodic Intonation Therapy program using personal computer and its clinical effects for nonfluent aphasic patients. The algorithm was composed to voice analog signal, PCM, AMDF, Short-time autocorrelation function and center clipping. The main menu contains pitch, waveform, sound intensity and speech files on window. Aphasic patients' intonation patterns overlay on selected kMIT patterns. Three aphasic patients with or without kMIT training participated in this study. Four affirmative sentences and two interrogative sentences were uttered on CSL by stimulus of ST. VOT, VD, Hold and TD were measured on Spectrogram. In addition, articulation disorders and intonation patterns were evaluated objectively on spectrogram. The results indicated that nonfluent aphasic patients with kMIT training group showed some clinical effects of speech intelligibility based on VOT, TD values, articulation evaluation and prosodic pattern changes.

  • PDF

A Study on Pitch Period Detection of Speech Signal Using Modified AMDF (변형된 AMDF를 이용한 음성 신호의 피치 주기 검출에 관한 연구)

  • Seo, Hyun-Soo;Bae, Sang-Bum;Kim, Nam-Ho
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • v.9 no.1
    • /
    • pp.515-519
    • /
    • 2005
  • Pitch period that is a important factor in speech signal processing is used in various applications such as speech recognition, speaker identification, speech analysis and synthesis. So many pitch detection algoritms have been studied until now. AMDF which is one of pitch period detection algorithms chooses the time interval from valley point to valley point as pitch period. In selection of valley point to detect pitch period, complexity of the algoritm is increased. So in this paper we proposed the simple algorithm using modified AMDF that detects global minimum valley point as pitch period of speech signal and compared existing methods with it through simulation.

  • PDF

Perceptual Speech Assessment after Maxillary Advancement Osteotomy in Patients with a Repaired Cleft Lip and Palate

  • Kim, Seok-Kwun;Kim, Ju-Chan;Moon, Ju-Bong;Lee, Keun-Cheol
    • Archives of Plastic Surgery
    • /
    • v.39 no.3
    • /
    • pp.198-202
    • /
    • 2012
  • Background : Maxillary hypoplasia refers to a deficiency in the growth of the maxilla commonly seen in patients with a repaired cleft palate. Those who develop maxillary hypoplasia can be offered a repositioning of the maxilla to a functional and esthetic position. Velopharyngeal dysfunction is one of the important problems affecting speech after maxillary advancement surgery. The aim of this study was to investigate the impact of maxillary advancement on repaired cleft palate patients without preoperative deterioration in speech compared with non-cleft palate patients. Methods : Eighteen patients underwent Le Fort I osteotomy between 2005 and 2011. One patient was excluded due to preoperative deterioration in speech. Eight repaired cleft palate patients belonged to group A, and 9 non-cleft palate patients belonged to group B. Speech assessments were performed preoperatively and postoperatively by using a speech screening protocol that consisted of a list of single words designed by Ok-Ran Jung. Wilcoxon signed rank test was used to determine if there were significant differences between the preoperative and postoperative outcomes in each group A and B. And Mann-Whitney U test was used to determine if there were significant differences in the change of score between groups A and B. Results : No patients had any noticeable change in speech production on perceptual assessment after maxillary advancement in our study. Furthermore, there were no significant differences between groups A and B. Conclusions : Repaired cleft palate patients without preoperative velopharyngeal dysfunction would not have greater risk of deterioration of velopharyngeal function after maxillary advancement compared to non-cleft palate patients.

A Study of Nasalance Change in Submucosal Type Cleft Palate Patients by Surgery (점막하 구개열 환자의 수술 전후 비음도 변화에 대한 연구)

  • Choi, Ju-Seok;Leem, Dae-Ho;Baek, Jin-A;Kim, Oh-Hwan;Kim, Hyun-Ki;Shin, Hyo-Keun
    • Korean Journal of Cleft Lip And Palate
    • /
    • v.8 no.2
    • /
    • pp.53-62
    • /
    • 2005
  • Submucosal type cleft palate is a kind of cleft palate. A submucosal cleft may result in shortening of the anteroposterior dimension of the hard or soft palates or both. The increased distance along with the lack of muscle connection in the soft palate usually accounts for the lack of palatopharyngeal function in patients with submucosal cleft. Resonance disorders which is found in cleft patients show hypernasality or hyponasality. Many cases of submucosal type cleft palate patients visit our clinics due to hypernasality. In this study, resonance disorders was evaluated through nasalance test. Experimental group was composed of submucosal type cleft palate patients. The patients were treated by a so-called combined therapy, i.e., operation and speech training. To observe the changing pattern by surgery, nasalance test was carried out one time before surgery and three times after surgery. Nasometer II was used as a examination. The questionaire was filled with single vowels & diphthongs. The mean nasalance score of the child was significantly lower than that of the adult at every vowel. An early age at operation (under 10 years) was that a better functional result was achieved with patients. The mean nasalance score of /i/ was highest and that of /a/ was the lowest. The result of corrective surgery in selected cases has achieved improvement in all cases. Hypernasality has been consistently diminished. he operation.

  • PDF

Combining multi-task autoencoder with Wasserstein generative adversarial networks for improving speech recognition performance (음성인식 성능 개선을 위한 다중작업 오토인코더와 와설스타인식 생성적 적대 신경망의 결합)

  • Kao, Chao Yuan;Ko, Hanseok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.38 no.6
    • /
    • pp.670-677
    • /
    • 2019
  • As the presence of background noise in acoustic signal degrades the performance of speech or acoustic event recognition, it is still challenging to extract noise-robust acoustic features from noisy signal. In this paper, we propose a combined structure of Wasserstein Generative Adversarial Network (WGAN) and MultiTask AutoEncoder (MTAE) as deep learning architecture that integrates the strength of MTAE and WGAN respectively such that it estimates not only noise but also speech features from noisy acoustic source. The proposed MTAE-WGAN structure is used to estimate speech signal and the residual noise by employing a gradient penalty and a weight initialization method for Leaky Rectified Linear Unit (LReLU) and Parametric ReLU (PReLU). The proposed MTAE-WGAN structure with the adopted gradient penalty loss function enhances the speech features and subsequently achieve substantial Phoneme Error Rate (PER) improvements over the stand-alone Deep Denoising Autoencoder (DDAE), MTAE, Redundant Convolutional Encoder-Decoder (R-CED) and Recurrent MTAE (RMTAE) models for robust speech recognition.

Speech Verification using Similar Word Information in Isolated Word Recognition (고립단어 인식에 유사단어 정보를 이용한 단어의 검증)

  • 백창흠;이기정홍재근
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1255-1258
    • /
    • 1998
  • Hidden Markov Model (HMM) is the most widely used method in speech recognition. In general, HMM parameters are trained to have maximum likelihood (ML) for training data. This method doesn't take account of discrimination to other words. To complement this problem, this paper proposes a word verification method by re-recognition of the recognized word and its similar word using the discriminative function between two words. The similar word is selected by calculating the probability of other words to each HMM. The recognizer haveing discrimination to each word is realized using the weighting to each state and the weighting is calculated by genetic algorithm.

  • PDF

Prosodic characteristics of French language in conversational discourse (프랑스어의 대화 담화에 나타난 운율 연구)

  • Ko, Young-Lim;Yoon, Ae-Sun
    • Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.165-180
    • /
    • 2001
  • In this paper prosodic characteristics of French language are analysed with a corpus of radio interview. Intonation patterns are interpreted in terms of raising pattern, focal raising pattern and falling pattern. Accentual prominence is classified in two types, rhythmic accent and focal accent. Focal accent permit to explain the cohesion in a utterance or between two utterances. As a prosodic variable of discourse pauses are described by their form of realization (filled pause, silent pause, hesitation etc), their distribution and their function in utterance.

  • PDF

Real Time Implementation of a Korean Speech Synthesizer (한국어 음성합성기의 실시간 구현에 관한 연구)

  • 임광일;이규태;조철우;이우선;신인철;이태원
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.25 no.2
    • /
    • pp.176-181
    • /
    • 1988
  • In this paper, the LPC speech synthesizer with Multipulsse excitation is implemented using general-purpose DSP \ulcornerD7720. As the driving function for synthesis filter is used in the amplitude and position of pulse, the Voice/Unvoice decision and pitch period detectioncan be excluded. The synthesizer is implemented with DSP device which is operated on the interrupt mehtod with main computer and on the DMA mehtod with D/A converter. The comparision of synthetic and original waveform, alogn with the listening test, proves the validity of this system.

  • PDF

AN ALGORITHM TO REDUCE THE PITCH SEARCHING TIME USING MODIFIED DELTA SEARCH IN CELP VOCODER (개선된 델타검색기법을 이용한 피치검색시간의 단축)

  • 이주헌
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.214-217
    • /
    • 1994
  • The major drawback in the Code Excited Linear Prediction type vocoders is their large computational requirements. In this paper, a simple method is proposed to reduce the pitch searching time in the pitch filter almost without degradation of quality. On the basis of the observational regularity of the correlation function of speech, only the limited numbers of pitch lags are considered to be an optimum pitch. This is done by skipping the negative envelope side of the correlation function and limiting the maximum number of lags to be considered preliminarily. By doing so, we can reduce the computational time of pitch searching more than 51% with negligible quality degradation. In addition to that, by combining that method with the conventional delta search technique, we can reduce the computational time requirements more than 60% without serious lowering the speech quality in segmental SNR measure compared to the conventional full search method.

  • PDF

The Vowel Length as a Function of the Articulatory Force of the Following Consonants in Korean

  • Kim, Dae-Won
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.143-153
    • /
    • 2002
  • This study was designed to determine (1) the effects of the following stop consonant on the vowel length in isolated bi-syllabic words, (2) the mechanism which renders vowels longer in duration before lax stops than tense stops, (3) where the aspiratory interval is included, in the vowel portion or the preceding consonantal portion and (4) the influence of the preceding consonants upon the duration of the following vowel. Measurements were made of five timing variables on acoustic signals as three native Korean speakers uttered isolated bi-syllabic /VCV/ words in which the vowel was identical, /$\alpha$/, and the C slot was filled with bilabial stops. Findings: (1) the vowel length before the lax stops was significantly longer than before the tense stops, while the difference in the vowel duration between the tense stops was insignificant or negligible, (2) the vowel length varied as a function of the articulatory force of the following consonants, regardless of the phonological unit of syllable, (3) The aspiratory interval is interpreted as a portion of the preceding consonant and (4) The effects of the preceding consonants on the final vowel length were not rule-governed.

  • PDF