• Title/Summary/Keyword: Phoneme

Search Result 458, Processing Time 0.022 seconds

COMPARATIVE STUDY UPON THE CHARACTERISTICS OF WRITING BETWEEN THE PATIENTS WITH WRITING DISABILITIES AND NORMAL ELEMENTARY SCHOOL STUDENTS (쓰기 장애 환자와 정상 초등학교 학생의 쓰기 특성 비교)

  • Cho, Soo-Churl;Shin, Sung-Woong
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.12 no.1
    • /
    • pp.51-70
    • /
    • 2001
  • Characteristics of handwriting were investigated and compared between the patients with writing disabilities and normal elementary school pupils. Generally, the heights of the letters of the patients were significantly larger than those of normal children, and letters of the patients were more sparsely distributed than those of controls. The distance between the words were significantly reduced in the patients’ writings, which indicated that patients had much more problems of space-leaving than normal pupils. Letter heights differences were significant across all grades in the patients and normal controls. The heights of the letters decreased as they grew older, and the slope of the decrements were more steeper in normal girls(r=-0.45) than girls with writing disabilities(r=-0.16). Sex differences were found in the letter spacings in low grades(grades 1, 2), that is, the distances between the letters were significantly narrower in the male patients than normal boys in these grades, and the differences were almost indiscriminating in grades 3 through 5, and finally, in sixth grade, letter spacings were signifycantly broader in normal boys than male dysgraphics. In girls, letter spacings were significantly broader in the patients across all grades. These findings supports the hypothesis that male and female writings were qualitatively different and that distinct mechanisms served in boys and girls dysgraphics. Across all grades and sexes, spaces between the words of the patients were significantly broader than normal pupils, which suggested that space-leaving between the words was important in Korean writings. There was trend that letter spacings and word spacings decreased across grades, but in girls, no correlations between the letter spacings and grades were found. Correlation analyses revealed that letter heights and letter spacings had mild correlation(r=0.11-0.15), and that letter spacings and word spacings had robust correlation(r=0.99). Phonological errors were mostly found in last phoneme(Jong-seong), especially double-phoneme(ㄳ, ㄵ, ㄶ, ㄺ, ㄻ, ㄼ, ㄾ, ㄿ, ㅀ, ㅄ), and in the case the sound values changed due to assimilations of phonemes. Semantic errors were rare in both groups. Space-leaving errors were correlated with phonological errors, and more frequent in boys than girls. In conclusion, significant differences existed in the letter heights, letter spacings, word spacings, and frequencies of phonological errors and spaceleaving errors between the patients with writing disabilities and normal pupils. The characteristics of writings changed across grades and the developmental profiles were somewhat quantitatively different between the groups. The differences became obvious from the second-third grades.

  • PDF

A Study on the Differences of Cognitive Functions, Neurobehavioral Symptoms and Daily Living Functions According to the Lateralization of Lesion in Patients with Non-Traumatic Subcortical Cerebrovascular Disease (비외상성 피질하 뇌혈관질환 환자에서 병소의 편측성에 따른 인지기능, 정신행동증상 및 일상생활기능의 차이에 대한 연구)

  • Park, Young-Soo;Lee, Young-Ho;Choi, Young-Hee;Ko, Dae-Kwan;Chung, Young-Cho;Park, Byoung-Kwan;Kim, Soo-Ji;Chung, Suk-Haui;Ko, Byoung-Hee;Song, Il-Byoung;Park, Kun-Woo;Lee, Dae-Hie
    • Sleep Medicine and Psychophysiology
    • /
    • v.3 no.1
    • /
    • pp.56-67
    • /
    • 1996
  • Objectives : This study was designed to find clinical factors that could be differentiated by the lateralization of lesion and also find clinical factors to predict the lateralization of lesion. Methods : The subjects were 65 cooperative inpatients and outpatients with non-traumatic subcortical cerebrovascular disease without neurologic and psychiatric history from January 1995 to September 1995 ; 48 patients in Kyung Hee University, Oriental Medicine Hospital, 35 patients in Anam Hospital, Korea University were examined as subjects, but authors excluded 20 patients whose data were incomplete or who had uncertain lesions on brain CT or MRI. The 65 patients were divided into three groups-group with left hemispheric lesion, group with right hemispheric lesion, group with both hemispheric lesion-according to the finding of brain imaging study. Their cognitive functions were evaluated by the Benton Neuropsychological Assessment(BNA), their subjective neurobehavioral symptoms by Symptom Check List-90-R(SCL-90-R), their objective neurobehavioral symptoms by Neurobehavioral Rating Scale, and their daily living functions by Geriatric Evaluation by Relative's Rating Instrument(GERRl) and Instrumental Activities of Daily Living Scale(IADLs). Results : The results were as follows : 1) The results of cognitive function test indicated that the group with right hemispheric lesion showed low functions in Tactile Form Perception(left), the group with left hemispheric lesion showed low functions in Finger localization(right), the group with right hemispheric lesion showed low functions in Finger Localization(left). 2) Though, there were little significant differences in subjective neurobehavioral symptoms, the group with right hemispheric lesion showed higher scores in all symptoms except hostility. 3) Though, there were little significant differences in objective neurobehavioral symptoms, the group with both hemispheric lesion showed higher scores in cognition, guilty/disinhibition, the group with left hemispheric lesion showed higher scores in lability of mood, the group with right hemispheric lesion showed highest scores in psychotism, neurotism, agitation-hostility and decreased motivation/emotional withdrawal. 4) There were little significant differences among three groups in Daily Living Functions, but the group with right hemispheric lesion showed the lowest functions in Instrumental Activities of Daily Living. 5) As a result of discriminant analysis on each factor's contribution to the prediction of lesion, Finger Localization(left), Phoneme Discrimination and Tactile Form Perception(right) showed that they had the potentiality to predict lesion. Conclusion : The results suggest that there are little significant differences among the groups of three non-traumatic subcortical cerebrovascular disease in cognitive functions, but the group with right hemispheric lesion showed more serious and various changes in subjective and objective neurobehavioral symptoms, and showed low functions in Instrumental Activities of Daily Living. This results suggest the possibility that the decline of the daily living function in the group with right hemispheric lesion were due to various symptoms, not due to cognitive dysfunction. The confirmation of the possibility should be worked out through the follow-up study of some groups containing cortical lesion. Apart from these findings, Finger Localization, Tactile Form Perception(right) and Phoneme Discrimination suggest that they can be used as clinically valuable cognitive parameters that predict the lateralization of lesion in non-traumatic cerebrovascular disease.

  • PDF

A Study on the Rejection Capability Based on Anti-phone Modeling (반음소 모델링을 이용한 거절기능에 대한 연구)

  • 김우성;구명완
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.3
    • /
    • pp.3-9
    • /
    • 1999
  • This paper presents the study on the rejection capability based on anti-phone modeling for vocabulary independent speech recognition system. The rejection system detects and rejects out-of-vocabulary words which were not included in candidate words which are defined while the speech recognizer is made. The rejection system can be classified into two categories by their implementation methods, keyword spotting method and utterance verification method. The keyword spotting method uses an extra filler model as a candidate word as well as keyword models. The utterance verification method uses the anti-models for each phoneme for the calculation of confidence score after it has constructed the anti-models for all phonemes. We implemented an utterance verification algorithm which can be used for vocabulary independent speech recognizer. We also compared three kinds of means for the calculation of confidence score, and found out that the geometric mean had shown the best result. For the normalization of confidence score, usually Sigmoid function is used. On using it, we compared the effect of the weight constant for Sigmoid function and determined the optimal value. And we compared the effects of the size of cohort set, the results showed that the larger set gave the better results. And finally we found out optimal confidence score threshold value. In case of using the threshold value, the overall recognition rate including rejection errors was about 76%. This results are going to be adapted for stock information system based on speech recognizer which is currently provided as an experimental service by Korea Telecom.

  • PDF

영어 발음 교육

  • 이영길
    • Proceedings of the KSPS conference
    • /
    • 1997.07a
    • /
    • pp.258-259
    • /
    • 1997
  • 1. 외국어로서의 영어 교육에 있어서 발음 지도는 어느 정도의 영어 수준에 도달하기를 기다릴 필요없이 가능한 한 저학년에서부터 직접 지도되어야 한다. 즉 영어 교육은 영어 발음 교육부터 시작되는 것이 가장 바람직하다. 어느 정도의 수준 높은 문법 이론을 알고 있는 (대)학생들이라도 발음에 관한 한 많은 연습이 요구되는 경우가 흔히 있다. 바꿔 말하면 이러한 학생들은 그들이 갖고 있는 문법 지식만큼 발음에 대한 적극적인 구사력도 당연히 발휘할 수 있어야할 것이다. 영어 교육을 강조할 때 문장 구조와 어휘 교육이 중요시된다면 발음 또한 조기 교육 단계부터 영어 교육 프로그램의 필수불가결한 요소로 인식되어야 한다. 그렇다면 제일 처음 무엇을 어떻게 시작 해야할 것인가\ulcorner 흔히 음소(phoneme)라는 말의 최소 단위부터 시작하여 자음군(consonant cluster)과 같은 음 결합체를 가르친 다음 단어 강세(word stress)를 다루며, 마지막으로 문장 강세(sentence stress), 리듬(rhythm), 억양(intonation) 등을 포함함 이음말(connected speech)을 가르치는 순서가 될 수 있을 것이다. 그러나 이러한 방법이 이론상 논리적이긴 하지만 실제로 영어를 외국어로 배우는 우리 학생들에게는 얼마나 효과를 거둘 수 있는지 매우 의심스렵다. 오히려 가장 유익한 순서는 기본 억양 과 같은 적절한 표현과 함께 주어진 화맥 속에서의 의미 있는 문장 강세를 가르치고 그 다음에 그에 수반되는 중요한 소리의 발음을 지적해 주는 것이다. 예를 들면 Give it to him과 같은 구조를 교사가 구두로 제시할 때 단어 하나 하나를 강조한 나머지 너무 천천히 말하게 되면 전체 문장의 발음을 오히려 어렵게 만들어버린다. 중요한 것 은 기본 의사소통에 필요한 부분에 초점을 맞추는 일이다. 개별 단어에 부수되는 문제점은 '보충 지도'(remedial teaching)로 교정이 가능하다. 2. 우리의 초등학교 영어 교육의 현황을 고려할 때 비록 발음 지도가 쉬운 일은 아니지만 미래 지향적 결과를 기대할 때 우선 두 가지를 생각할 수 있다. 첫째로 현재의 교육대학교의 교사양성에 있어서 영어교육의 교과과정을 염두에 두지 않을 수 없다. 1981년도부터 교육대학교가 4년제가 명실공히 영어과로 운영되기는 수년밖에 되지 않는 실정이다. 현재의 교과과정도 현장에서 영어교육을 담당하기에는 불충분할 뿐만 아니라 영어발음에 관한 뚜렷한 과정이 없는 실정이다. 혼히 외국인 강사가 담당하는 이른바 영어회화 시간이 곧 발음 시간도 될 수 있다고 생각하기 쉬우나 이것은 전적으로 별개의 문제이다. 따라서 체계적인 발음 교육을 할 수 있는 교과과정이 되기를 바란다. 3. 앞에서 언급했듯이 4년제 이전에 졸업한 현직 교사들은 재학 중 영어 발음에 관한 지도를 받아본 적이 없다. 여기서 중요한 것은 이들 교사들에게 적절하고도 충분한 발음 교육을 시켜야 하는 연수 과정이다. 소리로 듣고 말해야 하는 초둥 영어 교육에 서 교사의 발음에 관한 지식은 그 중요성을 아무리 과대평가해도 지나치지 않을 것이다. 문제는 연수 내용이다. 적어도 현재까지 실시되어 온 초둥영어교육 담당자 연수 교과목 내용은 핵심을 찾기 힘들 정도로 교파목이 다양하고 산만하다. 따라서 예를 들면 영어발음 지도에 관한 과목도 마지못해 끼워 넣는 식의 과목 배정이다. 여기에 고작 할당된 시간은 많아야 4시간 정도이다. 대학에서 한 학기에도 부족한 영어 발음을 아 무런 배경 지식도 없는 초등 교사들에게 4시간 동안 무엇을 어떻게 가르칠 것인가\ulcorner

  • PDF

A Phoneme-based Approximate String Searching System for Restricted Korean Character Input Environments (제한된 한글 입력환경을 위한 음소기반 근사 문자열 검색 시스템)

  • Yoon, Tai-Jin;Cho, Hwan-Gue;Chung, Woo-Keun
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.10
    • /
    • pp.788-801
    • /
    • 2010
  • Advancing of mobile device is remarkable, so the research on mobile input device is getting more important issue. There are lots of input devices such as keypad, QWERTY keypad, touch and speech recognizer, but they are not as convenient as typical keyboard-based desktop input devices so input strings usually contain many typing errors. These input errors are not trouble with communication among person, but it has very critical problem with searching in database, such as dictionary and address book, we can not obtain correct results. Especially, Hangeul has more than 10,000 different characters because one Hangeul character is made by combination of consonants and vowels, frequency of error is higher than English. Generally, suffix tree is the most widely used data structure to deal with errors of query, but it is not enough for variety errors. In this paper, we propose fast approximate Korean word searching system, which allows variety typing errors. This system includes several algorithms for applying general approximate string searching to Hangeul. And we present profanity filters by using proposed system. This system filters over than 90% of coined profanities.

A Study on Regression Class Generation of MLLR Adaptation Using State Level Sharing (상태레벨 공유를 이용한 MLLR 적응화의 회귀클래스 생성에 관한 연구)

  • 오세진;성우창;김광동;노덕규;송민규;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.22 no.8
    • /
    • pp.727-739
    • /
    • 2003
  • In this paper, we propose a generation method of regression classes for adaptation in the HM-Net (Hidden Markov Network) system. The MLLR (Maximum Likelihood Linear Regression) adaptation approach is applied to the HM-Net speech recognition system for expressing the characteristics of speaker effectively and the use of HM-Net in various tasks. For the state level sharing, the context domain state splitting of PDT-SSS (Phonetic Decision Tree-based Successive State Splitting) algorithm, which has the contextual and time domain clustering, is adopted. In each state of contextual domain, the desired phoneme classes are determined by splitting the context information (classes) including target speaker's speech data. The number of adaptation parameters, such as means and variances, is autonomously controlled by contextual domain state splitting of PDT-SSS, depending on the context information and the amount of adaptation utterances from a new speaker. The experiments are performed to verify the effectiveness of the proposed method on the KLE (The center for Korean Language Engineering) 452 data and YNU (Yeungnam Dniv) 200 data. The experimental results show that the accuracies of phone, word, and sentence recognition system increased by 34∼37%, 9%, and 20%, respectively, Compared with performance according to the length of adaptation utterances, the performance are also significantly improved even in short adaptation utterances. Therefore, we can argue that the proposed regression class method is well applied to HM-Net speech recognition system employing MLLR speaker adaptation.

English Phoneme Recognition using Segmental-Feature HMM (분절 특징 HMM을 이용한 영어 음소 인식)

  • Yun, Young-Sun
    • Journal of KIISE:Software and Applications
    • /
    • v.29 no.3
    • /
    • pp.167-179
    • /
    • 2002
  • In this paper, we propose a new acoustic model for characterizing segmental features and an algorithm based upon a general framework of hidden Markov models (HMMs) in order to compensate the weakness of HMM assumptions. The segmental features are represented as a trajectory of observed vector sequences by a polynomial regression function because the single frame feature cannot represent the temporal dynamics of speech signals effectively. To apply the segmental features to pattern classification, we adopted segmental HMM(SHMM) which is known as the effective method to represent the trend of speech signals. SHMM separates observation probability of the given state into extra- and intra-segmental variations that show the long-term and short-term variabilities, respectively. To consider the segmental characteristics in acoustic model, we present segmental-feature HMM(SFHMM) by modifying the SHMM. The SFHMM therefore represents the external- and internal-variation as the observation probability of the trajectory in a given state and trajectory estimation error for the given segment, respectively. We conducted several experiments on the TIMIT database to establish the effectiveness of the proposed method and the characteristics of the segmental features. From the experimental results, we conclude that the proposed method is valuable, if its number of parameters is greater than that of conventional HMM, in the flexible and informative feature representation and the performance improvement.

A Study on Rhythm Information Visualization Using Syllable of Digital Text (디지털 텍스트의 음절을 이용한 운율 정보 시각화에 관한 연구)

  • Park, seon-hee;Lee, jae-joong;Park, jin-wan
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2009.05a
    • /
    • pp.120-126
    • /
    • 2009
  • As the information age grows rapidly, the amount of digital texts has been increasing as well. It has brought an increasing of visualization case in order to figure out lots of digital texts. Existing visualized design of digital text is merely concentrating on figuration of subject word through adoption of stemming algorithm and word frequency extraction, prominence of meaning of text, and connection in between sentences. So it is a fact that expression of rhythm that can visualize sentimental feeing of digital text was insufficient. Syllable is a phoneme unit that can express rhythm more efficiently. In sentences, syllable is a most basic pronunciation unit in pronouncing word, phase and sentence. On this basis, accent, intonation, length of rhythm factor and others are based on syllable. Sonority, which is most closely associated with definitions of syllable, is expressed through air flow of igniting lung and acoustic energy that is specified kinetic energy into sonority. Seen from this perspective, this study examines phonologic definition and characteristics based on syllable, which is properties of digital text, and research the way to visualize rhythm through diagram. After converting digital text into phonetic symbol by the experiment, rhythm information are visualized into images using degree of resonance, which was started from rhythm in all languages, and using syllable establishment of digital text. By visualizing syllable information, it provides syllable information of digital text and express sentiment of digital text through diagram to assist user's understanding by systematic formula. Therefore, this study is aimed at planning for easy understanding of text's rhythm and realizing visualization of digital text.

  • PDF

A STUDY ON THE INFLUENCE OF THE PALATAL PLATES UPON THE DURATION OF KOREAN SOUNDS (구개상 장착에 따른 한국어 어음의 조음시간 변화에 관한 연구)

  • Koh, Yeo-Joon;Kim, Chang-Whe;Kim, Yong-Soo
    • The Journal of Korean Academy of Prosthodontics
    • /
    • v.32 no.1
    • /
    • pp.77-102
    • /
    • 1994
  • Many studies have been made on the masticatory and esthetic effects of prosthodontic treatments, but few on the restoration of pronunciation, especially in complete denture wearers. The purpose of this study is to provide a basis that could be of help to the complete denture wearers' speech adaptation by analyzing the influence of the palatal coverage upon the duration of consonants and vowels with the method of experimental phonetics. For this study, metal plates and resin plates were made for 3 male subjects in their twenties, who have good occlusion, and do not have speech and hearing disorders. Then 8 Korean consonants and 4 Korean vowels were selected, systemically considering phonetic variants such as the place and manner of articulation, lenis/fortis, mutual effect of each phoneme, etc. They were combined into meaningless tested words in the form of /VCV/, and were included in the carrier sentences. Each informant uttered the sentences 1) without the plate, 2) with the metal plate, 3) with the resin plate. The recorded data were analyzed through the waveform of sounds and spectrogram by using the program SoundEdit, Signalize, Statview 512+for the Macintosh computer. The duration of each segment was measured by searching for the boundaries between the preceding vowels and consonants, and between the consonants and the following vowels. The study led to the conclusion that. 1. With the palatal plate, the duration of all the tested words increased and the duration increased more with the resin plate than with the metal plate. 2. With the palatal plate, the duration of all the preceding vowels, consonants, and following vowels increased, but the temporal structure of the tested words was maintained. 3. As for the manner of articulation, fricative /s/(ㅅ) was greatly influenced by both kinds of palatal plates. 4. As for the place of articulation, alveolar sounds /d/(ㄷ), /n/(ㄴ) were greatly influnced by the kinds of palatal plates, and the velar sounds /n/(ㅇ), /g/(ㄱ) were influenced by the platal plates, but the kind of the palatal plates did not show any significance. 5. As for the lenis/fortis, lenis was influenced more by the kind of the palatal plates. 6. As for the influence of vowels upon each segment in the tested words, palatal vowel /i/(ㅣ) had greater influence than pharyngeal vowel /a/(ㅏ), and following vowels than preceding vowels.

  • PDF

Corpus-based Korean Text-to-speech Conversion System (콜퍼스에 기반한 한국어 문장/음성변환 시스템)

  • Kim, Sang-hun; Park, Jun;Lee, Young-jik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.24-33
    • /
    • 2001
  • this paper describes a baseline for an implementation of a corpus-based Korean TTS system. The conventional TTS systems using small-sized speech still generate machine-like synthetic speech. To overcome this problem we introduce the corpus-based TTS system which enables to generate natural synthetic speech without prosodic modifications. The corpus should be composed of a natural prosody of source speech and multiple instances of synthesis units. To make a phone level synthesis unit, we train a speech recognizer with the target speech, and then perform an automatic phoneme segmentation. We also detect the fine pitch period using Laryngo graph signals, which is used for prosodic feature extraction. For break strength allocation, 4 levels of break indices are decided as pause length and also attached to phones to reflect prosodic variations in phrase boundaries. To predict the break strength on texts, we utilize the statistical information of POS (Part-of-Speech) sequences. The best triphone sequences are selected by Viterbi search considering the minimization of accumulative Euclidean distance of concatenating distortion. To get high quality synthesis speech applicable to commercial purpose, we introduce a domain specific database. By adding domain specific database to general domain database, we can greatly improve the quality of synthetic speech on specific domain. From the subjective evaluation, the new Korean corpus-based TTS system shows better naturalness than the conventional demisyllable-based one.

  • PDF