Search | Korea Science

Development of a Machine Learning-based Language Corrector for AI Speakers of Patients with Articulation Disorders (조음장애인용 AI스피커를 위한 머신러닝 기반 언어교정기 개발)

Lee, DongHeon;Moon, Mikyeong
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2020.07a
- /
- pp.371-372
- /
- 2020
최근 인공지능의 발달로 인해 AI스피커에 대한 연구가 활발히 이루어지고 있다. 조음장애는 구강 안에서 말소리를 제대로 만들지 못해서 제대로 된 언어를 구사하지 못하는 장애를 말한다. 조음장애인들이 AI스피커를 사용하면 발음을 제대로 인식하지 못하기 때문에 사용의 어려움이 있다. 본 논문에서는 경증 조음장애인들이 AI스피커를 이용할 수 있도록 머신러닝 기반 언어교정기의 개발내용에 관하여 기술한다. 이는 언어로 명령 줄 수 있는 여러 시스템에 활용될 수 있을 것으로 기대한다.
PDF

Post-Processing of Speech Recognition Using Phonological Variables and Improved Edit-distance (발음 변이와 개선된 편집 거리를 이용한 음성 인식 후처리)

Kim, Yejin;Park, Youngmin;Kang, Sangwoo;Jung, Sangkeon;Lee, Cheongjae;Seo, Jungyun
- Annual Conference on Human and Language Technology
- /
- 2014.10a
- /
- pp.9-12
- /
- 2014
본 논문에서는 오인식된 고유명사의 후처리 방법을 제안한다. 최근 음성 인식 후처리를 위해 통계적 방법을 이용하는 연구가 활발히 진행되어 왔다. 하지만 고유명사의 음성 인식 후처리는 대용량의 데이터 수집에 많은 비용이 필요하므로 통계적 방법을 효과적으로 적용하기 어렵다. 따라서 본 논문에서는 발음 변이 현상을 고려하여 편집 거리 알고리즘을 개선한 기법을 제안한다. 본 논문에서는 고유명사의 음성 오인식 교정 성능을 검증하였고, 그 결과 P@3의 결과가 비교 모델보다 55%의 성능 향상률을 보였다.
PDF

교정적 치료

Yang, Won-Sik
- The Journal of the Korean dental association
- /
- v.20 no.9 s.160
- /
- pp.753-758
- /
- 1982
선천이상 중에서도 순열과 구개열은 특상한 성질의 것으로서 안모의 변형, 발음장애, 신체타부와의 관계기형, 이비인후과질환을 병발하기 쉽고, 호흡기, 소화기 질환에 이환되기 쉬으며 환자는 정신적, 사회적으로도 매우 불리한 위치에 놓여 있는 신체장애자로, Fogh-Anderson은 순열과 구개열은 열성반성유전(recessive sexlinked heredity)을 한다고 보고했다. 그러나 구개열은 유전적추적이 불가능하기 때문에 그 발생원인으로 물리적원인을 우선적으로 들고있다. 발생기전은 태생 8~12조경의 제2차구개형성기에 어떤 이유로 양측의 구개돌기(palatine process)와 비중격이 분리되있는 채로 있든가, 접근해서도 유합되지 않고 있든가의 조해요인으로 태아의 모태내에 있어서의 자세, 특히 흉부에 의한 하악골압박에 의해 발생하는 소악증(micromandible), 설하수(glossoptosis)를 동반하는 Pierr Robin syndrome, 태생 8~10조경까지의 혀의 만기정유, 지, 제대의 원시구강내로의 미입등을 들수있다. 순열 및 구개열환자의 치료에 있어서 종래에는 전과정이 외과의사에게만 맡겨졌었으나, 구순, 비부의 추형등의 문제, 특징적으로 발생되는 하악의 열성장, 이별궁이 왜형, 발음장애등의 문제점해결을 위하여 필연적으로 이에 관계있는 명기다른 전문분야의 전문의가 시술에 임하게되는 multidiscipline approach로서 종합진단, 장기치료계획의 입안, 전문적 의견의 교환을 통해서 치료시기, 치료순서의 결정으로 성공적인 순열및 구개열의 치료목표를 달성하리라고 생각한다.
PDF

A Method for Correcting English Vowel Pronunciation by Wooden Chopsticks (나무젓가락에 의한 영어모음 발음교정 방안)

Yang, Byung-Gon
- Phonetics and Speech Sciences
- /
- v.2 no.4
- /
- pp.51-58
- /
- 2010
English vowels play an important role in the daily communication between Korean students and international visitors. However, many Korean students still have difficulty producing them distinctively. Vowels vary according to shapes of oral and pharyngeal cavities, which are mainly determined by the degree of jaw opening and tongue position. Yang (2008a) proposed a simplified chart of English and Korean vowels for an educational purpose. He also suggested to use wooden chopsticks to secure distinguishable jaw openings. The purpose of this study is to tap whether wooden chopsticks can be applicable to a method for correcting English vowel pronunciation. Twelve male and female students participated in the recordings of eight /hVd/ words followed by additional recordings with wooden chopsticks between upper and lower teeth. The first and second formant trajectories of both natural and controlled vowel productions were obtained and compared at six equidistant measurement points using Praat. Results showed that the formant values of natural vowel productions were comparable to those of controlled productions. Vowels with similar formant trajectories of male students were separated with the aid of chopsticks. The width of each chopstick could be controlled similarly in the experiment. The author concludes that wooden chopsticks can be useful to correct vowel pronunciation. Further studies are desirable for native speakers to make perceptual evaluations of controlled vowel productions by nonnative speakers.
PDF

English Learning Applications Using Big Data Development (빅데이터를 활용한 영어학습 애플리케이션 설계 및 구현)

Lee, Jae-hoon;Kim, Seung-beom;Kim, Chang-young;Yang, Won-seok;Kim, Do-woo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2020.11a
- /
- pp.644-647
- /
- 2020
최근 교육분야에서는 IT 기술을 활용하여 교육을 혁신하는 것을 의미하는 에듀테크에 대한 관심이 높아지고 있다. 단순한 지식의 전달이 아닌 사용자의 수준에 맞춰진 학습을 하고 자신의 학습 내용을 스스로 모니터링할 수 있는 새로운 교육시스템이 필요하다. 이에 본 논문에서는 빅데이터를 활용한 영어학습 애플리케이션를 제안한다. 제안하는 애플리케이션은 영어뉴스 기사에서 추출한 빅데이터를 활용하여 사용자 수준에 맞춘 유용한 문장을 분석해 자동으로 문제를 생성하고 사용자의 음성데이터를 강세 분석 알고리즘으로 원어민 발음과 비교분석 하여 발음 및 강세를 교정할 수 있도록 설계 및 구현하였다.
https://doi.org/10.3745/PKIPS.y2020m11a.644 인용 PDF

THE EFFECT OF ORTHODONTIC TREATMENT BY PREMOLAR EXTRACTION ON THE PRONUNCIATION OF THE KOREAN CONSONATS (소구치 발거를 통한 교정치료가 한국어 자음의 발음에 미치는 영향)

Lee, Jeong-Hee;Yoon, Young-Jooh;Kim, Kwang-Won
- The korean journal of orthodontics
- /
- v.27 no.1
- /
- pp.91-103
- /
- 1997
This paper aimed to study what the influences of orthodontic treatment of pronunciation are. We compared the duration and the acoustic wave patterns of Korean consonants pronounced by a control group with those of a patient who had his four premolars extracted and had been given orthodontic treatment The results were as follows : 1. Compared to the control group, the treatment group had a longer duration time of consonant pronunciation for all consonants but "ㅅ(s)" and "ㅌ($(t^h)$" in CV(consonant-vowel) pairs. Especially in the case of "ㅈ(dz)", "ㅆ$({\varphi}^h)$" for CV-pairs, and "ㄷ(d)" in VCV(vowel-consonant-vowel) clusters, the duration of consonant sound showed a sharp contrast between the control group and the treatment group. 2. There were clear differences in the acoustic wave patterns of "ㅉ(ts)", "ㅆ$({\varphi}^h)$" and "ㅊ$(c^h)$", all of which were in VCV-clusters. The acoustic wave pattern of "ㅉ(ts)", when pronounced by the treatment group, was stronger than the control group's. This phenomenon was most remarkable in the transitive section where the "ㅉ(ts)" sound flowed into the following vowel. When a preceding vowel shifted to the consonant "ㅆ$({\varphi}^h)$", the attack property of the appeared clearly in the acoustic waves of the treament group, while in the control group the starting point of consonart was indistinctive. Consonant duration for the treatment group was longer, and the appearance of a zero crossing point in the acoustic wave was more frequent. In the case of "ㅊ$(c^h)$", the treatment group produced a strong acoustic wave, and the property of aspiration was obvious in it. 3. When the treatment group pronounced "ㄷ(d)" and "ㅈ(dz)" in CV-pairs, the acoustic-wave was similar to that of aspirated "ㅌ$(t^h)$" and "ㅊ$(c^h)$". 4. The aspirated "ㅌ$(t^h)$" and "ㅊ$(c^h)$" pronounced by the treatment group showed the stronger airstream and acoustic wave form.
PDF

Types of malocclusion and oral health effect index(OHIP-14) according to recognition of orthodontic treatment (부정교합 종류에 따른 교정치료의 인식과 구강건강영향지수(OHIP-14))

Yoon, Hyun-Seo
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.18 no.12
- /
- pp.434-442
- /
- 2017
The purpose of this study was to examine the influence of type of malocclusion and orthodontic treatment awareness on quality of life among orthodontic patients in the region of Busan as well as to develop an educational program tailored to the type of malocclusion as a way to improve quality of life. A survey was conducted for approximately 6 months from December, 2015, and the answer sheets from 472 respondents were analyzed. The most common painful area was the teeth, and this case was most predominant in the respondents with level 2 malocclusion, who differed from others in that regard (p<0.001). Regarding the relationship between satisfaction with orthodontic treatment and quality of life, respondents who were more satisfied currently and who were neither quite confident nor quite unconfident were ahead of their counterparts in quality of life. Concerning the reason for receiving orthodontic treatment, quality of life was lower among patients who started to receive treatment due to pronunciation problems (p=0.013), chewing difficulty (p<0.001), and temporomandibular joint click sound (p<0.001). With regard to influential factors on oral health-related quality of life, time for starting to receive orthodontic treatment was most influential (p<0.001), followed by current satisfaction (p<0.001), changes in confidence (p=0.003), self-rated teeth status (p=0.008), and type of occlusion (p=0.019). Therefore, accurate analysis of the oral health status of orthodontic patients and customized oral health education are required to improve quality of life even during the period of orthodontic treatment.
https://doi.org/10.5762/KAIS.2017.18.12.434 인용 PDF KSCI

A Study on the Utilization of Speech Recognition Technology in Foreign Language Learning Applications - Focusing on English and French Speech - (외국어 학습용 어플리케이션의 음성 인식 기술 활용 현황 - 영어와 프랑스어 말하기 학습을 중심으로 -)

Kim, Sunhee;Jung, Hyunhoon
- Journal of Digital Contents Society
- /
- v.19 no.4
- /
- pp.621-630
- /
- 2018
This paper presents a case study on foreign language learning applications based on the speech recognition technology, aiming to grasp their current status and limitations of the technology applied to the foreign language speaking education, especially for English and French. As a result of examining the characteristics of the selected English and French applications by drawing on speech learning, it is shown that the use of speech recognition technology has the advantage of creating a speaking practice environment and giving feedback. However, in the case of feedback, there is a lack of appropriate calibration feedback which can help learners correct errors by themselves.
https://doi.org/10.9728/dcs.2018.19.4.621 인용 PDF KSCI

Speech Visualization of Korean Vowels Based on the Distances Among Acoustic Features (음성특징의 거리 개념에 기반한 한국어 모음 음성의 시각화)

Pok, Gouchol
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.12 no.5
- /
- pp.512-520
- /
- 2019
It is quite useful to represent speeches visually for learners who study foreign languages as well as the hearing impaired who cannot directly hear speeches, and a number of researches have been presented in the literature. They remain, however, at the level of representing the characteristics of speeches using colors or showing the changing shape of lips and mouth using the animation-based representation. As a result of such approaches, those methods cannot tell the users how far their pronunciations are away from the standard ones, and moreover they make it technically difficult to develop such a system in which users can correct their pronunciation in an interactive manner. In order to address these kind of drawbacks, this paper proposes a speech visualization model based on the relative distance between the user's speech and the standard one, furthermore suggests actual implementation directions by applying the proposed model to the visualization of Korean vowels. The method extract three formants F1, F2, and F3 from speech signals and feed them into the Kohonen's SOM to map the results into 2-D screen and represent each speech as a pint on the screen. We have presented a real system implemented using the open source formant analysis software on the speech of a Korean instructor and several foreign students studying Korean language, in which the user interface was built using the Javascript for the screen display.
https://doi.org/10.17661/jkiiect.2019.12.5.512 인용 PDF KSCI

An Introduction to 'Dr.Speaking' - English Pronunciation Tutoring System for Korean - (한국인을 위한 영어발음교정 시스템 'Dr.Speaking' 소개)

김효숙
- Proceedings of the KSPS conference
- /
- 2002.11a
- /
- pp.47-50
- /
- 2002
This paper is to introduce 'Dr. Speaking', which was recently developed by Eonon Inc.. 'Dr. Speaking' is an English pronunciation tutoring system. This has three distinguishing features. First, it teaches how to organize a speaker's vocal organs to pronounce accurately. Second, after it compares a speaker's pronunciation with that of a native speaker's, it grades that speaker's pronunciation level according to phonetic standards. Third, it provides proper information necessary for correcting a speaker's incorrect pronunciation. It is not always easy for a tutoring system to execute the above three almost simutaneously. However, 'Dr. Speaking' proved itself that it is possible by adding speech technology (e.g. speech recognition) to phonetic knowledge.
PDF

Search Result 59, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)