• 제목/요약/키워드: Speech Confidence

검색결과 70건 처리시간 0.028초

한국인 영어 학습자의 영어 단어 경계 인지 시 변이음 단서 사용 연구 (A Study of the use of allophonic cues in the perception of English word boundaries by Korean learners of English)

  • 장수영;박한상
    • 말소리와 음성과학
    • /
    • 제3권3호
    • /
    • pp.63-68
    • /
    • 2011
  • This study investigates how Korean students employ acoustic-phonetic cues in perceiving word boundaries of near-homophonous English phrases. For this study, 60 Korean college students participated in the experiment of discriminating word boundaries for 42 pairs of stimuli comprising the allophonic cues of aspiration and glottal stop. Results were analysed in terms of the correctness of responses and the correlation between correctness and confidence. Results showed that stimuli pairs of the glottal stop cue give a higher correctness but those of aspiration a relatively lower correctness. Comparison of the results of this study with those of the previous studies of English and Japanese speakers showed that Korean and Japanese speakers of English give a substantially lower correctness than native speakers of English, while Korean learners of English as a foreign language provide a lower correctness than Japanese speakers of English as a second language.

  • PDF

방송뉴스 핵심어 검출 시스템에서의 오인식 거부를 위한 DTW의 적용 (DTW based Utterance Rejection on Broadcasting News Keyword Spotting System)

  • 박경미;박정식;오영환
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.155-158
    • /
    • 2005
  • Keyword spotting is effective to find keyword from the continuously pronounced speech. However, non-keyword may be accepted as keyword when the environmental noise occurs or speaker changes. To overcome this performance degradation, utterance rejection techniques using confidence measure on the recognition result have been developed. In this paper, we apply DTW to the HMM based broadcasting news keyword spotting system for rejecting non-keyword. Experimental result shows that false acceptance rate is decreased to 50%.

  • PDF

일반화를 강화한 시각적 피드백 프로그램이 무변성 환자의 음성 일반화에 미치는 영향 : 사례연구 (The Effect of Voice Generalization on Puberphonia Patients via Generalization -Reinforced Visual Feedback Program: A Case Study)

  • 권순복;박희준;정옥란;왕수건
    • 음성과학
    • /
    • 제15권2호
    • /
    • pp.145-156
    • /
    • 2008
  • The purpose of this study was to investigate the reason why puberphonia patients revisit hospitals after completion of its treatment and the effect of visual voice therapy on voice improvement. The subject the study included are two puberphonia patients who had been diagnosed by laryngologists. The patients who were diagnosed as puberphonia by the laryngologist and treated by the a speech pathologist, completed their treatment and revisited hospital. The study used laryngoscopy, acoustic and aerodynamic analysis before and after voice treatment to investigate what change happens and why generalization of treatment effect did not occur naturally in the daily life. Their voices of pre-therapy and post-therapy were analyzed on the aspects of acoustics, aerodynamics and laryngeal endoscopy. As a result, it was found that fundamental frequency(Fo) was significantly lowered in respect of acoustic change and maximum phonation time(MPT) was increased to some extent in respect of aerodynamic change. In addition, there was a laryngoscopic change and commissure glottic chink disappeared generally in the phonation. The reason why the generalization did not occur naturally in one’s daily routine was mainly due to the fact that high-pitched voicing was used for a long time. Other than that reason, negative reaction or attitude of surrounding people and lack of confidence were to blame for failure of generalization.

  • PDF

Academic Performance, Communication, and Psychosocial Development of Prelingual Deaf Children with Cochlear Implants in Mainstream Schools

  • Choi, Ji Eun;Hong, Sung Hwa;Moon, Il Joon
    • Journal of Audiology & Otology
    • /
    • 제24권2호
    • /
    • pp.61-70
    • /
    • 2020
  • Background and Objectives: To assess the academic performance, communication skills, and psychosocial development of prelingual deaf children with cochlear implants (CIs) attending mainstream schools, and to evaluate the impact of auditory speech perception on their classroom performance. Subjects and Methods: As participant, 67 children with CI attending mainstream schools were included. A survey was conducted using a structured questionnaire on academic performance in the native language, second language, mathematics, social studies, science, art, communication skills, self-esteem, and social relations. Additionally, auditory and speech performances on the last follow-up were reviewed retrospectively. Results: Most implanted children attending mainstream school appeared to have positive self-esteem and confidence, and had little difficulty in conversing in a quiet classroom. Also, half of the implanted children (38/67) scored above average in general academic achievement. However, academic achievement in the second language (English), social studies, and science were usually poorer than general academic achievement. Furthermore, half of the implanted children had difficulty in understanding the class content (30/67) or conversing with peers in a noisy classroom (32/67). These difficulties were significantly associated with poor speech perception. Conclusions: Improving the listening environment for implanted children attending mainstream schools is necessary.

Academic Performance, Communication, and Psychosocial Development of Prelingual Deaf Children with Cochlear Implants in Mainstream Schools

  • Choi, Ji Eun;Hong, Sung Hwa;Moon, Il Joon
    • 대한청각학회지
    • /
    • 제24권2호
    • /
    • pp.61-70
    • /
    • 2020
  • Background and Objectives: To assess the academic performance, communication skills, and psychosocial development of prelingual deaf children with cochlear implants (CIs) attending mainstream schools, and to evaluate the impact of auditory speech perception on their classroom performance. Subjects and Methods: As participant, 67 children with CI attending mainstream schools were included. A survey was conducted using a structured questionnaire on academic performance in the native language, second language, mathematics, social studies, science, art, communication skills, self-esteem, and social relations. Additionally, auditory and speech performances on the last follow-up were reviewed retrospectively. Results: Most implanted children attending mainstream school appeared to have positive self-esteem and confidence, and had little difficulty in conversing in a quiet classroom. Also, half of the implanted children (38/67) scored above average in general academic achievement. However, academic achievement in the second language (English), social studies, and science were usually poorer than general academic achievement. Furthermore, half of the implanted children had difficulty in understanding the class content (30/67) or conversing with peers in a noisy classroom (32/67). These difficulties were significantly associated with poor speech perception. Conclusions: Improving the listening environment for implanted children attending mainstream schools is necessary.

New Postprocessing Methods for Rejectin Out-of-Vocabulary Words

  • Song, Myung-Gyu
    • The Journal of the Acoustical Society of Korea
    • /
    • 제16권3E호
    • /
    • pp.19-23
    • /
    • 1997
  • The goal of postprocessing in automatic speech recognition is to improve recognition performance by utterance verification at the output of recognition stage. It is focused on the effective rejection of out-of vocabulary words based on the confidence score of hypothesized candidate word. We present two methods for computing confidence scores. Both methods are based on the distance between each observation vector and the representative code vector, which is defined by the most likely code vector at each state. While the first method employs simple time normalization, the second one uses a normalization technique based on the concept of on-line garbage mode[1]. According to the speaker independent isolated words recognition experiment with discrete density HMM, the second method outperforms both the first one and conventional likelihood ratio scoring method[2].

  • PDF

BMS 알고리즘을 이용한 거절기능 성능 향상 (Improvement of Confidence Measure Performance using Background Model Set Algorithm)

  • 김병돈;이경록;김진영;최승호
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2003년도 5월 학술대회지
    • /
    • pp.79-82
    • /
    • 2003
  • In this paper, we proposed Backgorund Model Set algorithm for the speaker verification to improve the shortcoming of calculating process in conventional confidence measure(CM). CM is to display relative likelihood between recognized models and unrecognized models. Unrecognized models is known as antiphone models. Calculate probability and standard deviation using all phonemes at process that compose antiphone model. At this process, antiphone CM brought bad result. Also, recognition time increases. In order problem, we studied about method to reconstitute average and standard deviation taking BMS algorithm using antiphoneme that near phoneme of CM calculation.

  • PDF

HMM을 기반으로 한 자율이동로봇의 음성명령 인식시스템의 개발 (Development of Autonomous Mobile Robot with Speech Teaching Command Recognition System Based on Hidden Markov Model)

  • 조현수;박민규;이현정;이민철
    • 제어로봇시스템학회논문지
    • /
    • 제13권8호
    • /
    • pp.726-734
    • /
    • 2007
  • Generally, a mobile robot is moved by original input programs. However, it is very hard for a non-expert to change the program generating the moving path of a mobile robot, because he doesn't know almost the teaching command and operating method for driving the robot. Therefore, the teaching method with speech command for a handicapped person without hands or a non-expert without an expert knowledge to generate the path is required gradually. In this study, for easily teaching the moving path of the autonomous mobile robot, the autonomous mobile robot with the function of speech recognition is developed. The use of human voice as the teaching method provides more convenient user-interface for mobile robot. To implement the teaching function, the designed robot system is composed of three separated control modules, which are speech preprocessing module, DC servo motor control module, and main control module. In this study, we design and implement a speaker dependent isolated word recognition system for creating moving path of an autonomous mobile robot in the unknown environment. The system uses word-level Hidden Markov Models(HMM) for designated command vocabularies to control a mobile robot, and it has postprocessing by neural network according to the condition based on confidence score. As the spectral analysis method, we use a filter-bank analysis model to extract of features of the voice. The proposed word recognition system is tested using 33 Korean words for control of the mobile robot navigation, and we also evaluate the performance of navigation of a mobile robot using only voice command.

Articulation error of children with adenoid hypertrophy

  • Eom, Tae-Hoon;Jang, Eun-Sil;Kim, Young-Hoon;Chung, Seung-Yun;Lee, In-Goo
    • Clinical and Experimental Pediatrics
    • /
    • 제57권7호
    • /
    • pp.323-328
    • /
    • 2014
  • Purpose: Adenoid hypertrophy is a physical alteration that may affect speech, and a speech disorder can have other negative effects on a child's life. Airway obstruction leads to constricted oral breathing and causes postural alterations of several oro-facial structures, including the mouth, tongue, and hyoid bone. The postural modifications may affect several aspects of speech production. Methods: In this study, we compared articulation errors in 19 children with adenoid hypertrophy (subject group) to those of 33 children with functional articulation disorders independent of anatomical problems (control group). Results: The mean age of the subject group was significantly higher (P=0.016). Substitution was more frequent in the subject group (P=0.003; odds ratio [OR], 1.80; 95% confidence interval [CI], 1.23- 2.62), while omission was less frequent (P<0.001; OR, 0.43; 95% CI, 0.27-0.67). Articulation errors were significantly less frequent in the palatal affricative in the subject group (P=0.047; OR, 0.25; 95% CI, 0.07-0.92). The number of articulation errors in other consonants was not different between the two groups. Nasalization and aspiration were significantly more frequent in the subject group (P=0.007 and 0.014; OR, 14.77 and 0.014; 95% CI, [1.62-135.04] and NA, respectively). Otherwise, there were no differences between the two groups. Conclusion: We identified the characteristics of articulation errors in children with adenoid hypertrophy, but our data did not show the relationship between adenoid hypertrophy and oral motor function that has been observed in previous studies. The association between adenoid hypertrophy and oral motor function remains doubtful.

음성 기반 상담의 품질 평가를 위한 자동화 기법 (A Method of Automated Quality Evaluation for Voice-Based Consultation)

  • 이건수;김중연
    • 인터넷정보학회논문지
    • /
    • 제22권2호
    • /
    • pp.69-75
    • /
    • 2021
  • 언택트 시대의 시작으로, 온라인 산업의 성장 속도는 점차 빨라지고 있다. 온라인 산업이 성장할수록, 고객 관리에 대한 중요성은 높아지며, 그 접점에 존재하는 컨택센터 시장 역시 성장하고 있다. 언택트 시대의 주요 서비스 분야인 컨택센터의 업무가 노동 집약적이라는 아이러니를 극복하고 컨택센터 업무 효율을 증가시키기 위한 다양한 업무 자동화 기술 개발 연구들이 활발하게 진행되고 있다. 본 연구는 업무 자체는 정형적이지만, 그 중요성이 높아 업무 자동화의 효율이 높은 대표적인 컨택센터의 업무 중 하나인 품질평가 업무의 자동화 방법을 제안한다. 제안 방법은 채널 분리된 상담 내용 녹취 내용을 토대로, 음성 인식 결과를 획득한 뒤, 문장단위 발화 내용을 분석하여, 정량 평가 항목인 도입부 평가, 응대 중 경청과 침묵 평가, 그리고 마무리 평가를 수행한 후 수행 결과를 평가표에 맞춰 출력하는 단계를 따른다. 제안 방법은 전문가의 평가 결과 대비 92.7% 일치율을 보였다. 불일치 케이스의 경우, 주로 음성 인식의 오류에 기인한 경우였다. 따라서 음성 인식 결과의 신뢰도가 보장된다면, 본 논문에서 제안한 방법을 통해 자동화된 품질평가로 해당 업무 효율을 증대시킬 수 있을 것이다.