• Title/Summary/Keyword: Pronunciation Training

Search Result 71, Processing Time 0.018 seconds

Pronunciation Variation Patterns of Loanwords Produced by Korean and Grapheme-to-Phoneme Conversion Using Syllable-based Segmentation and Phonological Knowledge (한국인 화자의 외래어 발음 변이 양상과 음절 기반 외래어 자소-음소 변환)

  • Ryu, Hyuksu;Na, Minsu;Chung, Minhwa
    • Phonetics and Speech Sciences
    • /
    • v.7 no.3
    • /
    • pp.139-149
    • /
    • 2015
  • This paper aims to analyze pronunciation variations of loanwords produced by Korean and improve the performance of pronunciation modeling of loanwords in Korean by using syllable-based segmentation and phonological knowledge. The loanword text corpus used for our experiment consists of 14.5k words extracted from the frequently used words in set-top box, music, and point-of-interest (POI) domains. At first, pronunciations of loanwords in Korean are obtained by manual transcriptions, which are used as target pronunciations. The target pronunciations are compared with the standard pronunciation using confusion matrices for analysis of pronunciation variation patterns of loanwords. Based on the confusion matrices, three salient pronunciation variations of loanwords are identified such as tensification of fricative [s] and derounding of rounded vowel [ɥi] and [$w{\varepsilon}$]. In addition, a syllable-based segmentation method considering phonological knowledge is proposed for loanword pronunciation modeling. Performance of the baseline and the proposed method is measured using phone error rate (PER)/word error rate (WER) and F-score at various context spans. Experimental results show that the proposed method outperforms the baseline. We also observe that performance degrades when training and test sets come from different domains, which implies that loanword pronunciations are influenced by data domains. It is noteworthy that pronunciation modeling for loanwords is enhanced by reflecting phonological knowledge. The loanword pronunciation modeling in Korean proposed in this paper can be used for automatic speech recognition of application interface such as navigation systems and set-top boxes and for computer-assisted pronunciation training for Korean learners of English.

Generating Pronunciation Lexicon for Continuous Speech Recognition Based on Observation Frequencies of Phonetic Rules (음소변동규칙의 발견빈도에 기반한 음성인식 발음사전 구성)

  • Na, Min-Soo;Chung, Min-Hwa
    • MALSORI
    • /
    • no.64
    • /
    • pp.137-153
    • /
    • 2007
  • The pronunciation lexicon of a continuous speech recognition system should contain enough pronunciation variations to be used for building a search space large enough to contain a correct path, whereas the size of the pronunciation lexicon needs to be constrained for effective decoding and lower perplexities. This paper describes a procedure for selecting pronunciation variations to be included in the lexicon based on the frequencies of the corresponding phonetic rules observed in the training corpus. Likelihood of a phonetic rule's application is estimated using the observation frequency of the rule and is used to control the construction of a pronunciation lexicon. Experiments with various pronunciation lexica show that the proposed method is helpful to improve the speech recognition performance.

  • PDF

Teaching Pronunciation for English as an International Language (국제어로서의 영어 발음교육 : 과제와 방향)

  • Park, Joo-Kyung
    • Proceedings of the KSPS conference
    • /
    • 2000.03a
    • /
    • pp.103-104
    • /
    • 2000
  • As the role and status of English as an international language(EIL) have been widely discussed, studies need to be done to find out new issues and concerns related to teaching EIL In Korea. This presentation will review the changes in teaching English in Korea, teaching pronunciation, in particular, focusing on its goal and major instructional approaches. Suggestions will be made on developing a learner-centered communicative model for teaching English pronunciation and on training both Korean and foreign teachers of English to teach English pronunciation.

  • PDF

The use of audio-visual aids and hyper-pronunciation method in teaching English consonants to Japanese college students

  • Todaka, Yuichi
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.149-154
    • /
    • 1996
  • Since the 1980s, a number of professionals in the ESL/EFL field have investigated the role of pronunciation in the ESL/EFL curriculum. Applying the insights gained from the second language acquisition research, these efforts have focused on the integration of pronunciation teaching and learning into the communicative curriculum, with a shift towards overall intelligibility as the primary goal of pronunciation teaching and learning. The present study reports on the efficacy of audio-visual aids and hyper-pronunciation training method in teaching the productions of English consonants to Japanese college students. The talk will focus on the implications of the present study, and the presenter makes suggestions to teaching pronunciation to Japanese learners.

  • PDF

Digital enhancement of pronunciation assessment: Automated speech recognition and human raters

  • Miran Kim
    • Phonetics and Speech Sciences
    • /
    • v.15 no.2
    • /
    • pp.13-20
    • /
    • 2023
  • This study explores the potential of automated speech recognition (ASR) in assessing English learners' pronunciation. We employed ASR technology, acknowledged for its impartiality and consistent results, to analyze speech audio files, including synthesized speech, both native-like English and Korean-accented English, and speech recordings from a native English speaker. Through this analysis, we establish baseline values for the word error rate (WER). These were then compared with those obtained for human raters in perception experiments that assessed the speech productions of 30 first-year college students before and after taking a pronunciation course. Our sub-group analyses revealed positive training effects for Whisper, an ASR tool, and human raters, and identified distinct human rater strategies in different assessment aspects, such as proficiency, intelligibility, accuracy, and comprehensibility, that were not observed in ASR. Despite such challenges as recognizing accented speech traits, our findings suggest that digital tools such as ASR can streamline the pronunciation assessment process. With ongoing advancements in ASR technology, its potential as not only an assessment aid but also a self-directed learning tool for pronunciation feedback merits further exploration.

A Case Study on Voice Training Supporters' Training Course Management for Multicultural Family Members: Focus on B University's Governmental Support Policy (다문화가족 구성원 대상 보이스트레이닝 서포터스 양성과정 운영 사례 연구 -B대학교 정부 지원 사업을 중심으로-)

  • Lee, Younghee;Cho, Wisu
    • Journal of Korean language education
    • /
    • v.28 no.4
    • /
    • pp.121-147
    • /
    • 2017
  • This study shows the current management status and the results of B University's multicultural creative-HR team's voice training supporters' preparation course that is part of the local funding project at the university. For this, the concept of voice training and educational contents of the multicultural members are first extracted from several documents. Then, a description of the management case of B University's voice training supporters' education course is given regarding the goals, operator of management, propulsion progress, and contents of previous education. For analyzing the management results of this work, in-depth interviews with the supporters and a half-structured survey are conducted with the voice academy main instructors. Moreover, reports of the work results, work journals of supporters and etc. are used for analyzing the results. According to the results of this analysis, the aspect of education, previous education contents, and teaching practicum are not organically connected. A more detailed curriculum about the comprehension ability of practical affairs is needed for managing a classroom. In aspect of management, the preparatory stage of voice training course and the practice stage were not linked, and thus, more cooperation is required with the main instructors. Although the results are limited, the voice training of the supporters' training course has its implications. First, the education of Korean pronunciation and intonation are provided for the supporters, thereby being able to facilitate learner-centered education. Second, it demonstrates in an empirical case that a class can be administered by specializing in Korean pronunciation and intonation. At last, it can provide a chance to practice teaching and offer field experience for students who have a Korean education major.

Automatic pronunciation assessment of English produced by Korean learners using articulatory features (조음자질을 이용한 한국인 학습자의 영어 발화 자동 발음 평가)

  • Ryu, Hyuksu;Chung, Minhwa
    • Phonetics and Speech Sciences
    • /
    • v.8 no.4
    • /
    • pp.103-113
    • /
    • 2016
  • This paper aims to propose articulatory features as novel predictors for automatic pronunciation assessment of English produced by Korean learners. Based on the distinctive feature theory, where phonemes are represented as a set of articulatory/phonetic properties, we propose articulatory Goodness-Of-Pronunciation(aGOP) features in terms of the corresponding articulatory attributes, such as nasal, sonorant, anterior, etc. An English speech corpus spoken by Korean learners is used in the assessment modeling. In our system, learners' speech is forced aligned and recognized by using the acoustic and pronunciation models derived from the WSJ corpus (native North American speech) and the CMU pronouncing dictionary, respectively. In order to compute aGOP features, articulatory models are trained for the corresponding articulatory attributes. In addition to the proposed features, various features which are divided into four categories such as RATE, SEGMENT, SILENCE, and GOP are applied as a baseline. In order to enhance the assessment modeling performance and investigate the weights of the salient features, relevant features are extracted by using Best Subset Selection(BSS). The results show that the proposed model using aGOP features outperform the baseline. In addition, analysis of relevant features extracted by BSS reveals that the selected aGOP features represent the salient variations of Korean learners of English. The results are expected to be effective for automatic pronunciation error detection, as well.

Automatic Generation of Pronunciation Variants for Korean Continuous Speech Recognition (한국어 연속음성 인식을 위한 발음열 자동 생성)

  • 이경님;전재훈;정민화
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.2
    • /
    • pp.35-43
    • /
    • 2001
  • Many speech recognition systems have used pronunciation lexicon with possible multiple phonetic transcriptions for each word. The pronunciation lexicon is of often manually created. This process requires a lot of time and efforts, and furthermore, it is very difficult to maintain consistency of lexicon. To handle these problems, we present a model based on morphophon-ological analysis for automatically generating Korean pronunciation variants. By analyzing phonological variations frequently found in spoken Korean, we have derived about 700 phonemic contexts that would trigger the multilevel application of the corresponding phonological process, which consists of phonemic and allophonic rules. In generating pronunciation variants, morphological analysis is preceded to handle variations of phonological words. According to the morphological category, a set of tables reflecting phonemic context is looked up to generate pronunciation variants. Our experiments show that the proposed model produces mostly correct pronunciation variants of phonological words. Then we estimated how useful the pronunciation lexicon and training phonetic transcription using this proposed systems.

  • PDF

Multicriteria-Based Computer-Aided Pronunciation Quality Evaluation of Sentences

  • Yoma, Nestor Becerra;Berrios, Leopoldo Benavides;Sepulveda, Jorge Wuth;Torres, Hiram Vivanco
    • ETRI Journal
    • /
    • v.35 no.1
    • /
    • pp.89-99
    • /
    • 2013
  • The problem of the sentence-based pronunciation evaluation task is defined in the context of subjective criteria. Three subjective criteria (that is, the minimum subjective word score, the mean subjective word score, and first impression) are proposed and modeled with the combination of word-based assessment. Then, the subjective criteria are approximated with objective sentence pronunciation scores obtained with the combination of word-based metrics. No a priori studies of common mistakes are required, and class-based language models are used to incorporate incorrect and correct pronunciations. Incorrect pronunciations are automatically incorporated by making use of a competitive lexicon and the phonetic rules of students' mother and target languages. This procedure is applicable to any second language learning context, and subjective-objective sentence score correlations greater than or equal to 0.5 can be achieved when the proposed sentence-based pronunciation criteria are approximated with combinations of word-based scores. Finally, the subjective-objective sentence score correlations reported here are very comparable with those published elsewhere resulting from methods that require a priori studies of pronunciation errors.

A Study on Reexamination of the syllable errors of nasal consonant ending for Chinese learners in the Korean language study (중국인 학습자 비음 종성 /ㄴ/, /ㅇ/ 음절의 발음 오류 재고 -한·중 음절 유형을 통하여-)

  • Zhang, Jian
    • Journal of Korean language education
    • /
    • v.28 no.1
    • /
    • pp.251-268
    • /
    • 2017
  • This study is based on differences of syllable type between Korean and Chinese language pronunciation. For example, Nasal consonant ending 【n】 and 【${\eta}$】 reside in both Korean and Chinese phonetics simultaneously. However, in experiential training, Chinese learners will make errors in pronunciation of the Korean syllable nasal consonant ending like 【n】 and 【${\eta}$】. In the previous research, analysis of pronunciation errors were often based on the perspective of phonological system and combination of the phoneme rules. However, in this study, the analysis is based on the differences between Korean and Chinese syllables category to indicate the cause of pronunciation errors. The main findings of this study indicated that in the process of pronunciation of Chinese, nasal consonant syllable rime and its 【back】 tongue vowel are combined with each other. However, this rule does not apply in Korean pronunciation. Therefore, the Korean syllabic types like "앤, 응, 옹, 앵, 은, 온, 언" also exist in the Chinese language. When theChinese learners pronounce these types of syllables, the combination of the voweland nasal syllable rime rule will be taken, which will result in pronunciationerrors.