• Title/Summary/Keyword: Transcriptions

Search Result 95, Processing Time 0.03 seconds

Study on Efficient Generation of Dictionary for Korean Vocabulary Recognition (한국어 음성인식을 위한 효율적인 사전 구성에 관한 연구)

  • Lee Sang-Bok;Choi Dae-Lim;Kim Chong-Kyo
    • Proceedings of the KSPS conference
    • /
    • 2002.11a
    • /
    • pp.41-44
    • /
    • 2002
  • This paper is related to the enhancement of speech recognition rate using enhanced pronunciation dictionary. Modern large vocabulary, continuous speech recognition systems have pronunciation dictionaries. A pronunciation dictionary provides pronunciation information for each word in the vocabulary in phonemic units, which are modeled in detail by the acoustic models. But in most speech recognition system based on Hidden Markov Model, actual pronunciation variations are disregarded. Without the pronunciation variations in the speech recognition system, the phonetic transcriptions in the dictionary do not match the actual occurrences in the database. In this paper, we proposed the unvoiced rule of semivowel in allophone rules to pronunciation dictionary. Experimental results on speech recognition system give higher performance than existing pronunciation dictionaries.

  • PDF

Review and Interpretations of Plimpton 322 (고대 바빌로니아 Plimpton 322의 역사적 고찰)

  • Kim, Min-Kyeong
    • Journal for History of Mathematics
    • /
    • v.20 no.1
    • /
    • pp.45-56
    • /
    • 2007
  • The aims of the study were to review the transcriptions of the famous cuneiform tablet 'Plimpton 322' and interpret the meanings of the numbers. Since the tablet was found, many scholars tried to interpretate the relation among numbers. Neugebauer & Sacks, Buck, and Robson's finding are reviewed. This tablet must be the most well known and taken as an important role to complete a proof of the Pytagoras' theorem before the development of Greek Mathematics.

  • PDF

Consonant Confusions Matrices in Adults with Dysarthria Associated with Cerebral Palsy (뇌성마비로 인한 마비말장애 성인의 자음 오류 분석)

  • Lee, Youngmee;Sung, JeeEun;Sim, HyunSub
    • Phonetics and Speech Sciences
    • /
    • v.5 no.1
    • /
    • pp.47-54
    • /
    • 2013
  • The aim of this study was to analyze consonant articulation errors produced by 90 speakers with cerebral palsy (CP). Phonetic transcriptions were made for 37 single-word utterances containing 70 phonemes: 48 initial consonants and 22 final consonants. Errors of substitution, omission, and distortion were analyzed using a confusion matrix paradigm showing the visualization of error patterns. Results showed that substitution errors in initial and final consonants were most frequent, followed by omission and distortion. Consonant omission occurred more frequently on final consonants. In both initial and final consonants, the within-place errors were more prominent than the within-manner errors. The current results suggest that consonant confusion matrices for dysarthric speech may provide useful information for evaluating speech intelligibility and developing automatic speech recognition system of adults with CP associated dysarthria.

Corpus-based evaluation of French text normalization (코퍼스 기반 프랑스어 텍스트 정규화 평가)

  • Kim, Sunhee
    • Phonetics and Speech Sciences
    • /
    • v.10 no.3
    • /
    • pp.31-39
    • /
    • 2018
  • This paper aims to present a taxonomy of non-standard words (NSW) for developing a French text normalization system and to propose a method for evaluating this system based on a corpus. The proposed taxonomy of French NSWs consists of 13 categories, including 2 types of letter-based categories and 9 types of number-based categories. In order to evaluate the text normalization system, a representative test set including NSWs from various text domains, such as news, literature, non-fiction, social-networking services (SNSs), and transcriptions, is constructed, and an evaluation equation is proposed reflecting the distribution of the NSW categories of the target domain to which the system is applied. The error rate of the test set is 1.64%, while the error rate of the whole corpus is 2.08%, reflecting the NSW distribution in the corpus. The results show that the literature and SNS domains are assessed as having higher error rates compared to the test set.

A Diachronic Study of Japanese Dakuon - through the Analysis of Korean Source-Materials in the 15-18th Centuries - (일본어 탁음의 비음성의 변천 과정 - 15-18세기의 일본어 전사 자료를 이용하여 -)

  • Jin Nam-Taek
    • MALSORI
    • /
    • no.48
    • /
    • pp.35-55
    • /
    • 2003
  • The aim of this study is to clarify the process of the sound changes of Japanese consonants (Dakuons) in the analysis of the transcriptions of Korean Source-Materials (i.e. Japanese textbooks for Korean and the records of travel in Japan) written in the 15-l8th centuries with the Korean writing system. Especially these records of travel in Japan are meaningful in that the process of change of Dakuon is shown in detail. The results are as follows. 1) In the 15th century, all Dakuons /g d z b/ had nasality. 2) The nasality of /z/ and /b/ disappeared in the 16th century. 3) The nasality of /d/ disappeared in the late 17th century.

  • PDF

An ESL Teacher's Perspective on Recasts: A Qualitative Exploration of "When" and "How"?

  • Byun, Ji-Hyun;Kayi-Aydar, Hayriye
    • English Language & Literature Teaching
    • /
    • v.16 no.4
    • /
    • pp.1-18
    • /
    • 2010
  • Recasts, which are defined as implicit types of corrective feedback, have been the focus of numerous SLA researchers for more than a decade. A range of classroom-based observational and experimental research studies have explored how and when language teachers provide recasts to learners' ill-formed utterances and aimed to understand the role of recasts in language acquisition or learning. On the basis of previous studies on recasts, our study investigated when an ESL teacher provided recasts and how recasts were provided in his class. The research questions were as follows: (1) When does an ESL teacher provide recasts? (2) How does the teacher provide recasts? The data came from observations of one ESL classroom as well as consecutive-semi structured interviews with the teacher. The data analysis included transcriptions of teacher-student interactions in the target setting and categories of recasts according to the linguistic phenomena, which prompted recasting. Based on the findings, practical suggestions for ESL teachers were provided. [156 words].

  • PDF

Korean LVCSR for Broadcast News Speech

  • Lee, Gang-Seong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.2E
    • /
    • pp.3-8
    • /
    • 2001
  • In this paper, we will examine a Korean large vocabulary continuous speech recognition (LVCSR) system for broadcast news speech. The combined vowel and implosive unit is included in a phone set together with other short phone units in order to obtain a longer unit acoustic model. The effect of this unit is compared with conventional phone units. The dictionary units for language processing are automatically extracted from eojeols appearing in transcriptions. Triphone models are used for acoustic modeling and a trigram model is used for language modeling. Among three major speaker groups in news broadcasts-anchors, journalists and people (those other than anchors or journalists, who are being interviewed), the speech of anchors and journalists, which has a lot of noise, was used for testing and recognition.

  • PDF

The characteristics of phonology in hearing impaired children depending age and hearing threshold (연령과 청력역치에 따른 청각장애아동의 음운적 특성)

  • Seo, Yeong-Ran;Kim, Jin-Sook;Ko, Do-Heung
    • Speech Sciences
    • /
    • v.8 no.2
    • /
    • pp.7-22
    • /
    • 2001
  • This research with 40 hearing impaired children was performed to find out the characteristics of the phonological processes for each age, hearing loss degree and aided threshold degree through the transcriptions. The phonological processes of hearing impaired children are similar to those of normal children with a peculiar type of patterns in phonological processes. The results show that: (1) Between 5 and 6 year old groups the phonological processes were significantly different in palatal backing, glottal replacement and frication; between 6 and 7 year old groups the phonological processes were significantly different in velar fronting, labialization, alveolization and labial assimilation; between 7 and 8 year old groups the phonological processes, labialization, alveolization and alveolar assimilation, however 8 year old group showed more phonological processes than 7 year old group. (2) Between moderately-severe and severe hearing impaired groups, phonological processes were significantly different in the omissions of postvocalic, nasal and velar, stopping and stop assimilation. The differences of severe and profound groups were not found at all. (3) Aided hearing thresholds did not show any significant difference.

  • PDF

The relationship between segmental production by Japanese learners of Korean and pronunciation evaluation (일본인 한국어 학습자의 분절음 실현과 발음 평가의 상관성)

  • Hong, Hyejin;Ryu, Hyuksu;Chung, Minhwa
    • Phonetics and Speech Sciences
    • /
    • v.6 no.4
    • /
    • pp.101-108
    • /
    • 2014
  • This study investigates the effects of Japanese learners' Korean segmental production on pronunciation evaluation by Korean native raters. Read speech from 24 learners whose native language is Japanese are transcribed at the phonemic level, and confusion matrices are generated based on the phonemic transcriptions. The deviance from the canonical pronunciation found in the learners' speech is analyzed in terms of phoneme substitutions, vowel insertions, and consonant deletions. Each learner's pronunciation is rated impressionistically by 5 Korean native raters. The result shows that the deviance from the canonical pronunciation is strongly correlated with the pronunciation evaluation scores. Especially, the rates of phoneme substitutions and vowel insertions which are very strongly correlated with the pronunciation evaluation scores.

Production of English final stops by Korean speakers

  • Kim, Jungyeon
    • Phonetics and Speech Sciences
    • /
    • v.10 no.4
    • /
    • pp.11-17
    • /
    • 2018
  • This study reports on a production experiment designed to investigate how Korean speaking learners of English produce English forms ending in stops. In a repetition experiment, Korean participants listened to English nonce words ending in a stop and repeated what they heard. English speakers were recruited for the same task as a control group. The experimental result indicated that the transcriptions of the Korean productions by English native speakers showed vowel insertion in only 3% of productions although the pronunciation of English final stops showed that noise intervals after the closure of final stops were significantly longer for Korean speakers than for English speakers. This finding is inconsistent with the loanword data where 49% of words showed vowel insertion. It is also not compatible with the perceptual similarity approach, which predicts that because Korean speakers accurately perceive an English final stop as a final consonant, they will insert a vowel to make the English sound more similar to the Korean sound.