• Title/Summary/Keyword: Out-Of-Vocabulary

Search Result 153, Processing Time 0.022 seconds

Efficient Vocabulary Optimization Management using VCOR (VCOR를 이용한 효율적인 어휘 최적화 관리)

  • Oh, Sang-Yeob
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.10
    • /
    • pp.1436-1443
    • /
    • 2010
  • In vocabulary recognition system has it's bad points of processing vocabulary unseen triphone and then no got distribution of confidence measure by cannot normalization. According to this problem to improve suggested VCOR(Version Control for Out-of Rejection) system by out-of vocabulary rejection algorithm use vocabulary management optimization and then phone data search support. In VCOR system to provide vocabulary information efficiently offering for user's vocabulary information using extend facet classification that improved for vocabulary measure management function offering accuracy of recognition for vocabulary. In this paper proposed system performance as a result of represent vocabulary dependence recognition rate of 97.56%, vocabulary independence recognition rate of 96.23%.

Korean broadcast news transcription system with out-of-vocabulary(OOV) update module (한국어 방송 뉴스 인식 시스템을 위한 OOV update module)

  • Jung Eui-Jung;Yun Seung
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • spring
    • /
    • pp.33-36
    • /
    • 2002
  • We implemented a robust Korean broadcast news transcription system for out-of-vocabulary (OOV), tested its performance. The occurrence of OOV words in the input speech is inevitable in large vocabulary continuous speech recognition (LVCSR). The known vocabulary will never be complete due to the existence of for instance neologisms, proper names, and compounds in some languages. The fixed vocabulary and language model of LVCSR system directly face with these OOV words. Therefore our Broadcast news recognition system has an offline OOV update module of language model and vocabulary to solve OOV problem and selects morpheme-based recognition unit (so called, pseudo-morpheme) for OOV robustness.

  • PDF

The Analysis of Relevance of Vocabulary Used in the 'Water' unit of Chemistry I Textbook (화학 I 교과서의 "물"단원에 사용된 어휘의 적절성 분석)

  • Kim, Ji-Young;Cho, Mi-Ju;Goo, Mi-Na;Park, Jong-Seok
    • Journal of the Korean Chemical Society
    • /
    • v.54 no.4
    • /
    • pp.471-478
    • /
    • 2010
  • This study analyzed the vocabulary level in the 'Water' unit of chemistry I textbook. It also analyzed its relevance to the 11th graders' vocabulary level. The main tool for analyzing vocabulary level was SWA(Science Word Analysis) program which was referenced the Standard Korean Dictionary and Graduated Vocabulary of Korean Language Education. The results in this study turned out to be as follows: The distribution of scientific vocabulary level increased from Level-1 to Level-3 and showed a tendency to decrease from Level-3 until Level-5. The average percentage of Out of level is the largest as 37%. The highest percentage of non-scientific vocabulary was Level-1. The distribution of non-scientific vocabulary level decreased progressively. The Level-5 and Out of level are used 18% averagely. So, there are 6 vocabularies of Level-5 and 82 vocabularies of extra-level inappropriate in scientific vocabularies. And there are 53 vocabularies of Level-5 and 145 vocabularies of Out of level inappropriate in non-scientific vocabularies. Therefore, the overall state of textbooks for grade 11 students are reasonable. But there are a great many vocabularies inappropriate for them. Those should be used minimum, and to be changed to the 1-4 of level vocabulary as stated in the student's level of understanding of appropriate vocabulary.

A Study on Out-of-Vocabulary Rejection Algorithms using Variable Confidence Thresholds (가변 신뢰도 문턱치를 사용한 미등록어 거절 알고리즘에 대한 연구)

  • Bhang, Ki-Duck;Kang, Chul-Ho
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.11
    • /
    • pp.1471-1479
    • /
    • 2008
  • In this paper, we propose a technique to improve Out-Of-Vocabulary(OOV) rejection algorithms in variable vocabulary recognition system which is much used in ASR(Automatic Speech Recognition). The rejection system can be classified into two categories by their implementation method, keyword spotting method and utterance verification method. The utterance verification method uses the likelihood ratio of each phoneme Viterbi score relative to anti-phoneme score for deciding OOV. In this paper, we add speaker verification system before utterance verification and calculate an speaker verification probability. The obtained speaker verification probability is applied for determining the proposed variable-confidence threshold. Using the proposed method, we achieve the significant performance improvement; CA(Correctly Accepted for keyword) 94.23%, CR(Correctly Rejected for out-of-vocabulary) 95.11% in office environment, and CA 91.14%, CR 92.74% in noisy environment.

  • PDF

A Study on the Korean Language Acquisition of Children from Chinese Families in Korea: Focusing on Pronunciation and Vocabulary (재한중국인 가정 아동의 한국어 습득에 관한 연구 -발음과 어휘를 중심으로-)

  • Li, Yin
    • Journal of Korean language education
    • /
    • v.29 no.3
    • /
    • pp.165-196
    • /
    • 2018
  • This study examined the language acquisition of children from Chinese families in Korea under the bilingual background of Korean and Chinese, focusing on pronunciation and vocabulary. First, in the analysis of pronunciation acquisition, children correctly realized the lenition, aspirated sound, glottalization, palatalization, nasalization, and liquidization while the realization of pronunciation rules for unfamiliar words was low. There were also errors caused when the application principles of pronunciation were not accurately understood or they were not partially acquired. Second, in the analysis of vocabulary acquisition, the acquisition of receptive vocabulary was in the order of verb, noun, and adjective while they clearly understood vocabulary used in the actual relationships with school, family, and peers. In the acquisition of productive vocabulary, they showed the 'meaning-centered principle' of learning the meaning of vocabulary first and then learning its form afterwards. The amount of study and exposure to Korean language had effects on the improvement of vocabulary. Even though this study focused on the errors and characteristics in the acquisition process of Korean pronunciation and vocabulary for children from Chinese families in Korea, it could not clearly find out which one would have greater effects on the acquisition of Korean language. However, lots of exchanges and experiences with surrounding environment and peer group had great effects on the language acquisition and language acquisition transfer of children.

Vocabulary assessment based on construct definition in task-based language learning (과제 중심 학습에서 어휘 능력의 구성요소와 평가)

  • Kim, Yeon-Jin
    • English Language & Literature Teaching
    • /
    • v.12 no.3
    • /
    • pp.123-145
    • /
    • 2006
  • The purpose of this study is to propose an efficient vocabulary assessment model in task-based language learning and to verify the viability of this assessment model. Bachman and Palmer (1996) pointed out the fact that many language tests focus on just one of the areas of language knowledge. However, researchers suggested that it is necessary to acknowledge the needs of several analytic scales, which can provide separate ratings for different components of the language ability to be tested. Although there were many studies which tried to evaluate the various aspects of vocabulary ability, most of them measured only one or two factors. Based on previous research, this study proposed an assessment model of general construct of vocabulary ability and tried to measure vocabulary ability in four separate areas. The subjects were two classes of university level Korean EFL students. They participated in small group discussion via synchronous CMC. One class used a lexically focused task, which was proposed by Kim and Jeong (2006) and the other class used a non-lexically focused task. The results showed that the students with a lexically focused task significantly outperformed those with a non-lexically focused task in overall vocabulary ability as well as four subdivisions of vocabulary ability. In conclusion, the assessment model of separate ratings is a viable measure of vocabulary ability and this can provide elaborate interpretation of vocabulary ability.

  • PDF

The effects of corpus-based vocabulary tasks on high school students' English vocabulary learning and attitude (코퍼스를 기반으로 한 어휘 과제가 고등학생의 영어 어휘 학습과 태도에 미치는 영향)

  • Lee, Hyun Jin;Lee, Eun-Joo
    • English Language & Literature Teaching
    • /
    • v.16 no.4
    • /
    • pp.239-265
    • /
    • 2010
  • This study investigates the effects of corpus-based vocabulary tasks on the acquisition of English vocabulary in an attempt to explore the influence of corpus use on EFL pedagogy. For this to be realized, a total of 40 Korean high school students participated in the study over a 4-week period. An experimental group used a set of corpus-based tasks for vocabulary learning, whereas a control group carried out a traditional task (i.e., the L1-L2 translation) for vocabulary learning. To assess learning gains, the students were asked to complete the pre- and post-treatment tests measuring the word form, meaning, and use aspects of target lexical items. Results of the study indicate that in the experimental group the corpus-based vocabulary tasks were beneficial for the learning of word forms and use. In particular, corpus-based benefits were greatest in the low-proficiency EFL learners' collocational aspects of vocabulary use. On the other hand, in the control group, the traditional vocabulary tasks benefited the meaning aspects of target vocabulary items the most. In addition, survey results revealed that most students were positive about the corpus-based learning experience although some expressed reservations about the heavy cognitive load and the time-consuming nature of the analysis of corpus data primarily due to learners' lack of language proficiency.

  • PDF

Reliability measure improvement of Phoneme character extract In Out-of-Vocabulary Rejection Algorithm (미등록어 거절 알고리즘에서 음소 특성 추출의 신뢰도 측정 개선)

  • Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.10 no.6
    • /
    • pp.219-224
    • /
    • 2012
  • In the communication mobile terminal, Vocabulary recognition system has low recognition rates, because this problems are due to phoneme feature extract from inaccurate vocabulary. Therefore they are not recognize the phoneme and similar phoneme misunderstanding error. To solve this problem, this paper propose the system model, which based on the two step process. First, input phoneme is represent by number which measure the distance of phonemes through phoneme likelihood process. next step is recognize the result through the reliability measure. By this process, we minimize the phoneme misunderstanding error caused by inaccurate vocabulary and perform error correction rate for error provrd vocabulary using phoneme likelihood and reliability. System performance comparison as a result of recognition improve represent 2.7% by method using error pattern learning and semantic pattern.

Error Correction Methode Improve System using Out-of Vocabulary Rejection (미등록어 거절을 이용한 오류 보정 방법 개선 시스템)

  • Ahn, Chan-Shik;Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.10 no.8
    • /
    • pp.173-178
    • /
    • 2012
  • In the generated model for the recognition vocabulary, tri-phones which is not make preparations are produced. Therefore this model does not generate an initial estimate of parameter words, and the system can not configure the model appear as disadvantages. As a result, the sophistication of the Gaussian model is fall will degrade recognition. In this system, we propose the error correction system using out-of vocabulary rejection algorithm. When the systems are creating a vocabulary recognition model, recognition rates are improved to refuse the vocabulary which is not registered. In addition, this system is seized the lexical analysis and meaning using probability distributions, and this system deactivates the string before phoneme change was applied. System analysis determine the rate of error correction using phoneme similarity rate and reliability, system performance comparison as a result of error correction rate improve represent 2.8% by method using error patterns, fault patterns, meaning patterns.

Architectural Vocabulary Study of SeoGwolYeongGeonDoGamUiGwe(西闕營建都監儀軌) (1) - It is an attaching importance to with a timber transcriptions of Korean - ("서궐영건도감의궤"의 목재류 어휘 분석 연구)

  • Oh, Chang-Myong;Son, Hui-Ha;Cheon, Deuk-Youm
    • Journal of architectural history
    • /
    • v.16 no.1
    • /
    • pp.29-48
    • /
    • 2007
  • This study is an investigation that explication of the reading method of transcription marking, and explains what it means and where to use transcription marking of architectural vocabulary. focusing on timber class vocabulary from on SeoGwolYeongGeonDoGamUiGwe(${\ulcorner}$西闕營建都監儀軌(1831)${\lrcorner}$). This study is reviewed sequentially classifying items according to timber class, stone class, iron class architectural vocabulary. This treatise, named 'Architectural Vocabulary Study (1)', is mainly focused on making out a transcription marking of timber class vocabulary on the documents. But, because of the properties of the UiGwe(儀軌), there are some different expressions used for the same vocabulary. Therefore, the different expressions are referenced according to vocabulary used from the 17th century to the 20c century. This study lists timber class architectural vocabulary of transcription marking on SeoGwolYeongGeonDoGamUiGwe(${\ulcorner}$西闕營建都監儀軌(1831)$\lrcorner$). This study also explicates and explains the meaning of them. In advance, this study compares them with the vocabulary of national language in the middle ages and modern times. If we could explicate the transcription marking of the documents as it is, we can't only recover plenty of vocabulary related with characteristic architecture in the age of later Chosun Dynasty, but also correct wrongly used vocabulary nowadays. With the results, we can standardize and adjust vocabulary use of Korean traditional architecture. In advance, we can correct errors of spelling and mistaken explanation in the Korean Encyclopedia.

  • PDF