• Title/Summary/Keyword: Syllable Number

Search Result 84, Processing Time 0.024 seconds

Context Based Real-time Korean Writing Correction for Foreigners (외국인 학습자를 위한 문맥 기반 실시간 국어 문장 교정)

  • Park, Young-Keun;Kim, Jae-Min;Lee, Seong-Dong;Lee, Hyun Ah
    • Journal of KIISE
    • /
    • v.44 no.10
    • /
    • pp.1087-1093
    • /
    • 2017
  • Educating foreigners in Korean language is attracting increasing attention with the growing number of foreigners who want to learn Korean or want to reside in Korea. Existing spell checkers mostly focus on native Korean speakers, so they are inappropriate for foreigners. In this paper, we propose a correction method for the Korean language that reflects the contextual characteristics of Korean and writing characteristics of foreigners. Our method can extract frequently used expressions by Koreans by constructing syllable reverse-index for eojeol bi-gram extracted from corpus as correction candidates, and generate ranked Korean corrections for foreigners with upgraded edit distance calculation. Our system provides a user interface based on keyboard hooking, so a user can easily use the correction system along with other applications. Our system improves the detection rate for foreign language users by about 45% compared to other systems in foreign language writing environments. This will help foreign users to judge and correct their own writing errors.

Hangul Encoding Standard based on Unicode (유니코드의 한글 인코딩 표준안)

  • Ahn, Dae-Hyuk;Park, Young-Bae
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.12
    • /
    • pp.1083-1092
    • /
    • 2007
  • In Unicode, two types of Hangul encoding schemes are currently in use, namely, the "precomposed modern Hangul syllables" model and the "conjoining Hangul characters" model. The current Unicode Hangul conjoining rules allow a precomposed Hangul syllable to be a member of a syllable which includes conjoining Hangul characters; this has resulted in a number of different Hangul encoding implementations. This unfortunate problem stems from an incomplete understanding of the Hangul writing system when the normalization and encoding schemes were originally designed. In particular, the extended use of old Hangul was not taken into consideration. As a result, there are different ways to represent Hangul syllables, and this cause problem in the processing of Hangul text, for instance in searching, comparison and sorting functions. In this paper, we discuss the problems with the normalization of current Hangul encodings, and suggest a single efficient rule to correctly process the Hangul encoding in Unicode.

Effect of syllable complexity on the visual span of Korean Hangul reading and its relation to reading abilities (한글 글자 유형이 시각 폭과 읽기 능력에 미치는 영향)

  • Choi, Youngon;Kim, Tae Hoon
    • Korean Journal of Cognitive Science
    • /
    • v.27 no.2
    • /
    • pp.325-353
    • /
    • 2016
  • The visual span refers to the number of letters that can be accurately recognized without moving one's eyes. The size of the visual span is affected by sensory factors such as perimetric complexity, crowding, and mislocation of letters. Korean Hangul utilizes rather unique alphabetic-syllabary writing system, quite different from English and Chinese writing systems. Due to this combinatorial nature of the script, the visual span for Hangul characters can also be affected by the letter type (e.g., CV vs CVCC). The present study examined the effect of syllable complexity on the visual span for Hangul by comparing letter recognition accuracy across four letter type conditions (C only, CV, CVC, and CVCC). We also aimed to determine the meaningful letter type(s) that is associated with differences in reading abilities in Korean. Using a trigram presentation method, we found that overall recognition accuracy declined as syllable complexity increased. However, the visual span for CVC type was greater than that for CV type, suggesting that the effect is not necessarily linear, and that there might be other factors affecting the visual span for these types of letters. C and CV type showed fairly strong positive correlations with reading comprehension, suggesting that these might be the meaningful units for measuring visual span in relating to reading abilities.

Statistical Information of Korean Dictionary to Construct an Enormous Electronic Dictionary (대용량 전자사전 구축을 위한 국어 대사전의 통계 정보)

  • Kim, Cheol-Su;Kim, Yang-Beom
    • The Journal of the Korea Contents Association
    • /
    • v.7 no.6
    • /
    • pp.60-68
    • /
    • 2007
  • There are various application areas of Language information processing such as information retrieval, morphological analysis, spell checker, voice recognition, character recognition, etc. In these language information processing areas, an electronic dictionary is essential. This thesis made researches on basic statistical information on the Korean dictionary and on the construction of electronic dictionary. The targets of analysis were the number of registered word in Korea dictionary, the entry number of registered word in electronic dictionary, the number of used syllables, the number of different syllables, the average length of entry, the distribution of part of speech and the number of used nodes to construct electronic dictionary using Trie, except for words including a archaic word or incomplete syllables. Total entry number of electronic dictionary is 361,980, the number of used syllables is 1,289,659, the average length of entries is 3.56 and the number of different syllables is 2,463. Theses informations would play a beneficial role in constructing an electronic dictionary and in processing Korean information.

A Study on Word Recognition using sub-model based Hidden Markov Model (HMM 부모델을 이용한 단어 인식에 관한 연구)

  • 신원호
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06c
    • /
    • pp.395-398
    • /
    • 1994
  • In this paper the word recognition using sub-model based Hidden Markov Model was studied. Phoneme models were composed of 61 phonemes in therms of Korean language pronunciation characteristic. Using this, word model was maded by serial concatenation. But, in case of this phoneme concatenation, the second and the third phoneme of syllable are overlapped in distribution at the same time. So considering this, the method that combines the second and the third phoneme to one model was proposed. And to prevent the increase in number of model, similar phonemes were combined to one, and finially, 57 models were created. In experiment proper model structure of sub-model was searched for, and recognition results were compared. So similar recognition results were maded, and overall recognition rates were increased in case of using parameter tying method.

  • PDF

The Korean Word Length Effect on AudWord Recognition (청각단어 재인에서 나타난 한국어 단어 길이 효과)

  • Choi Wonil;Nam Kichun
    • MALSORI
    • /
    • no.44
    • /
    • pp.33-46
    • /
    • 2002
  • This study was conducted to examine the effect of word length on auditory word recognition. Word length can be defined by several sublexical units, such as letters, phonemes, syllables, etc. To find out which sublexical units are influential in auditory word recognition, the auditory lexical decision task was used. In Experiment 1, we examined the partial correlation between the speed of reaction time and the number of sublexical units, and in Experiment 2, we executed ANOVA to find out which sublexical length variable was an influential unit. Through these two experiment, we concluded syllable length was the most important variable on auditory word recognition.

  • PDF

A Study af Speech Rate and Fluency in Narmal Speakers (정상 성인의 말속도 및 유창성 연구)

  • Shin, Moon-Ja;Han, Sook-Ja
    • Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.159-168
    • /
    • 2003
  • The purpose of this study was to assess the speech rate, fluency and the type of dysfluencies of normal adults in order to provide a basic data of normal speaking. The number of subjects of this study were 30(14 females and 16 males), and their ages ranged 17 to 36. The rate was measured as syllables per minute (SPM). The speech rates in reading ranged 273-426 with a mean of 348 SPM and in speaking ranges 118-409 (mean=265). The average of their fluencies was 99.1% in reading and 96.9% in speaking. The rater reliability of speech rate in the data assessed by video was very high (r=0.98) and the rater reliability of speech fluency was moderately high (r=0.67). The disfluency types were also analysed from 150 disfluency episodes. Syllable repetition and word interjection were the most common disfluent types.

  • PDF

A Study of N-Insertion Preferences in Korean (선호도 조사를 통한 ㄴ첨가 현상의 실현 양상 연구)

  • Kook, Kyungnk-A;Kim, Ju-Won;Lee, Ho-Young
    • MALSORI
    • /
    • no.53
    • /
    • pp.37-60
    • /
    • 2005
  • A Study of N-Insertion Preferences in KoreanKyung-A Kook, Ju-Won Kim, Ho-Young LeeSince n-insertion is not an obligatory process in Korean, it is necessary to investigate what factors influence n-insertion preferences and whether n-insertion preferences have been changed over time. To find answers to these questions, an n-insertion preference test using a questionnaire was conducted. 183 words were selected for this test and 167 subjects participated in the test. The results of this test show that the n-insertion preferences were influenced by the speakers' age, the number and structure of the syllable, word class, phonetic environments, and familiarity. It is suggested that the results of this test should be incorporated into the Principles of Standard Pronunciation and in the Grand Dictionary of Standard Korean.

  • PDF

Errors of English stress by Korean speakers (한국인의 영어 강세 오류의 특징)

  • Park, Soon-Boak
    • English Language & Literature Teaching
    • /
    • v.10 no.3
    • /
    • pp.177-190
    • /
    • 2004
  • The purpose of this paper IS to investigate the aspects of errors of English stress by Korean students. In this experimental study, 17 students participated and read 120 words which are divided into three types-the beginning, the middle, and the advanced-according to the level of words. As a result of acoustical judgement, there were a greater number of errors In the advanced level of words, and the more syllables the words have, the more errors occurred, tins means Korean students who learn English as a second language have trouble realizing the right stress in words with larger numbers of syllables and the more advanced level. Furthermore it is interesting that Korean students imposed the primary stress on the second syllable when producing words with stress in the first, third and forth syllables.

  • PDF

Constraints of English Poetic Meter (영시 정형율의 제약들 - Iambic을 중심으로 -)

  • Sohn Ilkwon
    • MALSORI
    • /
    • no.42
    • /
    • pp.71-88
    • /
    • 2001
  • This study is on the constraints of English Poetic Meter. In English poems, the metrical pattern doesn't always match the linguistic stress on the lines. These mismatches are found differently among the poets. The peaks mismatched with the weak metrical position are divided into the two ways according as they are adjacent to the boundary of a phonological domain or not. PAF and $^*UV$] are suggested for the mismatched peak which are not adjacent to the boundary of a phonological domain ; $^*Peak$] and BT for the mismatched peak which are adjacent to the boundary of a phonological domain. For the lexical stress mismatched with the weak metrical position, $^*W{\;}{\Rightarrow}{\;}Strength$ is set up by the concept of the strong syllable. $MPS{\;}{\Rightarrow}{\;}\Phi_{max}$ for the metrical position size can replace the resolution which is used to control the number of syllables in English poems. These constraints show the different hierarchies among the poets.

  • PDF