• Title/Summary/Keyword: Chinese dictionary

Search Result 63, Processing Time 0.024 seconds

중국 코퍼스와 인터넷을 이용한 중한사전 표제어의 오류 연구 - F2-1을 중심으로

  • Baek, Jong-In
    • 중국학논총
    • /
    • no.63
    • /
    • pp.47-64
    • /
    • 2019
  • 当今在韩国流通的中韩词典收词颇多, 但词典里翻开哪已叶不难发现令人莫名其妙的词汇, 而且这些词汇当中有的甚至连汉语大词典里都找不到. 我们发现这些词汇里往往出现解释有误的问题. 本文主要探讨了这些解释有误词汇. 为此, 先从中韩词典里筛选出在现代汉语语料库中出现的次数少于十次的词汇. 我们认为此文里筛选出的这些词汇很可能不太正规或现在不怎幺使用. 为了使这种推测能得到更准确的印证, 作者在百度网上又检索了是否出现它们的用例, 之后, 就发现这些词汇确实存在各种问题, 需要校正这些解释有误的词汇. 本文以F2-1部分一千五百个词条为研究对象进行了适当性调查. 通过这次研究发现F2-1部分低频率词条有348个词, 其中45个词有各种问题. 值得探讨的是在汉韩词典里对这些低频率词条的说明出现不少错误, 许多词汇根本不适合被收录到词典里. 我们把这些带错误的词汇分成三各部分 : 1. 词汇解释有误, 2. 漏意味项, 3. 其他错误, 进行讨论. 我们将要继续研究其他项目的词条. 希望这些研究对中韩词典的编辑有所帮助.

Optimizing Multiple Pronunciation Dictionary Based on a Confusability Measure for Non-native Speech Recognition (타언어권 화자 음성 인식을 위한 혼잡도에 기반한 다중발음사전의 최적화 기법)

  • Kim, Min-A;Oh, Yoo-Rhee;Kim, Hong-Kook;Lee, Yeon-Woo;Cho, Sung-Eui;Lee, Seong-Ro
    • MALSORI
    • /
    • no.65
    • /
    • pp.93-103
    • /
    • 2008
  • In this paper, we propose a method for optimizing a multiple pronunciation dictionary used for modeling pronunciation variations of non-native speech. The proposed method removes some confusable pronunciation variants in the dictionary, resulting in a reduced dictionary size and less decoding time for automatic speech recognition (ASR). To this end, a confusability measure is first defined based on the Levenshtein distance between two different pronunciation variants. Then, the number of phonemes for each pronunciation variant is incorporated into the confusability measure to compensate for ASR errors due to words of a shorter length. We investigate the effect of the proposed method on ASR performance, where Korean is selected as the target language and Korean utterances spoken by Chinese native speakers are considered as non-native speech. It is shown from the experiments that an ASR system using the multiple pronunciation dictionary optimized by the proposed method can provide a relative average word error rate reduction of 6.25%, with 11.67% less ASR decoding time, as compared with that using a multiple pronunciation dictionary without the optimization.

  • PDF

A Comparative Study of the Trisyllabic Words with same form-morpheme and same meaning in Modern Chinese and the Trisyllabic Korean Words Written in Chinese Characters with same form-morpheme and same meaning (현대 중국어의 삼음사(三音詞)와 현용 한국 삼음절(三音節) 한자어(漢字語)의 동형(同形) 동소어(同素語) 비교 연구)

  • CHOE, GEUM DAN
    • Cross-Cultural Studies
    • /
    • v.25
    • /
    • pp.743-773
    • /
    • 2011
  • In this research, the writer has done a comparative analysis of 4,791 trisyllabic modern Chinese vocabularies from "a dictionary for trisyllabic modern Chinese word" and the corresponding Korean words written in Chinese characters out of 170,000 vocabularies hereupon that are collected in "new age new Korean dictionar y". Aa a result, we have the total 407 pairs of corresponding group with the following 3 types: 1) Chinese : Korean 3(2) : 3 syllable Chinese characters with completely same form-morpheme and same meaning, use, class (376pairs, 92.38% of 407), 2) Chinese : Korean 3 : 3 syllable Chinese characters with completely same form-morpheme and partly same meaning, use, class (18pairs, 4.42% of 407), 3)Chinese : Korean 3 : 3 syllable Chinese characters with completely same form-morpheme and different meaning, use, class (13pairs, 3.19% of 407).

A study on the Chinese characters originated in Japanese industrial standard (JIS * 0212) (일본공업규격 '정보교환용한자부호-보조한자'에 포함된 일본한자에 대한 연구)

  • 이춘택
    • Journal of Korean Library and Information Science Society
    • /
    • v.19
    • /
    • pp.59-81
    • /
    • 1992
  • This study investigates Japanese-made Chinese Characters in JIS X 0212-1990(Code of the Su n.0, pplementary Japanese Graphic Character Set for Information Interchange). As a results of detailed investigation, it is found that the number of Japanese-made Chinese Characters in su n.0, pplementary set reaches to 69 characters. Among them, 29 characters are not listed even in the best known chinese character dictionary [대한화사전]. 30 characters are found in the chinese character dictionaries published in Korea, while 39 characters are not found in any of those dictionaries. The distinctive characteristic of Japanese-made Chinese characters is that those chinese characters are made in order to name the things, such as fishes, birds, trees, which do not have Chinese-made Chinese Characters.

  • PDF

A note for Sino-Korean terminology of mathematics (수학에 쓰이는 한자말에 대한 소고)

  • Her, Min
    • Communications of Mathematical Education
    • /
    • v.30 no.2
    • /
    • pp.121-138
    • /
    • 2016
  • Most of elementary and secondary school mathematical terms in Korean are Sino-Korean words. We check Chinese characters relating to such Sino-Korean words by using Chinese dictionaries, and critically judge how much we can understand Sino-Korean words by Chinese characters. Through this search, we classify Sino-Korean words into three categories; words which can be understood by Chinese characters, words which can not be understood by Chinese characters, words which are misunderstood by Chinese characters.

On the pronunciation of Hanja based on Gujang Sansul Eumeui (구장산술음의에 비추어본 한자의 독음에 관한 논의)

  • Koh, Youngmee;Ree, Sangwook
    • Journal for History of Mathematics
    • /
    • v.29 no.3
    • /
    • pp.147-155
    • /
    • 2016
  • Ancient books from East Asia, especially, Korea, China and Japan, are all written in Chinese. Ancient mathematical books like 九章算術(Gujang Sansul in Korean sound, Jiuzhang Suanshu in Chinese) is not exceptional and also was written in Chinese. The book 九章算術音義(Gujang Sansul Eumeui in Korean, Jiuzhang Suanshu Yinyi in Chinese), a dictionary-like book on 九章算術was published by official 李籍(Lǐ Jí) of 唐(Tang) dynasty (AD 618-907). We discuss how to pronounce Chinese characters based on 九章算術音義. To do so, we compare the pronunciation of the characters used in the words which are explained in 九章算術音義, to those of the current Korean and Chinese. Surprisingly, the pronunciations of the Chinese characters are almost all accordant with those of both Korean and Chinese.

A Study on Error Analysis of Words Used in Shiji Liezhuan Presented in the Great Chinese-Korean Dictionary (『한한대사전(漢韓大辭典)』에 수록된 『사기(史記)·열전(列傳)』 관련어휘 오류연구(誤謬硏究))

  • Choi, Tae-Hoon
    • Cross-Cultural Studies
    • /
    • v.40
    • /
    • pp.213-238
    • /
    • 2015
  • This article attempts to correct errors in five words related to Shiji (The Grand Scribe's Records) Liezhuan (A Series of Biographies), which are presented in the Great Chinese-Korean Dictionary. The author analyses the problems with meaning interpretations of three words and additional meaning interpretations of two words. The main points of the study are presented in the following. First, in relation to the error correction in meaning interpretation, this study finds out that the explanations of "jiayu," "jiaochi," and "guancai" in the Great Chinese-Korean Dictionary are incorrect. Most of the cases include plausible interpretations of the words that are likely to cause readers to be confused with the meanings. Each of the words should be interpreted as "lend${\rightarrow}$give," "arrangement${\rightarrow}$new decoration, ornamentation, or embellishment after removing old one," and "accept something carefully or accept something after inspection${\rightarrow}$look over carefully or search for something." Second, as for the supplementary correction, this study points out that the explanations of "xiaoshi" and "shennian" are not sufficient. The following meanings for each word should be added, including "display skills" and " be trapped inside one's own mind." Furthermore, when comparing with the different translation versions by scholars at home and abroad, we can come to a following conclusion. The interpretations made by Zheng, Fan-Zhen are the most accurate for the "jiayu" item. With respect to the "jiaochi" item, the interpretations given by Piao, Yi- Feng; Wang, Li-Qi; Yang, Zhong-Xian; and Hao, Zhi-Da are relatively appropriate. The "guancai" item is adequately interpreted by Piao, Yi-Feng and Wang, Li-Qi. In the meaning interpretation of the "xiaoshi," Jin, Yuan- Zhong gave correct explanations. In addition, it is considered that Wang, Zhong provided the most ideal translations for the item "shennian."

A Study on Construction and Implementation of Web education System with Chinese conversion rule set (중국어 규칙변환 웹 교육시스템 설계 및 구현에 관한 연구)

  • Lee, Ji Hyun;Lee, Eun Ryoung
    • Journal of Digital Contents Society
    • /
    • v.17 no.4
    • /
    • pp.227-234
    • /
    • 2016
  • When Chinese character used in Korea, so did the characters' pronunciation, so many Korean Chinese characters today have similar pronunciation with Chinese, but since Korean and Chinese pronunciations were preserved and developed in different alphabets, the written letter of the pronunciation also differs. This study on Chinese education, has constructed and implemented an easy way to study Chinese pronunciations by creating conversion rule set between Chinese pronunciation, Chinese Hanyu latin Pinyin and Korean chinese character pronunciation consisting of an initial sound, a medial vowel, and a final consonant. This study has established web version and application version of this conversion rule set education system to enhance Chinese education.

Integrated Char-Word Embedding on Chinese NER using Transformer (트랜스포머를 이용한 중국어 NER 관련 문자와 단어 통합 임배딩)

  • Jin, ChunGuang;Joe, Inwhee
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.415-417
    • /
    • 2021
  • Since the words and words in Chinese sentences are continuous and the length of vocabulary is huge, Chinese NER(Named Entity Recognition) always based on character representation. In recent years, many Chinese research has been reconsidered how to integrate the word information into the Chinese NER model. However, the traditional sequence model has complex structure, the slow inference speed, and an additional dictionary information is needed, which is difficult to implement in the industry. The approach in this paper has the state of the art and parallelizable, which is integrated the char-word embeddings, so that the model learns word information. The proposed model is easy to implement, and outperforms traditional model in terms of speed and efficiency, which is improved f1-score on two dataset.

A Study on the Chinese Characters Originated in Japan in Japanes in Industrial Standard (일본공업규격 "정보교환용한자부호" 에 포함된 일본한자에 대한 연구)

  • Lee Choon-Tack
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.22
    • /
    • pp.219-257
    • /
    • 1992
  • Among the Chinese Characters originated in Japan, some of them are very ancient in their origin and others come to exist as different forms by being used widely in forged books in Chinese. These Characters can be divided into three groups. First, the Chinese Characters whose forms are different. Most of these are 'hoiui' (회의)character, being made by imitating the forms of the original Chinese Letters. These characters do have meaning but not pronunciation. This is one distinct feature of Chinese Characters originated in Japan. Second, the Chinese Characters whose meaning has been assigned by the Japanese people. These letters can be grouped into two. One is the letters whose meanings are entirely different from original Chinese Characters, and the other is the letters whose meanings are not known although their pronunciations are known. It can be explained that the letters with different forms are made because of the ignorance of letter's existence. Or, the letters were made on purpose in ordoer to be used in different meanings. Third, the Characters with a partial modification of original Chinese Characters. Among the Characters in three groups above, pure Japanese-made Chinese Characters are those in group one and three since those in group two are Chinese Letters whose meanings (or pronunciation) only are Japanese. As a results of detailed investigation of pure Japanese-made Chinese Character in JIS X 0208-1990, the followings are discovered: 1. Pure Japanese-made Chinese Characters are 147 in numbers. 2. The Characters which were originally Chinese but now considered to be Japanese-made are 5 in numbers. Among these letters, 39 Characters are not listed in TaeHanHwaSaJon(Whose fame is well known as the authoritative dictionary of Chinese Characters), 47 Characters are not found in the dictionaries of Chinese Characters compiled in Korea. 3. 14 Characters seem to be Japanese-made Chinese Characters although it cannot be said so with accuracy because of various meanings found in several dictionaries of Chinese Characrters.

  • PDF