• Title/Summary/Keyword: Phonetic Alphabet

Search Result 35, Processing Time 0.027 seconds

"한글음성문자(The Korean Phonetic Alphabet)"의 연원에 대한 고찰

  • 심소희
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.300-307
    • /
    • 1996
  • 한글은 세계 여느 문자처럼 자연 발생적으로 생긴 문자체계가 아니라 수천년간 인류의 어음에 대한 사유와 고도의 어음기술이 집약된 문자관에 의해 연역적 방법으로 창제된 문자체계이므로 다른 문자와는 차별적인 연구방법이 요구된다. 세종조에 편찬되었던 ${\ll}훈민정음{\gg}{;\}.{\;}{\ll}홍무정운역훈{\gg}{;\}.{\;}{\ll}동국정운{\gg}$는 제작 동기와 목적이 달라서 각각 상이한 어음체계를 가지고 있는데, 각 어음체계가 필요로 하는 글자꼴을 한글은 27개(자음16개, 모음11개)의 기본글자꼴에 발음부위 및 발음방법을 상형한 새로운 글자꼴을 제공하여 각각의 문자체계를 구성하였다. 유교적 언어관의 토양에서 세상 모든 음의 생성원리에 의해 창제된 한글은 바로 세상의 모든 음을 표기할 수 있는 문자체계인 것이다. 이러한 한글음성문자는 오늘날 이질어음체계의 외국어를 표음하는 데에도 예외 없이 적용될 수 있을 뿐만 아니라 글자꼴의 유연성.응용면에서 국제음성문자(I.P.A)보다 월등히 과학적이고 완정한 음성문자로서 기능할 수 있다.

  • PDF

Aparatus and Method for Inputting Chinese Based on Hunminjeongeum Using Korean Input Keyboard (기존 한글 키보드를 이용한 훈민정음 기반의 한글 병음 중국어 입력기 개발)

  • Sin, Eun-Joo;Choi, Ja-Ryoung;Lim, Soon-Bum
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.4
    • /
    • pp.549-557
    • /
    • 2020
  • Chinese is the most spoken language in the world. However, because Chinese is a hieroglyphic language, using Chinese in a digital environment is very inconvenient. Chinese users are using Pinyin for Chinese input, but the phonetic representation of the Latin alphabet is not good. Hunminjeongeum has an excellent phonetic representation which can improve Chinese usage in digital environment. Therefore, it is possible to improve the use of Chinese by Chinese users in digital environment and to help Korean users who are learning Chinese. Therefore, this paper proposes a Chinese input method using Hunminjeongeum. In addition, we develop an input software using this input method and verify its effectiveness by evaluating usability.

Heteronyms in modern Korean and their transcription in the IPA and the Roman alphabet (우리말 동철이음어(同綴異音語) IPA.로마자 표기 (사~섬))

  • Youe MahnGunn
    • MALSORI
    • /
    • no.37
    • /
    • pp.49-71
    • /
    • 1999
  • The Purpose of this paper is to gather pairs of heteronyms in modern Korean and transcribe them in the IPA and the Roman alphabet in order to propose that all of them should be differentiated in Hanngul orthography. More than a quarter of the whole Korean vocabulary consists of words with a long vowel and the number of minimal pairs distinguished only by the chroneme reaches nearly ten thousand (i.e. twenty thousand words). The letter h syllable-finally is used here to represent the long vowel in Romanization except the vowel '으‘[?:] which is transcribed by doubling the letter u (i.e. uu). Another factor bringing forth lots of heteronyms in Korean is the lack of full indication as to the non-automatic reinforcement in the initial consonant of a word (or a morpheme) when preceded by another within a phrase (or a word). These reinforced word-initial consonants are written with the letter c and an apostrophe (like c'g- , c'd- , c'b-, c's-, c'j-) in Romanization here. The reinforced morpheme-initial consonant within a word is written with the letters k t, p, ss and cz for ㄲ, ㄸ, ㅃ, ㅆ and ㅉ sounds respectively. The contrasted pronunciations of pairs of heteronyms beginning with ㅅ /s/sup h// and ㅆ /s/ sounds are transcribed here for exemplification.

  • PDF

Transliteration Correction Method using Korean Alphabet Viable Prefix (한국어 자모 Viable Prefix를 이용한 외래어 표기 교정 기법)

  • Kwon, Soon-Ho;Kwon, Hyuk-Chul
    • The KIPS Transactions:PartB
    • /
    • v.18B no.2
    • /
    • pp.87-92
    • /
    • 2011
  • In Korean documents, there are diverse spellings of transliterated foreign loanwords. This fact diminishes the performance of information retrieval systems in that a foreign word can be recognized differently, which is to say, as two or several different words. Thus, information retrieval systems require preprocessing to correct nonstandard loanword spellings prior to searching and recognizing corresponding equivalent words. This paper proposes a method that improves precision and processing efficiency using the Korean alphabet's viable prefix, which prunes a virtual tree from which candidate loanwords are created.

Proposed Methodology for Building Korean Machine Translation Data sets Considering Phonetic Features (단어의 음성학적 특징을 이용한 한국어 기계 번역 데이터 세트 구축 방안)

  • Zhang Qinghao;Yang Hongjian;Serin Kim;Hyuk-Chul Kwon
    • Annual Conference on Human and Language Technology
    • /
    • 2022.10a
    • /
    • pp.592-595
    • /
    • 2022
  • 한국어에서 한자어와 외래어가 차지하는 비중은 매우 높다. 일상어의 경우 한자어와 외래어의 비중이 약 53%, 전문어의 경우 약 92%에 달한다. 한자어나 외래어는 중국이나 다른 나라로부터 영향을 받아 한국에서 쓰이는 단어들이다. 한국어에서 사용되는 한자어와 외래어의 한글 표기과 원어 표기를 발음해보면, 발음이 상당히 유사하다는 것을 알 수 있다. 한자어인 도서관(图书馆)을 중국어로 발음해보면 thu.ʂu.kwan'로 해당 단어에 대한 한국 사람의 발음과 상당히 유사하다. 본 논문에서는 Source Length, Source IPA Length, Target Length, Target IPA Length, IPA Distance 등 총 5가지의 음성학적 특징을 고려한 한국어-중국어 한국어-영어 단어 기계번역 데이터 세트를 구축하고자 한다.

  • PDF

Annotation of a Non-native English Speech Database by Korean Speakers

  • Kim, Jong-Mi
    • Speech Sciences
    • /
    • v.9 no.1
    • /
    • pp.111-135
    • /
    • 2002
  • An annotation model of a non-native speech database has been devised, wherein English is the target language and Korean is the native language. The proposed annotation model features overt transcription of predictable linguistic information in native speech by the dictionary entry and several predefined types of error specification found in native language transfer. The proposed model is, in that sense, different from other previously explored annotation models in the literature, most of which are based on native speech. The validity of the newly proposed model is revealed in its consistent annotation of 1) salient linguistic features of English, 2) contrastive linguistic features of English and Korean, 3) actual errors reported in the literature, and 4) the newly collected data in this study. The annotation method in this model adopts the widely accepted conventions, Speech Assessment Methods Phonetic Alphabet (SAMPA) and the TOnes and Break Indices (ToBI). In the proposed annotation model, SAMPA is exclusively employed for segmental transcription and ToBI for prosodic transcription. The annotation of non-native speech is used to assess speaking ability for English as Foreign Language (EFL) learners.

  • PDF

Automatic Construction of Foreign Word Transliteration Dictionary from English-Korean Parallel Corpus (영-한 병렬 코퍼스로부터 외래어 표기 사전의 자동 구축)

  • Lee, Jae Sung
    • The Journal of Korean Association of Computer Education
    • /
    • v.6 no.2
    • /
    • pp.9-21
    • /
    • 2003
  • This paper proposes an automatic construction system for transliteration dictionary from English-Korean parallel corpus. The system works in 3 steps: it extracts all nouns from Korean documents as the first step, filters transliterated foreign word nouns out of them with the language identification method as the second step, and extracts the corresponding English words by using a probabilistic alignment method as the final step. Specially, the fact that there is a corresponding English word in most cases, is utilized to extract the purely transliterated part from a Koreans word phrase, which is usually used in combined forms with Korean endings(Eomi) or particles(Josa). Moreover, the direct phonetic comparison is done to the words in two different alphabet systems without converting them to the same alphabet system. The experiment showed that the performance was influenced by the first and the second preprocessing steps; the most efficient model among manually preprocessed ones showed 85.4% recall, 91.0% precision and the most efficient model among fully automated ones got 68.3% recall, 89.2% precision.

  • PDF

EEG based Vowel Feature Extraction for Speech Recognition System using International Phonetic Alphabet (EEG기반 언어 인식 시스템을 위한 국제음성기호를 이용한 모음 특징 추출 연구)

  • Lee, Tae-Ju;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.1
    • /
    • pp.90-95
    • /
    • 2014
  • The researchs using brain-computer interface, the new interface system which connect human to macine, have been maded to implement the user-assistance devices for control of wheelchairs or input the characters. In recent researches, there are several trials to implement the speech recognitions system based on the brain wave and attempt to silent communication. In this paper, we studied how to extract features of vowel based on international phonetic alphabet (IPA), as a foundation step for implementing of speech recognition system based on electroencephalogram (EEG). We conducted the 2 step experiments with three healthy male subjects, and first step was speaking imagery with single vowel and second step was imagery with successive two vowels. We selected 32 channels, which include frontal lobe related to thinking and temporal lobe related to speech function, among acquired 64 channels. Eigen value of the signal was used for feature vector and support vector machine (SVM) was used for classification. As a result of first step, we should use over than 10th order of feature vector to analyze the EEG signal of speech and if we used 11th order feature vector, the highest average classification rate was 95.63 % in classification between /a/ and /o/, the lowest average classification rate was 86.85 % with /a/ and /u/. In the second step of the experiments, we studied the difference of speech imaginary signals between single and successive two vowels.

Notes on Descriptions of the Prosodic System in French Grammars in the Age of Enlightenment & the Departure of the International Phonetic Alphabet (계몽주의 시대 프랑스 문법서에서 기술한 운율 현상과 국제음성기호의 출발에 대한 고찰)

  • Park, Moon-Kyou
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.4
    • /
    • pp.658-667
    • /
    • 2021
  • Our study aimed to analyze and reinterpret, by an acoustic approach, the descriptions of the 18th century prosody and introduce the figurative pronunciation system, which is the International Phonetic Alphabet pioneer. Our methodology compares and analyzes grammars and documents on the transcription system and restructures the prosodic structure. It is certain that the 18th century grammarians widely accepted the prosody theories made by Arnauld & Lancelot of the seventeenth century. In particular, grammar scholars accepted the dichotomous classification of the accent structures as prosodic and oratorical accents. The prosodic accent has a relation to intonation, and the oratorical accent has as its key elements intonation and intensity. Regarding the temporal structure, the lengthening of the final syllable was observed systematically by grammarians of the 18th century. This time structure is similar to that of today. Therefore, we can conclude that the final elongation, an essential characteristic of the modern French accent, has already played an imbued role in 18th century prosody. Despite this, the 18th century grammarians did not assign it the status of accent, as it was a stereotype that matches accent with intonation.

A Study on the Multilingual Speech Recognition for On-line International Game (온라인 다국적 게임을 위한 다국어 혼합 음성 인식에 관한 연구)

  • Kim, Suk-Dong;Kang, Heung-Soon;Woo, In-Sung;Shin, Chwa-Cheul;Yoon, Chun-Duk
    • Journal of Korea Game Society
    • /
    • v.8 no.4
    • /
    • pp.107-114
    • /
    • 2008
  • The requests for speech-recognition for multi-language in field of game and the necessity of multi-language system, which expresses one phonetic model from many different kind of language phonetics, has been increased in field of game industry. Here upon, the research regarding development of multi-national language system which can express speeches, that is consist of various different languages, into only one lexical model is needed. In this paper is basic research for establishing integrated system from multi-language lexical model, and it shows the system which recognize Korean and English speeches into IPA(International Phonetic Alphabet). We focused on finding the IPA model which is satisfied with Korean and English phoneme one simutaneously. As a result, we could get the 90.62% of Korean speech-recognition rate, also 91.71% of English speech-recognition rate.

  • PDF