• 제목/요약/키워드: consonant system

검색결과 89건 처리시간 0.022초

악리론으로 본 정음창제와 정음소 분절 알고리즘 (Ortho-phonic Alphabet Creation by the Musical Theory and its Segmental Algorithm)

  • 진용옥;안정근
    • 음성과학
    • /
    • 제8권2호
    • /
    • pp.49-59
    • /
    • 2001
  • The phoneme segmentation is a very difficult problem in speech sound processing because it has found out segmental algorithm in many kinds of allophone and coarticulation's trees. Thus system configuration for the speech recognition and voice retrieval processing has a complex system structure. To solve it, we discuss a possibility of new segmental algorithm, which is called the minus a thirds one or plus in tripartitioning(삼분손익) of twelve temporament(12 율려), first proposed by Prof. T. S. Han. It is close to oriental and western musical theory. He also has suggested a 3 consonant and 3 vowel phonemes in Hunminjungum(훈민정음) invented by the King Sejong in the 15th century. In this paper, we suggest to newly name it as ortho-phonic phoneme(OPP/정음소), which carries the meaning of 'the absoluteness and independency'. OPP also is acceptable to any other languages, for example IPA. Lastly we know that this algorithm is constantly applicable to the global language and is very useful to construct a voice recognition and retrieval structuring engineering.

  • PDF

Angle씨 II급 1류 부정교합아동의 발음에 관한 음향학적 연구 (AN ACOUSTIC ANALYSIS OF PRONUNCIATION IN CHILDREN WITH ANGLE'S CLASS II DIV. 1 MALOCCLUSION)

  • 박윤정;이상훈;손동수
    • 대한소아치과학회지
    • /
    • 제24권1호
    • /
    • pp.95-111
    • /
    • 1997
  • The human speech organ consists of respiration system (lung, larynx), phonation system (vocal cord), articulation system (esophagus, pharynx, uvula, teeth, gingiva, palate, tongue, lip) and resonating system(oral cavity, nasal cavity, paranasal sinus). Because teeth are components of the articulation system, it has been reported that the persons with abnormally positioned teeth generally have abnormal occlusion and pronunciation. In this study, using /ㅅ(s)/, the most commonly mispronunced consonant in children with malocclusion, and the seven single vowels, /사(sa), 서($s\delta$), 소(so), 수(su), 스($s\omega$), 시(si), 세(se)/ and / ㅏ(a), ㅓ($\delta$), ㅗ(o), ㅜ(u), ㅡ($\omega$), 1(i), ㅔ(e)/ were recorded and analyzed using speech analysis program on computer by measuring formants and compared them for investigating the differences in pronunciation in children with Angle's class I occlusions and those with Angle's class II div.1 malocclusion. The result were as follows: 1. In the Angle's Class II div.1 group, there were no significant differences in F1 of all recorded sounds as compared with Angle's Class I group(p>0.05). 2. In the consonants, there were significant differences in F2 of /스($s\omega$)/ and F2/F1 ratio of /사(sa), 서($s\delta$), 시(si)/ between the two group(p<0.05). 3. In the vowels, there were significant differences F2/F1 ratio of /ㅓ($\delta$)/(p<0.05) and no significant differences in F2/F1 ratio between two group(p>0.05). 4. In the consonants, there were significant differences in F2 and F2/F1 ratio when succeeding vowels were high or low, and F2/F1 ratio when front in accordance with tongue position (p<0.05). 5. In the vowels, there were no significant differences in formant in accordance with tongue position(p>0.05)

  • PDF

한글 언어 교습 시스템 (Korean language teaching system)

  • 정재원;이종원
    • 한국콘텐츠학회:학술대회논문집
    • /
    • 한국콘텐츠학회 2008년도 춘계 종합학술대회 논문집
    • /
    • pp.367-371
    • /
    • 2008
  • 이 시스템은 한국의 언어인 한글을 모르는 외국인뿐만 아니라 국내의 남녀노소 막론하고 불특정 다수를 위한 것이다. 앞서 말한 대상자들이 한글을 조력자 없이 혼자 배우는 것은 사실상 불가능 하다고 할 수 있다. 집안에서 혼자서도 문자를 이해하고 발음을 청취할 수 있는 시스템으로 한글의 자음과 모음이라는 특징을 활용한 AR환경에 입각한 시스템을 보여준다. 나아가 이 시스템을 이용한 단어학습 방법도 제시한다. 또한 현 수준은 데스크톱 기반 시스템이지만 PDA등의 hand-held 기반의 시스템으로의 발전을 기약할 수 있으며 적은 수의 마커를 사용하여 편리함을 도모하면서 인간과 컴퓨터 사이에 쉽게 상호작용하는 시스템을 선보인다.

  • PDF

민화 DB를 위한 분류체계 설계 (Designing a Classification System for Minhwa DB)

  • 최은진;이영숙
    • 한국멀티미디어학회논문지
    • /
    • 제25권1호
    • /
    • pp.135-143
    • /
    • 2022
  • In order to convert Korean folk paintings called Minhwa, a part of traditional Korean heritage, into DBs, it is necessary to design a classification system suitable for the characteristics of folk paintings. A classification system and the generating of unique codes are required to classify and save them. To realize this, a basic classification system was created by listing objects depicted in folk paintings, and keywords were extracted by reclassifying them for each object. In order to assign a unique code to each piece, we organize the English names of each Minhwa since the English names of the folk painting contain the names of objects. The code name is extracted by applying the order of nouns and consonant priority rules in English names and attaching five Arabic numerals. These codes are later assigned to each image file stored in the database and are input together with the keyword. The Minhwa DB constructed in this way enables storage and search centered on objects and keywords and the intuitive inferring of the type of object from the code name.

청음 음성학적 지식에 기반한 음가분류에 의한 핵심어 검출 시스템 구현 (The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification)

  • 김학진;김순협
    • 정보처리학회논문지B
    • /
    • 제10B권2호
    • /
    • pp.169-178
    • /
    • 2003
  • This study outlines two viewpoints the classification of phone likely unit (PLU) which is the foundation of korean large vocabulary speech recognition, and the effectiveness of Chiljongseong (7 Final Consonants) and Paljogseong (8 Final Consonants) of the korean language. The phone likely classifies the phoneme phonetically according to the location of and method of articulation, and about 50 phone-likely units are utilized in korean speech recognition. In this study auditory phonetical knowledge was applied to the classification of phone likely unit to present 45 phone likely unit. The vowels 'ㅔ, ㅐ'were classified as phone-likely of (ee) ; 'ㅒ, ㅖ' as [ye] ; and 'ㅚ, ㅙ, ㅞ' as [we]. Secondly, the Chiljongseong System of the draft for unified spelling system which is currently in use and the Paljongseonggajokyong of Korean script haerye were illustrated. The question on whether the phonetic value on 'ㄷ' and 'ㅅ' among the phonemes used in the final consonant of the korean fan guage is the same has been argued in the academic world for a long time. In this study, the transition stages of Korean consonants were investigated, and Ciljonseeng and Paljongseonggajokyong were utilized in speech recognition, and its effectiveness was verified. The experiment was divided into isolated word recognition and speech recognition, and in order to conduct the experiment PBW452 was used to test the isolated word recognition. The experiment was conducted on about 50 men and women - divided into 5 groups - and they vocalized 50 words each. As for the continuous speech recognition experiment to be utilized in the materialized stock exchange system, the sentence corpus of 71 stock exchange sentences and speech corpus vocalizing the sentences were collected and used 5 men and women each vocalized a sentence twice. As the result of the experiment, when the Paljongseonggajokyong was used as the consonant, the recognition performance elevated by an average of about 1.45% : and when phone likely unit with Paljongseonggajokyong and auditory phonetic applied simultaneously, was applied, the rate of recognition increased by an average of 1.5% to 2.02%. In the continuous speech recognition experiment, the recognition performance elevated by an average of about 1% to 2% than when the existing 49 or 56 phone likely units were utilized.

한글 문자의 전자계산조직에 적응하기 위한 특징추출에 관한 연구(I) (A Method For the Recognition of Printed Korean Characters)

  • 이주근
    • 대한전자공학회논문지
    • /
    • 제6권4호
    • /
    • pp.8-19
    • /
    • 1969
  • 우리 문자는 자모의 조합된 언어문자이기 때문에 그 수가 방대하여 수천개의 식별기구를 필요로 할 뿐만 dkl니라 재조가 복잡하며 대부분이 유이문자이기 때문에 Patttern 인식문제에 잇어서는 허다한 난점이 있다. 따라서 이들 재조상에서 오는 문제점을 분석, 평가하하여 최적조건을 결정하고, 특징추출에 노이적별함수의 적용은 다른 문자에서는 볼 수 없는 한글 문자에 관한 한 특수한 장점으로 나타난다는 것을 확인하여다. 이 특수점을 Systen의 서례에 최대한으로 적용하여 3분지 1이상의 System축소를 보았다. 인식방버버으로서는 표본 Pattern을 추출해 내서 Register에 기제한 다음 인식 Matrix에 의하여 식별하였다. 식별된 문자는 판정논리에 의하여 특수Parameter를 추출하였다. 이논적인 입증을 위한 몇가지의 실험적인 검사를 가하였으며 이 과정에서 얻어진 모든 자료들은 이 분양의 연구에 매우 유익한 기초자료를 제공할 것이로 보며, 한글문자의 Patterndlstlr에 관한 실마리가 잡혀졌다고 보겠다.

  • PDF

자연어 처리 기반 한국어 TTS 시스템 구현 (Implementation of Korean TTS System based on Natural Language Processing)

  • 김병창;이근배
    • 대한음성학회지:말소리
    • /
    • 제46호
    • /
    • pp.51-64
    • /
    • 2003
  • In order to produce high quality synthesized speech, it is very important to get an accurate grapheme-to-phoneme conversion and prosody model from texts using natural language processing. Robust preprocessing for non-Korean characters should also be required. In this paper, we analyzed Korean texts using a morphological analyzer, part-of-speech tagger and syntactic chunker. We present a new grapheme-to-phoneme conversion method for Korean using a hybrid method with a phonetic pattern dictionary and CCV (consonant vowel) LTS (letter to sound) rules, for unlimited vocabulary Korean TTS. We constructed a prosody model using a probabilistic method and decision tree-based method. The probabilistic method atone usually suffers from performance degradation due to inherent data sparseness problems. So we adopted tree-based error correction to overcome these training data limitations.

  • PDF

음성인식을 위한 청각신경 정보처리 모델링 (Auditory Neural Information Processing Modeling for Speech Recognition)

  • 이희규;이광형
    • 한국음향학회지
    • /
    • 제9권3호
    • /
    • pp.42-47
    • /
    • 1990
  • 음성처리 및 인식기기의 기능을 향상시키기 위해서는 생체공학적인 방법을 이용한 인체의 청각신경 정보처리 시스템의 연구가 중요하다. 그래서 본 논문에서는 와우각의 메카니즘을 분석한 기저막의 IIR 디지털 필터 모델링이 연구되었다. 특히 음소검출필터와 측징 추출을 위한 변별기능을 이용한 자음인식의 다층신경 모델을 구성한다. 이 모델은 자음인식에 있어서 90% 이상의 높은 감지율을 나타내고 있다.

  • PDF

실시간 음성타자 시스템 구현 (Development of Realtime Phonetic Typewriter)

  • 조우연;최두일
    • 대한전기학회:학술대회논문집
    • /
    • 대한전기학회 1999년도 추계학술대회 논문집 학회본부 B
    • /
    • pp.727-729
    • /
    • 1999
  • We have developed a realtime phonetic typewriter implemented on IBM PC with sound card based on Windows 95. In this system, analyzing of speech signal, learning of neural network, labeling of output neurons and visualizing of recognition results are performed on realtime. The developing environment for speech processing is established by adding various functions, such as editing, saving, loading of speech data and 3-D or gray level displaying of spectrogram. Recognition experimental using Korean phone had a 71.42% for 13 basic consonant and 90.01% for 7 basic vowel accuracy.

  • PDF

연속음성에서 천이구간의 탐색, 추출, 근사합성에 관한 연구 (A Study on a Searching, Extraction and Approximation-Synthesis of Transition Segment in Continuous Speech)

  • 이시우
    • 한국정보처리학회논문지
    • /
    • 제7권4호
    • /
    • pp.1299-1304
    • /
    • 2000
  • In a speed coding system using excitation source of voiced and unvoiced, it would be involved a distortion of speech quality in case coexist with a voiced and an unvoiced consonants in a frame. So, I propose TSIUVC(Transition Segment Including UnVoiced Consonant) searching, extraction ad approximation-synthesis method in order to uncoexistent with a voiced and unvoiced consonants in a frame. This method based on a zerocrossing rate and pitch detector using FIR-STREAK Digital Filter. As a result, the extraction rates of TSIUVC are 84.8% (plosive), 94.9%(fricative), 92.3%(affricative) in female voice, and 88%(plosive), 94.9%(fricative), 92.3%(affricative) in male voice respectively, Also, I obain a high quality approximation-synthesis waveforms within TSIUVC by using frequency information of 0.547kHz below and 2.813kHz above. This method has the capability of being applied to speech coding of low bit rate, speech analysis and speech synthesis.

  • PDF