Search | Korea Science

Text-to-Speech System Using Logatom (Logatom을 사용한 문서음성변환 시스템)

Cho Kwansun;Lee Chulhee
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.7-10
- /
- 1999
본 논문에서는 logatom 기반 무제한 한국어 TTS 시스템 구현을 제안한다. 이를 위하여 한국어를 대표할 만한 문서코퍼스를 선택하여 분석하고 이를 바탕으로 합성에 필요한 logatom을 설계한다. 일반적으로 음성코퍼스를 통해 음성세그먼트를 추출하여 접속에 기반한 TTS 시스템에서는 음성세그먼트를 의미있는 단어 또 는 어절로부터 추출한다. 하지만 음성세그먼트 추출시 고려되는 사항은 합성단위에 기초한 음소간의 결합형태이므로 본 논문에서는 음성세그먼트 추출을 위하여 무의미한 음소열인 logatom을 설계한다. Logatom은 문장 세그먼트의 어절내 위치와 문서코퍼스 분석 결과 얻어진 음소간의 결합형태를 기반으로 설계된다. 제안된 시스템의 합성음질을 평가하기 위하여 CVC 기반 logatom을 사용하여 임의의 문장을 합성해 본 결과 대부분의 음성세그먼트 접속이 자음에서 이루어지고 어절의 위치를 고려한 logatom 설계로 인하여 어절 내에서는 비교적 자연스러운 합성음을 얻을 수 있었다.
PDF

A Study on Phoneme-Based PSOLA Speech Synthesis Using LSP (LSP를 이용한 음소단위 PSOLA 음성합성에 관한 연구)

권혁제;조순계;김종교
- The Journal of the Acoustical Society of Korea
- /
- v.17 no.2
- /
- pp.3-10
- /
- 1998
본 논문에서는 음소단위 PSOLA 한국어 합성을 LSP line의 조절과 자모음 분석을 통해서 실시하였다. 음성합성에서 많이 사용하는 triphone, diphone, demisyllable등과 같은 합성단위들은 자연스러운 합성음을 위해 다양한 음운환경에서 수집된다. 그러나, 이런 방법 은 많은 시간과 메모리가 요구된다. 본 논문에서는 합성단위로서 자음17개, 모음 16개로 총 33개의 음소를 이용하였다. 자음은 후위모음/이/인 CV에서 segment되고, 모음은 단음절의 단모음과 이중모음을 1인의 화자로부터 합성데이터를 수집하였다. 또한, 10명의 화자가 발성 한 CV에서 각 모음에 따라 변하는 자음의 주파수를 분석하였고, CV+VC 또는 CV+CV에서 각 자음에 따라 변하는 모음의 포먼트변화를 분석하였다. 분석결과를 토대로 모음은 LSP line을 조절해서 PSOLA합성을 하고, 자음은 합성하려는 모음과 결합하였다. 그 결과 6개의 합성단어에 대한 청취율은 65%를 보였다.
PDF

An Implementation Method for The Phonemic and Syllabic Character Attributes of Hangul Character (한글 문자의 음소 및 음절 문자 특성의 구현 방안)

Byun, Jeong-Yong;Kang, Jin-Gon
- Annual Conference on Human and Language Technology
- /
- 1994.11a
- /
- pp.288-294
- /
- 1994
훈민정음 해례에 따르면 한글문자는 음소 및 음절 문자 특성을 가지고 있다. 이러한 특성들을 컴퓨터 시스템에서 구현함에 있어서 야기되어 왔던 각종 문제를 분석한 다음 이들 문제들에 대하여 한글문자의 특성을 제약함이 없이 컴퓨터에 대한 기술을 개발함으로써 해결책을 모색한다. 본 논문은 훈민정음 해례에서 밝힌 한글 문자의 음소 및 음절 문자 특성에 따라서 기존의 코드 체계를 평가하며, 그리고 이들에 대한 구현 방안을 제시하고자 한다. 또한 이러한 특성을 반영한 한글 입출력들인 '셔블'을 개발하고 이에 대한 검증을 시도하였다.
PDF

Phoneme-level Embedding based Korean Language Model (음소 단위 임베딩 기반 한국어 모델)

Choi, Woosung;Hyun, Kyungseok;Chung, Jaehwa;Jung, Soon Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.10a
- /
- pp.1026-1029
- /
- 2019
최근 제안되고 있는 Bert 등의 딥러닝 언어 모델 기반 pre-training 기법은 다양한 NLP 분야에서 활용되고 있다. 텍스트로 작성된 데이터 셋을 딥러닝 언어 모델이 학습하기 위해서는 토크나이징(tokenizing) 기술이 필요하다. 그러나 기존 토크나이징 방식은 한국어 및 한글이 가지는 고유한 특성(교착어적 특성과 모아쓰기 반영)을 반영하기 어렵다는 한계를 가지고 있다. 본 논문에서는 한국어와 한글이 가지는 고유한 특성을 고려하기 위하여 음소 단위의 임베딩 기법을 제안하며, 이를 기반으로 언어 모델을 설계 및 구현한다. 또한 음소 단위 임베딩 기반 한국어 모델이 실제 데이터 집합(구약성서)에서 나타나는 언어적 패턴을 학습할 수 있다는 것을 실험을 통하여 밝힌다.
https://doi.org/10.3745/PKIPS.y2019m10a.1026 인용 PDF

Sentimental Analysis using the Phoneme-level Embedding Model (음소 단위 임베딩 모형을 이용한 감성 분석)

Hyun, Kyeongseok;Choi, Woosung;Jung, Soon-young;Chung, Jaehwa
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.10a
- /
- pp.1030-1032
- /
- 2019
형태소 분석을 통하여 한국어 문장을 형태소 단위의 임베딩 및 학습 관련 연구가 되었으나 최근 비정형적인 텍스트 데이터의 증가에 따라 음소 단위의 임베딩을 통한 신경망 학습에 대한 요구가 높아지고 있다. 본 논문은 비정형적인 텍스트 감성 분석 성능 향상을 위해 음소 단위의 토큰을 생성하고 이를 CNN 모형을 기반으로 다차원 임베딩을 수행하고 감성분석을 위하여 양방향 순환신경망 모델을 사용하여 유튜브의 비정형 텍스트를 학습시켰다. 그 결과 텍스트의 긍정 부정 판별에 있어 90%의 정확도를 보였다.
https://doi.org/10.3745/PKIPS.y2019m10a.1030 인용 PDF

A quantitative study on the minimal pair of Korean phonemes: Focused on syllable-initial consonants (한국어 음소 최소대립쌍의 계량언어학적 연구: 초성 자음을 중심으로)

Jung, Jieun
- Phonetics and Speech Sciences
- /
- v.11 no.1
- /
- pp.29-40
- /
- 2019
The paper investigates the minimal pair of Korean phonemes quantitatively. To achieve this goal, I calculated the number of consonant minimal pairs in the syllable-initial position as both raw counts and relative counts, and analyzed the part of speech relations of the two words in the minimal pair. "Urimalsaem" was chosen as the object of this study because it was judged that the minimal pair analysis should be done through a dictionary and it is the largest among Korean dictionaries. The results of the study are summarized as follows. First, there were 153 types of minimal pairs out of 337,135 examples. The ranking of phoneme pairs from highest to lowest was 'ㅅ-ㅈ, ㄱ-ㅅ, ㄱ-ㅈ, ㄱ-ㅂ, ㄱ-ㅎ, ${\ldots}$, ㅆ-ㅋ, ㄸ-ㅋ, ㅉ-ㅋ, ㄹ-ㅃ, ㅃ-ㅋ'. The phonemes that played a major role in the formation of the minimal pair were /ㄱ, ㅅ, ㅈ, ㅂ, ㅊ/, in that order, which showed a high proportion of palatals. The correlation between the raw count of minimal pairs and the relative count of minimal pairs was found to be quite high r=0.937. Second, 87.91% of the minimal pairs shared the part of speech (same syntactic category). The most frequently observed type has been 'noun-noun' pair (70.25%), and 'vowel-vowel' pair (14.77%) was the next ranking. It can be indicated that the minimal pair could be grouped into similar categories in terms of semantics. The results of this study can be useful for various research in Korean linguistics, speech-language pathology, language education, language acquisition, speech synthesis, and artificial intelligence-machine learning as basic data related to Korean phonemes.
https://doi.org/10.13064/KSSS.2019.11.1.029 인용 PDF KSCI

Perception of English Vowels By Korean Learners: Comparisons between New and Similar L2 Vowel Categories (한국인 학습자의 영어 모음 인지: 새로운 L2 모음 범주와 비슷한 L2 모음 범주의 비교)

Lee, Kye-Youn;Cho, Mi-Hui
- The Journal of the Korea Contents Association
- /
- v.15 no.8
- /
- pp.579-587
- /
- 2015
The purpose of this study is to investigate how Korean learners perceive English vowels and further to test SLM which claims that new L2 vowel categories are more easily acquired than similar L2 vowel categories. Twenty Korean learners participated in English-to-Korean mapping test and English vowel identification test with target vowels /i, ɪ, u, ʊ, ɛ, æ/. The result revealed that Korean participants mapped the English pairs /i/-/ɪ/ and /u/-/ʊ/ onto single Korean vowel /i/ and /u/, respectively. in addition, both of English /ɛ/ and /æ/ were simultaneously mapped onto Korean /e/ and /ɛ/. This indicated that the Korean participants seemed to have perceptual difficulty for the pairs /i-ɪ/, /u-ʊ/, and /ɛ-æ/. The result of the forced-choice identification test showed that the accuracy of /ɪ, ʊ, æ/(ɪ: 81.3%, ʊ: 62.5%, æ: 60.0%) was significantly higher than that of /i, u, ɛ/(i: 28,8%, u: 28.8%, ɛ: 32.4%). Thus, the claim of SLM is confirmed given that /ɪ, ʊ, æ/ are new vowel categories whereas /i, u, ɛ/ are similar vowel categories. Further, the conspicuously low accuracy of the similar L2 vowel categories /i, u, ɛ/ was accounted for by over-generalization whereby the Korean participants excessively replaced L2 similar /i, u, ɛ/ with L2 new /ɪ, ʊ, æ/ as the participants were learning the L2 new vowel categories in the process of acquisition. Based on the findings this study, pedagogical suggestions are provided.
https://doi.org/10.5392/JKCA.2015.15.08.579 인용 PDF KSCI

Grapheme-to-Phoneme Conversion Regularity Effects among Late Korean-English Bilinguals (후기 한국어-영어 이중언어화자의 자소-음소 변환 규칙에 따른 영어 규칙성 효과)

Kim, Dahee;Baik, Yeonji;Ryu, Jaehee;Nam, Kichun
- Korean Journal of Cognitive Science
- /
- v.26 no.3
- /
- pp.323-355
- /
- 2015
This study examined grapheme-to-phoneme regularity effect among late Korean-English bilinguals by using whole word level task (lexical processing) and two meta-phonological tasks(sub-lexical processing): [1] English word naming task(whole word level), [2] rhyme judgement task(rhyme level), and [3] phoneme deletion task(phoneme level). Forty-three late Korean-English bilinguals participated in all three tasks. In these tasks, participants showed better performance in regular word conditions compared to irregular word conditions, demonstrating a clear English regularity effect. Post-hoc correlational analysis revealed strong correlation between word naming task and rhyme judgement task, which is different from the results reported with English monolinguals. The contradicting results might be due to the relevantly low English proficiency level among late Korean-English bilingual speakers. In conclusion, this study suggests that late Korean-English bilinguals make use of L2 grapheme-to-phoneme conversion (GPC) rule when reading L2 English words.
PDF KSCI

Parallel Speech Recognition on Distributed Memory Multiprocessors (분산 메모리 다중 프로세서 상에서의 병렬 음성인식)

윤지현;홍성태;정상화;김형순
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10a
- /
- pp.747-749
- /
- 1998
본 논문에서는 음성과 자연언어의 통합처리를 위한 효과적인 병렬 계산 모델을 제안한다. 음소모델은 continuous HMM에 기반을 둔 문맥종속형 음소를 사용하며, 언어모델은 knowledge-based approach를 사용한다. 또한 계층구조의 지식베이스상에서 다수의 가설을 처리하기 위해 memory-based parsing기술을 사용하였다. 본 연구의 병렬 음성인식 알고리즘은 분산메모리 MIMD 구조의 다중 Transputer 시스템을 이용하여 구현되었다. 실험을 통하여 음성인식 과정에서 발생하는 speech-specific problem의 해를 제공하고 음성인식 시스템의 병렬화를 통하여 실시간 음성인식의 가능성을 보여준다.
PDF

Automatic Phoneme Generator based on Standard Korean Pronunciation (표준어 규정에 따른 한국어 음소열 자동생성기)

이도관;강미영;윤근수;이교운;권혁철
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.04c
- /
- pp.528-530
- /
- 2003
우리말에서 띄어쓰기와 버금갈 정도로 어려운 것이 우리말의 발음이다. 이에 실생활에서 혼란스럽게 사용되는 발음법과 그로 인해 올바른 발음의 선택에 대한 어려움을 덜어낼 수 있도록 표준어 규정의 표준 발음법에 따른 한국어 음소열 자동 생성기를 구현하여 교육용으로 쓸 수 있도록 하는 것이 이 논문의 목적이다.
PDF

Search Result 529, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)