Search | Korea Science

A Study on MLP Neural Network Architecture and Feature Extraction for Korean Syllable Recognition (한국어 음절 인식을 위한 MLP 신경망 구조 및 특징 추출에 관한 연구)

금지수;이현수
- Proceedings of the IEEK Conference
- /
- 1999.11a
- /
- pp.672-675
- /
- 1999
In this paper, we propose a MLP neural network architecture and feature extraction for Korean syllable recognition. In the proposed syllable recognition system, firstly onset is classified by onset classification neural network. And the results information of onset classification neural network are used for feature selection of imput patterns vector. The feature extraction of Korean syllables is based on sonority. Using the threshold rate separate the syllable. The results of separation are used for feature of onset. nucleus and coda. ETRI's SAMDORI has been used by speech DB. The recognition rate is 96% in the speaker dependent and 93.3% in the speaker independent.
PDF

Pronunciation error types and sentence intelligibility of Korean EFL learners (영어 학습자의 발음 오류 유형과 발화 명료도의 관계 연구)

Kim, Hyun-Jin
- English Language & Literature Teaching
- /
- v.10 no.3
- /
- pp.159-175
- /
- 2004
This paper investigated the types of errors on English pronunciation and intelligibility of Korean EFL students, and the relationship between the pronunciation accuracy and intelligibility. Thirty one students were evaluated by six English native speakers in terms of overall intelligibility and accuracy In five areas such as nuclear stress, word stress, syllable structure, consonants and vowels. According to the findings of the study, pronunciation errors were made by the subjects more frequently In word stress than any other area of pronunciation accuracy. The Pearson correlation analysis showed that intelligibility was related with word stress, syllable structure, consonants and vowels, and the stepwise multiple regression analysis indicated that, among the above five areas of pronunciation accuracy, word stress best accounted for the intelligibility of a given sentence. In the conclusion, the importance of teaching pronunciation of in those five areas with a special focus on word stress was emphasized m terms of intelligibility.
PDF

IP generating factors and rules of read speech and dialogue in Korean (대화체와 낭독체의 억양구 형성에 관한 연구)

Park Jihye
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.285-288
- /
- 2002
본 논문에서는 발화 유형을 대화체와 낭독체의 두 가지로 구분하여 각 발화 유형에서 억양구를 형성하는 특징을 살펴보았다. 실험 결과, 한 문장 내에 두 개 이상의 억양구가 생성되는 경우와 접속문의 경우에는 낭독체에서 더 많은 억양구가 형성되었다. 대화체에서 더 많은 억양구가 형성되는 경우는 주로 주어 다음에 억양구가 형성되는 경우이며, 대화체 발화에서는 한 문장내에 두 개 이상의 억양구가 형성된 경우는 존재하지 않았다. 이러한 실험 결과를 바탕으로 억양구의 형성이 음절수뿐만 아니라 문장의 구조에 영향을 받으며, 이 두 가지 요인이 발화 유형에 따라 다르게 적용된다는 운율적 특징을 파악할 수 있다.
PDF

On vowel and syllable duration related to prosodic structure in Korean (한국어 운율구조와 관련한 모음 및 음절 길이)

Lee Sook-hyang
- MALSORI
- /
- no.35_36
- /
- pp.13-24
- /
- 1998
This study aims at examining the relationship between tonal events and their related vowel and syllable duration in Korean. Two things were investigated: one is to see if there is a hierarchical relationship in prosodic unit-final-lengthening and the other is to see if accentual phrase initial high tone syllable gets lengthened. Generally, higher prosodic units show larger degree of lengthening of the final vowel and also final syllable duration than the lower ones except for accentual phrase: Mean duration of utterance-final or intonational-phrase-final syllable(and its vowels) was longer than that of accentual-phrase-final or word-final syllable(and its vowels). However, mean duration of accentual phrase final syllable was shorter than that of word final syllable. Mean vowel duration of accentual phrase initial high tone syllable was shorter than that of any other prosodic unit. Its mean syllable duration, however, was longer than that of accentual-phrase-final or word-final syllable, indicating that strong consonants(fortis and aspirated) frequently appear in the accentual phrase initial position and this position is a prosodically strong position showing longer duration as well as high tone.
PDF

Hybrid CTC-Attention Based End-to-End Speech Recognition Using Korean Grapheme Unit (한국어 자소 기반 Hybrid CTC-Attention End-to-End 음성 인식)

Park, Hosung;Lee, Donghyun;Lim, Minkyu;Kang, Yoseb;Oh, Junseok;Seo, Soonshin;Rim, Daniel;Kim, Ji-Hwan
- Annual Conference on Human and Language Technology
- /
- 2018.10a
- /
- pp.453-458
- /
- 2018
본 논문은 한국어 자소를 인식 단위로 사용한 hybrid CTC-Attention 모델 기반 end-to-end speech recognition을 제안한다. End-to-end speech recognition은 기존에 사용된 DNN-HMM 기반 음향 모델과 N-gram 기반 언어 모델, WFST를 이용한 decoding network라는 여러 개의 모듈로 이루어진 과정을 하나의 DNN network를 통해 처리하는 방법을 말한다. 본 논문에서는 end-to-end 모델의 출력을 추정하기 위해 자소 단위의 출력구조를 사용한다. 자소 기반으로 네트워크를 구성하는 경우, 추정해야 하는 출력 파라미터의 개수가 11,172개에서 49개로 줄어들어 보다 효율적인 학습이 가능하다. 이를 구현하기 위해, end-to-end 학습에 주로 사용되는 DNN 네트워크 구조인 CTC와 Attention network 모델을 조합하여 end-to-end 모델을 구성하였다. 실험 결과, 음절 오류율 기준 10.05%의 성능을 보였다.
PDF

A Study on Speech Recognition using Recurrent Neural Networks (회귀신경망을 이용한 음성인식에 관한 연구)

한학용;김주성;허강인
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.3
- /
- pp.62-67
- /
- 1999
In this paper, we investigates a reliable model of the Predictive Recurrent Neural Network for the speech recognition. Predictive Neural Networks are modeled by syllable units. For the given input syllable, then a model which gives the minimum prediction error is taken as the recognition result. The Predictive Neural Network which has the structure of recurrent network was composed to give the dynamic feature of the speech pattern into the network. We have compared with the recognition ability of the Recurrent Network proposed by Elman and Jordan. ETRI's SAMDORI has been used for the speech DB. In order to find a reliable model of neural networks, the changes of two recognition rates were compared one another in conditions of: (1) changing prediction order and the number of hidden units: and (2) accumulating previous values with self-loop coefficient in its context. The result shows that the optimum prediction order, the number of hidden units, and self-loop coefficient have differently responded according to the structure of neural network used. However, in general, the Jordan's recurrent network shows relatively higher recognition rate than Elman's. The effects of recognition rate on the self-loop coefficient were variable according to the structures of neural network and their values.
PDF

End-to-end Korean Document Summarization using Copy Mechanism and Input-feeding (복사 방법론과 입력 추가 구조를 이용한 End-to-End 한국어 문서요약)

Choi, Kyoung-Ho;Lee, Changki
- Journal of KIISE
- /
- v.44 no.5
- /
- pp.503-509
- /
- 2017
In this paper, the copy mechanism and input feeding are applied to recurrent neural network(RNN)-search model in a Korean-document summarization in an end-to-end manner. In addition, the performances of the document summarizations are compared according to the model and the tokenization format; accordingly, the syllable-unit, morpheme-unit, and hybrid-unit tokenization formats are compared. For the experiments, Internet newspaper articles were collected to construct a Korean-document summary data set (train set: 30291 documents; development set: 3786 documents; test set: 3705 documents). When the format was tokenized as the morpheme-unit, the models with the input feeding and the copy mechanism showed the highest performances of ROUGE-1 35.92, ROUGE-2 15.37, and ROUGE-L 29.45.
https://doi.org/10.5626/JOK.2017.44.5.503 인용 KSCI

Frequency Related Information and Syllable Structure Constraints on Sino-Korean (한국 한자음의 빈도 관련 정보 및 음절 구조 제약)

Shin, Ji-Young
- Phonetics and Speech Sciences
- /
- v.1 no.2
- /
- pp.129-140
- /
- 2009
The purpose of the present study is to investigate frequency related information and syllable structure constraints on Sino-Korean. Previous studies on Sino-Korean have mostly investigated the historical change of sounds and reviewed archaic features of Chinese language in Sino-Korean. Unfortunately, there is little study on the sounds of contemporary Sino-Korean in terms of syllable structure constraints. For the purpose of the present study, sounds of 7,742 Chinese characters used in Sino-Korean (7,795 syllables) were investigated and syllable matrices made based on the results of frequency related information. As a result, 483 syllable types were observed and the most frequently observed syllables were as follows: /ku/ (103) > /ki/ (100) > /ju/ (87) > /pi/ (86). Only 16 out of 19 consonants are used for Sino-Korean. /$t^{\ast}$/ and /$p^{\ast}$/ are never used in Sino-Korean and /kh, $s^{\ast}$, $k^{\ast}$/ occur only a few times (3, 2, 1 respectively). /k/ (17.5%) shows the highest frequency and /n, ${\eta}$, 1, tc, m/ occupied the next rankings. Among 20 vowel types, /a/ showed the highest frequency and /o, u, i, $j{\Lambda}$, ${\Lambda}$/ occupied the next rankings. Based on the syllable matrices, gaps were observed and classified into accidental or systematic ones. Onset and nucleus, nucleus and coda, onset and coda, and other syllable structure constraints of Sino-Korean were listed.
PDF

A Production-Based Study of English Syllables with Weak-Strong Pattern in the Case of Korean Leaners with Low English Proficiency (초급 영어 학습자의 약강구조 영어 단어에서의 강약음절 산출)

Kim, Hee-Sung;Seo, Mi-Sun;Shin, Ji-Young;Kim, Kee-Ho
- Speech Sciences
- /
- v.12 no.3
- /
- pp.175-183
- /
- 2005
In this study, realization of strong and weak syllables in English by Korean leaners with low English proficiency was examined through experiment. The aspects of three acoustic characteristics-duration, pitch, amplitude-were measured and compared with native speakers of English. It was assumed that production of duration, pitch and amplitude of strong and weak syllable by Korean learners would be different from that of English native speakers. According to the production experiments, English native speakers produced strong syllable longer, higher and louder than weak syllable. However, Korean leaners produced strong syllable higher and louder than weak syllable, but not longer enough. Specifically, weak syllable by Korean leaners was longer and strong syllable shorter than native speakers. Furthermore, the difference in duration of syllables between Korean leaners and English native speakers is more significant than pitch and amplitude. As a result, the duration was more important cue for the realization of stress than pitch and amplitude. However, Korean leaners did not produce duration of stressed syllables as English native speakers did, even though they produce the pitch and amplitude of stressed syllable in a similar way to native speakers. The reasons for those were considered, too.
PDF

A Korean Part-of-Speech Tagger using Simplified Eojeol-based unit (단순화된 어절을 단위로 하는 한국어 품사 태거)

Lee, Eui-Hyeon;Kim, Young-Gil;Shin, Jaehun;Kwon, Hong-Seok;Lee, Jong-Hyeok
- 한국어정보학회:학술대회논문집
- /
- 2016.10a
- /
- pp.268-272
- /
- 2016
영어권 언어가 어절 단위로 품사를 부여하는 반면, 한국어는 굴절이 많이 일어나는 교착어로서 데이터부족 문제를 피하기 위해 형태소 단위로 품사를 부여한다. 이러한 구조적 차이 안에서 한국어에 적합한 품사 태깅 단위는 지속적으로 논의되어 왔으며 지금까지 음절, 형태소, 어절, 구가 제안되었다. 본 연구는 어절 단위로 태깅함으로써 야기되는 복잡한 품사 태그와 데이터부족 문제를 해소하기 위해 어절에서 주요 실질 형태소와 주요 형식 형태소만을 뽑아 새로운 어절을 생성하고, 생성된 단순한 어절에 대해 CRF 태깅을 수행하였다. 실험결과 평가 말뭉치에서 미등록 어절 등장 비율은 9.22%에서 5.63%로 38.95% 감소시키고, 어절단위 정확도를 85.04%에서 90.81%로 6.79% 향상시켰다.
PDF

Search Result 76, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)