Search | Korea Science

Speech Recognition of the Korean Vowel 'ㅜ' Based on Time Domain Bulk Indicators (시간 영역 벌크 지표에 기반한 한국어 모음 'ㅜ'의 음성 인식)

Lee, Jae Won
- KIISE Transactions on Computing Practices
- /
- v.22 no.11
- /
- pp.591-600
- /
- 2016
Computing technologies are increasingly applied to most casual human environment networks, as computing technologies are further developed. In addition, the rapidly increasing interest in IoT has led to the wide acceptance of speech recognition as a means of HCI. In this study, we present a novel method for recognizing the Korean vowel 'ㅜ', as a part of a phoneme based Korean speech recognition system. The proposed method involves analyses of bulk indicators calculated in the time domain instead of analysis in the frequency domain, with consequent reduction in the computational cost. Four elementary algorithms for detecting typical waveform patterns of 'ㅜ' using bulk indicators are presented and combined to make final decisions. The experimental results show that the proposed method can achieve 90.1% recognition accuracy, and recognition speed of 0.68 msec per syllable.
https://doi.org/10.5626/KTCP.2016.22.11.591 인용 KSCI

Speech Recognition of the Korean Vowel 'ㅡ' based on Neural Network Learning of Bulk Indicators (벌크 지표의 신경망 학습에 기반한 한국어 모음 'ㅡ'의 음성 인식)

Lee, Jae Won
- KIISE Transactions on Computing Practices
- /
- v.23 no.11
- /
- pp.617-624
- /
- 2017
Speech recognition is now one of the most widely used technologies in HCI. Many applications where speech recognition may be used (such as home automation, automatic speech translation, and car navigation) are now under active development. In addition, the demand for speech recognition systems in mobile environments is rapidly increasing. This paper is intended to present a method for instant recognition of the Korean vowel 'ㅡ', as a part of a Korean speech recognition system. The proposed method uses bulk indicators (which are calculated in the time domain) instead of the frequency domain and consequently, the computational cost for the recognition can be reduced. The bulk indicators representing predominant sequence patterns of the vowel 'ㅡ' are learned by neural networks and final recognition decisions are made by those trained neural networks. The results of the experiment show that the proposed method can achieve 88.7% recognition accuracy, and recognition speed of 0.74 msec per syllable.
https://doi.org/10.5626/KTCP.2017.23.11.617 인용 KSCI

Word Recognition using Fuzzy Inference based on LPC (선형예측계수에 기초한 퍼지추론 단어 인식)

Choi, Seung-Ho;Kim, Hyeong-Geun
- The Journal of the Acoustical Society of Korea
- /
- v.13 no.1
- /
- pp.32-41
- /
- 1994
To solve the frequency variation of speech patterns which consist of LPC sequences, new membership function view from LPC, spectrum and the relations between the order of LPC and spectrum is proposed. To solve the time variation, multi-secation equi-segmentation method which equally divide the speech section into several section are applied. False recognition mainly occur at time when the same syllable is placed at the same utterance. To reduce the error, fuzzy inference is executed using the proposed membership function and weights are assigned into sectional certainty and then the decision method for recognized the section up to the third candidate. To testify the validation of this method, we experimented the recognition test of 28 DDD area names. The recognition rate of the fuzzy inference by the triangle membership function is $92\%$. That of the combined method of the fuzzy inference and the dicision method is $92.9\%$ and that of fuzzy inference by the proposed membership funtion is $93.8\%$.
PDF

The Acoustic Characteristics of Focus Associated with the Korean Particle' -man' (한국어 특수조사 ‘-만’에 연계된 초점의 음향음성학적 특성)

Choe, J.W.;Jeon, Y.S.;C., Y.;Park, S.B.;Kim, K.H.
- Speech Sciences
- /
- v.5 no.2
- /
- pp.77-91
- /
- 1999
The purpose of this paper is to investigate the phonetic characteristics of the 'focus' phrases associated with the particle '-man' in Korean. The particle '-man' is a bound morpheme which, like other postpositions such as the subject marker '-ka' and the object marker '-lil', the so-called 'case markers' in Korean, typically attaches to a noun (phrase). The semantics of '-man' roughly corresponds to that of only, its counterpart in English, and is thus classified as a 'delimiter' (Yang 1973). It is assumed in this paper that '-man', like only in English, should have a 'focus' associated with it (von Stechow 1991, Rooth 1992). In general, '-man' attached phrases get the focus, but sometimes the association is not clear-cut, especially in the cases of emphatic use of '-man' or when the context strongly favors other phrase as the focus (Choe 1996). In this paper, we compare the phonetic characteristics of the '-man' marked phrases with those to which '-ka'/'-lil' is attached, and conclude that the focused '-man' phrases show higher fundamental frequencies than their equally focused 'case' -marked counterparts. However, when the context clearly forces the focus to fall on phrases other than the '-man' or '-ka'/'-lil' attached ones, there is no meaningful difference in fundamental frequency between the '-man' and '-ka'/'-lil' attached phrases. We also compare the phonetic characteristics of the regular use of '-man' with those of the emphatic '-man'. According to our experiments, the emphatic '-man' does not bring forth its phonetic effects, namely, higher fundamental frequencies, on the' -man' attached words or phrases but rather in various other ways such as higher fundamental frequencies in '-man', lengthening of the following word-initial syllable, or the inclusion of the following word in the same accentual phrase. Finally, it is claimed that '-man' associated focus phenomena, especially the emphatic use of '-man', show some typical acoustic characteristics of the other well-known focus phenomena, namely, wh-interrogatives.
PDF

Linguistic Characteristics of Domestic Men's Formal Wear Brand Names

Kwon, Hae-Sook
- Journal of Fashion Business
- /
- v.14 no.6
- /
- pp.11-22
- /
- 2010
The main purpose of this research was to examine the linguistic characteristics of domestic men's formal wear brand name. Four linguistic characteristics of language type, combined structure type of language, word class, length of brand name were investigated in this research and also examined the difference between brand type. For sample selection, the 209 men's fashion brands were selected from '2009 Korea Fashion Yearbook' and then, 25 brands which could not collect proper informations about the brand name or naming were excluded. Among total 184 men's brand names, 66 men's formal wear brands were selected and studied. For data analysis, quantitative evaluation of the frequency and qualitative evaluation have been used. The result as follows.; (1) Seven language types were found in domestic men's formal wear brand names. English has been used the most, then followed by Italian and French. (2) For combined structure type of brand name language, the single word used the most, followed by separately combined word type, artificially combined word, and unified word type. (3) The most frequently used the type of word class was noun, and followed by phrase, adjective, and verb. In the noun type, 6 different types which expressed a person, concrete & abstract entity, place, acronym, and neologic were found. For phrase, only noun type was appeared, however, 6 out of 20 phrases were abbreviated type. All eight adjective brand names implied an attributive character of the brand such as 'Dainty' or 'Solus(Solo)'. (4) The long name used most and then followed by normal and short length of brand name. Looking by the number of syllable, 4 syllables appeared the most and then followed by 3, 5, 6, 2 & 7 showed the same rate, and 8 syllables. (5) The result which compared the difference according to each brand type showed a difference in its language type, language combined style, word class, but length of brand name.
PDF KSCI

EVALUATION OF THE SYNTHETIC SPEECH QUALITY BY THE TD-PCULI METHOD

Kang, Chan-Hee;Shin, Yong-Jo;Kim, Yun-Seok;Kwon, Ki-Hyung;Chin, Yong-Ohk
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1994.06a
- /
- pp.977-983
- /
- 1994
In this paper we have evaluated the synthetic speech quality by the proposed TD-PCULI speech synthesis method. For the synthesis we have extracted parameters from the Korean monosyllables through the analysis of speech waveforms in the time domain. We have constructed the Korean data format dictionary for the synthesis-by-rule depending upon the frequencies of the Korean pronunciation large vocabulary dictionary, in which V type syllables are 19, CV type's are 80, VC type's are 30 and CVC type's are 100. And using them we have synthesized various Korean monosyllables, words and sentences. We have tested each 10 syllables selected according to the 4 Korean syllable types with the objective MOS(Mean Opinion Score) evluation method about the 4 items i.e., intelligibility, clearness, loudness, and naturality after selecting random group without the knowledge of them. And also we have tested the possibility to modify a duration and F0 into another forms with changing a duration (i.e., 150msec, 300msec, 500msec, 700msec and 1sec) and a central fundamental frequency(i.e., 80Hz, 118Hz, 140Hz, 170Hz, and 200Hz). As the results of experiments the noises occurred in the course of synthesizing the speech by the rules are removed to be a very clear level and we can find that the prosodic elements can be controled as a good condition.
PDF

A Study on Hangeul Orthography Guidelines for Foreigners (외국인을 위한 한글맞춤법 시안 연구)

Han, Jae young
- Journal of Korean language education
- /
- v.28 no.4
- /
- pp.273-296
- /
- 2017
This study focuses on a review of Hangeul orthography guidelines in Korean language regulations. It is indispensable to revise the guidelines thoroughly because it has been more than 80 years since a unified plan of Korean orthography was established in 1933, which the current orthography is based on. Also, it has been approximately 30 years since 1989, when the current guidelines were issued and promulgated. The viewpoint towards this review reflects the requirements by education fields of Korean as a foreign language and modern Korean users. Hangeul orthography consists of six clauses, along with an appendix regarding punctuation marks: 1) general rules, 2) consonants and vowels, 3) related to sounds, 4) about forms, 5) spacing between words, and 6) miscellaneous. This paper examined individual clauses and specific usages of the clauses, in terms of Korean as a foreign language. Based on the review, this paper suggests the following tasks in order to establish a draft of Hangeul orthography for foreigners. A. Among the individual clauses, some clauses that embody vocabulary education aspects should be addressed in a Korean dictionary, and deleted in Hangeul orthography guidelines. B. The clauses of Hangeul orthography guidelines should be edited for revision and substitution where necessary. C. The usage of individual clauses should be replaced with more appropriate examples aligned with everyday conversation. D. In order to establish 'Hangeul orthography for foreigners', linguists should continuously review several chapters and the appendix of Hangeul orthography, such as components about forms, spacing between words, miscellaneous, and punctuation marks. The purpose of this review is to pursue the simplicity of Hangeul orthography guidelines and the practicality in terms of reflecting more realistic examples. This review contributes to facilitate Korean language usage not only for non-native learners, but also native users.

Sound-Field Speech Evoked Auditory Brainstem Response in Cochlear-Implant Recipients

Jarollahi, Farnoush;Valadbeigi, Ayub;Jalaei, Bahram;Maarefvand, Mohammad;Zarandy, Masoud Motasaddi;Haghani, Hamid;Shirzhiyan, Zahra
- Korean Journal of Audiology
- /
- v.24 no.2
- /
- pp.71-78
- /
- 2020
Background and Objectives: Currently limited information is available on speech stimuli processing at the subcortical level in the recipients of cochlear implant (CI). Speech processing in the brainstem level is measured using speech-auditory brainstem response (S-ABR). The purpose of the present study was to measure the S-ABR components in the sound-field presentation in CI recipients, and compare with normal hearing (NH) children. Subjects and Methods: In this descriptive-analytical study, participants were divided in two groups: patients with CIs; and NH group. The CI group consisted of 20 prelingual hearing impairment children (mean age=8.90±0.79 years), with ipsilateral CIs (right side). The control group consisted of 20 healthy NH children, with comparable age and sex distribution. The S-ABR was evoked by the 40-ms synthesized /da/ syllable stimulus that was indicated in the sound-field presentation. Results: Sound-field S-ABR measured in the CI recipients indicated statistically significant delayed latencies, than in the NH group. In addition, these results demonstrated that the frequency following response peak amplitude was significantly higher in CI recipients, than in the NH counterparts (p<0.05). Finally, the neural phase locking were significantly lower in CI recipients (p<0.05). Conclusions: The findings of sound-field S-ABR demonstrated that CI recipients have neural encoding deficits in temporal and spectral domains at the brainstem level; therefore, the sound-field S-ABR can be considered an efficient clinical procedure to assess the speech process in CI recipients.
https://doi.org/10.7874/jao.2019.00353 인용

Sound-Field Speech Evoked Auditory Brainstem Response in Cochlear-Implant Recipients

Jarollahi, Farnoush;Valadbeigi, Ayub;Jalaei, Bahram;Maarefvand, Mohammad;Zarandy, Masoud Motasaddi;Haghani, Hamid;Shirzhiyan, Zahra
- Journal of Audiology & Otology
- /
- v.24 no.2
- /
- pp.71-78
- /
- 2020
Background and Objectives: Currently limited information is available on speech stimuli processing at the subcortical level in the recipients of cochlear implant (CI). Speech processing in the brainstem level is measured using speech-auditory brainstem response (S-ABR). The purpose of the present study was to measure the S-ABR components in the sound-field presentation in CI recipients, and compare with normal hearing (NH) children. Subjects and Methods: In this descriptive-analytical study, participants were divided in two groups: patients with CIs; and NH group. The CI group consisted of 20 prelingual hearing impairment children (mean age=8.90±0.79 years), with ipsilateral CIs (right side). The control group consisted of 20 healthy NH children, with comparable age and sex distribution. The S-ABR was evoked by the 40-ms synthesized /da/ syllable stimulus that was indicated in the sound-field presentation. Results: Sound-field S-ABR measured in the CI recipients indicated statistically significant delayed latencies, than in the NH group. In addition, these results demonstrated that the frequency following response peak amplitude was significantly higher in CI recipients, than in the NH counterparts (p<0.05). Finally, the neural phase locking were significantly lower in CI recipients (p<0.05). Conclusions: The findings of sound-field S-ABR demonstrated that CI recipients have neural encoding deficits in temporal and spectral domains at the brainstem level; therefore, the sound-field S-ABR can be considered an efficient clinical procedure to assess the speech process in CI recipients.
https://doi.org/10.7874/jao.2019.00353 인용

A Study on the Hangul Syllables of Unicode System considering Data Transmission Efficiency (데이터전송효율을 고려한 유니코드의 한글글자마디에 대한 연구)

Hong, Wan-Pyo
- The Journal of the Korea institute of electronic communication sciences
- /
- v.10 no.1
- /
- pp.39-46
- /
- 2015
The paper studied possibility of improvement of efficient of data processing in the line coder when Hangul syllables in Unicode system is used for the source code. The scrambling in the line coder is to solve the problem happened due to the source code. The study is based on the HDB-3 scrambling method in ITU-T standards that is applied to AMI line coder. The referred data of Hangul syllables and its use frequency which are required to analysis was used the data extracted from the source data of the National Korean Language Institute. According to the analysis, the average 24% scrambling was generated in source code of Hangul syllables in Unicode system. When the referred Hangul syllables was applied to Unicode system, the average 27% scrambling was producted. Total 8,924ea Hangul syllables in 11,172ea Hangul syllables in Unicode system were not scrambled. Therefore the referred Hangul syllables 1,540ea were accepted in the unscrambled code areas. As a result, the existing Unicode Hangul syllable codes can't prevent the scrambling, but it is possible to completely remove the 27% scrambling with new source coding system. And then, it can be improved the data processing efficient upto minimum 27% in line coder by software in presentation layer instead of physical layer.
https://doi.org/10.13067/JKIECS.2015.10.1.39 인용 PDF KSCI

Search Result 92, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)