Search | Korea Science

A Study on the Spoken KOrean-Digit Recognition Using the Neural Netwok (神經網을 利用한 韓國語數字音認識에 관한 硏究)

Park, Hyun-Hwa;Gahang, Hae Dong;Bae, Keun Sung
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.3
- /
- pp.5-13
- /
- 1992
Taking devantage of the property that Korean digit is a mono-syllable word, we proposed a spoken Korean-digit recognition scheme using the multi-layer perceptron. The spoken Korean-digit is divided into three segments (initial sound, medial vowel, and final consonant) based on the voice starting / ending points and a peak point in the middle of vowel sound. The feature vectors such as cepstrum, reflection coefficients, ${\Delta}$cepstrum and ${\Delta}$energy are extracted from each segment. It has been shown that cepstrum, as an input vector to the neural network, gives higher recognition rate than reflection coefficients. Regression coefficients of cepstrum did not affect as much as we expected on the recognition rate. That is because, it is believed, we extracted features from the selected stationary segments of the input speech signal. With 150 ceptral coefficients obtained from each spoken digit, we achieved correct recognition rate of 97.8%.
PDF

A Study on Speechreading about the Korean 8 Vowels (한국어 8모음 자동 독화에 관한 연구)

Lee, Kyong-Ho;Yang, Ryong;Kim, Sun-Ok
- Journal of the Korea Society of Computer and Information
- /
- v.14 no.3
- /
- pp.173-182
- /
- 2009
In this paper, we studied about the extraction of the parameter and implementation of speechreading system to recognize the Korean 8 vowel. Face features are detected by amplifying, reducing the image value and making a comparison between the image value which is represented for various value in various color space. The eyes position, the nose position, the inner boundary of lip, the outer boundary of upper lip and the outer line of the tooth is found to the feature and using the analysis the area of inner lip, the hight and width of inner lip, the outer line length of the tooth rate about a inner mouth area and the distance between the nose and outer boundary of upper lip are used for the parameter. 2400 data are gathered and analyzed. Based on this analysis, the neural net is constructed and the recognition experiments are performed. In the experiment, 5 normal persons were sampled. The observational error between samples was corrected using normalization method. The experiment show very encouraging result about the usefulness of the parameter.
https://doi.org/10.9708/jksci.2009.14.3.173 인용 PDF

Perceptual Structure of Korean Consonants in High Vowel Contexts (고설 모음 환경에서 한국어 자음의 지각적 구조)

Bae, Moon-Jung
- Phonetics and Speech Sciences
- /
- v.1 no.2
- /
- pp.95-103
- /
- 2009
We investigated the perceptual structure of Korean consonants by analyzing the confusion among consonants in various vowel contexts. The 36 CV syllable types combined by 18 consonants and 2 vowels (/i/ and /u/) were presented with masking noises or in degraded intensity. The confusion data were analyzed by the INDSCAL (Individual Difference Scaling), ADCLUS (Additive Clustering) and the probability of the transmitted information. The results were compared with those of a previous study with /a/ vowel context (Bae and Kim, 2002). The overall results showed that the laryngeal features-aspiration, lax and tense-are the most salient features in the perception of Korean consonant regardless of vowel contexts, but the perceptual saliency of place features varies across vowel conditions. In high vowel (front and back vowel) contexts, sibilant consonants were perceptually salient compared to in low vowel contexts. In back vowel contexts, grave (labial and velar) consonants were perceptually salient. These findings imply that place features and vowel features strongly interact in speech perception as well as in speech production. All statistical measures from our confusion data ensured that the perceptual structure of Korean consonants correspond to the hierarchical structure suggested in the feature geometry (Clements, 1991). We discuss the link between speech perception and production as the basis of phonology.
PDF

Nasal Consonants Recognition Based on the Perceptual Representation (지각적 표현에 기초한 비음 인식에 관한 연구)

Kim, Ki-Chul;Cho, Jung-Wan
- Annual Conference on Human and Language Technology
- /
- 1989.10a
- /
- pp.120-125
- /
- 1989
음성 신호에는 언어정보이외에 여러 요인에 의한 정보가 포함되어 있어서, 문자와 일대일로 대응되는 분절을 정확하게 검출하기가 어렵다. 본 연구에서는 선형 예측계수 (LPC) 스펙트럼의 첨두 부분을 강조한 이진 (binary) 스펙트럼을 제안하고, 이를 바탕으로 음의 안정영역과 천이영역을 통합하여 음향특징을 추출하고자 한다. 각 영역의 특징은 이진 스펙트럼을 누적하여 구하며, 통합적인 특징은 각 영역의 특징을 결합한 관계적 특징으로 나타낸다. 제 2 차 포르만트 주파수의 궤적을 관계적 특징으로 하여, 양순 비음과 치조 비음을 구별한 결과, 모음의 문맥과 화자에 비교적 독립적인 인식결과를 얻을 수 있었다. 또한 이진 스펙트럼이 원래의 스펙트럼에 포함된 정보를 유지하는지 검토하기 위해, 같은 거리척도 (distance measure) 에 의해 인식 실험한 결과 이진 스펙트럼의 성능이 오히려 우수하게 나타났으며, 관계적 이진 스펙트럼의 경우 화자에 따른 변화가 더욱 적었다. 음성에 백색 잡음 (Gaussian white noise)을 더하여 잡음음성 (noisy speech) 을 만든 뒤, 같은 방법으로 실험한 결과도 유사한 인식결과를 얻을 수 있어 제안된 이진 스펙트럼의 유효성을 확인하였다.
PDF

The Influence of L1 on L2 -Perception of Korean Monophthongs by Polish Speakers- (외국어 습득에 모국어가 미치는 영향에 대하여 -폴란드어 화자의 한국어 단순 모음 청취에 대한 연구-)

Paradowska Anna IBabella
- MALSORI
- /
- no.39
- /
- pp.73-86
- /
- 2000
This paper aims to research the influence of mother tongue (Polish) on the perception of a foreign language (Korean) i.e. how vowel sounds that are totally unfamiliar to the listeners are perceived, how the similar sounds are perceived and whether the perception differs according to the phonetic values of the neighbouring sounds. As a result, the degree of the influence of Ll on the vowels of L2 is different in each case and mostly depends on the familiarity of the vowel in question and on the articulatory similarities between the vowels in both languages. The results are as follows; The best perception was observed with Korean /i/ and /a/ (very similar places of articulation in both languages). The worst degree of perception was Korean /(equation omitted)/ that is very unfamiliar to Polish subjects. Vowels that are not so different from the Ll sounds were perceived fairly well. Another important result is that Polish listeners seem to be more sensitive to lip rounding than to the height of the tongue. The role of the neighbouring sounds seems to be of a considerable importance, Depending on the preceding vowel, a sudden drop or rise in the degree of the perception was observed.
PDF

Boolean Formulation of Korean Natural Language Queries Using Syntactic Analysis (구문 분석에 기반한 자연어 질의로부터의 불리언 질의 생성)

Park, Mi-Hwa;Won, Hyung-Suk;Lee, Won-Il;Lee, Geun-Bae
- Annual Conference on Human and Language Technology
- /
- 1998.10c
- /
- pp.73-80
- /
- 1998
본 연구는 자연어 질의의 형태 및 구문 정보를 바탕으로 불리언 질의를 생성하는데 그 목적을 둔다. 일반적으로 대부분의 상용정보검색시스템은 입력형식을 검색성능이 종은 불리언 형태로 하고 있으나, 일반 사용자는 자신이 원하는 정보를 불리언 형태로 표현하는데 익숙하지 않다. 그러므로 본 정보검색시스템은 자연어 질의를 기본 입력형태로 하여 사용자의 편의성을 높이고, 이 질의를 범주문법에 기반한 구문분석 결과에 의해 복합명사를 고려한 불리언 형태로 변환하여 검색을 수행함으로써 시스템의 검색 성능의 향상을 도모하였다. 정보검색 실험용 데이터 모음인 KTSET2.0으로 실험한 결과 본 논문에서 제안한 자연어 질의로부터 자동 생성된 불리언 질의의 검객성능이 KTSET2.0에서 제공하는 수동으로 추출한 불리언 질의보다 8% 더 우수한 성능을 보였고, 기존 자연어질의 시스템이 수용해온 방법인 형태소 분석을 거쳐 불용어를 제거한 후 Vector 모델을 적용하여 검색을 수행한 경우보다는 23% 더 나은 성능을 보였다.
PDF

Perceptual Vowel Space and Mental Representation of Korean Monophthongs (한국어 단모음의 지각적 모음공간과 심적 표상)

Choi, Yang-Gyu
- Speech Sciences
- /
- v.10 no.2
- /
- pp.287-301
- /
- 2003
The purpose of this study was to examine whether the same vowel sounds are perceived differently by the two local dialect speakers, Seoul dialect speakers (SDS) and Kyungnam dialect speakers (KDS), whose vowel systems differ each other. In the first experiment SDS and KDS heard vowels synthesized in vowel space with F1 by F2 and categorized them into one of 10 Korean monophthongs. The results showed that SDS and KDS perceived the synthesized vowels differently. For example, /$\varepsilon$ versus /e/ contrast, ${\o}$/, and /y/ are differentiated by SDS, whereas they are perceptually confused by KDS. We also observed that /i/ could not be perceived unless the vowel synthesis included F3 or higher formants. In the second experiment SDS and KDS performed the similarity rating task of 10 synthesized Korean monophthongs. Two-dimensional MDS solution based on the similarity rating scores was obtained for each dialect group. The first dimension can be named 'vowel advancement' and the second 'vowel height'. The comparison of the two MDS solutions showed that the overall psychological distances among the vowels are shorter in KDS than SDS and that especially the distance between /$\Lambda$/ and /i/ is shorter in KDS than SDS. The result suggested that perception or mental representation of vowels depends on the vowel system of the listener's dialect or language. Further research problems were discussed in the final section.
PDF

Perceptual Boundary on a Synthesized Korean Vowel /o/-/u/ Continuum by Chinese Learners of Korean Language (/오/-/우/ 합성모음 연속체에 대한 중국인 한국어 학습자의 청지각적 경계)

Yun, Jihyeon;Kim, EunKyung;Seong, Cheoljae
- Phonetics and Speech Sciences
- /
- v.7 no.4
- /
- pp.111-121
- /
- 2015
The present study examines the auditory boundary between Korean /o/ and /u/ on a synthesized vowel continuum by Chinese learners of Korean language. Preceding researches reported that the Chinese learners have difficulty pronouncing Korean monophthongs /o/ and /u/. In this experiment, a nine-step continuum was resynthesized using Praat from a vowel token from a recording of a male announcer who produced it in isolated form. F1 and F2 were synchronously shifted in equal steps in qtone (quarter tone), while F3 and F4 values were held constant for the entire stimuli. A forced choice identification task was performed by the advanced learners who speak Mandarin Chinese as their native language. Their experiment data were compared to a Korean native group. ROC (Receiver Operating Characteristic) analysis and logistic regression were performed to estimate the perceptual boundary. The result indicated the learner group has a different auditory criterion on the continuum from the Korean native group. This suggests that more importance should be placed on hearing and listening training in order to acquire the phoneme categories of the two vowels.
https://doi.org/10.13064/KSSS.2015.7.4.111 인용 PDF KSCI

Effects of F1/F2 Manipulation on the Perception of Korean Vowels /o/ and /u/ (F1/F2의 변화가 한국어 /오/, /우/ 모음의 지각판별에 미치는 영향)

Yun, Jihyeon;Seong, Cheoljae
- Phonetics and Speech Sciences
- /
- v.5 no.3
- /
- pp.39-46
- /
- 2013
This study examined the perception of two Korean vowels using F1/F2 manipulated synthetic vowels. Previous studies indicated that there is an overlap between the acoustic spaces of Korean /o/ and /u/ in terms of the first two formants. A continuum of eleven synthetic vowels were used as stimuli. The experiment consisted of three tasks: an /o/ identification task (Yes-no), an /u/ identification task (Yes-no), and a forced choice identification task (/o/-/u/). ROC(Receiver Operating Characteristic) analysis and logistic regression were performed to calculate the boundary criterion of the two vowels along the stimulus continuum, and to predict the perceptual judgment on F1 and F2. The result indicated that the location between stimulus no.5 (F1 = 342Hz, F2 = 691Hz) and no.6 (F1 = 336Hz, F2 = 700Hz) was estimated as a perceptual boundary region between /o/ and /u/, while stimulus no.0 (F1=405Hz, F2=666Hz) and no.10 (F1=321Hz, F2=743Hz) were at opposite ends of the continuum. The influence of F2 was predominant over F1 on the perception of the vowel categories.
https://doi.org/10.13064/KSSS.2013.5.3.039 인용 PDF

Cross-Generational Differences of /o/ and /u/ in Informal Text Reading (편지글 읽기에 나타난 한국어 모음 /오/-/우/의 세대간 차이)

Han, Jeong-Im;Kang, Hyunsook;Kim, Joo-Yeon
- Phonetics and Speech Sciences
- /
- v.5 no.4
- /
- pp.201-207
- /
- 2013
This study is a follow-up study of Han and Kang (2013) and Kang and Han (2013) which examined cross-generational changes in the Korean vowels /o/ and /u/ using acoustic analyses of the vowel formants of these two vowels, their Euclidean distances and the overlap fraction values generated in SOAM 2D (Wassink, 2006). Their results showed an on-going approximation of /o/ and /u/, more evident in female speakers and non-initial vowels. However, these studies employed non-words in a frame sentence. To see the extent to which these two vowels are merged in real words in spontaneous speech, we conducted an acoustic analysis of the formants of /o/ and /u/ produced by two age groups of female speakers while reading a letter sample. The results demonstrate that 1) the younger speakers employed mostly F2 but not F1 differences in the production of /o/ and /u/; 2) the Euclidean distance of these two vowels was shorter in non-initial than initial position, but there was no difference in Euclidean distance between the two age groups (20's vs. 40-50's); 3) overall, /o/ and /u/ were more overlapped in non-initial than initial position, but in non-initial position, younger speakers showed more congested distribution of the vowels than in older speakers.
https://doi.org/10.13064/KSSS.2013.5.4.201 인용 PDF

Search Result 217, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)