• Title/Summary/Keyword: 자음인식

Search Result 106, Processing Time 0.024 seconds

A Study on the Printed Korean and Chinese Character Recognition (인쇄체 한글 및 한자의 인식에 관한 연구)

  • 김정우;이세행
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.17 no.11
    • /
    • pp.1175-1184
    • /
    • 1992
  • A new classification method and recognition algorithms for printed Korean and Chinese character is studied for Korean text which contains both Korean and Chinese characters. The proposed method utilizes structural features of the vertical and horizontal vowel in Korean character. Korean characters are classified into 6 groups. Vowel and consonant are separated by means of different vowel extraction methods applied to each group. Time consuming thinning process is excluded. A modified crossing distance feature is measured to recognize extracted consonant. For Chinese character, an average of stroke crossing number is calculated on every characters, which allows the characters to be classified into several groups. A recognition process is then followed in terms of the stroke crossing number and the black dot rate of character. Classification between Korean and Chinese character was at the rate of 90.5%, and classification rate of Ming-style 2512 Korean characters was 90.0%. The recognition algorithm was applied on 1278 characters. The recognition rate was 92.2%. The densest class after classification of 4585 Chinese characters was found to contain only 124 characters, only 1/40 of total numbers. The recognition rate was 89.2%.

  • PDF

Correlation between tonal events and their acoustic duration (한국어 성조 이벤트와 음향적 길이)

  • 이숙향
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.06c
    • /
    • pp.383-386
    • /
    • 1998
  • 한국어의 운율구조는 발화문장(utterance), 억양구(intonational phrase), 악센트구(accentual phrase), 음운적 어절(phonological word), 음절(syllable) 순의 계층적 구조를 가지고 있다. 본 연구에서는 운율구조의 각 층에서 성조 이벤트가 얹혀지는 음절이나 또는 각 층의 운율단위말의 음절의 음향적 길이를 측정함으로써 첫째, 운율단위말의 음절의 음향적 길이 또한 계층적 순위를 보이는지 둘째, 성조 이벤트(tonal event)와 음향적 길이 사이에 높은 상관관계를 보이는지 보고자 한다. 즉, 두 가지 측면에서 길이비교가 수행되었는데 하나는 언어 보편적 현상으로 알려진 구말 장음화 현상으로써 각 층 운율적 단위의 마지막 음절의 모음 길이 비교이며 다른 하나는 억양구초 고성조가 실현되는 음절의 모음과 어절 내 모음, 그리고 고성조가 실현되는 억양구말 음절의 모음간의 길이 비교이다. 남녀 각각 200문장의 각 분절음과 운율분석을 한 후 길이에 대한 일원분산분석 실시 결과 억양구말은 악센트구말 보다 길었으나 악센트구말은 어절말과 차이를 보이지 않거나 남자 화자의 경우 오히려 짧게 나타났다. 그리고 남자화자의 경우 악센트구초 고성자가 얹혀지는 음절의 길이는 어절 내 어절말 음절을 제외한 그 외 음절과 화자에 따라 큰 차이를 보이지 않거나 그보다 조금 짧게 실현되는 것으로 나타났다. 위의 결과는 첫째, 단위말 음절 모음의 장음화는 운율적 구조의 층위에 일대일 대응을 보이지 않는 것으로 해석되며 둘째, 성조 이벤트와 그것이 실현되는 분절음의 음향적 길이와는 큰 상관관계를 보이지 않는 것으로 해석될 수 있겠다. 그러나 이러한 일반화에 대한 충분한 근거 제공을 위해서는 해당음절의 모음 길이 뿐만 아니라 초성자음의 길이간의 비교와 음절자체의 길이 비교 또한 필요한 것이며 모음길이에 대한 선행자음의 분절음적 영향 고려가 수반되어야 할 것으로 보인다. 다음 내용을 정리해 보고자 한다.리해 보고자 한다.rc$ 구입할 때 중점적으로 살펴보는 사항은 신선도와 순수재래종 여부, 위생상태였다. 한편 소비자가 언제나 구입할 수 없다는 의견이 85.2%나 되어 원활한 공급과 시장조성이 아직 정착되지 않고 있었다. $\bigcirc$ 현재 유통되고 있는 재래종닭은 소비자 대부분이 잡종으로 인식하고 있었으며, 재래종과 일반육계와의 구별은 깃털색, 피부색, 정강이색등 외관상으로 구별하고 있었다. 체중에 대한 반응은 너무 작다는 의견이었고, 식품으로의 인식도는 비교적 고급식품으로 인식하고 있다. $\bigcirc$ 재래종닭고기의 브랜드화에 대한 견해는 젊고 소득이 높은 계층에서 브랜드화의 필요성을 강조하고 있다. $\bigcirc$ 재래종달걀의 소비형태는 대부분의 소비자가 좋아하였으나 아직 먹어보지 못한 응답자가 많았다. 재래종달걀의 맛에 대해서는 고소하고 독특하여 차별성을 느끼고 있었다. $\bigcirc$ 재래종달걀의 구입장소는 계란판매점(축협.농협), 슈퍼, 백화점, 재래닭 사육 농장등 다양하였으며 포장단위는 10개를 가장 선호하였고, 포장재료는 종이, 플라스틱, 짚의 순으로 좋아하였다. $\bigcirc$ 달걀의 가격은 200원정도를 적정하다고 하였으며, 크기는 (평균 52g)는 가장 적당하다고 인식하고 있으며, 난각색은 대부분의 응답자가 갈색을 선호하였다. $\bigcirc$ 재래종달걀의 구입시 애로사항은 믿을수 없고, 구입장소를 몰라서, 값이 싸다 등이었고, 앞으로 신뢰할 수 있고 위생적인 생산 및 유통체계가 확립될 경우 더 많이 소비하겠다는 의견이었다. $\bigcirc$ 재래닭 판매업소(식당)의 판매형태는 66.7%인 대부분의 업소가 잡종과 개량종 유색닭을 판매하고 있었으며, 1개 업소에서 1일 판

  • PDF

Statistical Analysis of Korean Phonological Variations Using a Grapheme-to-phoneme System (발음열 자동 생성기를 이용한 한국어 음운 변화 현상의 통계적 분석)

  • 이경님;정민화
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.7
    • /
    • pp.656-664
    • /
    • 2002
  • We present a statistical analysis of Korean phonological variations using a Grapheme-to-Phoneme (GPT) system. The GTP system used for experiments generates pronunciation variants by applying rules modeling obligatory and optional phonemic changes and allophonic changes. These rules are derived form morphophonological analysis and government standard pronunciation rules. The GTP system is optimized for continuous speech recognition by generating phonetic transcriptions for training and constructing a pronunciation dictionary for recognition. In this paper, we describe Korean phonological variations by analyzing the statistics of phonemic change rule applications for the 60,000 sentences in the Samsung PBS Speech DB. Our results show that the most frequently happening obligatory phonemic variations are in the order of liaison, tensification, aspirationalization, and nasalization of obstruent, and that the most frequently happening optional phonemic variations are in the order of initial consonant h-deletion, insertion of final consonant with the same place of articulation as the next consonants, and deletion of final consonant with the same place of articulation as the next consonant's, These statistics can be used for improving the performance of speech recognition systems.

Synthesis of Multiplexed MACE Filter for Optical Korean Character Recognition (인쇄체 한글의 광학적 인식을 위한 다중 MACE 필터의 합성)

  • 김정우;김철수;배장근;도양회;김수중
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.12
    • /
    • pp.2364-2375
    • /
    • 1994
  • For the efficient recognition of printed Korean characters, a multiplexed minimum average correlation energy(MMACE) filter is proposed. Proposed method solved the disadvantages of the tree structure algorithm which recognition system is very huge and recognition method is sophisticated. Using only one consonant MMACE filter and one vowel one, we recognized the full Korean character. Each MMACE filter is multiplexed by 4 K-tuple MACE filters which are synthesized by 24 consonants and vowels. Hence the proposed MMACE filter and the correlation distribution plane are divided by 4 subregion. We obtained the binary codes for the Korean character recognition from each correlation distribution subplane. And the obtained codes are compared with the truth table for consonants and vowels in computer. We can recognize the full Korean characters when substitute the corresponded consonant or vowel font of the consistent code to the correlation peak place in the output correlation plane. The computer simulation and optical experiment results show that the proposed compact Korean character recognition system using the MMACE filters has high discrimination capability.

  • PDF

Signal analysis of Hangul shaped Chipless RFID Tag (한글형 Chipless RFID tag 신호의 분석)

  • Ryu, Beongju;Lee, Jehun;Koh, Jinhwan
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.38A no.12
    • /
    • pp.983-990
    • /
    • 2013
  • In this paper, we proposed a Hangul type chipless RFID tag, which has better legibility than the conventional chipless RFID tag not only to a computer but also to a human. We made consonant model, vowel model and whole character model by WIPL tool and checked the applicability of Hangul type chipless RFID tag. We obtain the RCS pattern of each character by simulation. Finally, We classify the character from input data in noisy environment using a variance of the data.

Effects of Articulator-distance and Tense in Phonological Awareness in Korean: The case of Korean Infants and Toddlers (한국어 음운인식에서의 조음거리와 긴장성 자질의 특성 연구: 영·유아를 중심으로)

  • Kim, Choong-Myung
    • The Journal of the Korea Contents Association
    • /
    • v.15 no.8
    • /
    • pp.424-433
    • /
    • 2015
  • This study tried to investigate the differences between auditory preferences for a discrimination study of minimal pairs with the different onset and the same nucleus of a syllable on the basis of articulator-distance in case of Korean infants and toddlers. As a result we found a main effect for articulator-distance and age but not an effect according to the types of phonation especially in terms of tense. Former results are line with the previous studies having reported the order of consonants acquisition based on the places of articulation suggesting that more sensitive responses for the contiguous and different phonemes may lead earlier acquisition for the same place of articulation of the speech sounds. Specifically, bilabial soudns are followed by alveolar and palatal sounds in order. The latter results also showed that tense consonants got a high rate of recognition beside lax consonants according to the age and sex.

Hierarchical Hidden Markov Model for Finger Language Recognition (지화 인식을 위한 계층적 은닉 마코프 모델)

  • Kwon, Jae-Hong;Kim, Tae-Yong
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.9
    • /
    • pp.77-85
    • /
    • 2015
  • The finger language is the part of the sign language, which is a language system that expresses vowels and consonants with hand gestures. Korean finger language has 31 gestures and each of them needs a lot of learning models for accurate recognition. If there exist mass learning models, it spends a lot of time to search. So a real-time awareness system concentrates on how to reduce search spaces. For solving these problems, this paper suggest a hierarchy HMM structure that reduces the exploration space effectively without decreasing recognition rate. The Korean finger language is divided into 3 categories according to the direction of a wrist, and a model can be searched within these categories. Pre-classification can discern a similar finger Korean language. And it makes a search space to be managed effectively. Therefore the proposed method can be applied on the real-time recognition system. Experimental results demonstrate that the proposed method can reduce the time about three times than general HMM recognition method.

Speech Recognition on Korean Monosyllable using Phoneme Discriminant Filters (음소판별필터를 이용한 한국어 단음절 음성인식)

  • Hur, Sung-Phil;Chung, Hyun-Yeol;Kim, Kyung-Tae
    • The Journal of the Acoustical Society of Korea
    • /
    • v.14 no.1
    • /
    • pp.31-39
    • /
    • 1995
  • In this paper, we have constructed phoneme discriminant filters [PDF] according to the linear discriminant function. These discriminant filters do not follow the heuristic rules by the experts but the mathematical methods in iterative learning. Proposed system. is based on the piecewise linear classifier and error correction learning method. The segmentation of speech and the classification of phoneme are carried out simutaneously by the PDF. Because each of them operates independently, some speech intervals may have multiple outputs. Therefore, we introduce the unified coefficients by the output unification process. But sometimes the output has a region which shows no response, or insensitive. So we propose time windows and median filters to remove such problems. We have trained this system with the 549 monosyllables uttered 3 times by 3 male speakers. After we detect the endpoint of speech signal using threshold value and zero crossing rate, the vowels and consonants are separated by the PDF, and then selected phoneme passes through the following PDF. Finally this system unifies the outputs for competitive region or insensitive area using time window and median filter.

  • PDF

Recognition of Various Printed Hangul Images by using the Boundary Tracing Technique (경계선 기울기 방법을 이용한 다양한 인쇄체 한글의 인식)

  • Baek, Seung-Bok;Kang, Soon-Dae;Sohn, Young-Sun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.1
    • /
    • pp.1-5
    • /
    • 2003
  • In this paper, we realized a system that converts the character images of the printed Korean alphabet (Hangul) to the editable text documents by using the black and white CCD camera, We were able to abstract the contours information of the character which is based on the structural character by using the boundary tracing technique that is strong to the noise on the character recognition. By using the contours information, we recognized the horizontal vowels and vertical vowels of the character image and classify the character into the six patterns. After that, the character is divided to the unit of the consonant and vowel. The vowels are recognized by using the maximum length projection. The separated consonants are recognized by comparing the inputted pattern with the standard pattern that has the phase information of the boundary line change. We realized a system that the recognized characters are inputted to the word editor with the editable KS Hangul completion type code.

Vector Quantizer Based Speaker Normalization for Continuos Speech Recognition (연속음성 인식기를 위한 벡터양자화기 기반의 화자정규화)

  • Shin Ok-keun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.8
    • /
    • pp.583-589
    • /
    • 2004
  • Proposed is a speaker normalization method based on vector quantizer for continuous speech recognition (CSR) system in which no acoustic information is made use of. The proposed method, which is an improvement of the previously reported speaker normalization scheme for a simple digit recognizer, builds up a canonical codebook by iteratively training the codebook while the size of codebook is increased after each iteration from a relatively small initial size. Once the codebook established, the warp factors of speakers are estimated by comparing exhaustively the warped versions of each speaker's utterance with the codebook. Two sets of phones are used to estimate the warp factors: one, a set of vowels only. and the other, a set composed of all the Phonemes. A Piecewise linear warping function which corresponds to the estimated warp factor is adopted to warp the power spectrum of the utterance. Then the warped feature vectors are extracted to be used to train and to test the speech recognizer. The effectiveness of the proposed method is investigated by a set of recognition experiments using the TIMIT corpus and HTK speech recognition tool kit. The experimental results showed comparable recognition rate improvement with the formant based warping method.