• Title/Summary/Keyword: Recognition ratio

Search Result 622, Processing Time 0.027 seconds

A Study on Formants of Vowels for Speaker Recognition (화자 인식을 위한 모음의 포만트 연구)

  • Ahn Byoung-seob;Shin Jiyoung;Kang Sunmee
    • MALSORI
    • /
    • no.51
    • /
    • pp.1-16
    • /
    • 2004
  • The aim of this paper is to analyze vowels in voice imitation and disguised voice, and to find the invariable phonetic features of the speaker. In this paper we examined the formants of monophthongs /a, u, i, o, {$\omega},{\;}{\varepsilon},{\;}{\Lambda}$/. The results of the present are as follows : $\circled1$ Speakers change their vocal tract features. $\circled2$ Vowels /a, ${\varepsilon}$, i/ appear to be proper for speaker recognition since they show invariable acoustic feature during voice modulation. $\circled3$ F1 does not change easily compared to higher formants. $\circled4$ F3-F2 appears to be constituent for a speaker identification in vowel /a/ and /$\varepsilon$/, and F4-F2 in vowel /i/. $\circled5$ Resulting of F-ratio, differences of each formants were more useful than individual formant of a vowel to speaker recognition.

  • PDF

New Postprocessing Methods for Rejectin Out-of-Vocabulary Words

  • Song, Myung-Gyu
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.3E
    • /
    • pp.19-23
    • /
    • 1997
  • The goal of postprocessing in automatic speech recognition is to improve recognition performance by utterance verification at the output of recognition stage. It is focused on the effective rejection of out-of vocabulary words based on the confidence score of hypothesized candidate word. We present two methods for computing confidence scores. Both methods are based on the distance between each observation vector and the representative code vector, which is defined by the most likely code vector at each state. While the first method employs simple time normalization, the second one uses a normalization technique based on the concept of on-line garbage mode[1]. According to the speaker independent isolated words recognition experiment with discrete density HMM, the second method outperforms both the first one and conventional likelihood ratio scoring method[2].

  • PDF

Development of Infants Music Education Application Using Augmented Reality

  • Yeon, Seunguk;Seo, Sukyong
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.1
    • /
    • pp.69-76
    • /
    • 2018
  • Augmented Reality (AR) technology has rapidly been applied to various application areas including e-learning and e-education. Focusing on the design and development of android tablet application, this study targeted to develop infant music education using AR technology. We used a tablet instead of personal computer because it is more easily accessible and more convenient. Our system allows infant users to play with teaching aids like blocks or puzzles to mimic musical play like game. The user sets the puzzle piece on the playground in front of the tablet and presses the play button. Then, the system extracts a region of interest among the images acquired by internal camera and separates the foreground image from the background image. The block recognition software analyzes, recognizes and shows the result using AR technology. In order to have reasonably working recognition ratio, we did experiments with more than 5,000 frames of actual playing scenarios. We found that the recognition rate can be secured up to 95%, when the threshold values are selected well using various condition parameters.

An Analysis of Formants Extracted from Emotional Speech and Acoustical Implications for the Emotion Recognition System and Speech Recognition System (독일어 감정음성에서 추출한 포먼트의 분석 및 감정인식 시스템과 음성인식 시스템에 대한 음향적 의미)

  • Yi, So-Pae
    • Phonetics and Speech Sciences
    • /
    • v.3 no.1
    • /
    • pp.45-50
    • /
    • 2011
  • Formant structure of speech associated with five different emotions (anger, fear, happiness, neutral, sadness) was analysed. Acoustic separability of vowels (or emotions) associated with a specific emotion (or vowel) was estimated using F-ratio. According to the results, neutral showed the highest separability of vowels followed by anger, happiness, fear, and sadness in descending order. Vowel /A/ showed the highest separability of emotions followed by /U/, /O/, /I/ and /E/ in descending order. The acoustic results were interpreted and explained in the context of previous articulatory and perceptual studies. Suggestions for the performance improvement of an automatic emotion recognition system and automatic speech recognition system were made.

  • PDF

A Contour Descriptors-Based Generalized Scheme for Handwritten Odia Numerals Recognition

  • Mishra, Tusar Kanti;Majhi, Banshidhar;Dash, Ratnakar
    • Journal of Information Processing Systems
    • /
    • v.13 no.1
    • /
    • pp.174-183
    • /
    • 2017
  • In this paper, we propose a novel feature for recognizing handwritten Odia numerals. By using polygonal approximation, each numeral is segmented into segments of equal pixel counts where the centroid of the character is kept as the origin. Three primitive contour features namely, distance (l), angle (${\theta}$), and arc-tochord ratio (r), are extracted from these segments. These features are used in a neural classifier so that the numerals are recognized. Other existing features are also considered for being recognized in the neural classifier, in order to perform a comparative analysis. We carried out a simulation on a large data set and conducted a comparative analysis with other features with respect to recognition accuracy and time requirements. Furthermore, we also applied the feature to the numeral recognition of two other languages-Bangla and English. In general, we observed that our proposed contour features outperform other schemes.

A Strategy for Integrated Target Recognition and High Quality Compression (목표물 탐지를 고려한 통합 이미지 압축에 관한 연구)

  • 남진우
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2000.08a
    • /
    • pp.257-260
    • /
    • 2000
  • In modern battlefield situation, radar and infrared sensors may be located on aircraft having limited computational resources available for real-time computer processing. Hence sensor images are transmitted typically to central stations for processing and automatic target recognition/detection. Owing to the limited bandwidth channels that are typically available between the aircraft and processing stations, images are compressed prior to transmission to facilitate rapid transfer. In this paper we examine the problem of compressing sensor data for transmission, given that target recognition is the end goal. Performance result shows that the front-end target recognition system achieves a relatively high level of performance as well as a high compression ratio.

  • PDF

Robot vision system for face recognition using fuzzy inference from color-image (로봇의 시각시스템을 위한 칼라영상에서 퍼지추론을 이용한 얼굴인식)

  • Lee, Joo-shin
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.7 no.2
    • /
    • pp.106-110
    • /
    • 2014
  • This paper proposed the face recognition method which can be effectively applied to the robot's vision system. The proposed algorithm is recognition using hue extraction and feature point. hue extraction was using difference of skin color, pupil color, lips color. Features information were extraction from eye, nose and mouth using feature parameters of the difference between the feature point, distance ratio, angle, area. Feature parameters fuzzified data with the data generated by membership function, then evaluate the degree of similarity was the face recognition. The result of experiment are conducted with frontal color images of face as input images the received recognition rate of 96%.

Synthesis of Novel H8-Binaphthol-based Chiral Receptors and Their Applications in Enantioselective Recognition of 1,2-Amino alcohols and Chirality Conversion of L-Amino acids to D-Amino acids

  • Jung, Hye-In;Nandhakumar, Raju;Yoon, Hoe-Jin;Lee, Sang-Gi;Kim, Kwan-Mook
    • Bulletin of the Korean Chemical Society
    • /
    • v.31 no.5
    • /
    • pp.1289-1294
    • /
    • 2010
  • Novel $H_8$-binaphthol-based chiral receptors appended with an uryl moiety (2a) and a guanidinium moiety (2b) have been designed and synthesized for the enantioselective recognition of 1,2-amino alcohols via reversible imine formation. The selectivities ($K_R/K_S$ = 9.8 ~ 19.4) of 2b in imine formation with 1,2-amino alcohols are higher than those of 2a ($K_R/K_S$ = 1.8 ~ 4.5). Similar efficiency trend have been observed in the conversion of L-amino acids to D-amino acids, i.e., the efficiency of the receptor 2b (D/L ratio: 4.3 ~ 10.1) is superior to 2a (D/L ratio: 4.0 ~ 8.7).

Eojeol-Block Bidirectional Algorithm for Automatic Word Spacing of Hangul Sentences (한글 문장의 자동 띄어쓰기를 위한 어절 블록 양방향 알고리즘)

  • Kang, Seung-Shik
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.4
    • /
    • pp.441-447
    • /
    • 2000
  • Automatic word spacing is needed to solve the automatic indexing problem of the non-spaced documents and the space-insertion problem of the character recognition system at the end of a line. We propose a word spacing algorithm that automatically finds out word spacing positions. It is based on the recognition of Eojeol components by using the sentence partition and bidirectional longest-match algorithm. The sentence partition utilizes an extraction of Eojeol-block where the Eojeol boundary is relatively clear, and a Korean morphological analyzer is applied bidirectionally to the recognition of Eojeol components. We tested the algorithm on two sentence groups of about 4,500 Eojeols. The space-level recall ratio was 97.3% and the Eojeol-level recall ratio was 93.2%.

  • PDF

The Transition Invariant Feature Extraction of the Character using the Spherical Coordinate System (구 좌표계를 이용한 위치 불변 문자 특징 추출)

  • Seo, Choon-Weon
    • 전자공학회논문지 IE
    • /
    • v.46 no.3
    • /
    • pp.19-25
    • /
    • 2009
  • In this paper, I suggested the character recognition methods which are used the centroid method and included the spherical transform from the rectangle coordination for the character recognition system and obtained the results of the above 78.14% average differential ratio for the character features. The character feature extraction system using the spherical transform method is suggested in this paper, and the possibilities of the method which is get the invariant feature for the character transition using the centroid are suggested through the differential ratio results.