Search | Korea Science

Improved Bimodal Speech Recognition Study Based on Product Hidden Markov Model

Xi, Su Mei;Cho, Young Im
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.13 no.3
- /
- pp.164-170
- /
- 2013
Recent years have been higher demands for automatic speech recognition (ASR) systems that are able to operate robustly in an acoustically noisy environment. This paper proposes an improved product hidden markov model (HMM) used for bimodal speech recognition. A two-dimensional training model is built based on dependently trained audio-HMM and visual-HMM, reflecting the asynchronous characteristics of the audio and video streams. A weight coefficient is introduced to adjust the weight of the video and audio streams automatically according to differences in the noise environment. Experimental results show that compared with other bimodal speech recognition approaches, this approach obtains better speech recognition performance.
https://doi.org/10.5391/IJFIS.2013.13.3.164 인용 PDF KSCI

Korean Vowel Recognition using Peripheral Auditory Model (말초 청각 계통 모델을 이용한 한국어 모음 인식)

Yun, Tae-Seong;Baek, Seung-Hwa;Park, Sang-Hui
- Journal of Biomedical Engineering Research
- /
- v.9 no.1
- /
- pp.1-10
- /
- 1988
In this study, the recognition experiments for Korean vowel are performed using peripheral auditory model. In addition, for the purpose of objective comparison, the recognition experiments are performed by extracting LPC cepstrum coefficients for the same speech data. The results are as follows. 1) The time and the frequency responses of the auditory model show that important features of input signal are involved in the responses of inner ear and auditory nerve. 2) The recognition results for Korean vowel show that the recognition rate by auditory model output is higher than the recognition rate by LPC cepstrum coefficients. 3) The adaptation phenomenon of auditory nerve provides useful characteristics for the discrimination of vowel signal.
PDF

A Study on EMG Signals Recognition using Time Delayed Counterpropagation Neural Network (시간 지연을 갖는 쌍전파 신경회로망을 이용한 근전도 신호인식에 관한 연구)

Kwon, Jangwoo;Jung, Inkil;Hong, Seunghong
- Journal of Biomedical Engineering Research
- /
- v.17 no.3
- /
- pp.395-401
- /
- 1996
In this paper a new neural network model, time delayed counterpropagation neural networks (TDCPN) which have high recognition rate and short total learning time, is proposed for electromyogram(EMG) recognition. Signals the proposed model increases the recognition rates after learned the regional temporal correlation of patterns using time delay properties in input layer, and decreases the learning time by using winner-takes-all learning rule. The ouotar learning rule is put at the output layer so that the input pattern is able to map a desired output. We test the performance of this model with EMG signals collected from a normal subject. Experimental results show that the recognition rates of the suggested model is better and the learning time is shorter than those of TDNN and CPN.
PDF

Neural Network for Speech Recognition Using Signal Analysis Characteristics by ${\nabla}^2G$ Operator (${\nabla}^2G$ 연산자의 신호 분석 특성을 이용한 음성 인식 신경 회로망에 관한 연구)

이종혁;정용근;남기곤;윤태훈;김재창;박의열;이양성
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.29B no.10
- /
- pp.90-99
- /
- 1992
In this paper, we propose a neural network model for speech recognition. The model consists of feature extraction parts and recognition parts. The interconnection model based on ${\Delta}^2$G operator was used for frequency analysis. Two features, global feature and local feature, were extracted from this model. Recognition parts consist of global grouping stage and local grouping stage. When the input pattern was coded by slope method, the recognition rate of speakers, A and B, was 100%. When the test was performed with the data of 9 speakers, the recognition rate of 91.4% was obtained.
PDF

Object Recognition Algorithm with Partial Information

Yoo, Suk Won
- International Journal of Advanced Culture Technology
- /
- v.7 no.4
- /
- pp.229-235
- /
- 2019
Due to the development of video and optical technology today, video equipments are being used in a variety of fields such as identification, security maintenance, and factory automation systems that generate products. In this paper, we investigate an algorithm that effectively recognizes an experimental object in an input image with a partial problem due to the mechanical problem of the input imaging device. The object recognition algorithm proposed in this paper moves and rotates the vertices constituting the outline of the experimental object to the positions of the respective vertices constituting the outline of the DB model. Then, the discordance values between the moved and rotated experimental object and the corresponding DB model are calculated, and the minimum discordance value is selected. This minimum value is the final discordance value between the experimental object and the corresponding DB model, and the DB model with the minimum discordance value is selected as the recognition result for the experimental object. The proposed object recognition method obtains satisfactory recognition results using only partial information of the experimental object.
https://doi.org/10.17703/IJACT.2019.7.4.229 인용 PDF KSCI

ADD-Net: Attention Based 3D Dense Network for Action Recognition

Man, Qiaoyue;Cho, Young Im
- Journal of the Korea Society of Computer and Information
- /
- v.24 no.6
- /
- pp.21-28
- /
- 2019
Recent years with the development of artificial intelligence and the success of the deep model, they have been deployed in all fields of computer vision. Action recognition, as an important branch of human perception and computer vision system research, has attracted more and more attention. Action recognition is a challenging task due to the special complexity of human movement, the same movement may exist between multiple individuals. The human action exists as a continuous image frame in the video, so action recognition requires more computational power than processing static images. And the simple use of the CNN network cannot achieve the desired results. Recently, the attention model has achieved good results in computer vision and natural language processing. In particular, for video action classification, after adding the attention model, it is more effective to focus on motion features and improve performance. It intuitively explains which part the model attends to when making a particular decision, which is very helpful in real applications. In this paper, we proposed a 3D dense convolutional network based on attention mechanism(ADD-Net), recognition of human motion behavior in the video.
https://doi.org/10.9708/jksci.2019.24.06.021 인용 PDF KSCI HTML

Vocabulary Recognition Model using a convergence of Likelihood Principla Bayesian methode and Bhattacharyya Distance Measurement based on Vector Model (벡터모델 기반 바타챠랴 거리 측정 기법과 우도 원리 베이시안을 융합한 어휘 인식 모델)

Oh, Sang-Yeob
- Journal of Digital Convergence
- /
- v.13 no.11
- /
- pp.165-170
- /
- 2015
The Vocabulary Recognition System made by recognizing the standard vocabulary is seen as a decline of recognition when out of the standard or similar words. The vector values of the existing system to the model created by configuring the database was used in the recognition vocabulary. The model to be formed during the search for the recognition vocabulary is recognizable because there is a disadvantage not configured with a database. In this paper, it induced to recognize the vector model is formed by the search and configuration using a Bayesian model recognizes the Bhattacharyya distance measurement based on the vector model, by applying the Wiener filter improves the recognition rate. The result of Convergence of two method's are improved reliability experiments for distance measurement. Using a proposed measurement are compared to the conventional method exhibited a performance of 98.2%.
https://doi.org/10.14400/JDC.2015.13.11.165 인용 PDF KSCI

Decision Tree Learning Algorithms for Learning Model Classification in the Vocabulary Recognition System (어휘 인식 시스템에서 학습 모델 분류를 위한 결정 트리 학습 알고리즘)

Oh, Sang-Yeob
- Journal of Digital Convergence
- /
- v.11 no.9
- /
- pp.153-158
- /
- 2013
Target learning model is not recognized in this category or not classified clearly failed to determine if the vocabulary recognition is reduced. Form of classification learning model is changed or a new learning model is added to the recognition decision tree structure of the model should be changed to a structural problem. In order to solve these problems, a decision tree learning model for classification learning algorithm is proposed. Phonological phenomenon reflected sound enough to configure the database to ensure learning a decision tree learning model for classifying method was used. In this study, the indoor environment-dependent recognition and vocabulary words for the experimental results independent recognition vocabulary of the indoor environment-dependent recognition performance of 98.3% in the experiment showed, vocabulary independent recognition performance of 98.4% in the experiment shown.
https://doi.org/10.14400/JDPM.2013.11.9.153 인용 PDF

A Study On Three-dimensional Optimized Face Recognition Model : Comparative Studies and Analysis of Model Architectures (3차원 얼굴인식 모델에 관한 연구: 모델 구조 비교연구 및 해석)

Park, Chan-Jun;Oh, Sung-Kwun;Kim, Jin-Yul
- The Transactions of The Korean Institute of Electrical Engineers
- /
- v.64 no.6
- /
- pp.900-911
- /
- 2015
In this paper, 3D face recognition model is designed by using Polynomial based RBFNN(Radial Basis Function Neural Network) and PNN(Polynomial Neural Network). Also recognition rate is performed by this model. In existing 2D face recognition model, the degradation of recognition rate may occur in external environments such as face features using a brightness of the video. So 3D face recognition is performed by using 3D scanner for improving disadvantage of 2D face recognition. In the preprocessing part, obtained 3D face images for the variation of each pose are changed as front image by using pose compensation. The depth data of face image shape is extracted by using Multiple point signature. And whole area of face depth information is obtained by using the tip of a nose as a reference point. Parameter optimization is carried out with the aid of both ABC(Artificial Bee Colony) and PSO(Particle Swarm Optimization) for effective training and recognition. Experimental data for face recognition is built up by the face images of students and researchers in IC&CI Lab of Suwon University. By using the images of 3D face extracted in IC&CI Lab. the performance of 3D face recognition is evaluated and compared according to two types of models as well as point signature method based on two kinds of depth data information.
https://doi.org/10.5370/KIEE.2015.64.6.900 인용 PDF KSCI KPUBS HTML

Implementation of Connected-Digit Recognition System Using Tree Structured Lexicon Model (트리 구조 어휘 사전을 이용한 연결 숫자음 인식 시스템의 구현)

Yun Young-Sun;Chae Yi-Geun
- MALSORI
- /
- no.50
- /
- pp.123-137
- /
- 2004
In this paper, we consider the implementation of connected digit recognition system using tree structured lexicon model. To implement efficiently the fixed or variable length digit recognition system, finite state network (FSN) is required. We merge the word network algorithm that implements the FSN with lexical tree search algorithm that is used for general speech recognition system for fast search and large vocabulary systems. To find the efficient modeling of digit recognition system, we investigate some performance changes when the lexical tree search is applied.
PDF

Search Result 3,389, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)