통합 검색 | Korea Science

자동차 잡음 및 오디오 출력신호가 존재하는 자동차 실내 환경에서의 강인한 음성인식 (Robust Speech Recognition in the Car Interior Environment having Car Noise and Audio Output)

박철호;배재철;배건성
- 대한음성학회지:말소리
- /
- 제62호
- /
- pp.85-96
- /
- 2007
In this paper, we carried out recognition experiments for noisy speech having various levels of car noise and output of an audio system using the speech interface. The speech interface consists of three parts: pre-processing, acoustic echo canceller, post-processing. First, a high pass filter is employed as a pre-processing part to remove some engine noises. Then, an echo canceller implemented by using an FIR-type filter with an NLMS adaptive algorithm is used to remove the music or speech coming from the audio system in a car. As a last part, the MMSE-STSA based speech enhancement method is applied to the out of the echo canceller to remove the residual noise further. For recognition experiments, we generated test signals by adding music to the car noisy speech from Aurora 2 database. The HTK-based continuous HMM system is constructed for a recognition system. Experimental results show that the proposed speech interface is very promising for robust speech recognition in a noisy car environment.
PDF

신경망과 구문분석을 이용한 한국어 연결 숫자음 인식 (Connected Korean Digit Recognition Using Neural Networks and Lexical Analysis)

이종석;이상욱
- 전자공학회논문지B
- /
- 제30B권12호
- /
- pp.21-30
- /
- 1993
In this paper, we propose a connected Korean digit recohnition system employing neural networks and lexical constraints of the Korean digits. In the proposed recognition system, firstly, each frame of digit string is labelled by phoneme classification neural networks.which are trained with the reference phoneme segments extracted form an isolated digit based on the position information. And, the frame labels are combined with each other for constructing the phoneme segments. Then, these segments are combined to form a digit candidate using the digit combination rules. The digit candidate is decided based on the condition for digit decision. If the condition is not satisfied, the digit candidate is further recognized using the digit decision neural network in the next step. In our approach, the neural networks are trained with 10 isolated digits uttered by 5 male speakers. To investigate the performance of the proposed recognition system, an intensive computer simulation on the 30 connected digit strings uttered by 5 male speakers is performed. The simulation result indicates that 95.6% digit recognition rate and 82% digit string recognition rate are provided by the proposed Korean digit recognition system.
PDF

Design and Implementation of a Face Recognition System-on-a-Chip for Wearable/Mobile Applications

Lee, Bongkyu
- 한국멀티미디어학회논문지
- /
- 제18권2호
- /
- pp.244-252
- /
- 2015
This paper describes the design and implementation of a System-on-a-Chip (SoC) for face recognition to use in wearable/mobile products. The design flow starts from the system specification to implementation process on silicon. The entire process is carried out using a FPGA-based prototyping platform environment for design and verification of the target SoC. To ensure that the implemented face recognition SoC satisfies the required performances metrics, time analysis and recognition tests were performed. The motivation behind the work is a single chip implementation of face recognition system for target applications.
https://doi.org/10.9717/kmms.2015.18.2.244 인용 PDF KSCI KPUBS HTML

A Study on Design and Implementation of Embedded System for speech Recognition Process

Kim, Jung-Hoon;Kang, Sung-In;Ryu, Hong-Suk;Lee, Sang-Bae
- 한국지능시스템학회논문지
- /
- 제14권2호
- /
- pp.201-206
- /
- 2004
This study attempted to develop a speech recognition module applied to a wheelchair for the physically handicapped. In the proposed speech recognition module, TMS320C32 was used as a main processor and Mel-Cepstrum 12 Order was applied to the pro-processor step to increase the recognition rate in a noisy environment. DTW (Dynamic Time Warping) was used and proven to be excellent output for the speaker-dependent recognition part. In order to utilize this algorithm more effectively, the reference data was compressed to 1/12 using vector quantization so as to decrease memory. In this paper, the necessary diverse technology (End-point detection, DMA processing, etc.) was managed so as to utilize the speech recognition system in real time
https://doi.org/10.5391/JKIIS.2004.14.2.201 인용 PDF KSCI

License Plate Recognition System Using Artificial Neural Networks

Turkyilmaz, Ibrahim;Kacan, Kirami
- ETRI Journal
- /
- 제39권2호
- /
- pp.163-172
- /
- 2017
A high performance license plate recognition system (LPRS) is proposed in this work. The proposed LPRS is composed of the following three main stages: (i) plate region determination, (ii) character segmentation, and (iii) character recognition. During the plate region determination stage, the image is enhanced by image processing algorithms to increase system performance. The rectangular license plate region is obtained using edge-based image processing methods on the binarized image. With the help of skew correction, the plate region is prepared for the character segmentation stage. Characters are separated from each other using vertical projections on the plate region. Segmented characters are prepared for the character recognition stage by a thinning process. At the character recognition stage, a three-layer feedforward artificial neural network using a backpropagation learning algorithm is constructed and the characters are determined.
https://doi.org/10.4218/etrij.17.0115.0766 인용 PDF KSCI

단어사전과 다층 퍼셉트론을 이용한 고립단어 인식 알고리듬 (Isolated Word Recognition Algorithm Using Lexicon and Multi-layer Perceptron)

이기희;임인칠
- 전자공학회논문지B
- /
- 제32B권8호
- /
- pp.1110-1118
- /
- 1995
Over the past few years, a wide variety of techniques have been developed which make a reliable recognition of speech signal. Multi-layer perceptron(MLP) which has excellent pattern recognition properties is one of the most versatile networks in the area of speech recognition. This paper describes an automatic speech recognition system which use both MLP and lexicon. In this system., the recognition is performed by a network search algorithm which matches words in lexicon to MLP output scores. We also suggest a recognition algorithm which incorperat durational information of each phone, whose performance is comparable to that of conventional continuous HMM(CHMM). Performance of the system is evaluated on the database of 26 vocabulary size from 9 speakers. The experimental results show that the proposed algorithm achieves error rate of 7.3% which is 5.3% lower rate than 12.6% of CHMM.
PDF

A Study on the Recognition System of the Il-Pa Stenographic Character Images using EBP Algorithm

Kim, Sang-Keun;Park, Gwi-Tae
- KIEE International Transaction on Systems and Control
- /
- 제12D권1호
- /
- pp.27-32
- /
- 2002
In this paper, we would study the applicability of neural networks to the recognition process of Korean stenographic character image, applying the classification function, which is the greatest merit of those of neural networks applied to the various parts so far, to the stenographic character recognition, relatively simple classification work. Korean stenographic recognition algorithms, which recognize the characters by using some methods, have a quantitative problem that despite the simplicity of the structure, a lot of basic characters are impossible to classify into a type. They also have qualitative one that It Is not easy to classify characters fur the delicacy of the character farms. Even though this is the result of experiment under the limited environment of the basic characters, this shows the possibility that the stenographic characters can be recolonized effectively by neural network system. In this system, we got 90.86% recognition rate as an average.
PDF

A Recognition System for Multi-Form Korean Characters Based on Hierarchical Temporal Memory

Haibao, Nan;Bae, Sun-Gap;Bae, Jong-Min;Kang, Hyun-Syug
- 한국멀티미디어학회논문지
- /
- 제12권12호
- /
- pp.1718-1727
- /
- 2009
Traditional character recognition systems usually aim at characters with simple variation. With the development of multimedia technology, printed characters may appear more diversely. Existing recognition technologies can't deal with Hangul recognition effectively in diverse environments. This paper presents a recognition system for multi-form Korean characters called RSMFK, which is based on the model of Hierarchical Temporal Memory (HTM). Our system can effectively recognize the printed Korean characters of different fonts, scales, rotation, noise and background. HTM is a model which simulates the neocortex of human brain to recognize and memorize intelligently. Experimental results show that RSMFK performs a good recognition rate of 97.8% on average, which is proved to be obviously improved over the conventional methods.
PDF

Human Face Recognition Based on improved CNN Model with Multi-layers

Zhang, Ruyang;Lee, Eung-Joo
- 한국멀티미디어학회논문지
- /
- 제24권5호
- /
- pp.701-708
- /
- 2021
As one of the most widely used technology in the world right now, Face recognition has already received widespread attention by all the researcher and institutes. It has been used in many fields such as safety protection, surveillance system, crime control and even in our ordinary life such as home security and so on. This technology with today's technology has advantages such as high connectivity and real time transformation. But we still need to improve its recognition rate, reaction time and also reduce impact of different environmental status to the whole system. So in this paper we proposed a face recognition system model with improved CNN which combining the characteristics of flat network and residual network, integrated learning, simplify network structure and enhance portability and also improve the recognition accuracy. We also used AR and ORL database to do the experiment and result shows higher recognition rate, efficiency and robustness for different image conditions.
https://doi.org/10.9717/kmms.2021.24.5.701 인용 PDF KSCI HTML

Video Palmprint Recognition System Based on Modified Double-line-single-point Assisted Placement

Wu, Tengfei;Leng, Lu
- Journal of Multimedia Information System
- /
- 제8권1호
- /
- pp.23-30
- /
- 2021
Palmprint has become a popular biometric modality; however, palmprint recognition has not been conducted in video media. Video palmprint recognition (VPR) has some advantages that are absent in image palmprint recognition. In VPR, the registration and recognition can be automatically implemented without users' manual manipulation. A good-quality image can be selected from the video frames or generated from the fusion of multiple video frames. VPR in contactless mode overcomes several problems caused by contact mode; however, contactless mode, especially mobile mode, encounters with several revere challenges. Double-line-single-point (DLSP) assisted placement technique can overcome the challenges as well as effectively reduce the localization error and computation complexity. This paper modifies DLSP technique to reduce the invalid area in the frames. In addition, the valid frames, in which users place their hands correctly, are selected according to finger gap judgement, and then some key frames, which have good quality, are selected from the valid frames as the gallery samples that are matched with the query samples for authentication decision. The VPR algorithm is conducted on the system designed and developed on mobile device.
https://doi.org/10.33851/JMIS.2021.8.1.23 인용 PDF KSCI HTML

검색결과 6,995건 처리시간 0.034초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)