Search | Korea Science

A study on performance improvement of neural network using output probability of HMM (HMM의 출력확률을 이용한 신경회로망의 성능향상에 관한 연구)

표창수;김창근;허강인
- Proceedings of the Korea Institute of Convergence Signal Processing
- /
- 2000.08a
- /
- pp.157-160
- /
- 2000
본 논문은 HMM(Hidden Markov Model)을 이용하여 인식을 수행할 경우의 오류를 최소화 할 수 있는 후처리 과정으로 신경망을 결합시켜 HMM 단독으로 사용하였을 때 보다 높은 인식률을 얻을 수 있는 HMM과 신경망의 하이브리드시스템을 제안한다. HMM을 이용하여 학습한 후 학습에 참여하지 않은 데이터를 인식하였을 때 오인식 데이터를 정인식으로 인식하도록 HMM의 출력으로 얻은 각 출력확률을 후처리에 사용될 MLP(Multilayer Perceptrons)의 학습용으로 사용하여 MLP를 학습하여 HMM과 MLP을 결합한 하이브리드 모델을 만든다. 이와 같은 HMM과 신경망을 결합한 하이브리드 모델을 사용하여 단독 숫자음과 4연 숫자음 데이터에서 실험한 결과 HMM 단독으로 사용하였을 때 보다 각각 약 4.5%, 1.3%의 인식률 향상이 있었다. 기존의 하이브리드 시스템이 갖는 많은 학습시간이 소요되는 문제점과 실시간 음성인식시스템을 구현할 때의 학습데이터의 부족으로 인한 인식률 저하를 해결할 수 있는 방법임을 확인할 수 있었다.
PDF

The Analysis of Face Recognition Rate according to Distance and Interpolation using PCA in Surveillance System (감시카메라 시스템에서 PCA에 의한 보간법과 거리별 얼굴인식률 분석)

Moon, Hae-Min;Kwak, Keun-Chang;Pan, Sung-Bum
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.21 no.6
- /
- pp.153-160
- /
- 2011
Recently, the use of security surveillance system including CCTV is increasing due to the increase of terrors and crimes. At the same time, interest of face recognition at a distance using surveillance cameras has been increasing. Accordingly, we analyzed the performance of face recognition according to distance using PCA-based face recognition and interpolation. In this paper, we used Nearest, Bilinear, Bicubic, Lanczos3 interpolations to interpolate face image. As a result, we confirmed that existing interpolation have an few effect on performance of PCA-based face recognition and performance of PCA-based face recognition is improved by including face image according to distance in traning data.
https://doi.org/10.13089/JKIISC.2011.21.6.153 인용 PDF KSCI HTML

A Study on The recognition rate of Electronic Toll Collecting System for Using RFID (RFID를 이용한 ETCS 인식률에 관한 연구)

Jang, Seong-Won;Park, Byeong-Ho;Park, Chan-Hong;Sung, Hyeon-Kyeong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2010.04a
- /
- pp.615-618
- /
- 2010
본 논문은 900MHz 대역의 RFID를 이용하여 고속도로 자동 요금징수 시스템을 개발하고, 차량에 부착한 태그를 RFID 리더가 인식하는 효율을 높이기 위하여 최적의 태그부착위치와 안테나 설치위치에 대하여 연구하고 속도에 따른 인식률을 연구 하였다. 연구결과 RFID 리더의 높이는 170cm일 때 지표면과의 각도 $80^{\circ}$와 차량 진행 방향과 RFID 리더의 안테나 면과의 각도 $90^{\circ}$일 때 최적의 RFID 리더 설치 위치를 나타냈고, 태그의 위치는 차량 전면 유리 운전석 쪽 아래 모서리에서부터 가로 10cm와 세로 10cm에서 가장 좋게 나타났다. 차량과 RFID 리더간의 거리에 따른 차량의 속도별에 의한 인식률은 차량과 RFID 리더간의 수평 거리가 25cm 이하일 때 30km/h에서 80km/h까지 모든 경우에 100%의 인식률을 보임으로써 빠른 속도로 톨게이트를 통과하여도 충분히 인식할 수 있을 것으로 나타났다.
https://doi.org/10.3745/PKIPS.y2010m04a.615 인용 PDF

Pose-invariant Face Recognition using a Cylindrical Model and Stereo Camera (원통 모델과 스테레오 카메라를 이용한 포즈 변화에 강인한 얼굴인식)

노진우;홍정화;고한석
- Journal of KIISE:Software and Applications
- /
- v.31 no.7
- /
- pp.929-938
- /
- 2004
This paper proposes a pose-invariant face recognition method using cylindrical model and stereo camera. We divided this paper into two parts. One is single input image case, the other is stereo input image case. In single input image case, we normalized a face's yaw pose using cylindrical model, and in stereo input image case, we normalized a face's pitch pose using cylindrical model with previously estimated pitch pose angle by the stereo geometry. Also, since we have an advantage that we can utilize two images acquired at the same time, we can increase overall recognition performance by decision-level fusion. Through representative experiments, we achieved an increased recognition rate from 61.43% to 94.76% by the yaw pose transform, and the recognition rate with the proposed method achieves as good as that of the more complicated 3D face model. Also, by using stereo camera system we achieved an increased recognition rate 5.24% more for the case of upper face pose, and 3.34% more by decision-level fusion.
PDF KSCI

The Method for Face Recognition using Wavelet Coefficients and Hidden Markov Model (웨이블렛 계수와 Hidden Markov Model를 이용한 얼굴인식 기법)

이경아;이대종;박장환;전명근
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 2003.09b
- /
- pp.162-165
- /
- 2003
본 논문에서는 웨이블렛 계수와 Hidden Markov Model(HMM)이용한 얼굴인식 알고리즘을 제안한다. 입력 영상은 이산웨이블렛을 기반으로 한 다해상도 분석기법을 사용하여 데이터 수를 압축한 후, 각각의 해상도에서 얻어진 웨이블렛 계수를 특징벡터로 사용하여 HMM의 모델을 생성한다. 인식단계 에서는 웨이블렛 변환에 의해 생성된 개별대역의 인식값을 더하여 상호 보완함으로써 인식률을 높일 수 있었다. 제안된 알고리즘의 타당성을 검증하기 위하여 기본적 알고리즘인 벡터 양자화(VQ) 기법을 적용한 경우와 기존 얼굴인식에 제안된 DCT-HMM을 이용한 기법과의 인식률 비교를 한 결과, 제안된 방법이 우수한 성능을 보임을 알 수 있었다.
PDF

Improvement of Face Recognition Rate by Preprocessing Based on Elliptical Model (타원 모델기반의 전처리 기법에 의한 얼굴 인식률 개선)

Won, Chul-Ho
- Journal of Korea Society of Industrial Information Systems
- /
- v.13 no.4
- /
- pp.56-63
- /
- 2008
Image calibration at preprocessing step is very important for face recognition rate improvement, and background noise deletion affects accuracy of face recognition specially. In this paper, a method is proposed to remove background area utilizing elliptical model at preprocessing step for face recognition rate improvement. As human face has the shape of ellipse, a face contour can be easily detected by using the elliptical model in face images.
PDF

The recognition of word by continuous speech recognition technic (연속 음성 인식 기법을 이용한 단어 음성 인식)

조영훈
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.91-94
- /
- 1998
우리만은 영어와는 달리 단어를 공백으로만 구분할 수 없다. 그러므로 대용량 어휘를 갖는 연속 음성을 인식하기 위한 언어모델을 만들기가 매우 어렵다. N-gram의 언어 모델을 우리말 문장에 적용하기 위해 하나의 문장을 한 단어로 구성하여 처리하였다. 우리의 인식시스템을 평가하기 위하여 시스템 공학 연구소에서 제공한 음성을 대상으로 인식률을 계산하였다. 단어의 종류는 452개이며 한명이 이 단어들을 2번씩 발음하고 총70명이 발음한 총 63,280개의 단어에 대하여 92.8%의 인식률을 얻었다. 일간지 사설로부터 추출한 단어를 대상으로 발음 사전을 10K 크기로 만들었다. 음성 모델은 uniphone을 사용하였다.
PDF

The Continuous Speech Recognition with Limited word (제한된 단어를 갖는 우리말 연속 음성 인식)

김석동
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06c
- /
- pp.87-90
- /
- 1998
이 논문에서 우리는 대규모 어휘를 갖는 연속 음성 인식을 위한 방법을 제시한다. 우리말은 영어와 구조적으로 달라서 대용량 어휘를 갖는 연속 음성을 인식하기 위한 언어모델을 만들기가 매우 어렵다. 언어 모델을 우리말 문장에 적용하기 위해 신문의 사설을 3-gram을 이용하여 처리하였다. 우리의 인식 시스템을 평가하기 위하여 시스템 공학 연구소에서 제공한 낭독 음성을 대상으로 인식률을 계산하였다. 589개의 문장을 대상으로 총 20명이 발음한 3,156개의 문장에 대하여 남자 92.2%, 여자 87.9%의 인식률을 얻었다. 발음사전은 낭독음성과 신문 사설에서 추출한 10K 크기이며 uniphone의 음성모델을 사용하였다.
PDF

Merging Context Information and Recognition Result for Robust Speech Recognition in Noisy Environments (잡음 환경에서의 강인한 음성인식을 위한 문맥 정보와 음성인식 결과의 융합)

Song, Won-Moon;Kim, Eun-Ju;Kim, Myung-Won
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.11b
- /
- pp.733-735
- /
- 2005
최근 음성인식 분야 에서는 잡음 환경에서 좀 더 신뢰도 높은 음성 인식 결과물 얻기 위하여 인식 결과 도출 단계에서 여러 가지 정보를 융합 하는 방법이나 인식결과를 후처리 하여 새로운 결과를 얻어 내는 방법들이 연구 되고 있다. 본 논문에서는 개인 모바일 기기에서의 음성 인식 환경에서 사용자의 발화 패턴 정보를 가지는 문맥 정보를 활용함으로서 잡음 환경에서의 음성 정보 손실에 따른 인식률 하락을 보완하는 방법을 제안한다. 먼저 사용자의 기기 사용 로그나 발화 로그 정보로부터 특정 명령어들의 순차적 발화 패턴을 마이닝하여 문맥 정보를 구성한다. 이 후 음성 발화시에 인식기의 최종 인식 결과에 대한 신뢰도가 떨어진다고 판단될 때 앞서 얻어진 문맥 정보의 신뢰도를 인식기의 각 후보단어들의 인식률과 융합하여 새로운 인식 결과를 도출해 낸다. 이러한 과정에서 인식기 결과에 대한 신뢰성을 판단하는 기준을 실험을 통하여 결정 하였으며 신뢰성이 기준 이하일 경우의 융합 과정을 위하여 후보 단어 인식률과 문맥정보를 적절히 융합할 수 있는 방법을 제안한다.
PDF

A Real-Time Embedded Speech Recognition System (실시간 임베디드 음성 인식 시스템)

남상엽;전은희;박인정
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.40 no.1
- /
- pp.74-81
- /
- 2003
In this study, we'd implemented a real time embedded speech recognition system that requires minimum memory size for speech recognition engine and DB. The word to be recognized consist of 40 commands used in a PCS phone and 10 digits. The speech data spoken by 15 male and 15 female speakers was recorded and analyzed by short time analysis method, which window size is 256. The LPC parameters of each frame were computed through Levinson-Burbin algorithm and they were transformed to Cepstrum parameters. Before the analysis, speech data should be processed by pre-emphasis that will remove the DC component in speech and emphasize high frequency band. Baum-Welch reestimation algorithm was used for the training of HMM. In test phone, we could get a recognition rate using likelihood method. We implemented an embedded system by porting the speech recognition engine on ARM core evaluation board. The overall recognition rate of this system was 95%, while the rate on 40 commands was 96% and that 10 digits was 94%.
PDF KSCI

Search Result 3,087, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)