Wireless Speech Recognition System using Psychoacoustic Model

Noh, Jin-Soo;Rhee, Kang-Hyeon;

전자공학회논문지CI (Journal of the Institute of Electronics Engineers of Korea CI)

제43권6호
/
Pages.110-116
/
2006
/
1229-6376(pISSN)

대한전자공학회 (The Institute of Electronics and Information Engineers)

심리음향 모델을 이용한 무선 음성인식 시스템

Wireless Speech Recognition System using Psychoacoustic Model

노진수 (조선대학교 전자공학과) ;
이강현 (조선대학교 전자공학과)

Noh, Jin-Soo (Department of Electronic Engineering, Chosun University) ;
Rhee, Kang-Hyeon (Department of Electronic Engineering, Chosun University)

발행 : 2006.11.25

PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

본 논문에서는 무선 음성 센서를 사용하여 스위치 제어나 생체신호 인증과 같은 유비쿼터스 센서 네트워크 응용 서비스를 지원하기 위한 음성인식 시스템을 구현하였다. 제안된 시스템은 무선 음성센서와 심리음향 모델을 이용한 음성인식 알고리즘과 에러정정을 위한 LDPC(Low Density Parity Check) 모듈로 구성된다. 제안된 음성인식 알고리즘은 센서의 소비 에너지를 효율적으로 사용하기 위하여 호스트 컴퓨터에 삽입되며, 음성인식의 정확도를 향상시키기 위하여 전방향 에러정정 알고리즘을 사용하였다. 또한, 효율적으로 무선채널의 잡음을 제거하고 무선채널 에러를 정정하기 위하여 실험 환경과 실험 계수를 최적화하였다. 결과적으로, 센서와 음원 사이의 거리가 1.0m 이하 일 때 FAR 0.126%와 FRR 7.5%를 얻었다.

In this paper, we implement a speech recognition system to support ubiquitous sensor network application services such as switch control, authentication, etc. using wireless audio sensors. The proposed system is consist of the wireless audio sensor, the speech recognition algorithm using psychoacoustic model and LDPC(low density parity check) for correcting errors. The proposed speech recognition system is inserted in a HOST PC to use the sensor energy effectively mil to improve the accuracy of speech recognition, a FEC(Forward Error Correction) system is used. Also, we optimized the simulation coefficient and test environment to effectively remove the wireless channel noises and correcting wireless channel errors. As a result, when the distance between sensor and the source of voice is less then 1.0m FAR and FRR are 0.126% and 7.5% respectively.

키워드

참고문헌

H. Wang, D. Estrin, and L. Girod, 'Pre-processing in a tiered sensor network for habitat monitoring,' EURASIP JASP special issue of sensor networks, pp. 392-401, 2003
S. Shukla, N. Bulusu, and S. Jha, 'Cane-toad monitoring in kakadu national park using wireless sensor networks,' in Proceedings of APAN, Cairns, Australia, July 2004
Paramvir Bahl and Venkata Padmanabhan, 'RADAR: An in-building RF-based user location and tracking system,' Proc. of IEEE INFOCOM, vol. 2, pp. 775-784, March 2000 https://doi.org/10.1109/INFCOM.2000.832252
Nissanka B., Priyantha, Anit Chakraborty, Hari Balakrishnan, 'The cricket location-support system,' Proc. of MOBICOM 2000, pp.32-43, Boston, MA, Aug. 2000, ACM, ACM Press.
Ian D. Chakeres and Luke Klein-Berndt, 'AODVjr, AODV Simplified,' ACM SIGMOBILE Mobile Computing and Communications Review, vol. 6, no. 3, Jul. 2002, pp.100-101 https://doi.org/10.1145/581291.581309
E. H. Callaway, 'Wireless Sensor Networks Architectures and Protocols,' Auebach, 2003
MICA2, http://www.xcross.com
TinyOS, http://webs.cs.berkeley.edu
Intel, http://www.intel.com/research/exploratory/wireless-sensors.htm#sensornetwork
Ustart-2400, http://www.huins.com
'ISO/IEC MPEG-2 Advanced Audio Coding 382(N-1)'-Presented at the 101st Convention 1996 November 8-11 Los Angeles, California, AN AUDIO ENGINEERING SOCIETY PREPRINT, 1996
R.G. Gallager, 'Low-density parity-check codes,' IRE Trans. Inform. Theory, vol. IT-8, pp. 21-28, Jan. 1962 https://doi.org/10.1109/TIT.1962.1057683

전자공학회논문지CI (Journal of the Institute of Electronics Engineers of Korea CI)

심리음향 모델을 이용한 무선 음성인식 시스템

Wireless Speech Recognition System using Psychoacoustic Model

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)