심리음향 모델을 이용한 무선 음성인식 시스템

Wireless Speech Recognition System using Psychoacoustic Model

  • Noh, Jin-Soo (Department of Electronic Engineering, Chosun University) ;
  • Rhee, Kang-Hyeon (Department of Electronic Engineering, Chosun University)
  • 발행 : 2006.11.25

초록

본 논문에서는 무선 음성 센서를 사용하여 스위치 제어나 생체신호 인증과 같은 유비쿼터스 센서 네트워크 응용 서비스를 지원하기 위한 음성인식 시스템을 구현하였다. 제안된 시스템은 무선 음성센서와 심리음향 모델을 이용한 음성인식 알고리즘과 에러정정을 위한 LDPC(Low Density Parity Check) 모듈로 구성된다. 제안된 음성인식 알고리즘은 센서의 소비 에너지를 효율적으로 사용하기 위하여 호스트 컴퓨터에 삽입되며, 음성인식의 정확도를 향상시키기 위하여 전방향 에러정정 알고리즘을 사용하였다. 또한, 효율적으로 무선채널의 잡음을 제거하고 무선채널 에러를 정정하기 위하여 실험 환경과 실험 계수를 최적화하였다. 결과적으로, 센서와 음원 사이의 거리가 1.0m 이하 일 때 FAR 0.126%와 FRR 7.5%를 얻었다.

In this paper, we implement a speech recognition system to support ubiquitous sensor network application services such as switch control, authentication, etc. using wireless audio sensors. The proposed system is consist of the wireless audio sensor, the speech recognition algorithm using psychoacoustic model and LDPC(low density parity check) for correcting errors. The proposed speech recognition system is inserted in a HOST PC to use the sensor energy effectively mil to improve the accuracy of speech recognition, a FEC(Forward Error Correction) system is used. Also, we optimized the simulation coefficient and test environment to effectively remove the wireless channel noises and correcting wireless channel errors. As a result, when the distance between sensor and the source of voice is less then 1.0m FAR and FRR are 0.126% and 7.5% respectively.

키워드

참고문헌

  1. H. Wang, D. Estrin, and L. Girod, 'Pre-processing in a tiered sensor network for habitat monitoring,' EURASIP JASP special issue of sensor networks, pp. 392-401, 2003
  2. S. Shukla, N. Bulusu, and S. Jha, 'Cane-toad monitoring in kakadu national park using wireless sensor networks,' in Proceedings of APAN, Cairns, Australia, July 2004
  3. Paramvir Bahl and Venkata Padmanabhan, 'RADAR: An in-building RF-based user location and tracking system,' Proc. of IEEE INFOCOM, vol. 2, pp. 775-784, March 2000 https://doi.org/10.1109/INFCOM.2000.832252
  4. Nissanka B., Priyantha, Anit Chakraborty, Hari Balakrishnan, 'The cricket location-support system,' Proc. of MOBICOM 2000, pp.32-43, Boston, MA, Aug. 2000, ACM, ACM Press.
  5. Ian D. Chakeres and Luke Klein-Berndt, 'AODVjr, AODV Simplified,' ACM SIGMOBILE Mobile Computing and Communications Review, vol. 6, no. 3, Jul. 2002, pp.100-101 https://doi.org/10.1145/581291.581309
  6. E. H. Callaway, 'Wireless Sensor Networks Architectures and Protocols,' Auebach, 2003
  7. MICA2, http://www.xcross.com
  8. TinyOS, http://webs.cs.berkeley.edu
  9. Intel, http://www.intel.com/research/exploratory/wireless-sensors.htm#sensornetwork
  10. Ustart-2400, http://www.huins.com
  11. 'ISO/IEC MPEG-2 Advanced Audio Coding 382(N-1)'-Presented at the 101st Convention 1996 November 8-11 Los Angeles, California, AN AUDIO ENGINEERING SOCIETY PREPRINT, 1996
  12. R.G. Gallager, 'Low-density parity-check codes,' IRE Trans. Inform. Theory, vol. IT-8, pp. 21-28, Jan. 1962 https://doi.org/10.1109/TIT.1962.1057683