Browse > Article
http://dx.doi.org/10.9717/kmms.2018.21.11.1342

Voice Activity Detection Algorithm using Wavelet Band Entropy Ensemble Analysis in Car Noisy Environments  

Park, Joo Hyun (Dept of IT Engineering, Sookmyung Women's University)
Park, Seah (Dept of IT Engineering, Sookmyung Women's University)
Lee, Muneui (Dept of IT Engineering, Sookmyung Women's University)
Lim, Soon-Bum (Research Institute of ICT Convergence, Dept of IT Engineering, Sookmyung Women's University)
Publication Information
Abstract
Voice Command systems are important means of ensuring accessibility to digital devices for use in situations where both hands are not free or for people with disabilities. Interests in services using speech recognition technology have been increasing. In this study, we developed a mobile writing application using voice recognition and voice command technology which helps people create and edit documents easily. This application is characterized by the minimization of the touch on the screen and the writing of memo by voice. We have systematically designed a mode to distinguish voice writing and voice command so that the writing and execution system can be used simultaneously in one voice interface. It provides a shortcut function that can control the cursor by voice, which makes document editing as convenient as possible. This allows people to conveniently access writing applications by voice under both physical and environmental constraints.
Keywords
Voice Recognition; Voice Command System; Voice Writing Application; Accessibility; Dual Voice Mode; Physical Disabled User; Mobile Accessibility;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 J.H. Park, S.B. Lim, J.H. Yook, and J.W. Lee, “An Analysis on the Disability Types and Requirements for Developing Daisy Reading Assistive Devices,” Journal of Special Education and Rehabilitation Science, Vol. 56, No. 3, pp. 503-520, 2017.   DOI
2 H.Y. Kim and S.B. Lim, “Accessibility Automatic Inspector Library for EPUB and its Components,” Journal of Korea Multimedia Society, Vol. 20, No. 2, pp. 330-335, 2017.   DOI
3 Voice Recognition System, http://blog.lgcns.com/711 (accessed Mar., 15, 2018).
4 J.R. Choi, J.S. Hwang, E.J. Sin, and S.B. Lim, “A Feedback Clue Model for Dynamically Updating e-book Content from User Feedback,” Journal of Korea Multimedia Society, Vol. 20, No. 2, pp. 313-321, 2017.   DOI
5 Google Docs, https://docs.google.com/ (accessed Mar., 5, 2018).
6 SpeechNotes, https://play.google.com/store/apps/details?id=co.speechnotes.speechnotes (accessed Mar., 12, 2018).
7 Strabase, Platform Big 3's Voice Recognition UI Competitive Landscape Analysis, Strabase Issue Alert, 2011.
8 J.H. Park, S.B. Lim, and J.W. Lee, “A Voice Annotation Browsing Technique in Digital Talking Book for Reading-disabled People,” Journal of Korea Multimedia Society, Vol. 16, No. 4, pp. 510-519, 2013.   DOI
9 D.G Jeong, “Trend on Artificial Intelligence Technology and Its Related Industry,” Korea Institute of Information Technology Magazine, Vol. 15, No. 2, pp. 21-28, 2017.   DOI
10 Android Speech API, https://developer.android.com/reference/android/speech/package-summary.html (accessed Mar., 20, 2018).