Acknowledgement
이 논문은 정부(과학기술정보통신부)의 재원으로 정보통신기획평가원의 지원을 받아 수행된 지역지능화혁신인재양성사업임(IITP-2024-2020-0-01462).
References
- Z. Zhang, J. Geiger, J. Pohjalainen, A. E. D. Mousa, W. Jin, and B. Schuller, "Deep learning for environmentally robust speech recognition: An overview of recent developments," ACM Transactions on Intelligent Systems and Technology, Vol.9, No.49, pp.1-28. 2018. https://doi.org/10.1145/3178115
- C. Bregler and Y. Koing, ""Eigenlips" for robust speech recognition," in Proceedings of the ICASSP'94. IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, Vol.2, pp.669-672, 1994.
- U. Meier, R. Stiefelhagen, J. Yang, and A. Waibel, "Towards unrestricted lip reading," International Journal of Pattern Recognition and Artificial Intelligence, Vol.14, No.5, pp.571-585, 2000. https://doi.org/10.1142/S0218001400000374
- Y. G. Kim, "Feature selection method for speaker independent lip reading on noisy environments," Ph.D. dissertation, Chungbuk National University, Cheongju, Korea, 2019.
- 한민경, "독화에 청각적으로 제공된 기본 주파수(F0) 보완정보," Communication Sciences & Disorders, Vol.1, No.1, pp.150-177, 1996.
- D. G. Stork and M. E. Hennecke, "Speechreading by humans and machines: models, systems, and applications," Berlin: Springer Science & Business Media, pp.525-531, 1996.
- B. Martinez, P. Ma, S. Petridis, and M. Pantic, "Lipreading using temporal convolutional networks," in Proceedings of the ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing, Barcelona, pp.6319-6323, 2020.
- P. Ma, B. Martinez, and M. Pantic, "Towards practical lipreading with distilled and efficient models," in Proceedings of the ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing, Toronto, pp.7608-7612, 2021.
- K. Vayadande, T. Adsare, N. Agrawal, T. Dharmik, A. Patil, and S. Zod, "LipReadNet: A deep learning approach to lip reading," in Proceedings of the 2023 International Conference on Applied Intelligence and Sustainable Computing, Dharwad, pp.1-6, 2023.
- 최병문, "구화교육," 한국구화학교, 1970.
- M. Hao, M. Mamut, N. Yadikar, A.Aysa, and K. Ubul, "A survey of research on lipreading technology," IEEE Access, Vol.8, pp.204518-204544, 2020. https://doi.org/10.1109/ACCESS.2020.3036865
- 김민정, "임상중심 말소리장애." 1st ed, Seoul: 학지사, 2021.
- J. J. O'Neill and H. J. Oyer, "Visual communication for the hard of hearing: History, research, and methods," 2nd ed., New Jersey: Prentice Hall, 1981.
- S. H. Cho and C. D. Choi, "Viseme and its teaching strategy for speech-reading and language normalization of people with hearing loss," Audiology and Speech Research, Vol.14, No.4, pp.219-226, 2018. https://doi.org/10.21848/asr.2018.14.4.219
- G. Potamianos and C. Neti, "Improved ROI and within frame discriminant features for lipreading," in Proceedings of the 2001 International Conference on Image Processing, Thessaloniki, Vol.3, pp.250-253, 2001.
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, Las Vegas, pp.779-788, 2016.
- J. Redmon and A. Farhadi, "Yolov3: An Incremental Improvement," Computer Vision and Pattern Recognition, Vol.1804, pp.1-6, 2018.
- G. Jocher and A. Chaurasia, "Ultralytics YOLOv8 Docs" [Internet], https://docs.ultralytics.com/ko
- J. Luettin and N. A. Thacker, "Speechreading using probabilistic models," Computer Vision and Image Understanding, Vol.65, No.2, pp.163-178, 1997. https://doi.org/10.1006/cviu.1996.0570
- Y. Lan, B. J. Theobald, R. Harvey, E. J. Ong, and R. Bowden, "Improving visual features for lip-reading," in Proceedings of the Auditory-visual Speech Processing 2010, Hakone, paper S7-3, 2010.
- B. Sujatha and T. Santhanam, "A novel approach inter-grating geometric and Gabor wavelet approaches to improvise visual lip-reading," International Journal of Soft Computing (IJSC), Vol.5, pp.13-18, 2010. https://doi.org/10.3923/ijscomp.2010.13.18
- M. Z. Ibrahim and D. J. Mulvaney, "Robust geometrical-based lip-reading using Hidden Markov models," in Proceedings of the EUROCON 2013, Zagreb, pp.2011-2016, 2013.
- 박혜영, 이관용, "패턴 인식과 기계학습," 1st ed., Gyeonggi-do: 이한출판사. 2011.
- 박창순, 이광용, 이형석, 정호영, "생활 속의 임베디드 소프트웨어", 1st ed., Seoul: U-북, 2007.
- A. Koumparoulis, G. Potamianos, Y. Mroueh, and S. J. Rennie, "Exploring ROI size in deep learning based lipreading," in Proceedings of the Auditory-visual Speech Processing 2017, Stockholm, pp.64-69, 2017.