Browse > Article
http://dx.doi.org/10.9717/kmms.2016.19.2.469

Speech Synthesis System for Detected Objects by Smart Phone  

Kwon, Soon-Kak (Dept. of Computer Software Engineering, Dongeui University)
Publication Information
Abstract
This paper designs an application for detecting various objects using a smart phone with camera sensor, then implements the application that detects the number of faces in front of a user by using the Face API provided by android and generates a speech to the user. For implementing the application, the GoF strategy pattern is applied to design the application. It provides some advantages; first, the algorithm development schedule can separate the whole application development schedule; next, it makes easier to add the algorithm. For example, another detecting algorithm for the other objects (character, motion detection) that may be developed in the future, or it may be replaced by a more high-performance algorithm. With the propose method, a general smart phone can make some advantages that can provide information of various objects (such as moving people and objects, and detected character from signboards) to the person who is visually impaired.
Keywords
Camera; TTS; Face Detection; Strategy Pattern;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 character recognition, https://ko.wikipedia.org, (accessed Jan., 13, 2016).
2 G. Yoo, K. Jeong, and H. Moon, "Object Recognition Based on Speed Up Robust Feature Algorithm(SURF) for Smartphone Environment," Proceeding of The Institute of Electronics Engineers of Korea, pp. 544-545, 2010.
3 N. Kim, D. Kim, S. Kim, and S. Kwon, “Vocabulary Generation Method by Optical Character Recognition,” Journal of Korea Multimedia Society, Vol. 18, No. 8, pp. 943-949, 2015.   DOI
4 E. Go, Y. Ha, S. Choi, K. Lee, and Y. Park, “An Implementation of an Android Mobile System for Extracting and Retrieving Texts from Images,” Journal of Digital Contents Society, Vol. 12, No. 1, pp. 57-67, 2011.   DOI
5 M. Kim, “Individual Identification Using Ear Region Based on SIFT,” Journal of Korea Multimedia Society, Vol. 18, No. 1, pp. 1-8, 2015.   DOI
6 The development of face recognition technology and content services, http://www.kocca.kr, (accessed Apr., 10, 2014).
7 D. Kim, M. Sohn, and S. Lee, "A Study on Face Recognition Method Based on Binary Pattern Image under Varying Lighting Condition," Proceeding of The Institute of Electronics Engineers of Korea, pp. 61-74, 2012.
8 J. Kim, “Face Recognition by Fiducial Points Based Gabor and LBP Features,” The Journal of the Korea Contents Association, Vol. 13, No. 1, pp. 1-8, 2013.   DOI
9 M. Abdur Rahim, M. Najmul Hossain, T. Wahid, and M. Shafiul Azam, “Face Recognition using Local Binary Patterns (LBP),” Global Journal of Computer Science and Technology, Vol. 13, No. 4, pp. 1-8, 2013.
10 Object Recognition, http://www.haenaki.com/118, (accessed Jan., 11, 2016).
11 The 7 Best Android Text-To-Speech Engines, http://www.geoffsimons.com/2012/06/7-bestandroid-text-to-speech-engines.html, (accessed Jan., 05, 2016).
12 E. Gamma, R. Johnson, R. Johnson, and H. Vissides, Design Patterns, Elements of Reusable Object-Oriented Software, Addison-Wesley Publishing Company, Boston, 1995.
13 The Implementation of ths TTS in the Android, http://www.devblog.kr., (accessed Jan., 05, 2016).
14 Android Developer Text To Speech, http://developer.android.com/reference/android/speech/tts/TextToSpeech.html, (accessed Jan., 05, 2016).
15 Using the Google TTS(TextToSpeech), http://blog.daum.net/mailss/26, (accessed Jan., 05, 2016).
16 J. Yang, A Study on Syllable-level Concatenation for HMM-based Mixed-lingual Text-to-speech, Master's Thesis of Gwangju Institute of Science and Technology, 2010.
17 Demo Software: SIFT Keypoint Detector, http://www.cs.ubc.ca/-lowe/keypoints/ (accessed Jan., 05, 2016).
18 Character recognition, http://terms.naver.com, (accessed Jan., 13, 2016).