DOI QR코드

DOI QR Code

An Interactive Voice Web Browser Usable as a Multimodal Interface in Information Devices by Using VoiceXML

  • Jang, Min-Seok (Dept. of Computer Information Science, Kunsan National Univ.)
  • Published : 2004.10.01

Abstract

The present Web surroundings is mostly composed of HTML(Hypertext Mark-up Language) and thereby users obtain web informations mainly in GUI(Graphical User Interface) environment by clicking mouse in order to keep up with hyperlinked informations. However it is very inconvenient to work in this environment comparing with easily accessed one in which human`s voice is utilized for obtaining informations. Using VoiceXML, resulted from XML, for supplying the information through telephone on the basis of the contemporary matured technology of voice recognition/synthesis to work out the inconvenience problem, this paper presents the research results about VoiceXML VUI(Voice User Interface) Browser designed and implemented for realizing its technology and also the VoiceXML Dialog designed for the purpose of the browser's efficient use.

Keywords

References

  1. W3C Multimodal Interaction Activity, http://www.w3.org/2002/mmi/
  2. VoiceXML Forum, http://www.voicexml.org
  3. VoiceXML 2.0 Spec., http://www.w3.org/TR/voicexml20/
  4. Peter j. Danielsen, "The Promise of a Voice- Enabled Web", IEEE Computer, VOL.33, NO.3, pp.104-106, Aug. 2000 https://doi.org/10.1109/2.868708
  5. Snowshore Networks, http://www.snowshore.com
  6. BM Voice Toolkit, http://www-4.ibm.com/software/speech/enterprise/ vtoolkit.html
  7. Microsoft Speech, http://www.microsoft.com/speech/
  8. Nuance, http://www.nuance.com/prodserv/prodnuance.html
  9. SpeechWorks OpenSpeech Server, http://www.speechworks.com/products/speechrec/ index.cfm
  10. Microsoft Agent, http://www.microsoft.com/agent/
  11. R.Lau, G.Flammia, C.Pao, and V.Zue, "WebGALAXY: Beyond Point and Click a Conversational Interface To a Browser", in Proc. Sixth International World Wide Web Conference (M.R. Genesereth and A. Patterson, eds.), Santa Clara, CA, pp. 119-128, Apr 1997
  12. Chieko Asakawa et al, "Annotation Based Transcoding for Nonvisual Web Access", Proc. ASSET'00, pp.172-179, Nov. 2000
  13. IBM, http://www-4.ibm.com/software/webservers/appserv
  14. Stephen Breitenbach, et al, Early Adopter VoiceXML, Wrox Press Inc., p.300, Aug. 2001
  15. W3C DOM Requirements, http://www.w3.org/TR/DOM-Requirements