• Title/Summary/Keyword: voiceXML

Search Result 101, Processing Time 0.022 seconds

Strategy for Implementing A Voice Web Browser Based WIPI (WIPI기반 음성 웹브라우저 구현 방안)

  • Yu Se-Young;Kim Byung-Ki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2006.05a
    • /
    • pp.501-504
    • /
    • 2006
  • 인터넷 및 휴대폰들이 일반화되고 음성처리 기술이 실용화 단계로 발전함에 따라 음성 응용분야가 새로운 이슈로 떠오르고 있다. 음성처리 기술은 사람의 말을 알아들을 수 있는 귀와 사람에게 말을 할 수 있는 입을 마련해주는 새로운 분야다. 그리고, 음성으로 웹의 컨텐츠를 개발하기 위한 표준 언어인 VoiceXML, SALT가 빠르게 보급되고 있다. 음성인식과 음성합성 기술이 꾸준히 발전하여 음성 포털 서비스나 자동 음성 안내 시스템 등에 음성인식과 음성합성 기술이 채택되는 등 상용화 수준에 이르렀다. 사람에게 가장 편리한 정보 습득 방법은 음성이고 이러한 음성을 적용한 음성 웹 브라우저를 현재 유선 상에서 사용하고 있다. 하지만 아직까지 무선 플랫폼에 적용하여 사용하는 브라우저는 개발되지 않고 있다. 사용자에게 친숙한 무선인터넷 환경을 제공하고자 무선 음성 웹 브라우저를 구현방안을 제시하고자 한다.

  • PDF

A Design for Mobile Contents Converting Using XML Parser Extraction (XML Parser추출에 의한 모바일 컨텐츠 변환 설계)

  • 김영선;장덕철
    • Journal of Korea Multimedia Society
    • /
    • v.6 no.2
    • /
    • pp.267-275
    • /
    • 2003
  • The development of the mobile communication has been enlarged gradually and has been changed voice centered services into data centered ones supporting Internet to provide various kinds of mobile internet services. Researches for data transmission have been achieved actively lot the effective approachment of web in the mobile terminal. The users using mobile communication are increasing for its easy handcarried and movement and the information in internet based upon wire communication has been changed into mobile communication fast. The development of mobile internet requires various mobile contents using mobile internet at any time and at any place due to rapid enhance of network capacity and mobile internet according to network expansion. To accept these needs, there are many difficult problems that contents must be remade according to various terminal features and the develoument costs much. To solve these problems, this paper has the pun)me that contents of Web documents are changed and processed through XML palter and design new mobile contents system, providing mobile services rapidly and obtaining the charge reduction result with the enhance of contents development to the shortening of a development Period.

  • PDF

Multimodal User Interfaces for Web Services (웹 서비스를 위한 멀티 모달 사용자 인터페이스)

  • Song Ki-Sub;Kim Yeon-Seok;Lee Kyong-Ho
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.06b
    • /
    • pp.46-48
    • /
    • 2006
  • 본 논문에서는 웹 서비스의 WSDL 문서로부터 멀티 모달 유저 인터페이스를 동적으로 생성하는 방법을 제안한다. 이를 위해 W3C에서 제안한 사용자 인터페이스 관련 기술인 XForms와 VoiceXML을 소개하고. XForms에 기반한 사용자 인터페이스 생성 알고리즘을 제안한다. 제안된 방법은 WSDL 문서의 구조를 분석하고. 스키마로부터 데이터의 타입에 따른 적합한 컨트롤을 매핑하여 최적의 멀티 모달 사용자 인터페이스를 구성한다.

  • PDF

A Study on the JAVA Beans Component Architecture in Speech Recognition Flight Information System Using (VoiceXML을 사용한 음성 인식 항공 정보 시스템에서의 JAVA Beans Component 구조에 관한 연구)

  • 장준식;윤재석;김국보
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2002.05c
    • /
    • pp.105-111
    • /
    • 2002
  • 최근까지 웹은 컴퓨터 상에서의 디스플레이, 키보드, 포인팅 장치들과 같은 비주얼 인터페이스를 통해서 정보 전달 및 서비스를 해오고 있다. 또한 이들은 일부의 모바일용 서비스를 제외하고 대부분이 익스플로어나 네스케이프 등의 웹브라우져를 지원하는 서비스를 해오고 있다. 이와 같은 시스템은 시간과 공간에 제약이 있으며 지원하는 브라우저가 있어야 하는 단점이 있다. 전화의 보급률은 컴퓨터나 기타 장치들에 비해 높고, 음성은 사람에게 쉽게 다가갈 수 있고 편하게 사용할 수 있는 인터페이스이다. 본 논문에서는 지금까지의 보는 것 중심의 웹 서비스를 듣고 말하는 웹 서비스로 음성 인식 항공 정보 시스템으로 설계ㆍ구현하였다.

  • PDF

A Multimodal Interface for Telematics based on Multimodal middleware (미들웨어 기반의 텔레매틱스용 멀티모달 인터페이스)

  • Park, Sung-Chan;Ahn, Se-Yeol;Park, Seong-Soo;Koo, Myoung-Wan
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.41-44
    • /
    • 2007
  • In this paper, we introduce a system in which car navigation scenario is plugged multimodal interface based on multimodal middleware. In map-based system, the combination of speech and pen input/output modalities can offer users better expressive power. To be able to achieve multimodal task in car environments, we have chosen SCXML(State Chart XML), a multimodal authoring language of W3C standard, to control modality components as XHTML, VoiceXML and GPS. In Network Manager, GPS signals from navigation software are converted to EMMA meta language, sent to MultiModal Interaction Runtime Framework(MMI). Not only does MMI handles GPS signals and a user's multimodal I/Os but also it combines them with information of device, user preference and reasoned RDF to give the user intelligent or personalized services. The self-simulation test has shown that middleware accomplish a navigational multimodal task over multiple users in car environments.

  • PDF

Design of Specialized User Interface for Mobile Ubiquitous Devices Based on Using Patterns (사용자의 사용 방식에 근거한 이동형 유비쿼터스 단말기의 사용자 인터페이스 환경 설계)

  • Na, SangYeob;Yoo, HeeYong
    • The Journal of Korean Association of Computer Education
    • /
    • v.9 no.6
    • /
    • pp.79-87
    • /
    • 2006
  • An ubiquitous environment has been developed in order to allow users to use information more easily. These environments are based on advanced development of mobile ubiquitous hardwares. Currently, a various user interfaces are developed for mobile ubiquitous devices using the graphic or voice. In this paper, propose a specialized graphical user interface which is based on analysis of a user profile. This user interface can provides suitable interface for individual users using XML information on the small screen of mobile ubiquitous devices.

  • PDF

Design of Gesture based Interfaces for Controlling GUI Applications (GUI 어플리케이션 제어를 위한 제스처 인터페이스 모델 설계)

  • Park, Ki-Chang;Seo, Seong-Chae;Jeong, Seung-Moon;Kang, Im-Cheol;Kim, Byung-Gi
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.1
    • /
    • pp.55-63
    • /
    • 2013
  • NUI(Natural User Interfaces) has been developed through CLI(Command Line Interfaces) and GUI(Graphical User Interfaces). NUI uses many different input modalities, including multi-touch, motion tracking, voice and stylus. In order to adopt NUI to legacy GUI applications, he/she must add device libraries, modify relevant source code and debug it. In this paper, we propose a gesture-based interface model that can be applied without modification of the existing event-based GUI applications and also present the XML schema for the specification of the model proposed. This paper shows a method of using the proposed model through a prototype.

Development of HTMLtoVTML Conversion Agent using Embedded Text and Priori Structural Knowledge (내장 문자와 사전 구조 지식을 이용한 HTMLtoVXML 변환 에이전트 개발)

  • Jang, Young-Gun
    • The KIPS Transactions:PartD
    • /
    • v.10D no.2
    • /
    • pp.343-350
    • /
    • 2003
  • This paper presents a new agent which convert HTML contents to VXML contents automatically for voice services via web. In this paper, I propose an interactive hybrid sequential contents selection method to select desired contents fast and robustly from known web pages. It uses real time structural features as well as embedded text and/or priori structural knowledge such as link symbol position. To verify its effectiveness, a full agent system is implemented and tested. The method reflects user intention more accurately than conventional selections using structural features and is more robust to variations of HTML programming techniques. The agent is fast and has less computational burden than methods use XML or XHTML conversion as intermediate stage.

A Study on the Multi-Modal Browsing System by Integration of Browsers Using lava RMI (자바 RMI를 이용한 브라우저 통합에 의한 멀티-모달 브라우징 시스템에 관한 연구)

  • Jang Joonsik;Yoon Jaeseog;Kim Gukboh
    • Journal of Internet Computing and Services
    • /
    • v.6 no.1
    • /
    • pp.95-103
    • /
    • 2005
  • Recently researches about multi-modal system has been studied widely and actively, Such multi-modal systems are enable to increase possibility of HCI(Human-computer Interaction) realization, enable to provide information in various ways and also enable to be applicable in e-business application, If ideal multi-modal system can be realized in future, eventually user can maximize interactive usability between information instrument and men in hands-free and eyes-free, In this paper, a new multi-modal browsing system using Java RMI as communication interface, which integrated by HTML browser and voice browser is suggested and also English-English dictionary search application system is implemented as example.

  • PDF

SMIL Authoring System for Multi-media synchronization and representation (멀티미디어 동기화 및 표현을 위한 SMIL 저작 시스템)

  • Ham, Jong-Wan;Jin, Du-Seok;Choi, Bong-Kyu;Cao, Ke-Rang;Park, Man-Seob;Jung, Hoe-Kyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2009.05a
    • /
    • pp.653-656
    • /
    • 2009
  • Currently with development of development and the hardware of the superhigh speed network about increase is spreading out at the rapid pace the many multimedia contents quite from internet. The production environment is growing about the multimedia contents because of such as circumstance, as well as multimedia contents will increase. However, Numerous voice, the picture, with text etc. the time of the same multimedia contents and problem of spatial synchronization occur, started. W3C(World Wide Web Consortium) solves like this problem point presented the method for. Does so, SMIL(Synchronized Multimedia Integration Language) where puts a base in XML(Extensible Markup Language) will be able to compose the expression of the multimedia contents which is various standard was proposed. SMIL the individual multimedia object of chain with time will be able to integrate with the multimedia presentation which is synchronized spatial in order. In this paper a variety of multimedia content and synchronization of the time and space, to be represented by integrating the design and implementation of SMIL authoring system.

  • PDF