• Title/Summary/Keyword: Voice interface

Search Result 298, Processing Time 0.023 seconds

A Fuzzy-Neural Network Based Human-Machine Interface for Voice Controlled Robots Trained by a Particle Swarm Optimization

  • Watanabe, Keigo;Chatterjee, Amitava;Pulasinghe, Koliya;Izumi, Kiyotaka;Kiguchi, Kazuo
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2003.09a
    • /
    • pp.411-414
    • /
    • 2003
  • Particle swarm optimization (PSO) is employed to train fuzzy-neural networks (FNN), which can be employed as an important building block in real life robot systems, controlled by voice-based commands. The FNN is also trained to capture the user spoken directive in the context of the present performance of the robot system. The system has been successfully employed in a real life situation for navigation of a mobile robot.

  • PDF

Development of an Embedded System for Ship′s Steering Gear using Voice Recognition Module (음성인식모듈을 이용한 선박조타용 임베디드 시스템 개발)

  • 서기열;홍태호;김화영;박계각
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 2004.04a
    • /
    • pp.144-148
    • /
    • 2004
  • Recently, various studies had been made for automatic control system of small ships, in order to improve maneuvering and to reduce labor and working on board. To achieve efficient operation of small ships, it had accomplished to rapid development of automatic technique, but the ship operation had been more complicated because of the need to handle various gauges and instruments. To solve these problems, there are examples to be applied to the speech information processing technologies which is one of the human interface methods in the system operation of ship, but the implementation of definite system is still incomplete. Therefore, the purpose of this paper is to implement the control system for ship steering using the voice recognition module.

  • PDF

Intelligent Retrieval System with Interactive Voice Support (대화형 음성 지원을 통한 지능형 검색 시스템)

  • Moon, K.J.;Yoo, Y.S.
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.9 no.1
    • /
    • pp.29-35
    • /
    • 2015
  • In this paper, we propose a intelligent retrieval system with interactive voice support. The developed system helps to find misrecognized words by using the relationship between lexical items in a sentence recognition and present the correct vocabulary. In this study, we implement a simulation system that can be proposed to determine the usefulness of the product search assistance system which offers applications. Experimental results were confirmed to correct the wrong speech recognition vocabulary in a simple user interface to help the product search.

  • PDF

The Study of Web-tool for Scholarly Discussion and Publishing : The Case of KIPS Cyber Forum (WWW에서의 학술토론과 출판에 관한 연구 - KIPS의 사례를 중심으로 -)

  • 김재관
    • Journal of Korea Technology Innovation Society
    • /
    • v.2 no.1
    • /
    • pp.44-57
    • /
    • 1999
  • KIPS is a net-world, cyberspace for scholars in Public Administration and Policy Sciences in WWW. All knowledge-intensive work has its core the publishing and debating of document. We have created a cyber forum for that work KIPS Cyber Forum has adapted ‘D3E’, the web-tool kit for non-technical users to easily debate and publish documents that exploit to the full networked interactive web media. And, for real-time communication, we added it the voice conferencing system. KIPS has opened Cyber Forum service in November 1998. The visitors on KWS Cyber Forum are increasingly growing, but the participants on the debate are a few. This means that the problems of Cyber Forum Service are not technical, but participation. The result imply that, at now, high participation of scholars on the debate is needed, at first, by the detailed guides for internet, www and relevant technical information. After that more expertly designed interface is to be important.

  • PDF

VoiceEPG: Speech Interface for Electronic Program Guide (전자프로그램 가이드를 위한 음성 인터페이스)

  • 김한수;황인준
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.10c
    • /
    • pp.589-591
    • /
    • 2003
  • 최근 디지털 TV 방송의 활성화에 힘입어 수많은 채널을 통한 TV 프로그램 방송이 가능하게 되었다. 이로 인해 디지털 TV 시청자들은 신문 또는 TV 가이드와 같은 기존 인쇄매체를 통해 자신이 원하는 TV 프로그램 스케줄을 얻기가 사실상 매우 어렵게 되었다. 이와 같은 문제점을 해결하기 위해 디지털 TV 환경에서는 전자 프로그램 가이드(EPG: Electronic Program Guide)를 제공한다. 현재 제공되고 있는 EPG 서비스들은 대개 디지털 TV 화면 또는 각 방송사 웹 사이트 그리고 이동 단말기 등을 통해서 서비스 되고 있다. 대부분의 기존 연구들은 EPG 정보를 화면상에 시각적으로 제공하는 측면에만 초점을 두고 있다. 하지만 실질적으로 사용자 입장에서는 원하는 방송 프로그램의 스케줄 정보를 찾기 위해서 수백 채널에 달하는 방송 프로그램에 대한 정보를 일일이 검색하는 것은 매우 힘든 일이다. 게다가 사용자가 원하는 키워드를 직접 입력하는 방식 또한 사용자를 매우 번거롭게 한다. 따라서 본 논문에서는 EPG 서비스 방식에 VoiceXML 관련 기술을 접목하여 이동 단말기상에서 간단한 음성입력을 통해 EPG 서비스를 제공받을 수 있는 음성 인터페이스를 제안한다.

  • PDF

System Performance and Traffic Control for the AAL Type 2 Traffic in IMT-2000 Networks (IMT-2000 망에서 AAL-2 구조의 트래픽 제어 및 시스템 성능)

  • Ryu, Byung-Han;Ahn, Jee-Hwan;Baek, Jang-Hyun
    • IE interfaces
    • /
    • v.13 no.2
    • /
    • pp.178-187
    • /
    • 2000
  • In this paper, we investigate the system performance when the voice traffic is constructed as the ATM Adaptation Layer type 2(AAL-2) and then it is transmitted to the Base Station Controller(BSC) from the Base Station Transceiver Subsystem(BTS) through El link in International Mobile Telecommunication-2000 (IMT-2000) network. For this purpose, we first briefly describe the architecture of the BTS and the BSC, and then model it as a queueing network. By simulation study, we present the required processing time at traffic control blocks and the timeout time which should be set for multiplexing the user packets in the LIU(Line Interface Unit). Further, we evaluate the performance of physical links and the timeout probability that user packets can not be multiplexed within the established timeout time, and the multiplexing gain. Finally, we present the number of voice users who can be simultaneously admitted on one El link and 99.9% value of the transmission delay from the Radio Channel Element(RCE) to the Selector & Transcoder Subsystem(STS).

  • PDF

Design of Metaverse for Two-Way Video Conferencing Platform Based on Virtual Reality

  • Yoon, Dongeon;Oh, Amsuk
    • Journal of information and communication convergence engineering
    • /
    • v.20 no.3
    • /
    • pp.189-194
    • /
    • 2022
  • As non-face-to-face activities have become commonplace, online video conferencing platforms have become popular collaboration tools. However, existing video conferencing platforms have a structure in which one side unilaterally exchanges information, potentially increase the fatigue of meeting participants. In this study, we designed a video conferencing platform utilizing virtual reality (VR), a metaverse technology, to enable various interactions. A virtual conferencing space and realistic VR video conferencing content authoring tool support system were designed using Meta's Oculus Quest 2 hardware, the Unity engine, and 3D Max software. With the Photon software development kit, voice recognition was designed to perform automatic text translation with the Watson application programming interface, allowing the online video conferencing participants to communicate smoothly even if using different languages. It is expected that the proposed video conferencing platform will enable conference participants to interact and improve their work efficiency.

Usability Analysis and Improvement Plan for Intelligent Speakers in the 4th Industrial Revolution Environment

  • Seong-Hoon Lee;Dong-Woo Lee
    • International journal of advanced smart convergence
    • /
    • v.12 no.4
    • /
    • pp.119-125
    • /
    • 2023
  • Smart home in the 4th industrial revolution environment is where all devices in the home are connected to each other to provide the optimal living environment desired by the user. Artificial intelligence speakers are being used as a way to manage and control all devices used in this environment. The function of an artificial intelligence speaker ranges from simple music playback to serving as an interface that controls and manages all devices in a smart home space. In this study, we investigated and analyzed the usability of artificial intelligence speakers based on the current status of domestic and overseas markets and the survey contents of two organizations (Korea Consumer Agency and Korea Information and Communication Policy Institute (KISDI)). In addition, we investigated and analyzed the usability of artificial intelligence speakers. Based on the results of responses from users from two related organizations, major problems were derived, and major improvement measures, such as discovering new functions and improving voice recognition performance, were also described.

Development of Half-Mirror Interface System and Its Application for Ubiquitous Environment (유비쿼터스 환경을 위한 하프미러형 인터페이스 시스템 개발과 응용)

  • Kwon Young-Joon;Kim Dae-Jin;Lee Sang-Wan;Bien Zeungnam
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.12
    • /
    • pp.1020-1026
    • /
    • 2005
  • In the era of ubiquitous computing, human-friendly man-machine interface is getting more attention due to its possibility to offer convenient services. For this, in this paper, we introduce a 'Half-Mirror Interface System (HMIS)' as a novel type of human-friendly man-machine interfaces. Basically, HMIS consists of half-mirror, USB-Webcam, microphone, 2ch-speaker, and high-speed processing unit. In our HMIS, two principal operation modes are selected by the existence of the user in front of it. The first one, 'mirror-mode', is activated when the user's face is detected via USB-Webcam. In this mode, HMIS provides three basic functions such as 1) make-up assistance by magnifying an interested facial component and TTS (Text-To-Speech) guide for appropriate make-up, 2) Daily weather information provider via WWW service, 3) Health monitoring/diagnosis service using Chinese medicine knowledge. The second one, 'display-mode' is designed to show decorative pictures, family photos, art paintings and so on. This mode is activated when the user's face is not detected for a time being. In display-mode, we also added a 'healing-window' function and 'healing-music player' function for user's psychological comfort and/or relaxation. All these functions are accessible by commercially available voice synthesis/recognition package.

Design of Specialized User Interface for Mobile Ubiquitous Devices Based on Using Patterns (사용자의 사용 방식에 근거한 이동형 유비쿼터스 단말기의 사용자 인터페이스 환경 설계)

  • Na, SangYeob;Yoo, HeeYong
    • The Journal of Korean Association of Computer Education
    • /
    • v.9 no.6
    • /
    • pp.79-87
    • /
    • 2006
  • An ubiquitous environment has been developed in order to allow users to use information more easily. These environments are based on advanced development of mobile ubiquitous hardwares. Currently, a various user interfaces are developed for mobile ubiquitous devices using the graphic or voice. In this paper, propose a specialized graphical user interface which is based on analysis of a user profile. This user interface can provides suitable interface for individual users using XML information on the small screen of mobile ubiquitous devices.

  • PDF