Search | Korea Science

Voice Command Web Browser Using Variable Vocabulary Word Recognizer (가변어휘 단어 인식기를 사용한 음성 명령 웹 브라우저)

이항섭
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.2
- /
- pp.48-52
- /
- 1999
In this paper, we describe a Voice Command Web Browser using a variable vocabulary word recognizer that can do Internet surfing with Korean speech recognition on the Web. The feature of this browser is that it can handle the links and menus of the web browser by speech. Therefore, we can use speech interface together with mouse for web browsing. To recognize the recognition candidates dynamically changing according to Web pages, we use the variable vocabulary word recognizer. The recognizer was trained using POW (Phonetically Optimized Words) 3,848 words. So that it can recognize new words which did not exist in training data. The preliminary test results showed that the performance of speaker-independent and vocabulary-independent recognition is 93.8% for 32 Korean words. The Voice Command Web Browser was developed on windows 95/NT using Netscape Navigator and reflected usability test results in order to offer easy interface to users unfamiliar with speech interface. In on-line experiment of speaker-independent and environment-independent situation, Voice Command Web Browser showed recognition accuracy of 90%.
PDF

Implementation of Music Broadcasting Service System in the Shopping Center Using Text-To-Speech Technology (TTS를 이용한 매장 음악 방송 서비스 시스템 구현)

Chang, Moon-Soo;Kang, Sun-Mee
- Speech Sciences
- /
- v.14 no.4
- /
- pp.169-178
- /
- 2007
This thesis describes the development of a service system for small-sized shops which support not only music broadcasting, but editing and generating voice announcement using the TTS(Text-To-Speech) technology. The system has been developed based on web environments with an easy access whenever and wherever it is needed. The system is able to control the sound using silverlight media player based on the ASP .NET 2.0 technology without any additional application software. Use of the Ajax control allows for multiple users to get the maximum load when needed. TTS is built in the server side so that the service can be provided without user's computer. Due to convenience and usefulness of the system, the business sector can provide better service to many shops. Further additional functions such as statistical analysis will undoubtedly help shop management provide desirable services.
PDF

Construction of Integration Management System of Various Speech Corpora (다양한 음성코퍼스의 통합 관리시스템 구축)

Rhyu, Kyeong-Taek;Jeong, Chang-Won;Kim, Do-Goan;Lee, Young-Ju
- Journal of the Korea Society of Computer and Information
- /
- v.11 no.1 s.39
- /
- pp.259-271
- /
- 2006
In this paper, we propose relevant to design and implementation of an integrated management system for various speech corpora. The purpose of this paper is to manage an integrated management system for various kinds of speech corpora necessary for speech research and speech corpora constructed in different data formats. In addition, ways are considered to allow users to search with effect for speech corpora that meet various conditions which they want, and to allow them to add with ease corpora that are constructed newly. In order to achieve this goal, we design a global schema for an integrated management of new additional information without changing old speech corpora, and construct a web-based integrated management system based on the scheme that can be accessed without any temporal and spatial restrictions. Finally, we describe the web based interface which are the executed results involved in the service and show the efficiency of using the index view for implementation of integrated management system.
PDF

Intelligent Speech Web Considering User Inclination (사용자의 성향을 고려하는 지능형 음성 웹)

Kwon, Hyeong-Joon;Hong, Kwang-Seok
- The KIPS Transactions:PartB
- /
- v.15B no.4
- /
- pp.347-354
- /
- 2008
In this paper, we propose a method for personalizing and intelligence of speech Web. The proposed system records information that was demanded in the past as a transaction, explores association rules from those transactions, and discovers itemsets from frequent requests. This method is to recommend relevant information, based on frequent itemsets, to users who have similar inclinations to previous users. As a result of experimenting and implementation of proposed system for verification, we confirmed that the proposed system can recommend previously frequently requested information as relevant information.
https://doi.org/10.3745/KIPSTB.2008.15-B.4.347 인용 PDF KSCI

Integrating Pronunciation into a Classroom and on the Web

Kim, He-Kyung
- Proceedings of the KSPS conference
- /
- 2000.07a
- /
- pp.271-282
- /
- 2000
The aim of this presentation is to suggest a method of integrating the teaching of pronunciation into a typical communicative classroom and on the web. This presentation seeks the way by analyzing useful communicative expressions with a speech analyzer for English learners to see the sound pattern of those expressions and say them right. It is hoped that this presentation will prompt teachers to understand the current role of pronunciation in communicative English programs and that the WWW can help students improve their pronunciation to develop their speaking and listening skills. It also suggests the need for a database of visualized communicative expressions.
PDF

Applying Mobile Agent for Internet-based Distributed Speech Recognition

Saaim, Emrul Hamide Md;Alias, Mohamad Ashari;Ahmad, Abdul Manan;Ahmad, Jamal Nasir
- 제어로봇시스템학회:학술대회논문집
- /
- 2005.06a
- /
- pp.134-138
- /
- 2005
There are several application have been developed on internet-based speech recognition. Internet-based speech recognition is a distributed application and there were various techniques and methods have been using for that purposed. Currently, client-server paradigm was one of the popular technique that been using for client-server communication in web application. However, there is a new paradigm with the same purpose: mobile agent technology. Mobile agent technology has several advantages working on distributed internet-based system. This paper presents, applying mobile agent technology in internet-based speech recognition which based on client-server processing architecture.
PDF

An Investigation for Design and Implementation of an Integrated Data Management System of Various Speech Corpora (다양한 음성코퍼스의 통합관리시스템의 설계 및 구현에 관한 검토)

Hwang Kyunghun;Jeong Changwon;Kim Youngil;Kim Bongwan;Lee Yongju
- Proceedings of the KSPS conference
- /
- 2003.10a
- /
- pp.69-72
- /
- 2003
In this paper, we investigate various factors that are relevant to design and implementation of an integrated management system for various speech corpora. The purpose of this paper is to manage an integrated management system for various kinds of speech corpora necessary for speech research and speech corpora consrtructed in different data formats. In addition, ways are considered to allow users to search with effect for speech corpora that meet various conditions which they want, and to allow them to add with ease corpora that are constructed newly. In order to achieve this goal, we design a global schema for an integrated management of new additional information without changing old speech corpora, and construct a web-based integrated management system based on the scheme that can be accessed without any temporal and spatial restrictions. And we show the steps by which these can be implemented, and describe related future study topics, examining the system.
PDF

Automatic Generation of Voice Web Pages Based on SALT (SALT 기반 음성 웹 페이지의 자동 생성)

Ko, You-Jung;Kim, Yoon-Joong
- Journal of KIISE:Software and Applications
- /
- v.37 no.3
- /
- pp.177-184
- /
- 2010
As a voice browser is introduced, voice dialog application becomes available on the Web environment. The voice dialog application consists of voice Web pages that need to translate the dialog scripts into SALT(Speech Application Language Tags). The current Web pages have been designed for visual. They, however, are potentially capable of using voice dialog. This paper, therefore, proposes an automated voice Web generation method that finds the elements for voice dialog from Web pages based HTML and converts them into SALT. The automatic generation system of a voice Web page consists of a lexical analyzer and a syntactic analyzer that converts a Web page which is described in HTML to voice Web page which is described in HTML+SALT. The converted voice Web page is designed to be able to handle not only the current mouse and keyboard input but also voice dialog.
PDF KSCI

Development of a Voice User Interface for Web Browser using VoiceXML (VoiceXML을 이용한 VUI 지원 웹브라우저 개발)

Yea SangHoo;Jang MinSeok
- Journal of KIISE:Computing Practices and Letters
- /
- v.11 no.2
- /
- pp.101-111
- /
- 2005
The present web informations are mainly described in terms of HTML, which users obtain through input devices such as mouse, keyboard, etc. Thus the existing GUI environment have not supported human's most natural information acquisition means, that is, voice. To solve the problem, several vendors are developing voice user interface. However these products are deficient in man -machine interactivity and their accommodation of existing web environment. This paper presents a VUI(Voice User Interface) supporting web browser by utilizing more and more maturing speech recognition technology and VoiceXML, a markup language derived from XML. It provides users with both interfaces, VUI as well as GUI. In addition, XML Island technology is applied to the bowser in a way that VoiceXML fragments are nested in HTML documents to accommodate the existing web environment. Also for better interactivity, dialogue scenarios for menu, bulletin, and search engine are suggested.
PDF KSCI

Text to Speech System from Web Images (웹상의 영상 내의 문자 인식과 음성 전환 시스템)

안희임;정기철
- Proceedings of the IEEK Conference
- /
- 2001.06c
- /
- pp.5-8
- /
- 2001
The computer programs based upon graphic user interface(GUI) became commonplace with the advance of computer technology. Nevertheless, programs for the visually-handicapped have still remained at the level of TTS(text to speech) programs and this prevents many visually-handicapped from enjoying the pleasure and convenience of the information age. This paper is, paying attention to the importance of character recognition in images, about the configuration of the system that converts text in the image selected by a user to the speech by extracting the character part, and carrying out character recognition.
PDF

Search Result 101, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)