Search | Korea Science

A Voice-Annotation Technique in Mobile E-book for Reading-disabled People (독서장애인용 디지털음성도서를 위한 음성 어노테이션 기법)

Lee, Kyung-Hee;Lee, Jong-Woo;Lim, Soon-Bum
- Journal of Digital Contents Society
- /
- v.12 no.3
- /
- pp.329-337
- /
- 2011
Digital talking book has been developed to enhance reading experiences for reading-disabled people. In the existing digital talking book, however, annotations can be created only through the screen interfaces. Screen annotation interfaces is of no use for reading-disabled people because they need reader's eyesight. In this paper, we suggest a voice annotation technique can create notes and highlights at any playing time by using hearing sense and voice command. We design a location determination technique that pinpoints where a voice annotation should be placed in the playing sentences. To verify the effectiveness of our voice annotation technique, we implement a prototype in an android platform. We can find out by the black-blindfolded users testing that our system can perfectly locate the exact position that a voice annotation should be placed into.
https://doi.org/10.9728/dcs.2011.12.3.329 인용 PDF KSCI

Voice-Data Capacity in a multimedia mobile System with Multi-rate Data Traffic. (멀티미디어 이동통신 시스템에서 통신 속도별 음성-데이터 상호 용량계산)

Kwon, Young Soo
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2009.05a
- /
- pp.891-894
- /
- 2009
In this paper, a scheme to evaluate the number of users in a WCDMA supporting multi-rate traffic is presented through the calculations of Erlang capacity from a derived blocking probability. It is observed that voice-data Erlang capacities have an inverse linear relationship. When the $E_b/N_o$ decreases from 4 dB to 3 dB within the outage probability of 2 % and at the voice rate of 8 kbps, the results show an increase of 8 Erlang at the data rates of 15 kbps, 4 Erlang at 30 kbps, 2 Erlang at 60 kbps, and 1 Erlang at 120 kbps respectively and an increase of the Erlang capacities through a gradual decrease of the data rates from 960 kbps to 15 kbps. So this is useful for optimizing as a reference of design for the capacity in a WCDMA system.
PDF

Implementation of Scenario-based AI Voice Chatbot System for Museum Guidance (박물관 안내를 위한 시나리오 기반의 AI 음성 챗봇 시스템 구현)

Sun-Woo Jung;Eun-Sung Choi;Seon-Gyu An;Young-Jin Kang;Seok-Chan Jeong
- The Journal of Bigdata
- /
- v.7 no.2
- /
- pp.91-102
- /
- 2022
As artificial intelligence develops, AI chatbot systems are actively taking place. For example, in public institutions, the use of chatbots is expanding to work assistance and professional knowledge services in civil complaints and administration, and private companies are using chatbots for interactive customer response services. In this study, we propose a scenario-based AI voice chatbot system to reduce museum operating costs and provide interactive guidance services to visitors. The implemented voice chatbot system consists of a watcher object that detects the user's voice by monitoring a specific directory in real-time, and an event handler object that outputs AI's response voice by performing inference by model sequentially when a voice file is created. And Including a function to prevent duplication using thread and a deque, GPU operations are not duplicated during inference in a single GPU environment.
https://doi.org/10.36498/kbigdt.2022.7.2.91 인용 PDF KSCI

Voice Verification System for m-Commerce on CDMA Network

Kyung, Youn-Jeong
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.4E
- /
- pp.176-182
- /
- 2003
As the needs for wireless Internet service is increasing, the needs for secure m-commerce is also increasing. Conventional security techniques are reinforced by biometric security technique. This paper utilized the voice as biometric security techniques. We developed speaker verification system for m-commerce (mobile commerce) via wireless internet and wireless application protocol (WAP). We named this system the mVprotek. We implemented the system as client-server architecture. The clients are mobile phone simulator and personal digital assistant (PDA). The verification results are obtained by integrating the mVprotek system with SK Telecom's code dimension multiple access (CDMA) system. Utilizing f-ratio weighting and virtual cohort model normalization showed much better performance than conventional background model normalization technique.
PDF KSCI

A Study on the Multi-Modal Browsing System by Integration of Browsers Using lava RMI (자바 RMI를 이용한 브라우저 통합에 의한 멀티-모달 브라우징 시스템에 관한 연구)

Jang Joonsik;Yoon Jaeseog;Kim Gukboh
- Journal of Internet Computing and Services
- /
- v.6 no.1
- /
- pp.95-103
- /
- 2005
Recently researches about multi-modal system has been studied widely and actively, Such multi-modal systems are enable to increase possibility of HCI(Human-computer Interaction) realization, enable to provide information in various ways and also enable to be applicable in e-business application, If ideal multi-modal system can be realized in future, eventually user can maximize interactive usability between information instrument and men in hands-free and eyes-free, In this paper, a new multi-modal browsing system using Java RMI as communication interface, which integrated by HTML browser and voice browser is suggested and also English-English dictionary search application system is implemented as example.
PDF

Study on QoE of the VoIP Service for QoS levels over LTE Mobile Communication System (LTE 이동통신 시스템에서 QoS 변화에 따른 VoIP 서비스의 사용자 체감 품질 변화에 대한 연구)

Kim, Beom-Joon
- The Journal of the Korea institute of electronic communication sciences
- /
- v.11 no.3
- /
- pp.309-316
- /
- 2016
Recently, the voice service over a mobile communication system tends to be provided based on the packet-based technology. Even though the sufficient transmission rate is supported by LTE mobile communication system, the quality of VoIP service that is experienced by the user can be degraded by the change in the transmission conditions and the terminal mobility. This paper has established an environment on which experiments are conducted for the different values of the major parameters that represent the transmission conditions. The result can contribute to the decision of the requirement that the mobile system should meet for maintaining the quality of VoIP service.
https://doi.org/10.13067/JKIECS.2016.11.3.309 인용 PDF KSCI

Development of a Read-time Voice Dialing System Using Discrete Hidden Markov Models (이산 HM을 이용한 실시간 음성인식 다이얼링 시스템 개발)

Lee, Se-Woong;Choi, Seung-Ho;Lee, Mi-Suk;Kim, Hong-Kook;Oh, Kwang-Cheol;Kim, Ki-Chul;Lee, Hwang-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.13 no.1E
- /
- pp.89-95
- /
- 1994
This paper describes development of a real-time voice dialing system which can recognize around one hundred word vocabularies in speaker independent mode. The voice recognition algorithm in this system is implemented on a DSP board with a telephone interface plugged in an IBM PC AT/486. In the DSP board, procedures for feature extraction, vector quantization(VQ), and end-point detection are performed simultaneously in every 10 msec frame interval to satisfy real-time constraints after detecting the word starting point. In addition, we optimize the VQ codebook size and the end-point detection procedure to reduce recognition time and memory requirement. The demonstration system has been displayed in MOBILAB of the Korean Mobile Telecom at the Taejon EXPO'93.
PDF

Development of tangible language content system based on voice recording (음성녹음 기반의 실감형 어학시스템 콘텐츠 개발)

Na, Jong-Won
- Journal of Advanced Navigation Technology
- /
- v.17 no.2
- /
- pp.234-239
- /
- 2013
Learning a lesson about poor concentration and problems of the existing content, the system of language which could not be determined, Many teachers' assessment decision was made. As a result, voice recording based on the combination of ubiquitous technology and virtual reality technology, and install the projector in a classroom Through the learning content corresponding grade English student ID card attached RFID reader in each classroom, and students of RFID tags attached. In reality of the virtual three-dimensional image content foreigners and question-and-answer using the voice recording technology at the same time check the pronunciation and intonation level passes or level failure judged. Student education data to a central server system is configured to do so after saving to the DB through a feedback process, which provides information. Analysis of the issues that can have a common language content in the present study and Problem for voice recording technology to solve the problem and did not solve the existing language in the content level based classes.
https://doi.org/10.12673/jkoni.2013.17.2.234 인용 PDF KSCI

Implementation of Smart Convergent Communication System of Satellite and Wireless for Monitoring in Closed Room of Vessel (선박의 밀폐된 선내에서 해상관제를 위한 스마트 위성·무선 복합통신시스템 구현)

Park, Heum;Lee, Chang Bum;An, Sung Mun
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.19 no.8
- /
- pp.1853-1858
- /
- 2015
The existence communications of vessel focused on voice, FAX, ISDN, etc. using satellite on the existent most communications on the ships, and recently, the ships need high quality smart communication service environments. In the results of experiments using the existent system, it could access on board, but in the closed room of vessel, was impossible to access e-mail and smart apps except voice communication. In the present paper, we implemented a novel communication system that can access voice, text, e-mail, file, vessel monitoring apps, etc. It consists of a convergent communication terminal combined with satellite and communication for Smartphone, and smart communication environment on the closed room. As the results, we can access a variety of smart communication in anywhere on board.
https://doi.org/10.6109/jkiice.2015.19.8.1853 인용 PDF KSCI KPUBS HTML

Ambiguity Types of the Homonymic & Heterographic Units for Improving Korean Voice Recognition System - a Preliminary Research (한국어 음성인식 시스템 향상을 위한 동음이철 단위의 중의성 유형 분류)

Yoon, Ae-Sun;Kang, Mi-Young
- Speech Sciences
- /
- v.15 no.4
- /
- pp.67-81
- /
- 2008
The accuracy rate of P2G (Phoneme-to-Grapheme) is one of the important factors determining the quality of unlimited voice recognition (VR) systems. Few studies were, however, conducted to reduce ambiguities of a phoneme string which can be segmented into a variety of different linguistic units (i.e. morphemes, words, eo-jeols), thus be transformed into more than one grapheme string. This paper is a preliminary research for building a large knowledge base of those homonymic & heterographic units(HHUs), which will provide unlimited Korean VR systems with more accurate P2G information. This paper analyzes 2 main factors generating HHUs: (1) boundary determination of the prosodic unit; (2) its segmentation into linguistic units. In this paper, linguistic characteristics determining variable boundaries of a prosodic unit are investigated, and the ambiguity types of HHUs are classified in accordance with their morphological and syntactic structures as well as with the phonological rules governing them.
PDF

Search Result 118, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)