• Title/Summary/Keyword: Voice problem

Search Result 338, Processing Time 0.023 seconds

Design and Implementation of Web Interworking Learning System Using VoiceXML (VoiceXML을 이용한 Web 연동 학습 시스템 설계 및 구현)

  • Kim Dong-Hyun;Cho Chang-Su;Shin Jeong-Hoon;Hong Kwang-Seok
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.42 no.2 s.302
    • /
    • pp.21-30
    • /
    • 2005
  • Development of both multimedia technology and communication network technology has accomplished many changes through the field of learning system. For the construction of a more efficient and clever learning system there is a research being done by the use of the Web and the telephone network. But until now, the case of current implemented teaming system is single system and so it has each merits and demerits. That is to say, when we use the learning system through the Web, the demerit is only possible by the static states using computer. For those who do not use the computer, the demerit is that the user must learn the use of the new system. Also, the case of using telephone network has merits that one can use the system anyplace, anytime by the telephone. But it has the problem of not being able to transmit information very efficiently. From these, this paper proposes the learning system that can be used efficiently and conveniently anyplace, anytime by connecting both telephone network and web. Also, we propose a new algorithm of user ID, password and name registration function using teaming system using VoiceXML and individual learning progress save function using VoiceXML and web.

Design and Implementation of the Remote Image Transmission System using CDMA Communication Network (CDMA 통신망을 이용한 원격 영상 전송 시스템의 설계 및 구현)

  • 박성욱;황수철;박종욱
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.7 no.3
    • /
    • pp.54-61
    • /
    • 2002
  • A remote image transmission apparatus combines robot technology and image transmission there is safe problem or place that a person can not go. Recently, control apparatus that use wire and RF between web server and robot for remote control are developed. But there is problem that must install internet line and communication distance. Transmission distance problem call solve when using the equipment of RF, but price of RF router is problem that is very high cost. In this paper, we developed remote control system using the CDMA cellular phone communication network that can control image transmission and image transmission apparatus to solve these problem. Developed system could solve defects of methods that use existent RF and internet. And could transmit the most suitable image and voice under limited condition include current communication network.

  • PDF

A Study on the Apparatus for Image Transmission and Transmission Control using Cellular Phone Network (셀룰러폰 통신망을 이용한 영상전송 및 전송제어 장치에 관한 연구)

  • 박성욱;황수철;박종욱
    • Journal of Internet Computing and Services
    • /
    • v.3 no.1
    • /
    • pp.1-10
    • /
    • 2002
  • A remote image transmission apparatus combines robot technology and image transmission there is safe problem or place that a person can not go. Recently, control apparatus that use wire and RF between web server and robot for remote control are developed. But, there is problem that must install Internet line and communication distance. Transmission distance problem can solve when using the equipment of RF, but price of RF router is problem that is very high cost. In this paper, we developed remote control system using the cellular phone communication network that can control image transmission and image transmission apparatus to solve these problem. Developed system could solve defects of methods that use existent RF and internet. And could transmit the most suitable image and voice under limited condition include current communication network.

  • PDF

Improving Eye-gaze Mouse System Using Mouth Open Detection and Pop Up Menu (입 벌림 인식과 팝업 메뉴를 이용한 시선추적 마우스 시스템 성능 개선)

  • Byeon, Ju Yeong;Jung, Keechul
    • Journal of Korea Multimedia Society
    • /
    • v.23 no.12
    • /
    • pp.1454-1463
    • /
    • 2020
  • An important factor in eye-tracking PC interface for general paralyzed patients is the implementation of the mouse interface, for manipulating the GUI. With a successfully implemented mouse interface, users can generate mouse events exactly at the point of their choosing. However, it is difficult to define this interaction in the eye-tracking interface. This problem has been defined as the Midas touch problem and has been a major focus of eye-tracking research. There have been many attempts to solve this problem using blink, voice input, etc. However, it was not suitable for general paralyzed patients because some of them cannot wink or speak. In this paper, we propose a mouth-pop-up, eye-tracking mouse interface that solves the Midas touch problem as well as becoming a suitable interface for general paralyzed patients using a common RGB camera. The interface presented in this paper implements a mouse interface that detects the opening and closing of the mouth to activate a pop-up menu that the user can select the mouse event. After implementation, a performance experiment was conducted. As a result, we found that the number of malfunctions and the time to perform tasks were reduced compared to the existing method.

Positioning control of a redundant actuator

  • Sasaki, M.;Setta, M.;Satoh, K.;Fujisawa, F.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1994.10a
    • /
    • pp.605-610
    • /
    • 1994
  • This paper discusses the solution to the precise positioning control problem applied to a simple model of a dual stage or redundant positioner. The dual stage actuator presented here uses a VCM(Voice Coil Motor) as a coarse actuator and a piezoelectric actuator as a fine actuator. By adopting controllers with two-degree-of-freedom and by optimizing H$_{2}$ faster precise tracking can be realized. Experimental and numerical results are presented to demonstrate the control effects.

  • PDF

Distant-talking of Speech Interface for Humanoid Robots (휴머노이드 로봇을 위한 원거리 음성 인터페이스 기술 연구)

  • Lee, Hyub-Woo;Yook, Dong-Suk
    • Proceedings of the KSPS conference
    • /
    • 2007.05a
    • /
    • pp.39-40
    • /
    • 2007
  • For efficient interaction between human and robots, speech interface is a core problem especially in noisy and reverberant conditions. This paper analyzes main issues of spoken language interface for humanoid robots, such as sound source localization, voice activity detection, and speaker recognition.

  • PDF

Comparion of Noise Suppression Methods in Voice CODEC (음성코덱에서의 잡음제거 방식 비교)

  • Lee, Jin-Geol
    • The Journal of Engineering Research
    • /
    • v.3 no.1
    • /
    • pp.43-46
    • /
    • 1998
  • Considerable research in the last three decades has examined the problem of enhancement of speech degraded by additive background noise. We compare traditional methods such as spectral subtraction and Wiener filter, recently proposed psychoacoustic model based methods such as perceptual filter and noise suppression in EVRC in terms of performance and complexity.

  • PDF

A Study on the Isolated word Recognition Using One-Stage DMS/DP for the Implementation of Voice Dialing System

  • Seong-Kwon Lee
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1994.06a
    • /
    • pp.1039-1045
    • /
    • 1994
  • The speech recognition systems using VQ have usually the problem decreasing recognition rate, MSVQ assigning the dissimilar vectors to a segment. In this paper, applying One-stage DMS/DP algorithm to the recognition experiments, we can solve these problems to what degree. Recognition experiment is peformed for Korean DDD area names with DMS model of 20 sections and word unit template. We carried out the experiment in speaker dependent and speaker independent, and get a recognition rates of 97.7% and 81.7% respectively.

  • PDF

Research of Aesthetic Distance on the Cinematization of Novel (영화 <우리들의 일그러진 영웅>에 나타난 원작소설과의 미적 거리 연구)

  • Kim, Jong-Wan
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.6
    • /
    • pp.151-159
    • /
    • 2012
  • The purpose of this thesis is to figure out the mechanism that how can be shown the aesthetic distances of novel in the film. At discussion of the view point, novel can be told by two factors which are 'who is teller' and 'who is watcher' but in the film, novel's narration is divided into visual point and auditive point. And I will consider the phenomenon on the part of this difference. Next, I will argue about difference between novel and film from the Park Jongwon's aesthetic distances which interpreted Lee Munyeol's work. This thesis is going to observe that how the film adapted three types of view point and how that related the subject of the original novel. For this thesis, I tried to track 'the distances' between figure and identity, and reader and author. Also I did approach that how can be accepted the problem of 'aesthetic distance according to identity' based on this novel in the film and novel's text by reader. This study make a proposal or analysis to the differences between novels and films in terms of narrative point of view. Although it is shown by dividing into each chapter in novel and on connectivity in film, this paper finds out that both film and novel are shown the subject of reader's difference of the view point about 'author and director's identity'.

The role of voice onset time (VOT) and post-stop fundamental frequency (F0) in the perception of Tohoku Japanese stops (도호쿠 일본어의 폐쇄음 지각에 있어서 voice onset time(VOT)과 후속모음 fundamental frequency(F0)의 역할)

  • Hi-Gyung Byun
    • Phonetics and Speech Sciences
    • /
    • v.15 no.1
    • /
    • pp.35-45
    • /
    • 2023
  • Tohoku Japanese is known to have voiced stops without pre-voicing in word-initial position, whereas traditional or conservative Japanese has voiced stops with pre-voicing in the same position. One problem with this devoicing of voiced stops is that it affects the distinction between voiced and voiceless stops because their voice onset time (VOT) values overlap. Previous studies have confirmed that Tohoku speakers use post-stop fundamental frequency (F0) as an acoustic cue along with VOT to avoid overlap. However, the role of post-stop F0 as a perceptual cue in this region has barely been investigated. Therefore, this study explored the role of post-stop F0 in stop voicing perception along with VOT. Several perception tests were conducted using resynthesized stimuli, which were manipulated along a VOT continuum orthogonal to an F0 continuum. The results showed no significant regional difference (Tohoku vs. Chubu) for nonsense words (/ta-da/). However, for meaningful words (/pari/ 'Paris' vs. /bari/ 'Bali,' /piza/ 'pizza' vs. /biza/ 'visa'), a significant word effect was found, and it was confirmed that some listeners utilized the post-stop F0 more consistently and steadily than others. Based on these results, we discuss innovative listeners who may lead the change in the perception of stop voicing.