• 제목/요약/키워드: Dual Voice Mode

검색결과 11건 처리시간 0.024초

듀얼모드 통신 지원 임베디드 리눅스 기반의 모바일 이야기꾼 설계 및 구현 (Design and Implementation of Embedded Linux-based Mobile Teller which supports CDMA and WiBro networks)

  • 김도형;윤민홍;이경희;이철훈
    • 정보처리학회논문지D
    • /
    • 제15D권1호
    • /
    • pp.131-138
    • /
    • 2008
  • 본 논문에서는 음성통화를 위해 CDMA 네트워크와 데이터 통신을 위해 와이브로 네트워크를 동시에 사용하는 최초의 임베디드 리눅스 기반 듀얼모드 응용 서비스인 모바일 이야기꾼의 구현에 대해서 기술한다. 현재 와이브로 상용 서비스와 함께 두 개의 이종 네트워크를 지원하는 단말이 출시되었지만, 이들 네트워크를 효과적으로 사용하여 사용자에게 보다 나은 서비스를 제공할 수 있는 응용 서비스의 개발은 미비한 실정이다. 모바일 이야기꾼은 사용자가 듀얼모드 지원 단말에서 텍스트를 입력하면, 와이브로 네트워크를 통해 인터넷 상의 TTS 서버로 전달한다. TTS 서버는 전달된 텍스트를 음성으로 변환하고, 변환된 음성 데이터를 듀얼모드 지원 단말로 다시 전달한다. 듀얼모드 지원 단말은 수신된 음성 데이터를 CDMA 네트워크를 통해 수신자에게 전송하게 된다. 구현된 모바일 이야기꾼은 주위가 시끄러운 환경이나 언어 장애가 있는 사람도 CDMA를 통한 음성 통화를 가능하게 한다.

임베디드 리눅스 기반의 개인 오디오 레코더 서비스 구현 (The Implementation of Personal Audio Recorder Service based on Embedded Linux)

  • 김도형;이경희;이철훈
    • 정보처리학회논문지D
    • /
    • 제15D권2호
    • /
    • pp.257-262
    • /
    • 2008
  • 본 논문에서는 음성통화를 위해 CDMA 네트워크와 데이터 통신을 위해 와이브로 네트워크를 동시에 사용하는 임베디드 리눅스 기반의 듀얼모드 응용 서비스인 개인 오디오 레코더의 구현에 대해서 기술한다. 개인 오디오 레코더는 듀얼모드 지원 단말에 탑재된 클라이언트에서 음성 녹음을 시작하면, 송신자와 수신자의 CDMA 음성 데이터가 와이브로 네트워크를 통해 인터넷 상의 저장 서버로 전달된다. 개인 오디오 레코더 서버는 통화 번호 및 통화 시간을 기준으로 음성 데이터를 서버에 저장하게 된다. 구현된 개인 오디오 레코더는 단말의 저장공간이 부족한 환경에서도 음성 통화 내용을 저장할 수 있도록 한다. 그리고, 개인 오디오 레코더는 서버에 저장된 통화 목록을 검색하여, 특정 통화 내용을 재생할 수 있다.

문서 편집 접근성 향상을 위한 음성 명령 기반 모바일 어플리케이션 개발 (Voice Activity Detection Algorithm using Wavelet Band Entropy Ensemble Analysis in Car Noisy Environments)

  • 박주현;박세아;이무늬;임순범
    • 한국멀티미디어학회논문지
    • /
    • 제21권11호
    • /
    • pp.1342-1352
    • /
    • 2018
  • Voice Command systems are important means of ensuring accessibility to digital devices for use in situations where both hands are not free or for people with disabilities. Interests in services using speech recognition technology have been increasing. In this study, we developed a mobile writing application using voice recognition and voice command technology which helps people create and edit documents easily. This application is characterized by the minimization of the touch on the screen and the writing of memo by voice. We have systematically designed a mode to distinguish voice writing and voice command so that the writing and execution system can be used simultaneously in one voice interface. It provides a shortcut function that can control the cursor by voice, which makes document editing as convenient as possible. This allows people to conveniently access writing applications by voice under both physical and environmental constraints.

PZT를 이용한 광 정보저장기기용 엑츄에이터의 추적제어 (Track following control of optical pick-up actuator using PZT)

  • 이우철;양현석;박노철;박영필
    • 한국정밀공학회:학술대회논문집
    • /
    • 한국정밀공학회 2003년도 춘계학술대회 논문집
    • /
    • pp.664-669
    • /
    • 2003
  • This paper proposes a swing-arm type dual-stage actuator, which consists of a PZT actuator for fine motion and a VCM(Voice Coil Motor) for coarse motion, for SFF ODD(Small Form Factor Optical Disk Drive), in order to achieve fast access speed and precise track following control. We focus our attention on the design and control of the PZT actuator, because there have been a lot of previous researches related to the VCM and dual-stage actuators. Due to the dual cantilever structure, the PZT actuator can generate precise translational tracking motion at its tip where optical pickup is attached at, and the effect of hysteric behavior of the PZT element is reduced. The dynamic model of the PZT actuator is derived by using the Hamilton's principle, and verified by comparing with the experimental frequency response. The sliding mode control is designed in order to be robust against modeling uncertainties. Simulations and experimental results confirm the effectiveness of the suggested control scheme.

  • PDF

PZT를 이용한 초소형 광 픽업 엑츄에이터의 슬라이딩 모드 제어 (Sliding mode control of small form factor optical pick-up actuator using PZT)

  • Lee, Woo-Chul;Jung, Dong-Ha;Park, Tae-Wook;Park, No-Cheol;Yang, Hyun-Seok
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2003년도 추계학술대회논문집
    • /
    • pp.424-429
    • /
    • 2003
  • This paper proposes a swing-arm type dual-stage actuator, which consists of a PZT actuator for fine motion and a VCM(Voice Coil Motor) for coarse motion, for SFF ODD(Small Form Factor Optical Disk Drive), in order to achieve fast access speed and precise track following control. We focus our attention on the design and control of the PZT actuator, because there have been a lot of previous researches related to the VCM and dual-stage actuators. Due to the dual cantilever structure, the PZT actuator can generate precise translational tracking motion at its tip where optical pickup is attached at, and the effect of hysteric behavior of the PZT element is reduced. The dynamic model of the PZT actuator is derived by using the Hamilton's principle, and verified by comparing with the experimental frequency response. The sliding mode control is designed in order to be robust against modeling uncertainties. Simulations and experimental results confirm the effectiveness of the suggested control scheme.

  • PDF

PZT를 이용한 광 정보저장기기용 액추에이터의 트랙 추적제어 (Track-following Control of an Optical Pick-up Actuator Using PZT)

  • 정동하;박태욱;박노철;양현석;이우철
    • 한국소음진동공학회논문집
    • /
    • 제14권5호
    • /
    • pp.385-393
    • /
    • 2004
  • This paper proposes a swing-arm type dual-stage actuator, which consists of a PZT actuator for fine motion and a VCM(voice coil motor) for coarse motion, for an SFF ODD(small form factor optical disk drive), in order to achieve fast access speed and precise track-following control. Over the past few decades there have been a lot of researches related to the VCM and dual-stage actuator. In this paper, we focus our attention on the design and control of the PZT actuator. Due to the dual cantilever structure. the PZT actuator can generate precise translational tracking motion at its tip to which an optical pickup is attached. and the effect of hysteric behavior of the PZT element is reduced. The dynamic model of the PZT actuator is derived by using the Hamilton's principle, and verified by comparing it with the experimental frequency response. The sliding mode control is designed in order to be robust against modeling uncertainties. Simulations and experimental results confirm the effectiveness of the suggested control scheme.

The Effects of Multi-Modality on the Use of Smart Phones

  • Lee, Gaeun;Kim, Seongmin;Choe, Jaeho;Jung, Eui Seung
    • 대한인간공학회지
    • /
    • 제33권3호
    • /
    • pp.241-253
    • /
    • 2014
  • Objective: The objective of this study was to examine multi-modal interaction effects of input-mode switching on the use of smart phones. Background: Multi-modal is considered as an efficient alternative for input and output of information in mobile environments. However, there are various limitations in current mobile UI (User Interface) system that overlooks the transition between different modes or the usability of a combination of multi modal uses. Method: A pre-survey determined five representative tasks from smart phone tasks by their functions. The first experiment involved the use of a uni-mode for five single tasks; the second experiment involved the use of a multi-mode for three dual tasks. The dependent variables were user preference and task completion time. The independent variable in the first experiment was the type of modes (i.e., Touch, Pen, or Voice) while the variable in the second experiment was the type of tasks (i.e., internet searching, subway map, memo, gallery, and application store). Results: In the first experiment, there was no difference between the uses of pen and touch devices. However, a specific mode type was preferred depending on the functional characteristics of the tasks. In the second experiment, analysis of results showed that user preference depended on the order and combination of modes. Even with the transition of modes, users preferred the use of multi-modes including voice. Conclusion: The order of combination of modes may affect the usability of multi-modes. Therefore, when designing a multi-modal system, the fact that there are frequent transitions between various mobile contents in different modes should be properly considered. Application: It may be utilized as a user-centered design guideline for mobile multi modal UI system.

이중 구동 시스템을 위한 압전 밀리엑츄에이터의 제어기 설계 (Controller Design of Piezoelectric Milliactuator for Dual Stage System)

  • Hong, Eo-Jin;Yoon, Joon-Hyun;Park, No-Cheal;Yang, Hyun-Seok;Park, Young-Pil
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2001년도 추계학술대회논문집 I
    • /
    • pp.46-51
    • /
    • 2001
  • To reach high areal density, less track pitch is expected and more servo bandwidth is required. One approach to overcoming the problem is by using dual stage servo system. In this system, a voice coil motor (VCM) is used as the primary stage while a milliactuator is used as the secondary stage. We have suggested new milliactuator based on the shear mode of piezoelectric elements to drive the head suspension assembly. In this paper, we introduce controller design method, PQ method. PQ method reduces the controller design problem for DISO(dual-input/single-output) systems to two standard controller design problems for SISO(single-input/single-output) problems. The first part of PQ method directly address the issue of actuator output contribution, and the second part allows the use of traditional loop shaping to achieve the overall system performance. This paper shows how to employ the PQ method to meet aggressive close-loop performance specifications for a disk drive system with a VCM and piezoelectric milliactuator.

  • PDF

초소형 Sled-type 이중 서보 엑추에이터 설계 및 특성 분석 (The Design and Performance Test of Miniaturized Sled Type Dual-Servo Actuator)

  • 강동우;김기현;정재화;권대갑
    • 한국정밀공학회:학술대회논문집
    • /
    • 한국정밀공학회 2002년도 춘계학술대회 논문집
    • /
    • pp.357-360
    • /
    • 2002
  • Nowadays, the improvement and development of Multi-media, information and communication technology are rapidly processed. And many products, for example, digital camera, digital camcorder, and PDA, are used for them. They need large data storage capacity and small size, light storage system. Due to that, many studies and researches in data storage system have been carried out. Especially, micro drive system was presented by IBM.(1) However, its system is expensive and uneasy to be portable. In ODD technologies, 1 inch drive system is not yet or in processing status.(2) If to be possible and to be come up, it is cheap than HDD system and easy to transfer information. In this paper, a miniaturized actuator(about linch) is designed and tested for ODD system. Specially, it is adapted for NFR(Near-field Recoding) system using SIL(Solid Immersion Lens). It is the dual-servo actuator which consists of a coarse actuator and fine actuator. Its actuating force generation method is VCM(Voice Ceil Motor). The fine actuator has 4-wire suspensions and bobbin wrapped by coil and includes focusing motion as well as tracking motion. The coarse actuator has an actuating coil and V-grooved guide mechanism. Also, the characteristics of the designed actuator is estimated by sine-swept mode and LDV(Laser Doppler Vibro-meter).

  • PDF

Audio and Video Bimodal Emotion Recognition in Social Networks Based on Improved AlexNet Network and Attention Mechanism

  • Liu, Min;Tang, Jun
    • Journal of Information Processing Systems
    • /
    • 제17권4호
    • /
    • pp.754-771
    • /
    • 2021
  • In the task of continuous dimension emotion recognition, the parts that highlight the emotional expression are not the same in each mode, and the influences of different modes on the emotional state is also different. Therefore, this paper studies the fusion of the two most important modes in emotional recognition (voice and visual expression), and proposes a two-mode dual-modal emotion recognition method combined with the attention mechanism of the improved AlexNet network. After a simple preprocessing of the audio signal and the video signal, respectively, the first step is to use the prior knowledge to realize the extraction of audio characteristics. Then, facial expression features are extracted by the improved AlexNet network. Finally, the multimodal attention mechanism is used to fuse facial expression features and audio features, and the improved loss function is used to optimize the modal missing problem, so as to improve the robustness of the model and the performance of emotion recognition. The experimental results show that the concordance coefficient of the proposed model in the two dimensions of arousal and valence (concordance correlation coefficient) were 0.729 and 0.718, respectively, which are superior to several comparative algorithms.