• Title/Summary/Keyword: Voice recognition system

Search Result 334, Processing Time 0.029 seconds

Improving Student's Design Prototyping Skills using Interactive Prototyping Tool

  • Kim, Jongwan;Jeon, Jae-wook;Kim, Ki-yeon
    • Journal of Multimedia Information System
    • /
    • v.8 no.1
    • /
    • pp.75-78
    • /
    • 2021
  • This paper will explain the importance of using interactive prototyping tools in the HCI design process. The Future of HCI education project performed by ACM SIGCHI shows that students recognize that prototyping, especially paper prototyping and interactive prototyping, are both very important. Two widely-used prototyping tools in academy, Balsamiq and Oven, will be compared and rated by students according to their preferences. We choose the Balsamiq as our design tool because Oven can be designed on the web but applications cannot be designed directly on Mac or Windows. The Balsamiq tool will help you understand the task process of UI work and highlight the benefits of digital prototyping to test the execution of expected results in a fast fashion compared to high-level prototyping. We also present the outcome of this work through two case studies. In particular, the smart mirror project with voice recognition function shows the effectiveness of the proposed method as an example.

A Development of Speech Recognition System for Mobile Card Search (모바일 명함 검색을 위한 음성인식시스템 구현)

  • Hong, In-Suk;Ko, You-Jung;Kim, Yoon-Joong
    • Annual Conference of KIPS
    • /
    • 2009.04a
    • /
    • pp.138-141
    • /
    • 2009
  • 모바일 명함 관리 시스템은 간편하게 모바일 기기를 이용하여 명함을 등록하고 검색할 수 있으나 모바일 기기의 특징상 화면이 작고 정보를 이용하기 위해서는 펜을 이용하여 검색어를 입력해야하는 불편함이 있다. 이를 해결하기 위해 명령을 음성으로 처리하고자하는 VUI(Voice User Interface)의 필요성이 증가하였다. 또한 모바일 기기의 메모리 공간상의 제약으로 인한 음성인식엔진 탑재의 어려움이 있다. 이에 본 논문에서는 모바일 단말기로부터 음성을 입력받아 인식결과를 모바일 단말기로 되돌려 주는 음성인식 시스템을 구축하고 본 인식시스템과 모바일 클라이언트 시스템을 분산처리 가능한 웹서비스 환경으로 구성하였다.

Implementation of Home Appliance Control System with Speech Recognition based User Interfaces in Home Network Environments (홈 네트워크 환경에서 음성인식기반 사용자 인터페이스를 통한 가전기기 제어 시스템 구현)

  • Kim, Youn-Woo;Jang, Hyun-Su;Kim, Gu-Su;Eom, Young-Ik
    • Annual Conference of KIPS
    • /
    • 2007.05a
    • /
    • pp.735-738
    • /
    • 2007
  • 컴퓨팅 기술의 발전에 따라 유비쿼터스 시대로의 이행이 가속화되고 있다. 이에 따라 홈 네트워크 분야에 대한 연구와 상용화를 위한 노력이 활발해지고 있다. 이와 더불어 가전기기들의 종류는 다양해지고 복잡해지면서 사용자들의 가전기기 이용에 있어 사용법을 익혀야하는 어려움이 있다. 이러한 문제점을 해결하기 위한 일환으로 디지털 장치들을 편하게 사용하기 위한 멀티 모달 사용자 인터페이스가 요구되고 있다. 본 논문에서 네트워크 가전기기 제어가 가능한 홈 네트워크 미들웨어인 UPnP를 사용하여 VoiceXML을 통한 음성인식기반 사용자 인터페이스와 디지털 장치 제어 시스템을 제안하고 구현한 후 실험하였다.

Research on performance improvement of voice recognition-based customized local chatbot system using AutoRAG (AutoRAG를 이용한 음성인식 기반 맞춤형 로컬 챗봇 시스템의 성능 개선에 관한 연구)

  • Sung-jin Kim;Jae-hoon Lim;Sae-Hun Yeom
    • Annual Conference of KIPS
    • /
    • 2024.10a
    • /
    • pp.519-520
    • /
    • 2024
  • 본 논문은 오픈소스 LLM(Large Language Model)인 Llama3를 기반으로 음성 인터페이스를 갖춘 맞춤형 로컬 챗봇 시스템을 개발하였다. 이 시스템은 PEFT(Parameter Efficient Fine-Tuning)와 AutoRAG(Auto Retrieval-Augmented Generation)로 최적화된 RAG(Retrieval-Augmented Generation) 방식을 결합한 하이브리드 접근법을 통해 Llama3를 전이학습 하였다. Ollama를 사용하여 로컬 환경에서 챗봇을 구현하였으며, LangServe와 Ngrok을 활용해 배포하였다. Raspberry Pi 5에 구현하여 모바일 환경으로 동작 가능하게 하였고 음성인식 기능을 추가하여 사용자 편의성을 높였다. 연구한 모델의 성능 평가는 총 18 종류의 데이터셋에 대해 각 질문당 5회씩, 총 90회의 질문으로 정확도를 측정하였다. 실험결과, PEFT 학습 모델과 Advanced RAG를 결합한 시스템이 가장 우수한 성능을 나타냈다.

Determinants of Safety and Satisfaction with In-Vehicle Voice Interaction : With a Focus of Agent Persona and UX Components (자동차 음성인식 인터랙션의 안전감과 만족도 인식 영향 요인 : 에이전트 퍼소나와 사용자 경험 속성을 중심으로)

  • Kim, Ji-hyun;Lee, Ka-hyun;Choi, Jun-ho
    • The Journal of the Korea Contents Association
    • /
    • v.18 no.8
    • /
    • pp.573-585
    • /
    • 2018
  • Services for navigation and entertainment through AI-based voice user interface devices are becoming popular in the connected car system. Given the classification of VUI agent developers as IT companies and automakers, this study explores attributes of agent persona and user experience that impact the driver's perceived safety and satisfaction. Participants of a car simulator experiment performed entertainment and navigation tasks, and evaluated the perceived safety and satisfaction. Results of regression analysis showed that credibility of the agent developer, warmth and attractiveness of agent persona, and efficiency and care of the UX dimension showed significant impact on the perceived safety. The determinants of perceived satisfaction were unity of auto-agent makers and gender as predisposing factors, distance in the agent persona, and convenience, efficiency, ease of use, and care in the UX dimension. The contributions of this study lie in the discovery of the factors required for developing conversational VUI into the autonomous driving environment.

Pattern recognition and AI education system design for improving achievement of non-face-to-face (e-learning) education (비대면(이러닝) 교육 성취도 향상을 위한 패턴인식 및 AI교육 시스템 설계)

  • Lee, Hae-in;Kim, Eui-Jeong;Chung, Jong-In;Kim, Chang Suk;Kang, Shin-Cheon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.329-332
    • /
    • 2022
  • This study aims to identify problems with existing e-learning content and non-face-to-face class methods, improve students' concentration, improve class achievement and educational effectiveness, and propose an artificial intelligence class system design using a web server. By using the function of face and eye tracking using OpenCV to identify attendance and concentration, and by inducing feedback through voice or message to questions asked by the instructor in the middle of class, learners relieve boredom caused by online classes and test by runner If the score is not reached, we propose an artificial intelligence education program system design that can bridge the academic gap and improve academic achievement by providing educational materials and videos for the wrong problem.

  • PDF

Pattern Recognition and AI Education System Design Proposal for Improving the Achievement of Non-face-to-face (E-Learning) Education (비대면(이러닝) 교육 성취도 향상을 위한 패턴인식 및 AI교육 시스템 설계 구축)

  • Lee, Hae-in;Kim, Eui-Jeong;Chung, Jong-In;Kim, Chang Suk;Kang, Shin-Cheon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.280-283
    • /
    • 2022
  • This study aims to identify problems with existing e-learning content and non-face-to-face class methods, improve students' concentration, improve class achievement and educational effectiveness, and propose an artificial intelligence class system design using a web server. By using the function of face and eye tracking using OpenCV to identify attendance and concentration, and by inducing feedback through voice or message to questions asked by the instructor in the middle of class, learners relieve boredom caused by online classes and test by runner If the score is not reached, we propose an artificial intelligence education program system design that can bridge the academic gap and improve academic achievement by providing educational materials and videos for the wrong problem.

  • PDF

Development of Half-Mirror Interface System and Its Application for Ubiquitous Environment (유비쿼터스 환경을 위한 하프미러형 인터페이스 시스템 개발과 응용)

  • Kwon Young-Joon;Kim Dae-Jin;Lee Sang-Wan;Bien Zeungnam
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.12
    • /
    • pp.1020-1026
    • /
    • 2005
  • In the era of ubiquitous computing, human-friendly man-machine interface is getting more attention due to its possibility to offer convenient services. For this, in this paper, we introduce a 'Half-Mirror Interface System (HMIS)' as a novel type of human-friendly man-machine interfaces. Basically, HMIS consists of half-mirror, USB-Webcam, microphone, 2ch-speaker, and high-speed processing unit. In our HMIS, two principal operation modes are selected by the existence of the user in front of it. The first one, 'mirror-mode', is activated when the user's face is detected via USB-Webcam. In this mode, HMIS provides three basic functions such as 1) make-up assistance by magnifying an interested facial component and TTS (Text-To-Speech) guide for appropriate make-up, 2) Daily weather information provider via WWW service, 3) Health monitoring/diagnosis service using Chinese medicine knowledge. The second one, 'display-mode' is designed to show decorative pictures, family photos, art paintings and so on. This mode is activated when the user's face is not detected for a time being. In display-mode, we also added a 'healing-window' function and 'healing-music player' function for user's psychological comfort and/or relaxation. All these functions are accessible by commercially available voice synthesis/recognition package.

A Design of the Emergency-notification and Driver-response Confirmation System(EDCS) for an autonomous vehicle safety (자율차량 안전을 위한 긴급상황 알림 및 운전자 반응 확인 시스템 설계)

  • Son, Su-Rak;Jeong, Yi-Na
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.14 no.2
    • /
    • pp.134-139
    • /
    • 2021
  • Currently, the autonomous vehicle market is commercializing a level 3 autonomous vehicle, but it still requires the attention of the driver. After the level 3 autonomous driving, the most notable aspect of level 4 autonomous vehicles is vehicle stability. This is because, unlike Level 3, autonomous vehicles after level 4 must perform autonomous driving, including the driver's carelessness. Therefore, in this paper, we propose the Emergency-notification and Driver-response Confirmation System(EDCS) for an autonomousvehicle safety that notifies the driver of an emergency situation and recognizes the driver's reaction in a situation where the driver is careless. The EDCS uses the emergency situation delivery module to make the emergency situation to text and transmits it to the driver by voice, and the driver response confirmation module recognizes the driver's reaction to the emergency situation and gives the driver permission Decide whether to pass. As a result of the experiment, the HMM of the emergency delivery module learned speech at 25% faster than RNN and 42.86% faster than LSTM. The Tacotron2 of the driver's response confirmation module converted text to speech about 20ms faster than deep voice and 50ms faster than deep mind. Therefore, the emergency notification and driver response confirmation system can efficiently learn the neural network model and check the driver's response in real time.

Preprocessing Technique for Improvement of Speech Recognition in a Car (차량에서의 음성인식율 향상을 위한 전처리 기법)

  • Kim, Hyun-Tae;Park, Jang-Sik
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.1
    • /
    • pp.139-146
    • /
    • 2009
  • This paper addresses a modified spectral subtraction schemes which is suitable to speech recognition under low signal-to-noise ratio (SNR) noisy environment such as the automatic speech recognition (ASR) system in car. The conventional spectral subtraction schemes rely on the SNR such that attenuation is imposed on that part of the spectrum that appears to have low SNR, and accentuation is made on that part of high SNR. However, such postulation is adequate for high SNR environment, it is grossly inadequate for low SNR scenarios such as that of car environment. Proposed methods focused specifically to low SNR noisy environment by using weighting function for enhancing speech dominant region in speech spectrum. Experimental results by using voice commands for car show the superior performance of the proposed method over conventional methods.