• Title/Summary/Keyword: 시각 음성인식

Search Result 129, Processing Time 0.039 seconds

Effects of Situation Awareness and Decision Making on Safety, Workload and Trust in Autonomous Vehicle Take-over Situations (자율주행 자동차의 제어권 전환상황에서 상황인식 및 의사결정 정보 제공이 운전자에게 미치는 영향)

  • Kim, Jihyun;Lee, Kahyun;Byun, Youngsi
    • Journal of the HCI Society of Korea
    • /
    • v.14 no.2
    • /
    • pp.21-29
    • /
    • 2019
  • Take-over requests in semi-autonomous cars must be handled properly in the case of road obstacles or curved roads in order to avoid accidents. In these situations, situation awareness and appropriate decision making are essential for distracted drivers. This study used a driving simulator to investigate the components of auditory-visual information systems that affect safety, workload, and trust. Auditory information consisted of either voice guidance providing situation awareness for the take-over or a beep sound that only alerted the driver. Visual information consisted of either a screen showing how to maneuver the vehicle or only an icon indicating a take-over situation. By providing auditory information that increased situation awareness and visual information that aided decision making, trust and safety increased, while workload decreased. These results suggest that the levels of situation awareness and decision making ability affect trust, safety, and workload for drivers.

A Study on Spatio-temporal Features for Korean Vowel Lipreading (한국어 모음 입술독해를 위한 시공간적 특징에 관한 연구)

  • 오현화;김인철;김동수;진성일
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.1
    • /
    • pp.19-26
    • /
    • 2002
  • This paper defines the visual basic speech units, visemes and investigates various visual features of a lip for the effective Korean lipreading. First, we analyzed the visual characteristics of the Korean vowels from the database of the lip image sequences obtained from the multi-speakers, thereby giving a definition of seven Korean vowel visemes. Various spatio-temporal features of a lip are extracted from the feature points located on both inner and outer lip contours of image sequences and their classification performances are evaluated by using a hidden Markov model based classifier for effective lipreading. The experimental results for recognizing the Korean visemes have demonstrated that the feature victor containing the information of inner and outer lip contours can be effectively applied to lipreading and also the direction and magnitude of the movement of a lip feature point over time is quite useful for Korean lipreading.

Design and Implementation of the VoiceXML Interpreter for Voice Web-service (음성 웹서비스를 위한 VoiceXML 해석기의 설계 및 구현)

  • 신현경;강동남;염세훈;유재우
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.42-47
    • /
    • 2001
  • In this paper, we propose an interpreter, which recognizes the VoiceXML markups, verifies the validation of the document, and interprets the VoiceXML documents using DI parser and the generated AST by the parser. The VoiceXML interpreter consists of DI parser and executor, and the DI parser uses recursive descent parsing technology, and the executor uses FIA (Form Interpretation Algorithm) proposed by VXML forum. This system uses the Java language in order to develop the runtime environment for VoiceXML efficiently, thus this system has portability.

  • PDF

Speech Recognition and Lip Shape Feature Extraction for English Vowel Pronunciation of the Hearing - Impaired Based on SVM Technique (SVM 기법에 기초한 청각장애인의 영어모음 발음을 위한 음성 인식 및 입술형태 특징 추출)

  • Lee, Kun-Min;Han, Kyung-Im;Park, Hye-Jung
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.11 no.3
    • /
    • pp.247-252
    • /
    • 2017
  • The purpose of this study is to suggest the visual teaching method for the English vowel pronunciation, especially for the hearing-impaired who mostly rely on the visual aids, based on the SVM technique. By extracting phonetic features using the SVM technique from the sounds that are hard to hear by ear, the lip shapes for each vowel were refined. The lip shape refinement for vowels is advantageous in that language learners can easily see the movement of articulators by eye, and it is helpful for learning and teaching English vowels for the hearing-impaired.

Study of Information Transmission System for Visually Impaired (시각 장애인을 위한 정보전송 시스템 연구)

  • Lee, Seo-Young;Choi, Jong-Yeob;Ahn, Sang-Jun;Kim, Jeong-Hun;Park, Yong Wook
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.12 no.6
    • /
    • pp.1227-1232
    • /
    • 2017
  • In this study, we have studied a system to reduce the inconvenience and the accident rate at the intersection when the visually impaired use traffic information by using pressure sensor and ultrasonic sensor. By using the pressure sensor to light the strip LED, the driver recognizes the pedestrian, thereby reducing the accident rate. In addition to the pedestrian signal light information, the ultrasonic sensor and Bluetooth transmit the bus position information to the application so that the user can listen to the voice.

Implementation of Information Access Embedded System for the Blind People (시각 장애인을 위한 정보접근 임베디드 시스템의 구현)

  • Kim, Si-Woo;Lee, Jae-Kyun;Lee, Chae-Wook
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.2C
    • /
    • pp.167-172
    • /
    • 2008
  • Since a 2-dimensional (2D) bar code can retrieve data and information quickly, it is widely used and recognized as a useful tool for many industrial applications. However, the information capacity of the 2D bar code is still limited. Recently the analog-digital code (AD code), which has the largest storage capacity yet contained in a code, has been developed, thereby expanding the bar code's application range because it overcomes the limitation of data capacity. In this paper, we present the AD code and implement an effective embedded system which can transform text information into voice using the 2D AD code and Text To Speech (TTS). This voice information can also be transmitted to blind people as well as the old by capturing the AD code on paper or in books.

Comparison of Deep Learning Networks in Voice-Guided System for The Blind (시각장애인을 위한 음성안내 네비게이션 시스템의 심층신경망 성능 비교)

  • An, Ryun-Hui;Um, Sung-Ho;Yu, Yun Seop
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.175-177
    • /
    • 2022
  • This paper introduces a system that assists the blind to move to their destination, and compares the performance of 3-types of deep learning network (DNN) used in the system. The system is made up with a smartphone application that finds route from current location to destination using GPS and navigation API and a bus station installation module that recognizes and informs the bus (type and number) being about the board at bus stop using 3-types of DNN and bus information API. To make the module recognize bus number to get on, We adopted faster-RCNN, YOLOv4, YOLOv5s and YOLOv5s showed best performance in accuracy and speed.

  • PDF

A Survey on Awareness and Availability on Items of 2018 Assistive Devices Distribution Program for the Disabled in the Occupational Therapists (2018년도 장애인 보조기기 교부사업 품목에 대한 작업치료사의 인식도와 활용도 조사)

  • Kim, Jeong-Eun;Park, Je-Min;Bae, Su-Yeong;Jung, Nam-hae
    • Korean Journal of Occupational Therapy
    • /
    • v.26 no.4
    • /
    • pp.85-95
    • /
    • 2018
  • Objective : The purpose of this study was to investigate the awareness and availability on items of 2018 assistive devices distribution program for the disabled in the occupational therapists. Methods : A total of 132 occupational therapists participated in the survey from May 1 to May 31. Results : 96.2% of the occupational therapists responded that assistive device is helpful in lives of the disabled people. Especially, they responded that assistive device is the most helpful in 'movement and mobility'. Awareness on an angle spoon/fork with built-up handle and universal cuff was the highest, while a visual signaling indicator was the lowest. Availability on an air cushion was the highest, while a visual signaling indicator and a voice guidance system were the lowest. 67.4% responded that 'sometimes' they use the assistive device and 77.3% responded they will utilize the assistive device. To improve awareness and availability, 43.2% needed financial support, 32.6% needed to add insurance bill and 22.7% needed related education. Conclusion : In the future, this result will be available as a basic data for the education about assistive device for the occupational therapists.

The Design and Implementation of the Wireless Home Automation Control System using WAP (WAP을 이용한 무선 홈 자동화 제어 시스템 설계 및 구현)

  • Shim, Hyeon-Cheol;Jun, Hyung-Kook;Eom, Young-Ik
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10b
    • /
    • pp.969-972
    • /
    • 2000
  • 기존의 홈 자동화 시스템은 유선 전화망을 이용하여 사용자는 집으로 전화를 걸어 안내방송에 따라 서비스 코드를 입력하는 방식이었다. 하지만 이러한 음성 제어 방식은 사용자가 서비스 코드를 인식해야 하는 경우 주위 환경이나 통화 상태에 따라 사용이 불편할 수 있고 신뢰도가 떨어질 수 있으며 유지 보수적인 면에서 확장성이 낮은 단점을 갖는다. 본 논문에서는 WAP을 이용한 무선 홈 자동화 제어 시스템을 소개하며 이 시스템은 WAP 서비스를 이용하여 집안의 가전기기를 제어하거나 기기의 상태 정보 등을 사용자의 무선 핸드폰으로 전달해 주는 시스템이다. 즉, 무선인터넷 프로토콜인 WAP을 이용하여 시각적으로 사용자에게 시스템 정보를 전달해 주도록 했으며, 운영체제가 포팅(porting)된 임베디드 시스템을 사용함으로써 홈 자동화 제어 시스템이 쉽게 확장 가능하도록 설계 및 구현하였다.

  • PDF

Navigation App for the Blind and Tactile guide stick (시각장애인을 위한 내비게이션 App과 촉각을 이용한 방향 안내 지팡이)

  • Han, Hyo-Byung;Lee, Gi-Hyuk;Park, Keun-Joon;Beom, Hyo-Won;Kim, Ung-Seop;Seong, Ji-Ae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.10a
    • /
    • pp.327-330
    • /
    • 2018
  • 우리는 본 연구를 통해 모바일을 통해 음성으로 목적지를 설정하고 사용자의 위치 정보를 바탕으로 경로 상의 다음 노드 방향을 효과적으로 계산하는 시스템을 설계하였다. 우리가 설계한 시스템은 손잡이에 달린 모터가 예상 경로방향을 가리키고 사용자는 모터 방향을 손가락의 촉각을 통해 인식함으로써 방향을 예측한다.