• Title/Summary/Keyword: 시각 음성인식

Search Result 130, Processing Time 0.023 seconds

HunMinJeomUm: Text Extraction and Braille Conversion System for the Learning of the Blind (시각장애인의 학습을 위한 텍스트 추출 및 점자 변환 시스템)

  • Kim, Chae-Ri;Kim, Ji-An;Kim, Yong-Min;Lee, Ye-Ji;Kong, Ki-Sok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.21 no.5
    • /
    • pp.53-60
    • /
    • 2021
  • The number of visually impaired and blind people is increasing, but braille translation textbooks for them are insufficient, which violates their rights to education despite their will. In order to guarantee their rights, this paper develops a learning system, HunMinJeomUm, that helps them access textbooks, documents, and photographs that are not available in braille, without the assistance of others. In our system, a smart phone app and web pages are designed to promote the accessibility of the blind, and a braille kit is produced using Arduino and braille modules. The system supports the following functions. First, users select documents or pictures that they want, and the system extracts the text using OCR. Second, the extracted text is converted into voice and braille. Third, a membership registration function is provided so that the user can view the extracted text. Experiments have confirmed that our system generates braille and audio outputs successfully, and provides high OCR recognition rates. The study has also found that even completely blind users can easily access the smart phone app.

Effects of Situation Awareness and Decision Making on Safety, Workload and Trust in Autonomous Vehicle Take-over Situations (자율주행 자동차의 제어권 전환상황에서 상황인식 및 의사결정 정보 제공이 운전자에게 미치는 영향)

  • Kim, Jihyun;Lee, Kahyun;Byun, Youngsi
    • Journal of the HCI Society of Korea
    • /
    • v.14 no.2
    • /
    • pp.21-29
    • /
    • 2019
  • Take-over requests in semi-autonomous cars must be handled properly in the case of road obstacles or curved roads in order to avoid accidents. In these situations, situation awareness and appropriate decision making are essential for distracted drivers. This study used a driving simulator to investigate the components of auditory-visual information systems that affect safety, workload, and trust. Auditory information consisted of either voice guidance providing situation awareness for the take-over or a beep sound that only alerted the driver. Visual information consisted of either a screen showing how to maneuver the vehicle or only an icon indicating a take-over situation. By providing auditory information that increased situation awareness and visual information that aided decision making, trust and safety increased, while workload decreased. These results suggest that the levels of situation awareness and decision making ability affect trust, safety, and workload for drivers.

A Study on Spatio-temporal Features for Korean Vowel Lipreading (한국어 모음 입술독해를 위한 시공간적 특징에 관한 연구)

  • 오현화;김인철;김동수;진성일
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.1
    • /
    • pp.19-26
    • /
    • 2002
  • This paper defines the visual basic speech units, visemes and investigates various visual features of a lip for the effective Korean lipreading. First, we analyzed the visual characteristics of the Korean vowels from the database of the lip image sequences obtained from the multi-speakers, thereby giving a definition of seven Korean vowel visemes. Various spatio-temporal features of a lip are extracted from the feature points located on both inner and outer lip contours of image sequences and their classification performances are evaluated by using a hidden Markov model based classifier for effective lipreading. The experimental results for recognizing the Korean visemes have demonstrated that the feature victor containing the information of inner and outer lip contours can be effectively applied to lipreading and also the direction and magnitude of the movement of a lip feature point over time is quite useful for Korean lipreading.

Design and Implementation of the VoiceXML Interpreter for Voice Web-service (음성 웹서비스를 위한 VoiceXML 해석기의 설계 및 구현)

  • 신현경;강동남;염세훈;유재우
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.4
    • /
    • pp.42-47
    • /
    • 2001
  • In this paper, we propose an interpreter, which recognizes the VoiceXML markups, verifies the validation of the document, and interprets the VoiceXML documents using DI parser and the generated AST by the parser. The VoiceXML interpreter consists of DI parser and executor, and the DI parser uses recursive descent parsing technology, and the executor uses FIA (Form Interpretation Algorithm) proposed by VXML forum. This system uses the Java language in order to develop the runtime environment for VoiceXML efficiently, thus this system has portability.

  • PDF

Speech Recognition and Lip Shape Feature Extraction for English Vowel Pronunciation of the Hearing - Impaired Based on SVM Technique (SVM 기법에 기초한 청각장애인의 영어모음 발음을 위한 음성 인식 및 입술형태 특징 추출)

  • Lee, Kun-Min;Han, Kyung-Im;Park, Hye-Jung
    • Journal of rehabilitation welfare engineering & assistive technology
    • /
    • v.11 no.3
    • /
    • pp.247-252
    • /
    • 2017
  • The purpose of this study is to suggest the visual teaching method for the English vowel pronunciation, especially for the hearing-impaired who mostly rely on the visual aids, based on the SVM technique. By extracting phonetic features using the SVM technique from the sounds that are hard to hear by ear, the lip shapes for each vowel were refined. The lip shape refinement for vowels is advantageous in that language learners can easily see the movement of articulators by eye, and it is helpful for learning and teaching English vowels for the hearing-impaired.

Study of Information Transmission System for Visually Impaired (시각 장애인을 위한 정보전송 시스템 연구)

  • Lee, Seo-Young;Choi, Jong-Yeob;Ahn, Sang-Jun;Kim, Jeong-Hun;Park, Yong Wook
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.12 no.6
    • /
    • pp.1227-1232
    • /
    • 2017
  • In this study, we have studied a system to reduce the inconvenience and the accident rate at the intersection when the visually impaired use traffic information by using pressure sensor and ultrasonic sensor. By using the pressure sensor to light the strip LED, the driver recognizes the pedestrian, thereby reducing the accident rate. In addition to the pedestrian signal light information, the ultrasonic sensor and Bluetooth transmit the bus position information to the application so that the user can listen to the voice.

Implementation of Information Access Embedded System for the Blind People (시각 장애인을 위한 정보접근 임베디드 시스템의 구현)

  • Kim, Si-Woo;Lee, Jae-Kyun;Lee, Chae-Wook
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.2C
    • /
    • pp.167-172
    • /
    • 2008
  • Since a 2-dimensional (2D) bar code can retrieve data and information quickly, it is widely used and recognized as a useful tool for many industrial applications. However, the information capacity of the 2D bar code is still limited. Recently the analog-digital code (AD code), which has the largest storage capacity yet contained in a code, has been developed, thereby expanding the bar code's application range because it overcomes the limitation of data capacity. In this paper, we present the AD code and implement an effective embedded system which can transform text information into voice using the 2D AD code and Text To Speech (TTS). This voice information can also be transmitted to blind people as well as the old by capturing the AD code on paper or in books.

Comparison of Deep Learning Networks in Voice-Guided System for The Blind (시각장애인을 위한 음성안내 네비게이션 시스템의 심층신경망 성능 비교)

  • An, Ryun-Hui;Um, Sung-Ho;Yu, Yun Seop
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.175-177
    • /
    • 2022
  • This paper introduces a system that assists the blind to move to their destination, and compares the performance of 3-types of deep learning network (DNN) used in the system. The system is made up with a smartphone application that finds route from current location to destination using GPS and navigation API and a bus station installation module that recognizes and informs the bus (type and number) being about the board at bus stop using 3-types of DNN and bus information API. To make the module recognize bus number to get on, We adopted faster-RCNN, YOLOv4, YOLOv5s and YOLOv5s showed best performance in accuracy and speed.

  • PDF

A Survey on Awareness and Availability on Items of 2018 Assistive Devices Distribution Program for the Disabled in the Occupational Therapists (2018년도 장애인 보조기기 교부사업 품목에 대한 작업치료사의 인식도와 활용도 조사)

  • Kim, Jeong-Eun;Park, Je-Min;Bae, Su-Yeong;Jung, Nam-hae
    • Korean Journal of Occupational Therapy
    • /
    • v.26 no.4
    • /
    • pp.85-95
    • /
    • 2018
  • Objective : The purpose of this study was to investigate the awareness and availability on items of 2018 assistive devices distribution program for the disabled in the occupational therapists. Methods : A total of 132 occupational therapists participated in the survey from May 1 to May 31. Results : 96.2% of the occupational therapists responded that assistive device is helpful in lives of the disabled people. Especially, they responded that assistive device is the most helpful in 'movement and mobility'. Awareness on an angle spoon/fork with built-up handle and universal cuff was the highest, while a visual signaling indicator was the lowest. Availability on an air cushion was the highest, while a visual signaling indicator and a voice guidance system were the lowest. 67.4% responded that 'sometimes' they use the assistive device and 77.3% responded they will utilize the assistive device. To improve awareness and availability, 43.2% needed financial support, 32.6% needed to add insurance bill and 22.7% needed related education. Conclusion : In the future, this result will be available as a basic data for the education about assistive device for the occupational therapists.

The Design and Implementation of the Wireless Home Automation Control System using WAP (WAP을 이용한 무선 홈 자동화 제어 시스템 설계 및 구현)

  • Shim, Hyeon-Cheol;Jun, Hyung-Kook;Eom, Young-Ik
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2000.10b
    • /
    • pp.969-972
    • /
    • 2000
  • 기존의 홈 자동화 시스템은 유선 전화망을 이용하여 사용자는 집으로 전화를 걸어 안내방송에 따라 서비스 코드를 입력하는 방식이었다. 하지만 이러한 음성 제어 방식은 사용자가 서비스 코드를 인식해야 하는 경우 주위 환경이나 통화 상태에 따라 사용이 불편할 수 있고 신뢰도가 떨어질 수 있으며 유지 보수적인 면에서 확장성이 낮은 단점을 갖는다. 본 논문에서는 WAP을 이용한 무선 홈 자동화 제어 시스템을 소개하며 이 시스템은 WAP 서비스를 이용하여 집안의 가전기기를 제어하거나 기기의 상태 정보 등을 사용자의 무선 핸드폰으로 전달해 주는 시스템이다. 즉, 무선인터넷 프로토콜인 WAP을 이용하여 시각적으로 사용자에게 시스템 정보를 전달해 주도록 했으며, 운영체제가 포팅(porting)된 임베디드 시스템을 사용함으로써 홈 자동화 제어 시스템이 쉽게 확장 가능하도록 설계 및 구현하였다.

  • PDF