• Title/Summary/Keyword: SpeechWeb

Search Result 101, Processing Time 0.032 seconds

Effect of Virtual Reality Exposure and Web-based Cognitive Intervention Integrated Program on Social Anxiety Disorder (발표상황에 대한 가상현실노출과 웹기반 인지적 개입의 통합 프로그램 효과 검증)

  • Park, Ki-Woo;Yoon, Hyae-Yeong
    • Journal of the Korea Convergence Society
    • /
    • v.13 no.1
    • /
    • pp.1-12
    • /
    • 2022
  • In this study, the effect of VR exposure programs integrated with web-based cognitive restructuring education on reducing social anxiety was confirmed. The experimental group (n=12) received a 10~15 minute session of web-based cognitive intervention and a 20-minute session of virtual reality exposure therapy. The comparison group (n=15) received a 10~15 minute session of web-based speech education and a 20-minute session of virtual reality exposure therapy. After 4 weeks, the experimental group had an increase in positive interpretation bias, a decrease in negative interpretation bias, and a decreased level of social anxiety. These results suggest that the combination of self-help form of web-based cognitive intervention in the treatment of social anxiety disorder can improve the therapeutic effect of VRET.

Implementation of the Web Service Provider for the Speech Recognition Web Page (음성 인식용 웹페이지를 위한 웹서비스 제공자의 구현)

  • 오지영;김윤중
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2003.11a
    • /
    • pp.257-260
    • /
    • 2003
  • 본 논문은 일반 웹페이지를 음성인식이 가능한 웹페이지로 전환하고, 이 페이지가 사용 될 수 있는 웹서비스를 구현하였다. 본 연구에서 구현한 시스템은 웹서비스 소비자와 웹서비스 제공자로 구성되어 있다. 웹서비스 소비자는 다음에 설명하는 두개의 웹서비스 제공자를 호출하는 기능과 재구성된 웹페이지외 xml 문서를 저장하는 기능, xml 문서로부터 사용자의 음성과 매핑되는 URL을 검색하는 기능을 포함하고 있다. 웹서비스 제공자는 웹페이지를 변환하는 웹서비스 제공자와 음성인식 웹서비스 제공자이다. 웹페이지 변환 웹서비스 제공자는 일반 웹페이지를 분석하여 필요한 태그를 변환하는 기능과 하이퍼링크 값인 URL을 추출하는 기능으로 구성되어 있다. 사용자의 음성을 분석하고 인식하는 음성인식기는 기존의 연구에서 구현된 음성인식 웹서비스 제공자를 이용하였다.

  • PDF

The Smart Learning System for English Language Using Hangeul (한글을 이용한 스마트 영어 학습 시스템)

  • Kwon, Seung-tag;Kim, Yong-seok
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.6
    • /
    • pp.1157-1163
    • /
    • 2015
  • In this paper, we developed a Web App that operates in a mobile device. Also, we designed and developed an electronic dictionary of English words and sentences are expressed by English pronunciation with hangeul. The database using English words, Hangeul code with pictures, vocabulary definitions, speech sound files, and many sentences are created in this system. We developed the English learning system using HTML5 and m-Bizmaker software tools.

Voice Creator: A Vocal Customization Web Application Prototype (Voice Creator: 개인 맞춤형 목소리 생성 웹 어플리케이션 프로토타입)

  • Byeon, Hyeon Jeong;Yeo, Soohyun;Oh, Uran
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.567-569
    • /
    • 2021
  • Due to the important role of avatars in computer-mediated communication (CMC), a growing number of CMC-based services now support avatar customization options. However, in many cases, customization and personalization options are limited to visual features. In this paper, we propose and describe a prototype for a vocal customization web application. Titled Voice Creator, the app is designed for both able-bodied and speech- or hearing-impaired users who seek to communicate anonymously using digital voice identities.

Information Retrieval System Using Korean Speech Recognition on the Web Browser (웹 브라우저 상에서 한국어 음성인식을 이용한 정보검색 시스템)

  • 이항섭
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • 1998.08a
    • /
    • pp.35-38
    • /
    • 1998
  • 웹 브라우저 상에서 한국어 음성인식을 이용한 정보검색 시스템에 대한 것이다. 이 시스템의 특징은 웹 브라우저 상에서 보여지는 Hypertext Word를 인식할 수 있는 거승로 기존의 웹 브라우저를 마우스 click 대신 음성인식을 이용하여 사용할 수 있다는 것이다. 웹 브라우저를 통해서 보여지는 고정되지 않고 계속 하여 변화하는 인식후보를 인식하기 위해 당 연구실에서 개발한 가변 어휘 인식기를 사용하였다. 시스템은 windows 95/NT 환경에서 개발되었으며, 사용자가 새로운 인터페이스를 배울 필요 없이 바로 사용할 수 있도록 사용자 편의성 부분도 고려하여 개발되었다. 개발된 시스템은 독립 환경, 독립 화자에 대해 실험한 결과 130여개의 단어에 대해 편균 90% 정도의 인식성능을 보인다.

  • PDF

An Architecture for Mobile Instruction: Application to Mathematics Education through the Web

  • Kim, Steven H.;Kwon, Oh-Nam;Kim, Eun-Jung
    • Research in Mathematical Education
    • /
    • v.4 no.1
    • /
    • pp.45-55
    • /
    • 2000
  • The rapid proliferation of wireless networks provides a ubiquitous channel for delivering instructional materials at the convenience of the user. By delivering content through portable devices linked to the Internet, the full spectrum of multimedia capabilities is available for engaging the user's interest. This capability encompasses not only text but images, video, speech generation and voice recognition. Moreover, the incorporation of machine learning capabilities at the source provides the ability to tailor the material to the general level of expertise of the user as well as the immediate needs of the moment: for instance, a request for information regarding a particular city might be covered by a leisurely presentation if solicited from the home, but more tersely if the user happens to be driving a car. This paper presents system architecture to support mobile instruction in conjunction with knowledge-based tutoring capabilities. For concreteress, the general concepts are examined in the context of a system for mathematics education on the Web.

  • PDF

Structural live load surveys by deep learning

  • Li, Yang;Chen, Jun
    • Smart Structures and Systems
    • /
    • v.30 no.2
    • /
    • pp.145-157
    • /
    • 2022
  • The design of safe and economical structures depends on the reliable live load from load survey. Live load surveys are traditionally conducted by randomly selecting rooms and weighing each item on-site, a method that has problems of low efficiency, high cost, and long cycle time. This paper proposes a deep learning-based method combined with Internet big data to perform live load surveys. The proposed survey method utilizes multi-source heterogeneous data, such as images, voice, and product identification, to obtain the live load without weighing each item through object detection, web crawler, and speech recognition. The indoor objects and face detection models are first developed based on fine-tuning the YOLOv3 algorithm to detect target objects and obtain the number of people in a room, respectively. Each detection model is evaluated using the independent testing set. Then web crawler frameworks with keyword and image retrieval are established to extract the weight information of detected objects from Internet big data. The live load in a room is derived by combining the weight and number of items and people. To verify the feasibility of the proposed survey method, a live load survey is carried out for a meeting room. The results show that, compared with the traditional method of sampling and weighing, the proposed method could perform efficient and convenient live load surveys and represents a new load research paradigm.

Design and Implementation of Voice-based Interactive Service KIOSK (음성기반 대화형 서비스 키오스크 설계 및 구현)

  • Kim, Sang-woo;Choi, Dae-june;Song, Yun-Mi;Moon, Il-Young
    • Journal of Practical Engineering Education
    • /
    • v.14 no.1
    • /
    • pp.99-108
    • /
    • 2022
  • As the demand for kiosks increases, more users complain of discomfort. Accordingly, a kiosk that enables easy menu selection and order by producing a voice-based interactive service is produced and provided in the form of a web. It implements voice functions based on the Annyang API and SpeechSynthesis API, and understands the user's intention through Dialogflow. And discuss how to implement this process based on Rest API. In addition, the recommendation system is applied based on collaborative filtering to improve the low consumer accessibility of existing kiosks, and to prevent infection caused by droplets during the use of voice recognition services, it provides the ability to check the wearing of masks before using the service.

An Audio-Visual Teaching Aid (AVTA) with Scrolling Display and Speech to Text over the Internet

  • Davood Khalili;Chung, Wan-Young
    • Proceedings of the IEEK Conference
    • /
    • 2003.07c
    • /
    • pp.2649-2652
    • /
    • 2003
  • In this Paper, an Audio-Visual Teaching aid (AVTA) for use in a classroom and with Internet is presented. A system, which was designed and tested, consists of a wireless Microphone system, Text to Speech conversion Software, Noise filtering circuit and a Computer. An IBM compatible PC with sound card and Network Interface card and a Web browser and a voice and text messenger service were used to provide slightly delayed text and also voice over the internet for remote teaming, while providing scrolling text from a real time lecture in a classroom. The motivation for design of this system, was to aid Korean students who may have difficulty in listening comprehension while have, fairly good reading ability of text. This application of this system is twofold. On one hand it will help the students in a class to view and listen to a lecture, and on the other hand, it will serve as a vehicle for remote access (audio and text) for a classroom lecture. The project provides a simple and low cost solution to remote learning and also allows a student to have access to classroom in emergency situations when the student, can not attend a class. In addition, such system allows the student in capturing a teacher's lecture in audio and text form, without the need to be present in class or having to take many notes. This system will therefore help students in many ways.

  • PDF

Phonetic Factors Conditioning the Release of English Sentence-Final Stops (영어 문장 말 폐쇄음의 파열 양상)

  • Kim, Da-Hee
    • MALSORI
    • /
    • no.53
    • /
    • pp.1-16
    • /
    • 2005
  • This experimental study aims to test the hypothesis that the occurrence of English sentence-final stop release is, at least, partly predictable by examining its phonetic context. 10 native(5 male and 5 female) speakers of American English recorded, in a sound-proof booth, sentences excerpted from novels and the natural documents on the World Wide Web. Based on the waveforms and spectrograms of the recorded sentences, judgements of the release of a sentence-final stop were made. If the aperiodic energy of a given final stop lasted more than .015 second, it was considered to be "released." The result reveals that English sentence-final stops tend to be released when they are 1) velar consonants, 2) preceeded by tense vowels, and 3) coda consonants of content words. The phonetic environment in which final stops are often released can be characterized by the articulatory comfortableness and the need for release burst noise, without which the final stops may not be correctly perceived. By examining the release of English final stops, it is concluded that the phonological events, which had been considered to occur rather "randomly," in fact, reflect the universal tendency of human speech: to minimize the speakers' and hearers' effort.

  • PDF