• Title/Summary/Keyword: TTS

Search Result 306, Processing Time 0.024 seconds

VoiceXML Dialog System Based on RSS for Contents Syndication (콘텐츠 배급을 위한 RSS 기반의 VoiceXML 다이얼로그 시스템)

  • Kwon, Hyeong-Joon;Kim, Jung-Hyun;Lee, Hyon-Gu;Hong, Kwang-Seok
    • The KIPS Transactions:PartB
    • /
    • v.14B no.1 s.111
    • /
    • pp.51-58
    • /
    • 2007
  • This paper suggests prototype of dialog system combining VXML(VoiceXML) that is the W3C's standard XML format for specifying interactive voice dialogues between human and computer, and RSS(RDF Site Summary or Really Simple Syndication) that is representative technology of semantic web for syndication and subscription of updated web-contents. Merits of the proposed system are as following: 1) It is a new method that recognize spoken contents using ire and wireless telephone networks and then provide contents to user via STT(Speech-to-Text) and TTS(Text-to-Speech) instead of traditional method using web only. 2) It can apply advantage of RSS that subscription of updated contents is converted to VXML without modifying traditional method to provide RSS service, 3) In terms of users, it can reduce restriction on time-spate in search of contents that is provided by RSS because it uses ire and wireless telephone networks, not internet environment. 4) In terms of information provider, it does not need special component for syndication of the newest contents using speech recognition and synthesis technology. We implemented a news service system using VXML and RSS for performance evaluation of the proposed system. In experiment results, we estimated the response time and the speech recognition rate in subscription and search of actuality contents, and confirmed that the proposed system can provide contents those are provided using RSS Feed.

Endoscopic Balloon Dilatation in Children with Congenital and Acquired Esophageal Anomalies (소아의 선천성 및 후천성 식도 질환에서 내시경적 풍선 확장술)

  • Kwak, Ju Yuong;Park, Jae Hong
    • Pediatric Gastroenterology, Hepatology & Nutrition
    • /
    • v.8 no.2
    • /
    • pp.137-142
    • /
    • 2005
  • Purpose: To evaluate the safety, efficacy and technical problems of the endoscopic balloon dilatation of esophageal anomalies in children. Methods: The medical records of 8 children treated by endoscopic balloon dilatation for esophageal anomalies over a 10-year period at Pusan National University Hospital were reviewed retrospectively. The balloon catheter (Maxforce TTS or CRE, Boston Scientific Co., USA) was positioned across the area of narrowing by direct visualization. The balloon was slowly inflated with normal saline to specified pressures for each balloon and maintained for 60 seconds and then deflated. After 60 seconds pause, the procedure was repeated with a larger sized balloon (increments of 1 mm for each subsequent dilation) till effective dilatation was confirmed by direct visualization without complications. Results: Three male and five female were included and their mean age was 4.2 years. A total of 27 (average of 3.2 per patient) dilatation were performed. Underlying diseases of patients are postoperative stricture of esophageal atresia in 3 cases, esophageal ring in 2 cases, achalasia, corrosive esophagitis and hypertensive LES in one case respectively. The size of initial dilating balloon was chosen on the basis of the diameter of the narrowing determined by endoscopy. The first dilation in patients with severe esophageal stricture was made with a 6 mm sized balloon. Complications observed were esophageal perforation and respiratory holding during the procedure in one case respectively. Successful outcome was seen in 6 patients (75%). Conclusion: Endoscopic balloon dilatation can provide a safe and effective mean of treating esophageal anomalies in children and should be considered the treatment of choice in the initial management of those cases.

  • PDF

Development of Voice Information System for Safe Navigation in Marine Simulator (시뮬레이터 기반 음성을 이용한 항행정보 안내시스템의 개발)

  • Son N. S.;Kim S. Y.
    • Journal of the Korean Society for Marine Environment & Energy
    • /
    • v.5 no.3
    • /
    • pp.28-34
    • /
    • 2002
  • As the technology of Speech Recognition(SR) and Text-To-Speech(TTS) develops rapidly, voice control and guidance system is thought to be very helpful for safe navigation. But Voice Control and Guidance System(VCGS) is not yet so popularly included in Navigation Supporting System(NSS). The main reason of this is that VCGS is so complicated and user-unfriendly that navigation officers hesitate to use VCGS. Frequent errors in operating VCGS due to low rate of SR are another reason. To make VCGS more practicable for safe navigation, we design the user-friendly VCGS. Firstly, by using interviews we survey functions and procedures that navigation officers want to be included in VCGS. Secondly, to raise the rate of SR, we tun the environmental noise in bridge and to reduce the errors due to low rate of SR in operating VCGS, we design the functions of self-correction. Also we apply a user-independent SR engine so that procedures of teaming of speakers is basically not necessary. Using simulator experiments the functions and procedures of the user-friendly YCGS for safe navigation are evaluated and the results of evaluation are fed back to the design. As a result, we can design the VCGS more helpful for safe navigation. In this paper, we describe the features of the user-friendly VCGS for safe navigation and discuss the results of simulator experiments.

  • PDF

An emotional speech synthesis markup language processor for multi-speaker and emotional text-to-speech applications (다음색 감정 음성합성 응용을 위한 감정 SSML 처리기)

  • Ryu, Se-Hui;Cho, Hee;Lee, Ju-Hyun;Hong, Ki-Hyung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.40 no.5
    • /
    • pp.523-529
    • /
    • 2021
  • In this paper, we designed and developed an Emotional Speech Synthesis Markup Language (SSML) processor. Multi-speaker emotional speech synthesis technology that can express multiple voice colors and emotional expressions have been developed, and we designed Emotional SSML by extending SSML for multiple voice colors and emotional expressions. The Emotional SSML processor has a graphic user interface and consists of following four components. First, a multi-speaker emotional text editor that can easily mark specific voice colors and emotions on desired positions. Second, an Emotional SSML document generator that creates an Emotional SSML document automatically from the result of the multi-speaker emotional text editor. Third, an Emotional SSML parser that parses the Emotional SSML document. Last, a sequencer to control a multi-speaker and emotional Text-to-Speech (TTS) engine based on the result of the Emotional SSML parser. Based on SSML which is a programming language and platform independent open standard, the Emotional SSML processor can easily integrate with various speech synthesis engines and facilitates the development of multi-speaker emotional text-to-speech applications.

A Study on Verification of Back TranScription(BTS)-based Data Construction (Back TranScription(BTS)기반 데이터 구축 검증 연구)

  • Park, Chanjun;Seo, Jaehyung;Lee, Seolhwa;Moon, Hyeonseok;Eo, Sugyeong;Lim, Heuiseok
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.11
    • /
    • pp.109-117
    • /
    • 2021
  • Recently, the use of speech-based interfaces is increasing as a means for human-computer interaction (HCI). Accordingly, interest in post-processors for correcting errors in speech recognition results is also increasing. However, a lot of human-labor is required for data construction. in order to manufacture a sequence to sequence (S2S) based speech recognition post-processor. To this end, to alleviate the limitations of the existing construction methodology, a new data construction method called Back TranScription (BTS) was proposed. BTS refers to a technology that combines TTS and STT technology to create a pseudo parallel corpus. This methodology eliminates the role of a phonetic transcriptor and can automatically generate vast amounts of training data, saving the cost. This paper verified through experiments that data should be constructed in consideration of text style and domain rather than constructing data without any criteria by extending the existing BTS research.

A Literature Review of Tongue Movement and Measurement Tools for Dysphagia (연하장애 환자의 혀 운동 및 측정 도구에 대한 고찰)

  • Kim, Jin-Yeong;Son, Yeong-Soo;Hong, Deok-Gi
    • Therapeutic Science for Rehabilitation
    • /
    • v.11 no.4
    • /
    • pp.55-68
    • /
    • 2022
  • Objective : This review aimed to provide information for clinical application by confirming the principles and characteristics of the tool through a review of tongue movement and measurement tools for patients with swallowing disorders. Results : We identified 15 tools used as tongue exercises and measurement tools in the field of dysphagia. According to principle, the tools were classified as either a bulb sensor, resistive sensor sheet, mouthpiece with sensor, or other techniques. The bulb sensor was easy to use but had limitations in fixing the position when measuring tongue pressure. The resistive sensor sheet could be measured at a more stable position than the bulb sensor. A mouthpiece with a sensor could be used in an individual's oral cavity such that the position was fixed when measuring the tongue pressure. Other techniques had the advantage of being wireless and capable of sensing light. Conclusion : Based on this literature review, it is necessary to facilitate the selection of the best tool for quantitative tongue measurement in dysphagia. The review can also be used to develop a Korean tongue movement tool model that can be used in hospitals and community centers.