• Title/Summary/Keyword: Voice language

Search Result 411, Processing Time 0.028 seconds

Design and Implementation of VoiceXML VUI Browser (VoiceXML VUI Browser 설계/구현)

  • 장민석;예상후
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.11a
    • /
    • pp.788-791
    • /
    • 2002
  • The present Web surroundings is composed of HTML(Hypertext Mark-up Language) and thereby users obtains web informations mainly in GUI(Graphical User Interface) environment by clicking mouse in order to keep up with hyperlinked informations. However it is very inconvenient to work in this environment comparing with easily accessed one in which human's voice is utilized for obtaining informations. Using VoiceXML, resulted from XML, for supplying the information through telephone on the basis of the contemporary matured technology of voice recognition/synthesis to work out the inconvenience problem, this paper presents the research results about VoiceXML Web Browser designed and implemented for realizing its technology.

  • PDF

An Implementation of Speech DB Gathering System Using VoiceXML (VoiceXML을 이용한 음성 DB 수집 시스템 구현)

  • Kim Dong-Hyun;Roh Yong-Wan;Hong Kwang-Seok
    • Journal of Internet Computing and Services
    • /
    • v.6 no.1
    • /
    • pp.39-50
    • /
    • 2005
  • Speech DB is basically required factor when we are study for phonetics, speech recognition and speech synthesis and so on. The quantity and quality of speech DB decide the efficiency of system that we develop. therefore. speech DB has an extremely important factor, Recently, development of the various telephone service technique such as voice portal. it is actual condition where the necessity of collection of telephone speech DB. The existing IVR application telephone speech DB collection system used C/C++ language or the exclusive development tool. Thus it is the actual condition where the recycle of each application service for resources is difficult and have a problem of many labors and time necessity. But. VoiceXML is a language having tag form ipredicated in XML. which has easy and simple grammar system. Therefore, if we make a few efforts we could draw up easily. it has a merit reducing labors and time, Also, VoiceXML has many advantages of various telephone speech DB gathering because of changing contents of DB. In this paper, we introduce telephone speech DB gathering system which is the mast important factor for development of speech information processing technique.

  • PDF

A Comparison study on the relationship between the Self-reported Voice Problem and Body Mass Index (자가 음성평가와 체질량지수의 특성 비교)

  • Lee, Inae;Hwang, Young-Jin
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.14 no.3
    • /
    • pp.1330-1334
    • /
    • 2013
  • The purpose of this study was to analyze the association between self-reported voice problem and body mass index. Data were collected from the 5th Korea National Health and Nutritional Examination Survey (2010) from 5,811 subjects(2,503 men and 3,308 women) aged 19 years and olders. chi-square, t-test and multi-nominal logistic regression analysis were used that to compare self-reported voice problem and variable(age, sex, hight, weight, waist measurement, body mass index). body mass index(OR=1.028, 95% CI: 1.003-1.056) was independently associated with self-reported voice problem(p<0.031). also over weight-two step obesity (OR=1.765, 95% CI: 1.036-3.006) were independently associated with self-reported voice problem(p<0.036). The results of comparison verified that body mass index are valuable self-reported voice problem of risk factor. when the evaluation were conducted, what was considered body mass index is needed.

HUVOIS speech service solution based on VoiceXML (VoiceXML기반 HUVOIS 음성처리 솔루션)

  • KIM MOON-SIK
    • Proceedings of the Acoustical Society of Korea Conference
    • /
    • autumn
    • /
    • pp.33-34
    • /
    • 2004
  • 통화 위주의 기능이 주류를 이루고 있던 전화 서비스시장에 다양한 정보를 제공하기 위한 첨단 부가서비스를 제공하기 위해서는 인터넷과의 연동, 음성인식, 음성합성, 음성녹음 등의 요소들을 제공할 수 있어야 하며, 여러 고객의 다양한 요구사항을 수용하기 위한 서비스 시나리오의 개발 방법이 제공되어야 한다. HUVOIS solution 은 WWW 콘서시엄의 표준에 따른 VoiceXML 2.0 인터프리터 엔진과 음성인식엔진, 음성합성엔진을 수용하였으며 신규 부가서비스를 쉽고 빠르게 제공할 수 있는 환경을 제공하기 위하여 개발되었다. 본 논문에서는 KT가 개발한 HUVOIS 솔루션과 이를 이용한 각종 서비스 및 사업에 대하여 기술하였다.

  • PDF

Development of voice pen-pal application of global communication system by voice message

  • Lau, Shuai
    • Korean Journal of Artificial Intelligence
    • /
    • v.2 no.1
    • /
    • pp.1-3
    • /
    • 2014
  • These days, interest and demand on smart learning has rapidly increased. Video English and mobile system based English speaking service have become popular. This study gave prototype of application to give and take voice message with world people and to give new concept of voice pen-pal beyond exchange of text messages. In modern society having rapidly increasing demand on smart learning, you can study foreign language by smart phone and communicate with foreigners by voice anytime and anywhere. The app allows global exchange to learn conversation. Recruitment of initial users and profit model have problems. We shall develop to improve problems and to solve difficulty.

Voice therapy for pitch problems following thyroidectomy without laryngeal nerve injury (신경학적 손상이 없는 갑상선 술 후 음도문제의 음성치료)

  • Ji-sung Kim;Mi-jin Kim
    • Phonetics and Speech Sciences
    • /
    • v.15 no.3
    • /
    • pp.53-58
    • /
    • 2023
  • After thyroidectomy, some patients who show normal vocal cord movement still complain of subjective voice problems, which could lead to a decrease in quality of life related to communication. This study aims to investigate the effectiveness of a newly designed voice therapy applying neck exercise and semi-occluded vocal tract exercise (SOVTE) to improve voice problems after thyroidectomy without neurological injury. For this purpose, voice therapy was randomly assigned to 10 women who received thyroidectomy. Acoustic analysis [fundamental frequency, jitter, shimmer, noise-to-harmonics ratio, min Voice Range Profile (VRP), max VRP, VRP] was performed before and after surgery and immediately after voice therapy to compare voice changes. The study showed a statistically significant increase in max VRP and VRP after voice therapy compared to before surgery. These results suggest that the voice therapy methods in this study effectively improve a major symptom of voice problems after thyroidectomy, specifically the reduction in the high-frequency range. However, this study was limited in the number of s participants and did not control for the type of surgery. Therefore, further research utilizing larger sample sizes and controlled variables is needed to investigate the long-term effects of voice therapy.

Voice Synthesis Detection Using Language Model-Based Speech Feature Extraction (언어 모델 기반 음성 특징 추출을 활용한 생성 음성 탐지)

  • Seung-min Kim;So-hee Park;Dae-seon Choi
    • Journal of the Korea Institute of Information Security & Cryptology
    • /
    • v.34 no.3
    • /
    • pp.439-449
    • /
    • 2024
  • Recent rapid advancements in voice generation technology have enabled the natural synthesis of voices using text alone. However, this progress has led to an increase in malicious activities, such as voice phishing (voishing), where generated voices are exploited for criminal purposes. Numerous models have been developed to detect the presence of synthesized voices, typically by extracting features from the voice and using these features to determine the likelihood of voice generation.This paper proposes a new model for extracting voice features to address misuse cases arising from generated voices. It utilizes a deep learning-based audio codec model and the pre-trained natural language processing model BERT to extract novel voice features. To assess the suitability of the proposed voice feature extraction model for voice detection, four generated voice detection models were created using the extracted features, and performance evaluations were conducted. For performance comparison, three voice detection models based on Deepfeature proposed in previous studies were evaluated against other models in terms of accuracy and EER. The model proposed in this paper achieved an accuracy of 88.08%and a low EER of 11.79%, outperforming the existing models. These results confirm that the voice feature extraction method introduced in this paper can be an effective tool for distinguishing between generated and real voices.

How Well Did We Know About Our Communication? "Origins of Human Communication"

  • Jung-Woo Son
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.34 no.1
    • /
    • pp.57-58
    • /
    • 2023
  • Through accurate observation and the results of experimental studies using great apes, the author tells us exactly what we have not known about human communication. The author persuasively conveys to the reader the grand history of developing from great apes' gestures to human gestures, to human speech. Given that great apes and human gestures were the origin of human voice language, we have once again realized that our language is, after all, an "embodied language."

The Real-time Shopping System using Multipurpose Visual Language with Voice Recognize (음성인식시스템과 다목적 시각 언어를 연동한 실시간 쇼핑 시스템)

  • Kim, Young-Jong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.6
    • /
    • pp.4164-4169
    • /
    • 2015
  • In this paper planed Real-time Shopping System using Multipurpose Visual Language System(MVLS) with voice recognition remote controller. This system has a merit that using existing on-line & off-line shopping system with addition MVLS data. This can realization little modification existing shopping system. Also, customer's a point of view that has a merit to using easy device for shopping. That is no more using difficult device like that keyboard or mouse, and approach to easy device that voice recognition remote controller or smart phone. Especially, aspect of the old and the infirm and disabled persons that information minority group, can easy buy the product using this system. And, the sellers can more easily collection customer's data and using that future sales strategy.