• Title/Summary/Keyword: human voice

Search Result 355, Processing Time 0.023 seconds

Probabilistic Neural Network Based Learning from Fuzzy Voice Commands for Controlling a Robot

  • Jayawardena, Chandimal;Watanabe, Keigo;Izumi, Kiyotaka
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2004.08a
    • /
    • pp.2011-2016
    • /
    • 2004
  • Study of human-robot communication is one of the most important research areas. Among various communication media, any useful law we find in voice communication in human-human interactions, is significant in human-robot interactions too. Control strategy of most of such systems available at present is on/off control. These robots activate a function if particular word or phrase associated with that function can be recognized in the user utterance. Recently, there have been some researches on controlling robots using information rich fuzzy commands such as "go little slowly". However, in those works, although the voice command interpretation has been considered, learning from such commands has not been treated. In this paper, learning from such information rich voice commands for controlling a robot is studied. New concepts of the coach-player model and the sub-coach are proposed and such concepts are also demonstrated for a PA-10 redundant manipulator.

  • PDF

Voice and Sasang Constitution: In terms of source functions (음성과 사상체질: 음원을 중심으로)

  • Moon Seung-Jae;Park Jong-ju;Hwang Hye-jeong
    • MALSORI
    • /
    • no.48
    • /
    • pp.19-33
    • /
    • 2003
  • Sasang Constitutional Medicine, a branch of traditional Korean medicine, believes that the health of human beings can be promoted by taking advantage of the fact that people have different constitutions. It utilizes the characteristics in human voice to diagnose the constitution of the patients. This study aims at establishing the relationship between Sasang constitutions and their corresponding voice characteristics by investigating source-related variables. Voice recordings of 23 patients from three different constitutions were obtained whose constitutions had been already diagnosed by the experts in the fields. Fundamental frequency related variables (average pitch, maximum/minimum pitch, pitch range), phonation type, speaking tempo were measured and analyzed for each group. The phonation type seemed to be a possible candidate for a successful variable to determine constitution. No statistically significant relationship was manifested between other variables and constitutions. Despite its failure to firmly establish the relationship between voice and constitutions, the current study suggests that future research should include not only source-related variables

  • PDF

Divine Instrument : Voice (신이 주신 악기 : 목소리)

  • Kim, Han-Su
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.22 no.2
    • /
    • pp.103-105
    • /
    • 2011
  • The voice defines man. The voice and speech have built all of the human civilizations. Man can communicate with each other by voice and enjoy his/her spare time with singing a song. Actually the voice is the most beautiful and the first musical instrument in history. The aim of this review article is to considering the voice as a musical instrument.

  • PDF

Design and Implementation of IVR Server Using VoiceXML (VoiceXML을 이용한 IVR 서버 설계 및 구현)

  • Lee, Chang-Ho;Jang, Won-Jo;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.47-59
    • /
    • 2002
  • A new brilliant service using human-voice and DTMF (Dual Tone Multi Frequency) technique is expected nowadays in order to obtain valuable information on the internet more easily. VoiceXML (Voice eXtensible Markup Language) is the right choice that makes the new service possible. In this paper, the design and implementation of IVR (Interactive Voice Response) server using VoiceXML is described, where it connects with internet and IVR server efficiently. IVR server using VoiceXML is composed of two groups: VoiceXML document handling and VoiceXML execution. Scenario part of IVR server corresponds to VoiceXML document, the execution is performed by VoiceXML execution.

  • PDF

Emotion Detecting Method Based on Various Attributes of Human Voice

  • MIYAJI Yutaka;TOMIYAMA Ken
    • Science of Emotion and Sensibility
    • /
    • v.8 no.1
    • /
    • pp.1-7
    • /
    • 2005
  • This paper reports several emotion detecting methods based on various attributes of human voice. These methods have been developed at our Engineering Systems Laboratory. It is noted that, in all of the proposed methods, only prosodic information in voice is used for emotion recognition and semantic information in voice is not used. Different types of neural networks(NNs) are used for detection depending on the type of voice parameters. Earlier approaches separately used linear prediction coefficients(LPCs) and time series data of pitch but they were combined in later studies. The proposed methods are explained first and then evaluation experiments of individual methods and their performances in emotion detection are presented and compared.

  • PDF

An Interactive Voice Web Browser Usable as a Multimodal Interface in Information Devices by Using VoiceXML

  • Jang, Min-Seok
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.6
    • /
    • pp.771-775
    • /
    • 2004
  • The present Web surroundings is mostly composed of HTML(Hypertext Mark-up Language) and thereby users obtain web informations mainly in GUI(Graphical User Interface) environment by clicking mouse in order to keep up with hyperlinked informations. However it is very inconvenient to work in this environment comparing with easily accessed one in which human`s voice is utilized for obtaining informations. Using VoiceXML, resulted from XML, for supplying the information through telephone on the basis of the contemporary matured technology of voice recognition/synthesis to work out the inconvenience problem, this paper presents the research results about VoiceXML VUI(Voice User Interface) Browser designed and implemented for realizing its technology and also the VoiceXML Dialog designed for the purpose of the browser's efficient use.

Convergence of the Image of the Professor in Human Resources of Small and Medium Enterprises to Self Image : Mediating effect of voice image (중소기업 인적자원의 교수자이미지가 자아이미지에 미치는 융합연구 : 교수자음성이미지의 매개효과)

  • Kim, Jeoung-Yeoul
    • Journal of Convergence for Information Technology
    • /
    • v.7 no.4
    • /
    • pp.229-234
    • /
    • 2017
  • The purpose of this study was to investigate 188 university students at Seoul National University and to present self - image data to university students for the development of small and medium human resources. The results of the study are as follows. First, there was a positive correlation between the correlation between the image of the trainee perceived by university students and the self - image, the correlation between the image of the trainee perceived by the university students and the voice image, and the correlation between the voice image and the self - image perceived by university students. Second, as a result of examining whether or not the voice image is mediated in the relationship between the image of the talent and the self - image perceived by university students, Therefore, it is confirmed that as the image level of the talent related to the human resource of SMEs increases, the level of the voice image increases and the self image level also improves accordingly.

Human Voice, This Mystery

  • Horiuchi, Terumichi
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.378-378
    • /
    • 1996
  • Human beings and chimpanzees are very much alike. and scientists say there is only 1% difference between them. Contrary to our expectations, the difference lies not in brains but in tracheas ( windpipes ). Those of human beings are bigger and longer than those of chimpanzees. Thu means more air is inspired and expired as breath. About breath there are interesting descriptions in the Bible. In the Genesis it says God made a man out of soil and breathed life-giving breath into his nostrils and the man began to live. In other part it says life exists between incoming breath and outgoing breath. Thus breath plays key role is our life. In Hebrew and Greek, breath and spirit are the same words. In Hebrew it is ‘Luahf’ and in Greek, ‘Pneuma’ With breath and mouth organs human beings produced voice, and with haritage and through leaning we train our voice to reach the level of language which convey our culture. My contention is to realize the gift of voice and train it so that it can perform proper function as a tool of conveying our thought and culture. This is a kind of practice of speech and it may be called speechology. It includes the following practical methods: 1. Try to read aloud. 2. Encourage recitation, 3. Make public speaking as possible. 4. Learn theories of phonetics; such as about pronunciation, accent, intonation, prominence, assimilation and so on.

  • PDF

AI Voice Agent and Users' Response (AI 음성 에이전트의 음성 특성에 대한 사용자 반응 연구)

  • Beak, Seung Ju;Jung, Yoon Hyuk
    • The Journal of Information Systems
    • /
    • v.31 no.2
    • /
    • pp.137-158
    • /
    • 2022
  • Purpose As artificial intelligence voice agents (AIVA) have been widely adopted in services, diverse forms of their voices, which are the main interface with users, have been experimented. The purpose of this study is to examine how users evaluate vocal characteristics (gender, voice pitch, and voice pace) of AIVA, depending on prior research on human voice attractiveness. Design/methodology/approach This study employed an experimental survey which 516 participated in. Each participant was randomly assigned into one of eight situations (e.g., male - higher pitch - faster pace) and listened a AIVA voice sample, which introduce weather information. Next, a participant answered three consequence factors (attractiveness, trust, and anthropomorphism). Findings The results reveal that female voices of AIVA were perceived as more attractive and trustworthy than male voices. As far as voice pitch goes, while lower-pitch voices were preferred in female voices, higher-pitch voices were preferred in male voices. Finally, faster voices of AIVA were more attractive than slower voices.

Personal Credit Evaluation System through Telephone Voice Analysis: By Support Vector Machine

  • Park, Hyungwoo
    • Journal of Internet Computing and Services
    • /
    • v.19 no.6
    • /
    • pp.63-72
    • /
    • 2018
  • The human voice is one of the easiest methods for the information transmission between human beings. The characteristics of voice can vary from person to person and include the speed of speech, the form and function of the vocal organ, the pitch tone, speech habits, and gender. The human voice is a key element of human communication. In the days of the Fourth Industrial Revolution, voices are also a major means of communication between humans and humans, between humans and machines, machines and machines. And for that reason, people are trying to communicate their intentions to others clearly. And in the process, it contains various additional information along with the linguistic information. The Information such as emotional status, health status, part of trust, presence of a lie, change due to drinking, etc. These linguistic and non-linguistic information can be used as a device for evaluating the individual's credit worthiness by appearing in various parameters through voice analysis. Especially, it can be obtained by analyzing the relationship between the characteristics of the fundamental frequency(basic tonality) of the vocal cords, and the characteristics of the resonance frequency of the vocal track.In the previous research, the necessity of various methods of credit evaluation and the characteristic change of the voice according to the change of credit status were studied. In this study, we propose a personal credit discriminator by machine learning through parameters extracted through voice.