• Title/Summary/Keyword: human voice

Search Result 355, Processing Time 0.025 seconds

Modular Fuzzy Neural Controller Driven by Voice Commands

  • Izumi, Kiyotaka;Lim, Young-Cheol
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.32.3-32
    • /
    • 2001
  • This paper proposes a layered protocol to interpret voice commands of the user´s own language to a machine, to control it in real time. The layers consist of speech signal capturing layer, lexical analysis layer, interpretation layer and finally activation layer, where each layer tries to mimic the human counterparts in command following. The contents of a continuous voice command are captured by using Hidden Markov Model based speech recognizer. Then the concepts of Artificial Neural Network are devised to classify the contents of the recognized voice command ...

  • PDF

Voice Analysis of Highest Falsetto and Lowest Modal Voice (가성구와 흉성구의 객관적인 음성분석)

  • 진성민;송윤경;권기환;이경철;반재호
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.13 no.2
    • /
    • pp.151-154
    • /
    • 2002
  • Background and Objectives : The pitch range of the human voice is variable, extending from chest register to falsetto register. Although numerous studies have investigated after laryngeal mechanism description of falsetto tone, systematic and objective studies were lack. The purpose of this study was to systematically analyze and compare modal with falsetto voice. Materials and Methods : Seven adult baritones were selected from a larger population of volunteers at choir. Simultaneous measurements of acoustic, electroglottographic and aerodynamic study were made during /e/ sustained in two vocal registers, lowest modal and highest falsetto. Statistical analysis was performed using Wilkoxson signed rankes test. Results : In the acoustic analysis, shimmer was increased in flasetto voice(p<0.05). In the electroglottographic analysis, closed quotient(CQ), speed quotient(SQ) at the modal voice were higher than at the falsetto voice(p<0.05). In the aerodynamic analysis, and airflow rate(MFR) of falsetto voice was higher than modal voice(p<0.05). Conclusions : In the results of the study indicate that, falsetto register ineffective, inefficient, generally unpleasant because it was produced by incomplete clousure of true vocal cord. We anticipated that further study with large samples can provide an objective criteria for status and classification of singer's modal and falsetto voice.

  • PDF

An Analysis of the Vibration Characteristics through the Human Body (인체 내부에서의 진동 전달특성 분석)

  • 전종원;진용옥
    • The Journal of the Acoustical Society of Korea
    • /
    • v.19 no.7
    • /
    • pp.59-65
    • /
    • 2000
  • This paper describes the analysis of vibration characteristics through the human body as the research for voice therapy and diagnosis. The oscillation signal is not external forces but the self-voice to be pronounced the vowels ('a', 'e', 'i', 'o', 'u'). The experiment system consists of microphones, accelerometers and amplifiers. The input data are stored by the computer. At the same time, the voice is stored by the microphone and the vibration signal of the human body is stored by accelerometer. The 63 points are appointed in head, neck, trunk of human body. The positions and number of times are changeable by the purpose. The analysis parameters are amplitude, phase, fundamental. frequency, formant and the correlation of vibration signal and voice is measured by coherence function. The results show that the vibration signals have characteristic vibration in the positions of human body.

  • PDF

Design and Implementation of VoiceXML VUI Browser (VoiceXML VUI Browser 설계/구현)

  • 장민석;예상후
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.11a
    • /
    • pp.788-791
    • /
    • 2002
  • The present Web surroundings is composed of HTML(Hypertext Mark-up Language) and thereby users obtains web informations mainly in GUI(Graphical User Interface) environment by clicking mouse in order to keep up with hyperlinked informations. However it is very inconvenient to work in this environment comparing with easily accessed one in which human's voice is utilized for obtaining informations. Using VoiceXML, resulted from XML, for supplying the information through telephone on the basis of the contemporary matured technology of voice recognition/synthesis to work out the inconvenience problem, this paper presents the research results about VoiceXML Web Browser designed and implemented for realizing its technology.

  • PDF

On The Voice Training of Stage Speech in Acting Education - Yuri Vasiliev's Stage Speech Training Method - (연기 교육에서 무대 언어의 발성 훈련에 관하여 - 유리 바실리예프의 무대 언어 훈련방법 -)

  • Xu, Cheng-Kang
    • Journal of Korea Entertainment Industry Association
    • /
    • v.15 no.3
    • /
    • pp.203-210
    • /
    • 2021
  • Yuri Vasilyev - actor, director and drama teacher. Russian meritorious artist, winner of the stage "Medal of Friendship" awarded by Russian President Vladimir Putin; academician of the Petrovsky Academy of Sciences and Arts in Russia, professor of the Russian National Academy of Performing Arts, and professor of the Bavarian Academy of Drama in Munich, Germany. The physiological sense stimulation method based on the improvement of voice, language and motor function of drama actors. On the basis of a systematic understanding of performing arts, Yuri Vasiliev created a unique training method of speech expression and skills. From the complicated art training, we find out the most critical skills for focused training, which we call basic skills training. Throughout the whole training process, Professor Yuri made a clear request for the actor's lines: "action! This is the basis of actors' creation. So action is the key! Action and voice are closely linked. Actor's voice is human voice, human life, human feeling, human experience and disaster. It is also the foundation of creation that actors acquire their own voice. What we are engaged in is pronunciation, breathing, tone and intonation, speed and rhythm, expressiveness, sincerity, stage voice and movement, gesture, all of which are used to train the voice of actors according to the standard of drama. In short, Professor Yuri's training course is not only the training of stage performance and skills, but also contains a rich view of drama and performance. I think, in addition to learning from the means and methods of training, it is more important for us to understand the starting point and training objectives of Professor Yuri's use of these exercises.

Emotion Recognition Using Tone and Tempo Based on Voice for IoT (IoT를 위한 음성신호 기반의 톤, 템포 특징벡터를 이용한 감정인식)

  • Byun, Sung-Woo;Lee, Seok-Pil
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.65 no.1
    • /
    • pp.116-121
    • /
    • 2016
  • In Internet of things (IoT) area, researches on recognizing human emotion are increasing recently. Generally, multi-modal features like facial images, bio-signals and voice signals are used for the emotion recognition. Among the multi-modal features, voice signals are the most convenient for acquisition. This paper proposes an emotion recognition method using tone and tempo based on voice. For this, we make voice databases from broadcasting media contents. Emotion recognition tests are carried out by extracted tone and tempo features from the voice databases. The result shows noticeable improvement of accuracy in comparison to conventional methods using only pitch.

A Novel Computer Human Interface to Remotely Pick up Moving Human's Voice Clearly by Integrating ]Real-time Face Tracking and Microphones Array

  • Hiroshi Mizoguchi;Takaomi Shigehara;Yoshiyasu Goto;Hidai, Ken-ichi;Taketoshi Mishima
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1998.10a
    • /
    • pp.75-80
    • /
    • 1998
  • This paper proposes a novel computer human interface, named Virtual Wireless Microphone (VWM), which utilizes computer vision and signal processing. It integrates real-time face tracking and sound signal processing. VWM is intended to be used as a speech signal input method for human computer interaction, especially for autonomous intelligent agent that interacts with humans like as digital secretary. Utilizing VWM, the agent can clearly listen human master's voice remotely as if a wireless microphone was put just in front of the master.

  • PDF

The effect of the human voice that is consistent with context and the mechanical melody on user's subjective experience in mobile phones (휴대전화 상황에서 맥락과 일치하는 사람음과 단순 기계음이 사용자의 주관적 경험에 미치는 영향)

  • Cho, Yu-Suk;Eom, Ki-Min;Joo, Hyo-Min;Suk, Ji-He;Han, Kwang-Hee
    • Science of Emotion and Sensibility
    • /
    • v.12 no.4
    • /
    • pp.531-544
    • /
    • 2009
  • In the past, objective usability was one of the most important aspects when user used system. But nowadays user's subjective experiences are getting more critical element than objective usability in HCI(human-computer interaction). Most people own their mobile phone and use it frequently these days. It is especially important to make user's subjective experiences more positive when using devices like mobile phones people frequently carry and interact with. This study investigates whether the interfaces which express the emotion give more positive experiences to users. Researchers created mobile phone prototypes to compare the effect of mechanical melody feedback(the major auditory feedbacks on mobile phones) and emotional voice feedback(recorded human voice). Participants experienced four kinds of mobile phone prototypes(no feedback, mechanical melody feedback, emotional voice feedback and dual feedback) and evaluated their experienced usability, hedonic quality and preference. The result suggests that person's perceptional fun and hedonic quality were getting increased in the phone which gave the emotional voice feedback than the mechanical melody feedback. Nevertheless, the preference was evaluated lower in the emotional voice feedback condition than the others.

  • PDF

Design of Intelligent Emotion Recognition Model

  • Kim, Yi-gon
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.11 no.7
    • /
    • pp.611-614
    • /
    • 2001
  • Voice is one of the most efficient communication media and it includes several kinds of factors about speaker, context emotion and so on. Human emotion is expressed is expressed in the speech, the gesture, the physiological phenomena(the breath, the beating of the pulse, etc). In this paper, the emotion recognition method model using neuro-fuzzy in order to have cognizance of emotion from voice signal is presented and simulated.

  • PDF

Acoustic Analyses of Vocal Vibrato of Korean Singers

  • Yoo, Jae-Yeon;Jeong, Ok-Ran;Kwon, Do-Ha
    • Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.37-43
    • /
    • 2005
  • The phenomenon of vocal vibrato may be regarded as an acoustic representation of one of the most rapid and continuous changes in pitch and intensity that the human vocal mechanism is capable of producing. Singers are likely to use vibrato effectively to enrich their voice. The purpose of this study was to obtain acoustic measurements (vF0 and vAm) of 45 subjects (15 trot and 15 ballad singers and 15 non-singers) and to compare acoustic measurements of the vowel /a/ produced by 3 groups on 2 voice sampling conditions (prolongation and singing of /a/). Thirty singers of trot and ballad were selected by a producer and a concert director working for the KBS (Korean Broadcasting System). The MDVP was used to measure the acoustic parameters. A two-way MANOVA was used for statistical analyses. The results were as follows; Firstly, there was no significant difference among the 3 groups in vF0 and vAm in prolongation of /a/, but in singing voice, there was a significant difference among 3 groups in vF0 and vAm. Secondly, there was an interaction between music genre and voice sampling condition in vF0, and vAm. Finally, trot singers sing with more vibrato than ballad singers. It was concluded that it is very important to analyze singers' voice including various voice conditions (prolongation, reading, conversation, and singing) and to identify differences of singing voice characteristics among music genre.

  • PDF