• 제목/요약/키워드: Human voice

검색결과 353건 처리시간 0.024초

합성곱 신경망을 이용한 스마트 토이의 음성명령 학습에 관한 연구 (A Study on Voice Command Learning of Smart Toy using Convolutional Neural Network)

  • 이경민;박철원
    • 전기학회논문지
    • /
    • 제67권9호
    • /
    • pp.1210-1215
    • /
    • 2018
  • Recently, as the IoT(Internet of Things) and AI(Artificial Intelligence) technologies have developed, smart toys that can understand and act on the language of human beings are being studied. In this paper, we study voice learning using CNN(Convolutional Neural Network) by applying artificial intelligence based voice secretary technology to smart toy. When a human voice command gives, Smart Toy recognizes human voice, converts it into text, analyzes the morpheme, and conducts tagging and voice learning. As a result of test for the simulator program implemented using Python, no malfunction occurred in a single command. And satisfactory results were obtained within the selected simulation condition range.

Cannula-typed Silicone Voice Prosthesis(소망$\circledR$)의 개발 (Development of Cannula-typed Silicone Voice Prosthesis(So-Mang$\circledR$))

  • 최홍식;정은주;전희선;문인석;김영호;김광문
    • 대한후두음성언어의학회지
    • /
    • 제12권2호
    • /
    • pp.152-157
    • /
    • 2001
  • Background : Electrolarynx, Esophageal voice, and Silicone voice prosthesis with tracheoesophageal(T-E) fistula have been used as vocal rehabilitating methods for the post-laryngectomized patients. Prosthetic rehabilitation of voice after total laryngectomy has gained wide acceptance and has become a common practice in many clinics since the pioneering works of Singer and Blom In 1979. Since the introduction of tracheo-esophageal puncture and application of Blom Singer$\circledR$ voice prosthesis in 1980, several reliable voice prostheses have been developed and are successfully being used. Objectives : Even though quality of voice produced by Silicone voice prosthesis with T-E fistula is superior to other modalities, it still has some disadvantages. We devised a new cannulatyped silicone voice prosthesis. Methods : 1) Devising a new prototype of cannula-typed silicone voice prosthesis. 2) Application of the prototype using canine animal model(laryngectormized dog) and fitting trial on human patient whose previously inserted Silicone voice prosthesis is not functioning due to presumed fungal infection. Discussion : Final form of prototype was made after several times of major and minor modifications. Insertion of the newly developed Cannula-typed Silicone voice prosthesis on canine animal model and human trial were done without any difficulty. There were no serious leakage of saliva or food during swallowing. Conclusion : The newly developed Cannula-typed Silicone voice prosthesis(So-Mang$\circledR$) and the modified replacement method will further improve the results of post-laryngectomized prosthetic voice rehabilitation. Long-term animal study and human trial are planned in the near future.

  • PDF

인간-로봇 상호협력작업을 위한 모바일로봇의 지능제어에 관한 연구 (A Study on Intelligent Control of Mobile Robot for Human-Robot Cooperative Operation in Manufacturing Process)

  • 김두범;배호영;김상현;임오득;백영태;한성현
    • 한국산업융합학회 논문집
    • /
    • 제22권2호
    • /
    • pp.137-146
    • /
    • 2019
  • This study proposed a new technique to control of mobile robot based on voice command for (Human-Robot Cooperative operation in manufacturing precess). High performance voice recognition and control system was designed In this paper for smart factory. robust voice recognition is essential for a robot to communicate with people. One of the main problems with voice recognition robots is that robots inevitably effects real environment including with noises. The noise is captured with strong power by the microphones, because the noise sources are closed to the microphones. The signal-to-noise ratio of input voice becomes quite low. However, it is possible to estimate the noise by using information on the robot's own motions and postures, because a type of motion/gesture produces almost the same pattern of noise every time it is performed. In this paper, we describe an robust voice recognition system which can robustly recognize voice by adults and students in noisy environments. It is illustrated by experiments the voice recognition performance of mobile robot placed in a real noisy environment.

Mahalanobis Taguchi System을 이용한 파킨슨병 환자의 음성분석을 통한 진단에 관한 연구 (Diagnosis of Parkinson's Disease by Voice Disorder Using Mahalanobis Taguchi System)

  • 홍정의
    • 산업경영시스템학회지
    • /
    • 제32권4호
    • /
    • pp.215-222
    • /
    • 2009
  • Human voice reacts very sensitively to human's minute physical condition. For instance, human voice disorders affect patients profoundly especially in the case of Parkinson's disease. Acoustic tools such as MDVP, can function as an equipment that measures various voice in different objects. Many different approaches have been applied for analyzing the voice disorders for diagnosis of Parkinson's disease. According to the voice data of suspected Parkinson's patients from UCI Machine Learning Repository, it is reported to have 23 people with Parkinson's disease and 8 healthy people. Applying Mahalanobis Taguchi System (MTS) for diagnosis of Parkinson's disease, the correct diagnosis performance is compared to previous research results.

차량실내에서 음성출력장치의 소음비교특성에 관한 연구 (A Study on the Characteristics of Noise Comparison in Voice Warning System in the automobile indoors)

  • 한영출;김대열;오상기
    • 한국자동차공학회논문집
    • /
    • 제11권2호
    • /
    • pp.196-202
    • /
    • 2003
  • The object of this article is to study the plausibility of applying human voice warning system to automobiles. Human voice is considered the best tool for warning system in automobiles. For the purpose of comprehending the specific characteristics of relation between noises and properties of the automobiles indoors and voice warning system researcher performed FRF test in order to examine the characteristics of voice output, and FEM simulation to learn the specific properties of the car indoors. And furthermore, surveyed the quality of voice output, using the written inquiry to examine members. The result of the study shows that it is much possible to apply voice warning system to automobiles.

인간의 감정 인식을 위한 신경회로망 기반의 휴먼과 컴퓨터 인터페이스 구현 (Implementation of Human and Computer Interface for Detecting Human Emotion Using Neural Network)

  • 조기호;최호진;정슬
    • 제어로봇시스템학회논문지
    • /
    • 제13권9호
    • /
    • pp.825-831
    • /
    • 2007
  • In this paper, an interface between a human and a computer is presented. The human and computer interface(HCI) serves as another area of human and machine interfaces. Methods for the HCI we used are voice recognition and image recognition for detecting human's emotional feelings. The idea is that the computer can recognize the present emotional state of the human operator, and amuses him/her in various ways such as turning on musics, searching webs, and talking. For the image recognition process, the human face is captured, and eye and mouth are selected from the facial image for recognition. To train images of the mouth, we use the Hopfield Net. The results show 88%$\sim$92% recognition of the emotion. For the vocal recognition, neural network shows 80%$\sim$98% recognition of voice.

수화자(受話者) 구별을 위한 PAMD 구현 (Implement PAMD for discriminate human and ARS)

  • 서봉수
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2003년도 신호처리소사이어티 추계학술대회 논문집
    • /
    • pp.61-64
    • /
    • 2003
  • In this paper, we implement PAMD(Positive Answering Machine Detection) for discrimination human and ARS. We are used Grunt detection, Glitch Noise detection and Tone detection for PAMD. It distinguishes voice signals from ring-back tone and glitch noise respectively. And as a second step, it judges whether human responses or ARS responses after integrating pattern changes like initial response period, the number of voice data, each time of voice data period and glitch noise. The accuracy is about 9375 in ASR and about 98% in Mobile phone.

  • PDF

동굴관광용 고층수직이동 승강기의 긴급 음성구동 제어 (Voice Recognition Sensor Driven Elevator for High-rise Vertical Shift)

  • 최병섭;강태현;윤여훈;장훈규;소대화
    • 동굴
    • /
    • 제88호
    • /
    • pp.1-7
    • /
    • 2008
  • Recently, it is one of very interest technology of Human Computer Interaction(HCI). Nowadays, it is easy to find out that, for example, inside SF movies people has talking to computer. However, there are difference between CPU language and ours. So, we focus on connecting to CPU. For 30 years many scientists experienced in that technology. But it is really difficult. Our project goal is making that CPU could understand human voice. First of all the signal through a voice sensor will move to BCD (binary code). That elevator helps out people who wants to move up and down. This product's point is related with people's safety. Using a PWM for motor control by ATmega16, we choose a DC motor to drive it because of making a regular speed elevator. Furthermore, using a voice identification module the elevator driven by voice sensor could operate well up and down perfectly from 1st to 10th floor by PWM control with ATmega16. And, it will be clearly useful for high-rise vertical shift with voice recognition sensor driven.

음성모음과 신체의 상관관계 분석 (An Analysis of Correlation between Voice vowels and Human body)

  • 최인호;전종원
    • 한국항행학회논문지
    • /
    • 제14권3호
    • /
    • pp.375-383
    • /
    • 2010
  • 본 논문은 음성진단이나 음성치료를 위한 연구로서 음성과 신체의 상관관계를 분석한 것이다. 음성신호와 함께 신체의 머리와 가슴 그리고 복부에서 음성에 의한 진동파형을 측정하였으며, 이 때 사용한 음성은 모음 '아', '에', '이', '오', '우' 이다. 그 결과 모음에 따라 신체의 특징을 잘 나타내는 성분을 확인할 수 있었으며, 신체질량지수(BMI)와의 상관계수를 측정하여 음성에 의한 신체조건 진단의 활용방안을 제시하였다.

Greeting, Function, and Music: How Users Chat with Voice Assistants

  • Wang, Ji;Zhang, Han;Zhang, Cen;Xiao, Junjun;Lee, Seung Hee
    • 감성과학
    • /
    • 제23권2호
    • /
    • pp.61-74
    • /
    • 2020
  • Voice user interface has become a commercially viable and extensive interaction mechanism with the development of voice assistants. Despite the popularity of voice assistants, the academic community does not utterly understand about what, when, and how users chat with them. Chatting with a voice assistant is crucial as it defines how a user will seek the help of the assistant in the future. This study aims to cover the essence and construct of conversational AI, to develop a classification method to deal with user utterances, and, most importantly, to understand about what, when, and how Chinese users chat with voice assistants. We collected user utterances from the real conventional database of a commercial voice assistant, NetEase Sing in China. We also identified different utterance categories on the basis of previous studies and real usage conditions and annotated the utterances with 17 labels. Furthermore, we found that the three top reasons for the usage of voice assistants in China are the following: (1) greeting, (2) function, and (3) music. Chinese users like to interact with voice assistants at night from 7 PM to 10 PM, and they are polite toward the assistants. The whole percentage of negative feedback utterances is less than 6%, which is considerably low. These findings appear to be useful in voice interaction designs for intelligent hardware.