• 제목/요약/키워드: Human voice

검색결과 353건 처리시간 0.028초

음성명령기반 26관절 보행로봇 실시간 작업동작제어에 관한 연구 (A Study on Real-Time Walking Action Control of Biped Robot with Twenty Six Joints Based on Voice Command)

  • 조상영;김민성;양준석;구영목;정양근;한성현
    • 제어로봇시스템학회논문지
    • /
    • 제22권4호
    • /
    • pp.293-300
    • /
    • 2016
  • The Voice recognition is one of convenient methods to communicate between human and robots. This study proposes a speech recognition method using speech recognizers based on Hidden Markov Model (HMM) with a combination of techniques to enhance a biped robot control. In the past, Artificial Neural Networks (ANN) and Dynamic Time Wrapping (DTW) were used, however, currently they are less commonly applied to speech recognition systems. This Research confirms that the HMM, an accepted high-performance technique, can be successfully employed to model speech signals. High recognition accuracy can be obtained by using HMMs. Apart from speech modeling techniques, multiple feature extraction methods have been studied to find speech stresses caused by emotions and the environment to improve speech recognition rates. The procedure consisted of 2 parts: one is recognizing robot commands using multiple HMM recognizers, and the other is sending recognized commands to control a robot. In this paper, a practical voice recognition system which can recognize a lot of task commands is proposed. The proposed system consists of a general purpose microprocessor and a useful voice recognition processor which can recognize a limited number of voice patterns. By simulation and experiment, it was illustrated the reliability of voice recognition rates for application of the manufacturing process.

Signal Enhancement of a Variable Rate Vocoder with a Hybrid domain SNR Estimator

  • Park, Hyung Woo
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권2호
    • /
    • pp.962-977
    • /
    • 2019
  • The human voice is a convenient method of information transfer between different objects such as between men, men and machine, between machines. The development of information and communication technology, the voice has been able to transfer farther than before. The way to communicate, it is to convert the voice to another form, transmit it, and then reconvert it back to sound. In such a communication process, a vocoder is a method of converting and re-converting a voice and sound. The CELP (Code-Excited Linear Prediction) type vocoder, one of the voice codecs, is adapted as a standard codec since it provides high quality sound even though its transmission speed is relatively low. The EVRC (Enhanced Variable Rate CODEC) and QCELP (Qualcomm Code-Excited Linear Prediction), variable bit rate vocoders, are used for mobile phones in 3G environment. For the real-time implementation of a vocoder, the reduction of sound quality is a typical problem. To improve the sound quality, that is important to know the size and shape of noise. In the existing sound quality improvement method, the voice activated is detected or used, or statistical methods are used by the large mount of data. However, there is a disadvantage in that no noise can be detected, when there is a continuous signal or when a change in noise is large.This paper focused on finding a better way to decrease the reduction of sound quality in lower bit transmission environments. Based on simulation results, this study proposed a preprocessor application that estimates the SNR (Signal to Noise Ratio) using the spectral SNR estimation method. The SNR estimation method adopted the IMBE (Improved Multi-Band Excitation) instead of using the SNR, which is a continuous speech signal. Finally, this application improves the quality of the vocoder by enhancing sound quality adaptively.

음성기술을 이용한 정신피로 측정에 관한 타당성 연구 (A Validity Study on Measurement of Mental Fatigue Using Speech Technology)

  • 송승규;김종열;장준수;권철홍
    • 말소리와 음성과학
    • /
    • 제5권1호
    • /
    • pp.3-10
    • /
    • 2013
  • This study proposes a method to measure mental fatigue using speech technology, which has not been used in previous research and is easier than existing complex and difficult methods. It aims at establishing a relationship between the human voice and mental fatigue based on experiments to measure the influence of mental fatigue on the human voice. Two monotonous tasks of simple calculation such as finding the sum of three one digit numbers were used to measure the feeling of monotony and two sets of subjective questionnaires were used to measure mental fatigue. While thirty subjects perform the experiment, responses to the questionnaire and speech data were collected. Speech features related to speech source and the vocal tract filter were extracted from the speech data. According to the results, speech parameters deeply related to mental fatigue are a mean and standard deviation of fundamental frequency, jitter, and shimmer. This study shows that speech technology is a useful method for measuring mental fatigue.

시각장애인 유도로봇에서의 위치 설정 및 탐색에 대한 음성시스템의 설계 및 구현 (Design and Implementation of voice system about location set and search in the blind guidable robot)

  • 박승우;신동범;이응혁;홍승홍
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(5)
    • /
    • pp.125-128
    • /
    • 2002
  • One of ultimate purpose that performance to information society been going recently festinately intends is in human's welfare improvement. Also, research about assist for disabled person that belong on category that is disabled persons' cloth elevation estranged in the past according to disabled person population's increase and change of advanced human rights consciousness to ruins of industrial society and traffic civilization is afoot abuzz. Guidance robot of sight obstacle can speak as its part. This research is thing about voice system about location set and search in guidance robot that is embodying to make sight disabled person can visit schedule place smoothly.

  • PDF

음성의 특정 주파수 범위를 이용한 잡음환경에서의 감정인식 (Noise Robust Emotion Recognition Feature : Frequency Range of Meaningful Signal)

  • 김은호;현경학;곽윤근
    • 한국정밀공학회지
    • /
    • 제23권5호
    • /
    • pp.68-76
    • /
    • 2006
  • The ability to recognize human emotion is one of the hallmarks of human-robot interaction. Hence this paper describes the realization of emotion recognition. For emotion recognition from voice, we propose a new feature called frequency range of meaningful signal. With this feature, we reached average recognition rate of 76% in speaker-dependent. From the experimental results, we confirm the usefulness of the proposed feature. We also define the noise environment and conduct the noise-environment test. In contrast to other features, the proposed feature is robust in a noise-environment.

Indexing and Retrieval of Human Individuals on Video Data Using Face and Speaker Recognition

  • Y.Sugiyama;N.Ishikawa;M.Nishida;Y.Ariki
    • 한국방송∙미디어공학회:학술대회논문집
    • /
    • 한국방송공학회 1998년도 Proceedings of International Workshop on Advanced Image Technology
    • /
    • pp.122-127
    • /
    • 1998
  • In this paper, we focus on the information retrieval of human individuals who are recorded on the video database. Our purpose is to index persons by their faces or voice and to retrieve their existing time sections on the video data. The database system can track as well as extract a face or voice of a certain person and construct a model of the individual person in self-organization mode. If he appears again at different time, the system can put the mark of the same person to the associated frames. In this way, the same person can be retrieved even if the system does not know his exact name. As the face and speaker modeling, a subspace method is employed to improve the indexing accuracy.

  • PDF

전투기용 음성명령 시스템에 대한 연구 (A Study on Cockpit Voice Command System for Fighter Aircraft)

  • 김성우;서민기;오영환;김봉규
    • 한국항공우주학회지
    • /
    • 제41권12호
    • /
    • pp.1011-1017
    • /
    • 2013
  • 음성은 사람의 가장 자연스러운 정보 전달 수단이며, 음성인식 기술은 사람이 기계를 사용하는데 있어 편의성을 높이기 위해 필요성이 점차 증대되고 있다. 현대 전투기의 조종석은 디지털 기술의 발달로 인하여 항공전자 장비의 기능이 다양하고 복잡해지고 있으며, 전투기를 조종하여 공격 임무를 수행해야 하는 조종사에게 항공전자 장비의 운용으로 인한 임무 부하량이 증대되기 마련이다. 따라서 음성인식 기술을 이용하여 항공전자장비를 운용하게 되면, 조종사는 공격 임무에 더 많은 시간과 노력을 할애할 수 있게 된다. 본 연구는 전투기 조종석에 적용 가능한 음성명령 시스템을 개발하고, 검증환경을 구축하여 음성명령 시스템의 기능 및 성능을 검증한 것이다.

Interactive Adaptation of Fuzzy Neural Networks in Voice-Controlled Systems

  • Pulasinghe, Koliya;Watanabe, Keigo;Izumi, Kiyotaka;Kiguchi, Kazuo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 2002년도 ICCAS
    • /
    • pp.42.3-42
    • /
    • 2002
  • Fuzzy Neural Network (FNN) is a compulsory element in a voice-controlled machine due to its inherent capability of interpreting imprecise natural language commands. To control such a machine, user's perception of imprecise words is very important because the words' meaning is highly subjective. This paper presents a voice based controller centered on an adaptable FNN to capture the user's perception of imprecise words. Conversational interface of the machine facilitates the learning through interaction. The system consists of a dialog manager (DM), the conversational interface, a Knowledge base, which absorbs user's perception and acts as a replica of human understanding of imprecise words,...

  • PDF

고속 음성 문서 검색을 위한 Expected Matching Score 기반의 문서 확장 기법 (Expected Matching Score Based Document Expansion for Fast Spoken Document Retrieval)

  • 서민구;정규준;오영환
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2006년도 추계학술대회 발표논문집
    • /
    • pp.71-74
    • /
    • 2006
  • Many works have been done in the field of retrieving audio segments that contain human speeches without captions. To retrieve newly coined words and proper nouns, subwords were commonly used as indexing units in conjunction with query or document expansion. Among them, document expansion with subwords has serious drawback of large computation overhead. Therefore, in this paper, we propose Expected Matching Score based document expansion that effectively reduces computational overhead without much loss in retrieval precisions. Experiments have shown 13.9 times of speed up at the loss of 0.2% in the retrieval precision.

  • PDF

A Mobile Stress Management System utilizing Variable Voice Information According to the Wearing Area

  • Kang, Byeongsoo;Vannroath, Ky;Kang, Hyun-syug
    • 한국컴퓨터정보학회논문지
    • /
    • 제22권6호
    • /
    • pp.95-100
    • /
    • 2017
  • Recently, as stress has become a major threat to people's health, there is a growing interest in wearable stress management services for stress relief. In this paper, we developed a wearable device(Care-on) capable of extracting changeable human voice information at each site and a Healthcare App(S-Manager) that enables stress management in real time using the wearable device. It collects and analyzes variable real-time voice information for each part of the person's body. And It also provides the ability to monitor stress conditions in a mobile environment and provide feedback on the analysis results in step by step in the mobile environment. We tested the developed wearable devices and app in a mobile environment and analyzed the results to confirm their usefulness.