• Title/Summary/Keyword: Voice Problem

Search Result 338, Processing Time 0.03 seconds

Interactive content development of voice pattern recognition (음성패턴인식 인터랙티브 콘텐츠 개발)

  • Na, Jong-Won
    • Journal of Advanced Navigation Technology
    • /
    • v.16 no.5
    • /
    • pp.864-870
    • /
    • 2012
  • Voice pattern recognition technology to solve the problems of the existing problems and common issues that you may have in language learning content analysis. This is the first problem of language-learning content, online learning posture. Game open another web page through the lesson, but the concentration of the students fell. Have not been able to determine the second issue according Speaking has made the learning process actually reads. Third got a problem with the mechanical process by a learning management system, as well by the teacher in the evaluation of students and students who are learning progress between the difference in the two. Finally, the biggest problem, while maintaining their existing content made to be able to solve the above problem. Speaking learning dedicated learning programs under this background, voice pattern recognition technology learning process for speech recognition and voice recognition capabilities for learning itself has been used in the recognition process the data of the learner's utterance as an audio file of the desired change to a transfer to a specific location of the server or SQL server may be easily inserted into any system or program, any and all applicable content that has already been created without damaging the entire component because the new features were available. Contributed to this paper, active participation in class more interactive teaching methods to change.

Voice Recognition Performance Improvement using a convergence of Voice Energy Distribution Process and Parameter (음성 에너지 분포 처리와 에너지 파라미터를 융합한 음성 인식 성능 향상)

  • Oh, Sang-Yeob
    • Journal of Digital Convergence
    • /
    • v.13 no.10
    • /
    • pp.313-318
    • /
    • 2015
  • A traditional speech enhancement methods distort the sound spectrum generated according to estimation of the remaining noise, or invalid noise is a problem of lowering the speech recognition performance. In this paper, we propose a speech detection method that convergence the sound energy distribution process and sound energy parameters. The proposed method was used to receive properties reduce the influence of noise to maximize voice energy. In addition, the smaller value from the feature parameters of the speech signal The log energy features of the interval having a more of the log energy value relative to the region having a large energy similar to the log energy feature of the size of the voice signal containing the noise which reducing the mismatch of the training and the recognition environment recognition experiments Results confirmed that the improved recognition performance are checked compared to the conventional method. Car noise environment of Pause Hit Rate is in the 0dB and 5dB lower SNR region showed an accuracy of 97.1% and 97.3% in the high SNR region 10dB and 15dB 98.3%, showed an accuracy of 98.6%.

Implementation of VoIP Service in Hybrid Fiber Coaxial Network (Hybrid Fiber Coaxial망에서 VoIP 서비스 구현)

  • Ju, Jae-han
    • Journal of Advanced Navigation Technology
    • /
    • v.21 no.1
    • /
    • pp.113-118
    • /
    • 2017
  • As interest in mobile devices and networks has increased recently, voice over internet protocol (VoIP) service, which is a technology for transmitting voice data using an existing internet protocol (IP) network, has rapidly spread, Cheap voice call service has become possible. As the digital broadcasting service becomes popular, hybrid fiber coaxial (HFC) network technology, which uses broadband cable network through fusion of broadcasting and communication, utilizes existing communication system and network equipment to provide various new services such as interactive broadcasting service. Therefore, if UGS-AD is applied to VoCM and RTPS is applied to MTA in order to guarantee the quality of voice data in actual HFC Internet service network, it is possible to smoothly perform voice data transmission in narrow upstream band which is a problem in actual commercial HFC network We also proposed a method to improve VoIP service by improving QoS of voice data in HFC Internet service network.

A policy study for the voice recognition technology based on elderly health care (음성인식기술의 노인간병 적용을 위한 정책연구)

  • Cho, Byung-Chul;Cheon, Sooyoung;Kim, Kab-Nyun;Yuk, Hyun-Seung
    • Journal of Digital Convergence
    • /
    • v.16 no.2
    • /
    • pp.9-17
    • /
    • 2018
  • The purpose of this study is to find out how voice recognition technology can be utilized to solve the elderly problem rapidly aging in Korea. Public support services and civilian nursing services for the elderly are expected to expand in Korea. In this case, voice recognition technology can be used variously for the elderly who are not familiar with the media interface. To this end, our researchers visited Japan and examined the achievements obtained by voice recognition technology in the elderly care. Especially, when caregivers write reports, they have greatly reduced their working hours by replacing the handwritten reports with ones using voice recognition technology. This method can be easily implemented in Korea. In addition, the social cost of the elderly support can be gradually reduced through the development of a robot equipped with voice recognition technology. Consequently, we realize that when voice recognition technology is combined with artificial intelligence programs of various emotion recognition functions and various policy possibilities as well.

The research on the MEMS device improvement which is necessary for the noise environment in the speech recognition rate improvement (잡음 환경에서 음성 인식률 향상에 필요한 MEMS 장치 개발에 관한 연구)

  • Yang, Ki-Woong;Lee, Hyung-keun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.22 no.12
    • /
    • pp.1659-1666
    • /
    • 2018
  • When the input sound is mixed voice and sound, it can be seen that the voice recognition rate is lowered due to the noise, and the speech recognition rate is improved by improving the MEMS device which is the H / W device in order to overcome the S/W processing limit. The MEMS microphone device is a device for inputting voice and is implemented in various shapes and used. Conventional MEMS microphones generally exhibit excellent performance, but in a special environment such as noise, there is a problem that the processing performance is deteriorated due to a mixture of voice and sound. To overcome these problems, we developed a newly designed MEMS device that can detect the voice characteristics of the initial input device.

Development of Voice Guide Service for Pharmaceutical Information based on Ontology

  • Lee, Kyung Min;Kang, Min Soo;Jung, Yong Gyu
    • International Journal of Advanced Culture Technology
    • /
    • v.6 no.1
    • /
    • pp.50-59
    • /
    • 2018
  • Generally, disabled people have a lot of bad health status at low income levels, the need for health care is higher than for non-disabled people. Although the number of persons with disabilities is increasing with each passing year, their medical services and support are still limited and limited. This problem is not so different from approach to medical information. Conventional medical information is usually printed and transmitted to the patient, but visually impaired people have difficulty accessing such printed information. In the case of the visually impaired, there are many cases where it is not possible to read not only the printed letter but also the braille because the acquired incidence is high. Therefore, this paper tried to solve this problem by transmitting the information of medicine by voice using RFID. In addition, ontology was used to select more accurate drug information. Currently, there are drug information sites provided by the Ministry of Health and Welfare. However, since duplicate information is scattered on these sites, the ontology was used to build up the database.

Extraction of voice signal embedded in 1/f noise using wavelet

  • Toyama, Naoki;Sasaya, Takashi;Akizuki, Kageo
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1997.10a
    • /
    • pp.564-567
    • /
    • 1997
  • This paper deals with the problem of extraction of voice signal embedded in 1/f noise. We propose the extraction method using wavelet. This method is based on Wornell's modelling which can construct 1/f process in terms of uncorrelated variables and is well suited on treating 1/f process. Finally, we show further describe our method through simulation.

  • PDF

Aerodynamic Features and Voice Therapy Interventions of Functional Voice Disorder after Thyroidectomy (갑상선 절제 술 후 기능적 음성장애의 공기역학적 특징과 음성치료 중재)

  • Lee, Chang-Yoon;An, Soo-Youn;Chang, Hyun;Jeong, Hee Seok;Son, Hee Young
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.26 no.1
    • /
    • pp.25-33
    • /
    • 2015
  • Background and Objectives:The objective of this study was to investigate the features of post-thyroidectomy subjective voice disorder by Voice Handicap Index (VHI) and Voice Symptom Scale (VOISS) through aerodynamic analysis and to investigate the appropriate voice therapy intervention. Materials and Methods:Twenty post-thyroidectomy patients who had no recurrent laryngeal nerve paralysis through laryngeal stroboscopy were enrolled for this study. Acoustic and aerodynamic evaluations were performed before operation, 2 weeks and 3 months after operation. Subjective voice evaluation was performed by VHI and VOISS. Aerodynamic evaluation was compared and analysed by maximum phonation time(MPT), phonation threshold pressure(PTP), mean air flow rate(MFR), etc. Subjective voice evaluation was surveyed through VHI and VOISS. To evaluate patients' symptoms related to functional voice disorder, scores on physical domain in VHI and VOISS were selected to be compared for each session. Results: The 10 out of 20 participants who complained of voice symptoms had no significant difference with pre-operation in acoustic evaluation, but all showed higher scores on 2 weeks and 3 months after operation compared to pre-operation, in VHI-physical domain and selected questionnaires in VOISS. They reduced MPT and increased PTP value simultaneously. Laryngeal massage and breathing training were simultaneously treated to them, 5 participants resulting in improvement in MPT and PTP compared to pre-treatment. Conclusion:Patients who complained voice change with no organic damage after thyroidectomy were all shown to have reduced MPT and increased PTP in some by aerodynamic evaluations. Reduced MPT may imply some problem in air flow beneath glottis. Increased PTP suggests much more effort in vocalization mechanism than pre-operation. Comparing aerodynamic evaluations in post-thyroidectomy may provide information on behavioral interventions. Additionally, study on laryngeal massage and breathing training simultaneously treated to patients with such voice disorder is needed to be conducted with larger number of participants.

  • PDF

An Implementation of Speech DB Gathering System Using VoiceXML (VoiceXML을 이용한 음성 DB 수집 시스템 구현)

  • Kim Dong-Hyun;Roh Yong-Wan;Hong Kwang-Seok
    • Journal of Internet Computing and Services
    • /
    • v.6 no.1
    • /
    • pp.39-50
    • /
    • 2005
  • Speech DB is basically required factor when we are study for phonetics, speech recognition and speech synthesis and so on. The quantity and quality of speech DB decide the efficiency of system that we develop. therefore. speech DB has an extremely important factor, Recently, development of the various telephone service technique such as voice portal. it is actual condition where the necessity of collection of telephone speech DB. The existing IVR application telephone speech DB collection system used C/C++ language or the exclusive development tool. Thus it is the actual condition where the recycle of each application service for resources is difficult and have a problem of many labors and time necessity. But. VoiceXML is a language having tag form ipredicated in XML. which has easy and simple grammar system. Therefore, if we make a few efforts we could draw up easily. it has a merit reducing labors and time, Also, VoiceXML has many advantages of various telephone speech DB gathering because of changing contents of DB. In this paper, we introduce telephone speech DB gathering system which is the mast important factor for development of speech information processing technique.

  • PDF

Mobile Voice Note File Management Service For Improving Accessibility of the Blind (전맹인의 접근성 향상을 위한 모바일 음성 메모 파일 관리 서비스)

  • Lim, Soon-Bum;Lee, Mi Ji;Choi, Yoo Jin;Yook, Juhye;Park, Joo Hyun;Lee, Jongwoo
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.11
    • /
    • pp.1215-1222
    • /
    • 2019
  • Recently, people with disabilities also search for and collect information from the web through smart devices, and save collected information on smart devices or take notes. For non-disabled people, various memo applications are provided on the market, so it is more convenient to choose according to their preference. However, existing memo services are limited for use by blind people due to the importance of visual information. The problem with blind people when using smart devices is that the screen is not recognized, so it is not possible to check in which location the menu of the application exists. In addition, it is difficult to input and manipulate text, and systematic file management and control are not possible. Therefore, in this paper, we propose the development of voice memo service that blind people can use only voice and hearing information and can operate menu with Bluetooth remote controller. We will develop a system that includes a comprehensive voice file management function for storing, searching, playing, and deleting files, rather than simply storing voice files.