Search | Korea Science

Development of Voice Activated Universal Remote Control System using the Speaker Adaptation (화자적응을 이용한 음성인식 제어시스템 개발)

Kim Yong-Pyo;Yoon Dong-Han;Choi Un-Ha
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.10 no.4
- /
- pp.739-743
- /
- 2006
In this paper, development of voice activated Universal Remote Control using the Neural Networks. A speaker dependent system is developed to operate for a single speaker. These systems are usually easier to develop, cheaper to buy and more accurate, but not as flexible as speaker adaptive or speaker independent systems. A speaker independent system is developed to operate for any speaker of a particular type (e.g. American English). These systems are the most difficult to develop, most expensive and accuracy is lower than speaker dependent systems. However, they are more flexible. A speaker adaptive system is developed to adapt its operation to the characteristics of new speakers. It's difficulty lies somewhere between speaker independent and speaker dependent systems. This paper is developed Speaker Adaptation using the Neural Networks.
PDF KSCI

Large Scale Voice Dialling using Speaker Adaptation (화자 적응을 이용한 대용량 음성 다이얼링)

Kim, Weon-Goo
- Journal of Institute of Control, Robotics and Systems
- /
- v.16 no.4
- /
- pp.335-338
- /
- 2010
A new method that improves the performance of large scale voice dialling system is presented using speaker adaptation. Since SI (Speaker Independent) based speech recognition system with phoneme HMM uses only the phoneme string of the input sentence, the storage space could be reduced greatly. However, the performance of the system is worse than that of the speaker dependent system due to the mismatch between the input utterance and the SI models. A new method that estimates the phonetic string and adaptation vectors iteratively is presented to reduce the mismatch between the training utterances and a set of SI models using speaker adaptation techniques. For speaker adaptation the stochastic matching methods are used to estimate the adaptation vectors. The experiments performed over actual telephone line shows that proposed method shows better performance as compared to the conventional method. with the SI phonetic recognizer.
https://doi.org/10.5302/J.ICROS.2010.16.4.335 인용 PDF KSCI

A Study on Color Code Control Connected with Sound Source and Sensitivity of PA Speaker facility attachable LED Patch (PA스피커 시설물 부착형 LED패치의 음원감성 연계형 컬러코드 제어에 관한 연구)

Kim, Youngmin;Shin, Jaekwon;Cha, Jaesang
- Journal of Satellite, Information and Communications
- /
- v.10 no.3
- /
- pp.22-25
- /
- 2015
This paper performs Color Code Control Connected with Sound Source and Sensitivity of PA Speaker facility attachable LED patch. PA speaker delivers the technology to control the color code of LED patch along the present PA speakers for the facility-attached, LED the development of the patch. PA speakers facility attachable color code control technology of LED patch detects the sound from the PA speaker using a check, and if the analog signal source is detected (sound source)by converting the digital signal passes to the main controller can control the color and pattern of LED patches. In this paper, based on the PA speakers LED color control system, sound emotional linkage-type, and follow the lead of the PA speakers through the feelings can effectively channel LED linked to the source type and proceed to experiment with color and emotion control, whether or not they offer via the color control technology LED patch availability. PA speaker facility attachble color code control technology of LED patch connected with the source and future research directions in the field, and as the application is expected to be able to be widely utilized.
PDF KSCI

Active Noise Control in a Duct With Reflected Wave (반사파가 있는 관내의 능동 소음제어)

오상헌;김양한
- Journal of KSNVE
- /
- v.4 no.2
- /
- pp.187-198
- /
- 1994
This study is to describe the effects of the duct termination conditions conditions upon the active noise attenuation system. The adaptive filtering algorithm using FIR filter is implemented for duct noise attenuation. To avoid the instability caused by the acoustic feedback, two methods are considered. One is to use a compensating FIR filter. The other is to utilize uni-directional detecting microphone and uni-directional control speaker. Experimental results show that the reflections of sound from duct terminations greatly reduce the performance of ANC system. The directionality of detecting microphone and control speaker is a major factor to decide ANC performance. When there are some reflections from both duct terminations, the noise attenuation using finite FIR filter is not enough to model the duct plant. Especially, the reflection from the upstream termination reduces the noise attenuation in the frequencies related to the distance between control speaker and upstream termination. The performance of the noise attenuation is found to be largely enhanced by using uni-directional coupler, both on detecting microphone and control speaker, even if the duct system has an arbitrary termination conditions.
PDF

Development of the hybrid-type ultrasound speaker (하이브리드형 초음파 스피커 개발)

Lee, Hyoung-Sang;Kim, Bok-Kyu
- The Journal of the Acoustical Society of Korea
- /
- v.40 no.3
- /
- pp.247-253
- /
- 2021
Directional ultrasonic speakers that are used to hear sound only in a specific area have been continuously researched on various improvements in terms of sound quality and cost compared to general speakers. In this paper, we propose a DSP based hybrid-type ultrasonic speaker that can be heard at the same time as a general speaker in order to compensate for the sound in the low-band range, considering that it is difficult to hear the low-band sound below 500 Hz due to the sensor characteristics of the ultrasonic speaker. In the case of the system that is implemented by simply connecting a general speaker and an ultrasonic speaker, there are issues of high cost and difficulties of control as two amplifiers are used to playback ultrasonic and general sound sources. In addition, sound quality deteriorates due to the difference in playback time between ultrasonic and general sound sources. In order to improve issues of cost, control and sound quality, we developed hybrid-type ultrasonic speaker with a DSP based amplifier that can simultaneously playback by synchronizing the general sound source with the regenerated ultrasonic sound source, in addition to implement the existing CODEC functions such as Dynamic Range Control (DRC) and Equalizer (EQ).
https://doi.org/10.7776/ASK.2021.40.3.247 인용 PDF KSCI

An Implementation of Real-Time Speaker Verification System on Telephone Voices Using DSP Board (DSP보드를 이용한 전화음성용 실시간 화자인증 시스템의 구현에 관한 연구)

Lee Hyeon Seung;Choi Hong Sub
- MALSORI
- /
- no.49
- /
- pp.145-158
- /
- 2004
This paper is aiming at implementation of real-time speaker verification system using DSP board. Dialog/4, which is based on microprocessor and DSP processor, is selected to easily control telephone signals and to process audio/voice signals. Speaker verification system performs signal processing and feature extraction after receiving voice and its ID. Then through computing the likelihood ratio of claimed speaker model to the background model, it makes real-time decision on acceptance or rejection. For the verification experiments, total 15 speaker models and 6 background models are adopted. The experimental results show that verification accuracy rates are 99.5% for using telephone speech-based speaker models.
PDF

Dysarthric speaker identification with different degrees of dysarthria severity using deep belief networks

Farhadipour, Aref;Veisi, Hadi;Asgari, Mohammad;Keyvanrad, Mohammad Ali
- ETRI Journal
- /
- v.40 no.5
- /
- pp.643-652
- /
- 2018
Dysarthria is a degenerative disorder of the central nervous system that affects the control of articulation and pitch; therefore, it affects the uniqueness of sound produced by the speaker. Hence, dysarthric speaker recognition is a challenging task. In this paper, a feature-extraction method based on deep belief networks is presented for the task of identifying a speaker suffering from dysarthria. The effectiveness of the proposed method is demonstrated and compared with well-known Mel-frequency cepstral coefficient features. For classification purposes, the use of a multi-layer perceptron neural network is proposed with two structures. Our evaluations using the universal access speech database produced promising results and outperformed other baseline methods. In addition, speaker identification under both text-dependent and text-independent conditions are explored. The highest accuracy achieved using the proposed system is 97.3%.
https://doi.org/10.4218/etrij.2017-0260 인용 PDF KSCI

Implementation of Real-time Wheel Order Recognition System Based on the Predictive Parameters for Speaker's Intention

Moon, Serng-Bae;Jun, Seung-Hwan
- Journal of Navigation and Port Research
- /
- v.35 no.7
- /
- pp.551-556
- /
- 2011
In this paper new enhanced post-process predicting the speaker's intention was suggested to implement the real-time control module for ship's autopilot using speech recognition algorithm. The parameter was developed to predict the likeliest wheel order based on the previous order and expected to increase the recognition rate more than pre-recognition process depending on the universal speech recognition algorithms. The values of parameter were assessed by five certified deck officers being good at conning vessel. And the entire wheel order recognition process were programmed to TMS320C5416 DSP so that the system could recognize the speaker's orders and control the autopilot in real-time. We conducted some experiments to verify the usefulness of suggested module. As a result, we have confirmed that the post-recognition process module could make good enough accuracy in recognition capabilities to realize the autopilot being operated by the speech recognition system.
https://doi.org/10.5394/KINPR.2011.35.7.551 인용 PDF KSCI

Development of a Work Management System Based on Speech and Speaker Recognition

Gaybulayev, Abdulaziz;Yunusov, Jahongir;Kim, Tae-Hyong
- IEMEK Journal of Embedded Systems and Applications
- /
- v.16 no.3
- /
- pp.89-97
- /
- 2021
Voice interface can not only make daily life more convenient through artificial intelligence speakers but also improve the working environment of the factory. This paper presents a voice-assisted work management system that supports both speech and speaker recognition. This system is able to provide machine control and authorized worker authentication by voice at the same time. We applied two speech recognition methods, Google's Speech application programming interface (API) service, and DeepSpeech speech-to-text engine. For worker identification, the SincNet architecture for speaker recognition was adopted. We implemented a prototype of the work management system that provides voice control with 26 commands and identifies 100 workers by voice. Worker identification using our model was almost perfect, and the command recognition accuracy was 97.0% in Google API after post- processing and 92.0% in our DeepSpeech model.
https://doi.org/10.14372/IEMEK.2021.16.3.89 인용 PDF KSCI

Implementation of a Speaker-independent Speech Recognizer Using the TMS320F28335 DSP (TMS320F28335 DSP를 이용한 화자독립 음성인식기 구현)

Chung, Ik-Joo
- Journal of Industrial Technology
- /
- v.29 no.A
- /
- pp.95-100
- /
- 2009
In this paper, we implemented a speaker-independent speech recognizer using the TMS320F28335 DSP which is optimized for control applications. For this implementation, we used a small-sized commercial DSP module and developed a peripheral board including a codec, signal conditioning circuits and I/O interfaces. The speech signal digitized by the TLV320AIC23 codec is analyzed based on MFCC feature extraction methed and recognized using the continuous-density HMM. Thanks to the internal SRAM and flash memory on the TMS320F28335 DSP, we did not need any external memory devices. The internal flash memory contains ADPCM data for voice response as well as HMM data. Since the TMS320F28335 DSP is optimized for control applications, the recognizer may play a good role in the voice-activated control areas in aspect that it can integrate speech recognition capability and inherent control functions into the single DSP.
PDF

Search Result 163, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)