Search | Korea Science

Study of Speech Recognition System Operation for Voice-driven UAV Control (음성 기반 무인 항공기 제어를 위한 음성인식 시스템 운용 체계 연구)

Park, Jeong-Sik
- Journal of the Korean Society for Aeronautical & Space Sciences
- /
- v.47 no.3
- /
- pp.212-219
- /
- 2019
As unmanned aerial vehicle (UAV) has been utilized for military operation, efficient ways for controlling UAV has been necessary. In particular, instead of conventional approach using console control, speech recognition based UAV control is essential for military environments in which rapid command operation is required. But research on this novel approach is not actively studied yet. In this study, we introduce efficient ways of speech recognition system operation for voice-driven UAV control, focusing on mission command control from manned aircraft rather than ground control center. We propose an efficient way of system operation for UAV control in cooperation of aircraft and UAV, and verify its efficiency via speech recognition experiment.
https://doi.org/10.5139/JKSAS.2019.47.3.212 인용 PDF KSCI

Implementation of an Efficient Voice Transmission System in Bluetooth Network Rnvironments (블루투스 네트워크 환경에서의 효율적인 음성전송 시스템 구현)

Kim, Myung-Jong;Park, Ji-Hun;Kim, Hong-Kook
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2008.02a
- /
- pp.125-128
- /
- 2008
IPTV의 상용화에 맞추어 사용자와 TV간의 정보 교환에 의한 대화형 서비스들이 제공되고 있으며, 특히 음성인식 기술은 이러한 서비스를 실현하기 위한 중요한 기술 중의 하나로 대두되고 있다. TV에서의 음성인식 수행을 위해서는 가정환경과 같은 제한된 공간에서 효율적으로 사용자의 음성을 TV에 전송할 수 있는 근거리 무선통신 수단이 필요하게 된다. 특히, 리모트 컨트롤러와 같은 저전력 시스템 환경에서 구현이 가능해야 한다. 따라서 이러한 제한된 조건에서 최적의 성능을 갖는 음성 전송 시스템 개발이 요구되고 있다. 본 논문에서는 블루투스 환경 하에서 음성인식을 위해 필요한 음성전송 시스템을 실시간 구현한다. 효율적인 음성전송을 위해 G.711을 기본 코덱으로 사용하며, 음성전송 시 발생하는 패킷손실에 따른 음성 품질 저하를 줄이기 위해 G.711 패킷손실 은닉 알고리즘을 음성전송 시스템에 적용한다. 특히 G.711 패킷 손실 은닉 알고리즘 수행을 위해 블루투스 프로토콜 스택application layer에 RTP 프로토콜을 적용하여 패킷 손실 여부를 확인하고, 패킷 손실 발생 시 패킷손실 은닉 알고리즘을 통해 음성의 품질 저하를 줄인다. 구현된 시스템의 성능을 평가한 결과, G.711 패킷 손실 알고리즘을 적용하여 2~10%의 패킷손실 환경에서 14.7%의 음질개선을 얻을 수 있었다.
PDF

Design of Voice Activity Detection Algorithm for Variable Rate Speech Coders (가변전송률 음성부호화기 적용을 위한 음성활성도 측정 알고리즘 설계)

김재원
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.26 no.9A
- /
- pp.1451-1458
- /
- 2001
디지털 이동통신 시스템에서 가장 빈번하게 발생하는 음성 서비스의 궁극적인 목표는 양호한 음성 품질과 높은 주파수 효율의 제공에 있다. 음성은 묵음 구간에 의하여 구분되어진 짧고 간헐적인 음성 에너지의 반복으로 표현 가능하며 실제 음성 통화중 활성 음성이 존재하는 구간은 약 40%, 나머지 60% 구간은 묵음 또는 상대방의 음성을 듣는 구간이다. 이 묵음 구간을 효율적으로 활용함에 의해 시스템의 스펙트럼 이득을 얻을 수 있다. 본 논문에서는 디지털 이동통신 시스템과 같이 다양하게 변화하는 주변 잡음 환경에서도 강건하게 동작 가능하여 10msec 프레임 크기를 갖는 음성부호화기에 적용 가능한 음성 활성도 측정 방안을 설계하였다. 설계된 알고리즘은 음성에너지, 스펙트럼 분포, 영교차율, 그리고 LPC 잔여신호의 Peakiness 측정값을 이용하였다.
PDF

Service Mechanism for Enhanced Voice Traffic (음성 트래픽 향상을 위한 서비스 메커니즘)

김성태;강현국
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10c
- /
- pp.757-759
- /
- 2001
인터넷의 확산 및 고속화로 인하여 사용자들이 급격하게 증가하고 있으며, 이에 따라 인터넷을 이용한 다양한 멀티미디어 서비스들이 전개되고 있다. 또한, 기존 PSTN 위주의 음성 통신이 인터넷을 이용한 음성통신으로 급속히 바뀌고 있으며, 이를 효율적으로 연동하고 제어하기 위한 다양한 표준들이 나타나고 있다. 본 논문에서는 급속하게 발전되고 있는 인터넷 텔레포니 기술 중 세션제어를 위한 기녈 프로토콜 표준인 SIP를 살펴보고, 서비스 품질 향상을 위한 RSVP를 이용하여 보다 향상된 음성통신을 위한 기존의 시그널링 메커니즘을 살펴보고, 음성 트래픽 서비스 품질 향상을 위한 가장 효율적인 새로운 메커니즘을 제시 하고자 한다.
PDF

변성발성장애 환자에 대한 음성치료의 효과

표화영
- Proceedings of the KSLP Conference
- /
- 1998.11a
- /
- pp.197-197
- /
- 1998
변성발성장애는 변성기의 시기를 지났음에도 불구하고 적절한 음도저하를 습득하지 못함으로써 야기되는 음성장애의 한 가지로, 이런 환자들에 대해서는 제3형 갑상연골성형술을 통한 수술적 처치나 음성치료로 그 문제를 해결할 수 있다. 본 논문에서는 이중 음성치료적인 측면에 초점을 맞추어, 음성치료에 의한 변성발성장애 치료의 효율성에 대해 고찰해 보고자 한다. (중략)
PDF

Speech Enhancement Based on Soft Decision for Effective Noise Suppression (효율적인 잡음억제를 위한 Soft Decision 기반의 음성향상 기법)

Lim Hyoung-Keun;Kim Yu-Jin;Chung Jae-Ho
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.47-50
- /
- 2000
비상관적인 가산잡음에 오염된 음성으로부터 향상된 음성을 얻기 위한 방법 중 Soft Decision에 근거한 음성 향상 기법이 뛰어난 성능을 가진다고 알려져 있다. Soft Decision은 주파수 영역에서 음성에 가산된 잡음을 처리하며, 잡음 환경에 대한 사전정보에 의존적이다. 본 연구에서는 Soft Decision을 근거로 음성에 가산된 잡음신호를 비선형 처리를 하여 효과적으로 음성에 포함된 잡음을 추정하도록 하였으며, 잡음환경에 대한 사전 정보 없이 효율적으로 잡음을 억제하는 방법을 제안한다. 본 연구에서 제안한 음성향상 기법은 주관적인 음질평가에서 기존의 방법들보다 나은 성능을 나타내었다
PDF

Utterance display system for speech data acquisition (음성데이터 수집을 위한 발성내용 제시시스팀)

김경태;이용주;정유현
- The Journal of the Acoustical Society of Korea
- /
- v.12 no.1
- /
- pp.5-11
- /
- 1993
본 논문은 발성자의 자연스러운 음성데이터를 수집하기 위한 발성내용 제시시스팀의 구현에 대하여 기술한다. 대량의 음성정보의 수집 및 처리를 위해서는 이와같은 시스팀이 필수적이다. 왜냐하면, 음성정보처리의 성능 평가는 음성데이터와 발성방법에 따라 죄우되므로 실제의 환경에서 사용되는 자연스러운 음성으로 평가되어야만 객관적인 결과를 얻을 수 있기 때문이다. 따라서 이러한 음성데이터를 효율적으로 수집하기 위한 방법으로써 발성내용 제시시스팀에 관하여 기술하고자 한다. 특히, 본 논문에서는 발성해야 할 데이터를 제시하기 위한 방법으로써 발성내용 제시 시스팀에 관하여 기술하고자 한다. 특히, 본 논문에서는 발성해야 할 데이터를 제시하기 위한 요구사항, 기능, PC에 의한 구현에 대하여 기술한다. 본 시스팀은 음성수집 단계뿐만아니라 수집 후의 편집 작업의 편리성을 고려하여 구현하였으며, 4연속 숫자음 등 96명이 발성한 63,840개의 단어를 수집하는데 적용하였고 수집 과정에서 종래의 리스트를 보고 발성하는 방법에 비해 훨씬 효율적이고 자연스러운 발성을 유도할 수 있었다.
PDF

Voice Packet Processing Scheme for Voice Quality and Bandwidth Efficiency in VoIP (VoIP의 음성품질/대역효율 개선을 위한 음성패킷 처리)

Kim, Jae-Won;Sohn, Dong-Chul
- Journal of Korea Multimedia Society
- /
- v.7 no.7
- /
- pp.896-904
- /
- 2004
In this paper, We present an efficient variable rate speech coder for spectral efficiency and packet processing technique for packet loss compensation of a voice codec with 10msec frame in VoIP service. Through disconnecting the users from the spectral resource during silence interval of about 60% period, a variable rate voice coder based on a voice activity detection(VAD) can increase spectral gain by two times. The performance of the method was analyzed by variation of detected voice activity factor and degraded speech frame ratio under various background noise level, and compared those of G.729B of ITU-T 8kbps standard speech codec. A method to compensate lost packets utilized addition of recovery data to a main stream and error concealment scheme for speech quality enhancement, the performance is verified by reconstructed speech quality. The proposed scheme can achieve spectral gain by two times or enhance speech quality by 3dB through reserved bandwidth of VAD. Therefore, the proposed method can enhance a spectral efficiency or speech quality of VoIP.
PDF

Analysis of VoLTE Charge Reduction under VoLTE Growth (VoLTE 활성화에 따른 요금 인하 여력 분석)

Lee, Sang-Woo;Jeong, Seon-Hwa
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.41 no.1
- /
- pp.92-100
- /
- 2016
It is informed that the Voice over LTE(VoLTE) which serves voice and message on IP networks is better in terms of economies of scale than the legacy voice service on 2G/3G circuit-switched networks because of its technological and cost efficiency. In addition, services of voice and data are running on a single LTE network and as a result VoLTE has the more economies of scope. But, there is no study about how much technology-efficiency VoLTE has compared to circuit-based voice service and how much voice charge can be reduced as VoLTE grows up. This paper analyzes empirically cost-efficiency of VoLTE against circuit-based voice service and quantifies the reduction of voice charge as 2G/3G voice traffic shifts to VoLTE. The results describe the first is that the average cost of the total voice traffic rises shortly just after the investment of LTE network for providing VoLTE but it will soon have a capacity available to reduce the charge due to VoLTE's outstanding cost efficiency on the assumption that voice traffic is fixed, and the second is that the charge can be cut to 60% of the current rate in case of all the voice traffic moves to VoLTE. The latter proves partially the validation of data-focusing pricing plan. Our results are expected to become basic data for network operators' establishing pricing strategies and for policy makers' inducing price cutting.
https://doi.org/10.7840/kics.2015.41.1.92 인용 PDF KSCI

An Efficient Approach for Noise Robust Speech Recognition by Using the Deterministic Noise Model (결정적 잡음 모델을 이용한 효율적인 잡음음성 인식 접근 방법)

정용주
- The Journal of the Acoustical Society of Korea
- /
- v.21 no.6
- /
- pp.559-565
- /
- 2002
In this paper, we proposed an efficient method that estimates the HMM (Hidden Marke Model) parameters of the noisy speech. In previous methods, noisy speech HMM parameters are usually obtained by analytical methods using the assumed noise statistics. However, as they assume some simplication in the methods, it is difficult to come closely to the real statistics for the noisy speech. Instead of using the simplication, we used some useful statistics from the clean speech HMMs and employed the deterministic noise model. We could find that the new scheme showed improved results with reduced computation cost.
PDF KSCI

Search Result 869, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)