통합 검색 | Korea Science

디지털 통신 시스템에서의 음성 인식 성능 향상을 위한 전처리 기술 (Pre-Processing for Performance Enhancement of Speech Recognition in Digital Communication Systems)

서진호;박호종
- 한국음향학회지
- /
- 제24권7호
- /
- pp.416-422
- /
- 2005
디지털 통신 시스템에서의 음성 인식은 음성 부호화기에 의한 음성 신호의 왜곡으로 인하여 성능이 크게 저하된다. 본 논문에서는 음성 부호화기에 의한 스펙트럼 왜곡을 분석하고 왜곡된 주파수 정보를 보상하는 전처리 과정을 통하여 음성 인식 성능을 향상시키는 방법을 제안한다. 현재 널리 사용되는 표준 음성 부호화기인 IS-127 EVRC, ITU G.729 CS-ACELP. IS-96 QCELP를 사용하여 부호화에 의한 왜곡을 분석하고, 모든 음성 부호화기에 공통으로 적용하여 왜곡을 보상할 수 있는 전처리 방법을 개발하였다. 본 논문에서 제안하는 왜곡 보상 방법을 세 종류의 음성부호화기에 각각 적용하였으며, 왜곡된 음성 신호에 대한 음성 인식률에 비하여 최대 $15.6\%$의 인식률 향상을 얻을 수 있었다.
PDF KSCI

다채널 영상 감시 시스템을 위한 다중 포맷 동영상 저장 DirectShow Filter설계 및 구현 (MultiFormat motion picture storage subsystem using DirectShow Filters for a Mutichannel Visual Monitoring System)

정연권;하상석;정선태
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
- /
- pp.113-116
- /
- 2002
Windows provides Directshow for efficient multimedia streaming processings such as multimedia capture, storage, display and etc. Presently, many motion picture codecs and audio codecs are made to be used in Directshow framework and Windows also supports many codecs (MPEG4, H,263, WMV, WMA, ASF, etc.) in addition to a lot of useful tools for multimedia streaming processing. Therefore, Directshow can be effectively utilized for developing windows-based multimedia streaming applications such as visual monitoring systems which needs to store real-time video data for later retrieval. In this paper, we present our efforts for developing a Directshow Filter System supporting storage of motion pictures in various motion picture codecs. Our Directshow Filter system also provides an additional functionality of motion detection.
PDF

Symmetric Balance Incomplete Block Design Code와 Arrayed-Waveguide Grating을 이용한 Optical CDMA Network Codecs (Optical CDMA Network Codecs with Symmetric Balance Incomplete Block Design Code and Arrayed-Waveguide Grating)

지윤규
- 대한전자공학회논문지SD
- /
- 제49권5호
- /
- pp.22-29
- /
- 2012
본 논문은 symmetric balance incomplete block design(BIBD) code와 arrayed-waveguide grating(AWG) router의 주기적인 특성을 이용하여 optical CDMA network을 위한 coder-decoder(codec)을 구성하였다. 기존의 M-sequence code를 이용한 경우보다 다양한 구성을 할 수 있고 이 시스템의 잡음인 phase-induced intensity noise(PIIN)와 thermal noise를 분석하여 BER을 계산한 결과 향상된 성능을 보임을 알 수 있었다.
PDF KSCI

G.729A에서 EVRC로의 상호부호화 (A Transcoding Algorithm from G.729A to EVRC)

곽영진;정지민;권구락;임정석;황인호;이경훈;고성제
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2003년도 하계종합학술대회 논문집 Ⅳ
- /
- pp.2248-2251
- /
- 2003
Communication between speech networks employing different speech codecs requires interoperability. The cascade connection of two different codecs, called tandem coding, not only degrades speech quality, but also produces high computational loads. These Problems can be solved by using the transcoding algorithm. This paper presents an effective algorithm for transcoding from G.729A to EVRC and its simulation results.
PDF

Review on codec-agnostic approach for MPEG V-PCC

Tianyu, Dong;Jang, Euee S.
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송∙미디어공학회 2020년도 하계학술대회
- /
- pp.510-512
- /
- 2020
In this paper, we reviewed the method on the codec-agnostic design of MPEG V-PCC. The codec-agnostic approach designed V-PCC can use any video codec for compression. Thus, adoption with other codecs beside HEVC has not been systematically discussed. Through the analysis of the design issues that related to MPEG EVC and JPEG. We provided a strategy on choosing and targeting different video codecs for V-PCC.
PDF

Reed-Solomon 부호화/복호화를 위한 DSP 명령어 및 하드웨어 설계 (Design of DSP Instructions and their Hardware Architecture for Reed-Solomon Codecs)

이재성;선우명훈
- 한국통신학회논문지
- /
- 제28권6A호
- /
- pp.405-413
- /
- 2003
본 논문은 오류 정정을 위해 가장 많이 쓰이는 알고리즘 중 하나인 RS (Reed- Solomon) 부호화 및 복호화를 DSP (Digital Signal Processor) 칩에서 효율적으로 구현할 수 있는 새로운 명령어 및 하드웨어 구조를 제안한다. 제안한 구조는 원시 다항식의 변경에 따라 하드웨어를 재 설계할 필요가 없이 DSP 상에서 프로그램으로 변경이 가능하여 다양한 원시 다항식을 구현할 수 있다. 새로운 명령어 및 하드웨어 구조는 유한체 곱셈기 및 가산기를 이용하여 유한체 연산을 수행한다. 따라서, 제안한 DSP 구조는 기존 DSP 칩과 비교하여 복호화 속도를 향상시킬 수 있다. 본 하드웨어 구조는 130MHz 동작 주파수를 갖는 DSP 칩에서 228.1 Mbps의 RS 복호화 성능을 갖는다.
PDF KSCI

Performance Comparison on Speech Codecs for Digital Watermarking Applications

Mamongkol, Y.;Amornraksa, T.
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2002년도 ITC-CSCC -1
- /
- pp.466-469
- /
- 2002
Using intelligent information contained within the speech to identify the specific hidden data in the watermarked multimedia data is considered to be an efficient method to achieve the speech digital watermarking. This paper presents the performance comparison between various types of speech codec in order to determine an appropriate one to be used in digital watermarking applications. In the experiments, the speech signal encoded by four different types of speech codec, namely CELP, GSM, SBC and G.723.1codecs is embedded into a grayscale image, and theirs performance in term of speech recognition are compared. The method for embedding the speech signal into the host data is borrowed from a watermarking method based on the zerotrees of wavelet packet coefficients. To evaluate efficiency of the speech codec used in watermarking applications, the speech signal after being extracted from the attacked watermarked image will be played back to the listeners, and then be justified whether its content is intelligible or not.
PDF

기계를 위한 비디오 부호화 표준화 동향 (Standardization Trends in Video Coding for Machines)

권형진;정세윤;최진수;이태진;서정일
- 전자통신동향분석
- /
- 제35권5호
- /
- pp.102-111
- /
- 2020
An increase in high-quality video service continually leads to the standardization of high-performance video codecs such as the versatile video coding standard. Although such codecs have improved coding efficiency in terms of high fidelity, a tremendous increase in the amount of video data is required for more efficient compression, especially for efficiently recognizing and analyzing the target within the millions of objects/events captured every day, such as those by surveillance systems. Therefore, newly established MPEG standardization efforts have studied the new generation of video compression standards for machine vision-oriented video. This paper presents the standardization trends in video coding for machines and discusses further directions for improvement.
https://doi.org/10.22648/ETRI.2020.J.350509 인용 PDF

Dual-Domain Connection Scheme for HE-AAC and MPEG Surround

Pang, Hee-Suk
- The Journal of the Acoustical Society of Korea
- /
- 제28권1E호
- /
- pp.29-34
- /
- 2009
MPEG4 High Efficiency Advanced Audio Coding (HE-AAC) and MPEG Surround are one of the most efficient combinations for low bit rate multi-channel audio coding. Based on the fact that these two codecs have identical quadrature mirror filter (QMF) analysis and synthesis structures, we propose a dual-domain connection scheme for the codecs. Specifically two time-domain connection methods are analyzed and compared to the QMF subband-domain connection method. Experimental results show that both the time-domain connection methods cause no subjective sound quality degradation compared to the QMF subband-domain connection method, which verifies that one can select either of them depending on application scenarios.
PDF KSCI

13kbps QCELP에서 8kbps QCELP로의 음성 패킷 변환 기술 (Voice Packet Conversion from 13kbps QCELP to 8kbps QCELP Speech Codecs)

박호종;권상철
- 한국음향학회지
- /
- 제18권6호
- /
- pp.71-76
- /
- 1999
디지털 이동 통신 시스템에서 서로 다른 음성 압축기를 사용하는 단말기 사이의 통신은 음성 신호를 두 번의 압축/복원 과정을 거쳐 전달하므로 음질 저하, 계산량 증가, 전달 지연 증가 등의 문제를 발생시킨다. 본 논문에서는 이와 같은 단말기 사이의 통신에서의 문제점을 해결하기 위하여 음성 패킷 변환 방법을 제안하고, 13kbps QCELP 패킷을 8kbps QCELP 패킷으로 변환하는 방법을 개발한다. 여러 음성 신호를 이용한 모의 실험 결과, 본 논문에서 개발된 패킷 변환기가 짧은 음성전달 지연과 약 33%의 계산량으로 일반적인 이중 압축 방법과 동등한 음질의 음성 신호를 합성하는 것을 확인하였다.
PDF

검색결과 114건 처리시간 0.027초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)