통합 검색 | Korea Science

WDCT(Warped Discrete Cosine Transform)를 이용한 영상 압축 알고리듬 (An Image Compression Algorithm Using the WDCT (Warped Discrete Cosine Transform))

- 한국통신학회논문지
- /
- 제24권12B호
- /
- pp.2407-2414
- /
- 1999
본 논문에서는 WDCT(Warped Discrete Cosine Transform)의 개념에 대해서 소개하고 이의 응용분야로서 WDCT를 이용한 영상 압축 알고리듬을 제시한다. WDCT는 기존의 일반적인 DCT와 주파수 특성이 하나의 파라미터로 조절되는 IIR(infinte impulse response) 전대역 통과 필터(all-pass filter)를 직렬로 연결한 변환이다. 제시된 영상 압축 알고리듬에서는 필터의파라미터가 미리 정의된 범위 내에서 조절되도록 한다. 각 영상의 블록에 대해서 주어진 범위 내에서 가장 좋은 파라미터가 선정되면 이를 이용한 WDCT의 결과와 이 파라미터를 디코더로 전송한다. 본 논문에서는 IIR 전대역 통과 필터링 과정을 하나의 행렬로 대체하거나 DCT를 필터뱅크로 보아 IIR 필터와 DCT의 결합을 일반적인 DCT와 마찬가지로 하나의 행렬로 표현하였다. 따라서 주어진 파라미터에 따라 각각 다른 새로운 WDCT 행렬을 정의할 수 있으므로 WDCT의 결과는 행렬과 벡터의 곱으로 얻어진다. WDCT를 이용한 영상 압축의 결과는 높은 비트율과 고주파 성분이 많은 영상에 대하여 DCT의 성능보다 우수함을 알 수 있었다.
PDF

Warped Common Acoustical Pole and Zero 방법을 이용한 효율적인 공간 등화 (Effective Room Equalization Using Warped Common Acoustical Pole and Zero)

이준호;박영철;윤대희;이석필
- 한국음향학회지
- /
- 제28권1호
- /
- pp.51-60
- /
- 2009
본 논문에서는 warped common acoustical pole and zero (WCAPZ) 모델링 방법을 이용한 새로운 공간 등화 방법을 제안한다. 제안한 방법은 저주파 영역의 등화 성능을 감소시키지 않으면서 등화 필터의 차수를 줄일 수 있는 장점을 가진다. 따라서 제안된 공간 등화 시스템은 기존의 블록 변환 방법에 비해 연산량은 비슷하면서도 적은 입출력 지연을 가지게 된다. 컴퓨터 모의실험을 통해 제안된 방법이 기존의 기법에 비해 저주파 영역에서 좋은 공간 등화 성능을 보임을 검증하였다.
https://doi.org/10.7776/ASK.2009.28.1.051 인용 PDF KSCI

On the Use of Various Resolution Filterbanks for Speaker Identification

Lee, Bong-Jin;Kang, Hong-Goo;Youn, Dae-Hee
- The Journal of the Acoustical Society of Korea
- /
- 제26권3E호
- /
- pp.80-86
- /
- 2007
In this paper, we utilize generalized warped filterbanks to improve the performance of speaker recognition systems. At first, the performance of speaker identification systems is analyzed by varying the type of warped filterbanks. Based on the results that the error pattern of recognition system is different depending on the type of filterbank used, we combine the likelihood values of the statistical models that consist of the features extracting from multiple warped filterbanks. Simulation results with TIMIT and NTIMIT database verify that the proposed system shows relative improvement of identification rate by 31.47% and 15.14% comparing it to the conventional system.
PDF KSCI

WFIR 구조를 이용한 바이노럴 필터 설계 (Binaural Filter Design using Warped FIR Structure)

김동현
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1998년도 학술발표대회 논문집 제17권 1호
- /
- pp.193-196
- /
- 1998
지금까지 바이노럴 필터 설계 방법들의 대부분은 linear frequency scale을 이용한 것이지만, 사람의 귀는 non-linear frequency scale을 가지며 critical band에 의한 청각정보를 인지한다. 따라서, 이와 같은 특징을 이용하여 좀 더 효율적으로 바이노럴 필터를 설계할 수 있다. 본 논문에서는 frequency warping을 이용해 non-linear frequency resolution을 갖는 바이노럴 필터를 계산한다. 또한, 종래의 설계방법에 의한 필터와 warped FIR 구조를 갖는 바이노럴 필터와의 비교청취를 통해 성능의 비교 평가를 수행 한다.
PDF

적응적인 선형 보간을 이용한 부화소 기반 영상 확대 (Sub-pixel Image Magnification Using Adaptive Linear Interpolation)

유훈
- 한국멀티미디어학회논문지
- /
- 제9권8호
- /
- pp.1000-1009
- /
- 2006
본 논문에서는 부화소 단위의 적응적인 선형 보간법을 제안한다. 보통의 선형 보간법에 화소 마다 매개변수가 도입되고 이 매개 변수를 최적으로 구하기 위해서 저역 필터와 MMSE (minimum mean square error) 방법을 이용한 일반적인 보간 구조를 제안한다. 또한 제안된 일반적인 적응 선형 보간 구조에서 복잡도를 최소화한 방법을 유도하여 간단한 닫힌 형태의 식으로 제시한다. 기존 방법인 보통의 선형 보간법, 3차 컨볼루션 보간법에 비교하여 주관적으로나 객관적으로 제안된 방법의 우수함을 실험 결과로 알 수 있을 뿐만 아니라 왜곡 거리 선형 보간법(warped distance linear interpolation), 이동 선형 보간법(shifted linear interpolation) 등의 최근 기술과 비교하여도 우수함을 실험결과는 보여준다.
PDF

Statistical Model-Based Noise Reduction Approach for Car Interior Applications to Speech Recognition

Lee, Sung-Joo;Kang, Byung-Ok;Jung, Ho-Young;Lee, Yun-Keun;Kim, Hyung-Soon
- ETRI Journal
- /
- 제32권5호
- /
- pp.801-809
- /
- 2010
This paper presents a statistical model-based noise suppression approach for voice recognition in a car environment. In order to alleviate the spectral whitening and signal distortion problem in the traditional decision-directed Wiener filter, we combine a decision-directed method with an original spectrum reconstruction method and develop a new two-stage noise reduction filter estimation scheme. When a tradeoff between the performance and computational efficiency under resource-constrained automotive devices is considered, ETSI standard advance distributed speech recognition font-end (ETSI-AFE) can be an effective solution, and ETSI-AFE is also based on the decision-directed Wiener filter. Thus, a series of voice recognition and computational complexity tests are conducted by comparing the proposed approach with ETSI-AFE. The experimental results show that the proposed approach is superior to the conventional method in terms of speech recognition accuracy, while the computational cost and frame latency are significantly reduced.
https://doi.org/10.4218/etrij.10.1510.0024 인용 PDF KSCI

AURORA 잡음 처리 알고리즘을 이용한 전화망 환경에서의 강인한 음성 검출 (Robust Speech Detection Using the AURORA Front-End Noise Reduction Algorithm under Telephone Channel Environments)

서영주;지미경;김회린
- 대한음성학회지:말소리
- /
- 제48호
- /
- pp.155-173
- /
- 2003
This paper proposes a noise reduction-based speech detection method under telephone channel environments. We adopt the AURORA front-end noise reduction algorithm based on the two-stage mel-warped Wiener filter approach as a preprocessor for the frequency domain speech detector. The speech detector utilizes mel filter-bank based useful band energies as its feature parameters. The preprocessor firstly removes the adverse noise components on the incoming noisy speech signals and the speech detector at the next stage detects proper speech regions for the noise-reduced speech signals. Experimental results show that the proposed noise reduction-based speech detection method is very effective in improving not only the performance of the speech detector but also that of the subsequent speech recognizer.
PDF

고해상도 스테레오 카메라와 저해상도 깊이 카메라를 이용한 다시점 영상 생성 (Multi-view Generation using High Resolution Stereoscopic Cameras and a Low Resolution Time-of-Flight Camera)

이천;송혁;최병호;호요성
- 한국통신학회논문지
- /
- 제37권4A호
- /
- pp.239-249
- /
- 2012
최근 자연스러운 3차원 영상의 재현을 위하여 깊이영상을 이용한 영상합성 방법이 널리 이용되고 있다. 깊이영상은 시청자의 눈에 보이지는 않지만 합성영상의 화질을 결정하는 중요한 정보이므로 정확한 깊이영상을 획득하는 것이 중요하다. 특히 적외선 센서를 이용한 깊이 카메라(time-of-flight camera)는 보다 정확한 깊이영상을 획득하는데 이용되고 있다. 깊이 카메라는 스테레오 정합(stereo matching)에 비해 정확하고 실시간으로 깊이정보를 추출할 수 있지만, 제공되는 해상도가 너무 낮다는 단점이 있다. 본 논문에서는 단시점의 깊이영상을 두 시점의 깊이영상으로 확장하고, 이를 이용하여 여러 시점의 중간영상을 생성하는 시스템을 제안한다. 특히 복잡도를 낮춰 빠른 속도로 다시점 영상을 생성하는 시스템을 제안한다. 고해상도의 컬러 영상을 획득하기 위하여 두 대의 컬러 카메라를 설치하고 중간에 깊이 카메라를 획득한다. 그리고 깊이 카메라에서 획득한 깊이영상을 3차원 워핑을 이용하여 양쪽의 컬러 카메라의 위치로 시점 이동한다. 깊이영상과 컬러영상간의 객체 불일치 문제는 깊이값의 신뢰 도를 기반으로 한 조인트 양방향 필터(joint bilateral filter)를 이용하여 보정한다. 이러한 과정을 통해 얻은 깊이영상은 다시점 영상 합성 방법을 이용하여 다시점 영상을 획득한다. 이와 같은 과정은 다중 스레드를 이용하여 빠르게 처리할 수 있도록 구현했다. 실험을 통해 두 시점의 컬러영상과 두 시점의 깊이영상이 실시간으로 획득했고, 약 7 fps의 프레임율로 10시점의 중간시점을 동시에 생성했다.
https://doi.org/10.7840/KICS.2012.37A.4.239 인용 PDF KSCI

검색결과 8건 처리시간 0.028초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)