통합 검색 | Korea Science

영상의 크기 변환을 이용한 효율적인 움직임 보상 보간 기법 (Efficient Motion Compensated Interpolation Technique Using Image Resizing)

권혜경;이창우
- 방송공학회논문지
- /
- 제18권4호
- /
- pp.599-608
- /
- 2013
움직임 보상 보간 기법은 동영상 정보의 프레임율 증가 뿐 아니라 분산 동영상 부호화 시스템에서 부가 정보 생성에 사용된다. 본 논문에서는 움직임 보상 보간 기법의 성능을 효율적으로 향상시키기 위하여 DCT(Discrete Cosine Transform) 혹은 LiftLT를 이용하여 영상의 크기를 두 배 확대 변환한 후 움직임 보상 보간을 수행하여 생성된 두 배 크기의 영상을 다시 축소 변환하여 원래 크기의 보간 프레임을 생성하는 기법을 제안한다. 또한 보간 필터를 사용하여 부화소 단위의 세밀한 움직임 보상 보간을 수행한 경우의 성능도 분석한다. 모의 실험 결과 제안하는 기법이 기존의 보간 필터를 이용한 기법에 비해서 우수한 성능을 보임을 확인하였다.
https://doi.org/10.5909/JBE.2013.18.4.599 인용 PDF KSCI

Effects of Variable Block Size Motion Estimation in Transform Domain Wyner-Ziv Coding

Kim, Do-Hyeong;Ko, Bong-Hyuck;Shim, Hiuk-Jae;Jeon, Byeung-Woo
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2009년도 IWAIT
- /
- pp.381-384
- /
- 2009
In the Wyner-Ziv coding, compression performance highly depends on the quality of the side information since better quality of side information brings less channel noise and less parity bit. However, as decoder generates side information without any knowledge of the current Wyner-Ziv frame, it doesn't have optimal criterion to decide which block is more advantageous to generate better side information. Hence, in general, fixed block size motion estimation (ME) is performed in generating side information. By the fixed block size ME, the best coding performance cannot be attained since some blocks are better to be motion estimated in different block sizes. Therefore if there is a way to find appropriate ME block of each block, the quality of the side information might be improved. In this paper, we investigate the effects of variable block sizes of ME in generating side information.
PDF

A Robust Audio Fingerprinting System with Predominant Pitch Extraction in Real-Noise Environment

Son, Woo-Ram;Yoon, Kyoung-Ro
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2009년도 IWAIT
- /
- pp.390-395
- /
- 2009
The robustness of audio fingerprinting system in a noisy environment is a principal challenge in the area of content-based audio retrieval. The selected feature for the audio fingerprints must be robust in a noisy environment and the computational complexity of the searching algorithm must be low enough to be executed in real-time. The audio fingerprint proposed by Philips uses expanded hash table lookup to compensate errors introduced by noise. The expanded hash table lookup increases the searching complexity by a factor of 33 times the degree of expansion defined by the hamming distance. We propose a new method to improve noise robustness of audio fingerprinting in noise environment using predominant pitch which reduces the bit error of created hash values. The sub-fingerprint of our approach method is computed in each time frames of audio. The time frame is transformed into the frequency domain using FFT. The obtained audio spectrum is divided into 33 critical bands. Finally, the 32-bit hash value is computed by difference of each bands of energy. And only store bits near predominant pitch. Predominant pitches are extracted in each time frames of audio. The extraction process consists of harmonic enhancement, harmonic summation and selecting a band among critical bands.
PDF

음성 특성을 고려한 가라오케 시스템 (A Karaoke system based on the vocal characteristics)

김유승;김인철
- 방송공학회논문지
- /
- 제13권3호
- /
- pp.380-387
- /
- 2008
본 논문에서는 음성 특성에 기반을 둔 보컬 영역 검색 알고리듬을 적용하는 가라오케 시스템을 제시한다. 제안한 시스템에서 입력 음악은 보컬 영역 검색 알고리듬을 통해 보컬 부분과 반주 부분으로 분류된다. 그런 다음, 보컬 영역에 대해서만 보컬 제거기법을 적용한다. 보컬 영역 검색에서는 TICFT (twice iterated composite Fourier transform) 영역에서 보컬의 특성을 고려하여 분류를 수행한다. 보컬 제거를 위해서 대역 통과 필터링 된 보컬 영역으로부터 보컬 성분을 추출하고, 이를 원래의 음악에서 감산함으로써 보컬 성분이 제거된 음악을 얻는다. 본 논문에서 제시한 기법은 4곡의 노래에 적용하고, 그 성능을 평가한다.
https://doi.org/10.5909/JBE.2008.13.3.380 인용 PDF KSCI

MPEG 비트스트림과 구간 복호 영상을 사용한 장면 전환 검출 (Scene Change Detection Using MPEG Bitstream and Sectionally Decoded Video)

나윤정;하명환;이상길
- 방송공학회논문지
- /
- 제4권2호
- /
- pp.119-126
- /
- 1999
동영상에서 장면이 전환되는 지점을 빠르고 정확하게 검출하는 방법을 설계하였다. 이 방법은 MPEG 압축 영상에 대하여 시간적 표본화를 통하여 추출된 압축 영역의 데이터를 사용하여 장면 전환의 후보 구간들을 정하는 첫 번째 단계와, 이들 구간 안에서 각 프레임의 화소값을 얻고, 이를 사용하여 정확한 장면 전환 지점을 찾아내는 두 번째 단계로 구성된다. 두 번째 단계에서는 명암과 윤곽선 변화를 결합하여 장면 전환을 검출하였다. 또한 카메라 플래시 때문에 장면 전환으로 잘못 검출되는 것을 방지할 수 있는 방법을 연구하였다. 이상의 방법들을 통합함으로써 장면 전환을 빠르고 정확하게 검출할 수 있는 구조를 제안한다.
PDF

감시 비디오를 위한 H.264/SVC 비트스트림 영역에서의 그래프 기반 움직임 객체 검출 및 추적 (Graph-based Moving Object Detection and Tracking in an H.264/SVC bitstream domain for Video Surveillance)

호와리;김문철
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2012년도 하계학술대회
- /
- pp.298-301
- /
- 2012
This paper presents a graph-based method of detecting and tracking moving objects in H.264/SVC bitstreams for video surveillance applications that makes use the information from spatial base and enhancement layers of the bitstreams. In the base layer, segmentation of real moving objects are first performed using a spatio-temporal graph by removing false detected objects via graph pruning and graph projection, followed by graph matching to precisely identify the real moving objects over time even under occlusion. For the accurate detection and reliable tracking of moving objects in the enhancement layer, as well as saving computational complexity, the identified block groups of the real moving objects in the base layer are then mapped to the enhancement layer to provide accurate and efficient object detection and tracking in the bitstreams of higher resolution. Experimental results show the proposed method can produce reliable results with low computational complexity in both spatial layers of H.264/SVC test bitstreams.
PDF

이산 코사인 변환 공간에서의 주파수에 따른 광-적응 효과 최소 인지 왜곡 임계치 모델링 (Luminance-Adaptation Effect Just-Noticeable-Distortion Modeling according to Frequency in The DCT Domain)

배성호;김문철
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2012년도 하계학술대회
- /
- pp.95-98
- /
- 2012
본 논문에서는 DCT 변환 공간상의 배경휘도와 주파수를 고려한 2차원의 개선된 광-적응 효과(luminance adaptation: LA) JND 모델을 제안한다. 기존의 LA JND 모델은 배경 휘도가 중간점인 회색에 가까울수록 JND가 낮고, 배경 휘도가 어두워지거나 밝아질수록 JND 값이 증가하는 U자형의 1차원 함수형태를 보였다. 그러나 기존 LA JND 모델은 주파수에 따른 영향이 반영되지 않았기 때문에 DCT와 같은 주파수 공간상 JND 모델로는 부정확 한 단점이 있다. 본 논문에서는 주파수와 배경휘도에 따른 2차원 LA JND 모델을 제안한다. 주파수에 따른 LA JND 값을 실제 실험을 통해 획득하였다. 실험 방법은 9가지 크기의 배경 휘도가 다르고 공간적 복잡도가 없는 균일한 영상을 대상으로 $8{\times}8$ 실수형 DCT를 수행한 다음, 15가지 경우의 주파수 크기가 다른 계수들에 대해 사람이 인지 할 때 까지 노이즈를 증가시켜서 JND 값을 찾는 방식을 사용하였다. 실험 결과 4 cpd(cycle per degree) 보다 작은 주파수 대역 에서는 기존의 LA JND 모델과 유사한 결과를 얻었지만 4 cpd보다 큰 주파수 대역에서는 오히려 배경휘도가 작은 값을 가질수록 JND가 감소하는 형태를 보였다. 수행한 실험 결과를 반영하여 주파수가 반영된 2차원 LA JND 모델을 제안한다.
PDF

Implementation of Data Storage Media Control and Command(DSM-CC) Core User-to-User Interface for MPEG-2 Bit Stream Transport

Park, Seong-Jong;Kim, Yong-Han;Kim, Jae-Woo;Lee, Ho-Jang;Shim, Jae-Kyu;Kim, Jae-D.;Koh, Jong-Seong
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 1998년도 Proceedings of International Workshop on Advanced Image Technology
- /
- pp.79-84
- /
- 1998
This paper describes implementation of the core DSM-CC UU interface. It briefly describes the reference model for the DSM-CC and related standards that should be reviewed for the implementation. The Common Object Request Broker Architechture, Revision 2.0 (CORBA 2.0) is sued as a remote procedure call (RPC) scheme for the UU Interface. Entire system was implemented with C++ on Windows NT platforms. The implementation procedure has been decomposed ito two tasks. The first task is to implement the Naming Service for service navigation. The Naming Service is one of the CORBA Services that extend the core CORBA specification. A client GUI is implemented for easy navigation among various services. The second task is to construct multimedia server and client for a Video-on-Demand (VoD) system. MPEG-2 Transport Stream is transported via ATM AAL5 using the Windows Socket 2.2 ATM extension API. A GUI enables the user to navigate the service domain and select a program. After the selection the user can control the MPEG-2 stream with VCR-like buttons.
PDF

LTE-A 시스템에서 3 차원 빔포밍 기법 연구 (Three-dimensional beamforming techniques for LTE-A systems)

지형주;심병효
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 2015년도 추계학술대회
- /
- pp.43-44
- /
- 2015
LTE-Advanced system has been deployed with 2 and 4 transmission antennas (Tx) while the specification supports up to 8Tx. Due to deployment space, antenna dimension and complexity, the needs of deploying 8Tx system has not been motivated by operators. Recently, three dimensional (3D) beamforming with active antenna has attracted significant attention in the wireless industry. By incorporating 2D active array into LTE-A systems, the system offers freedom in controlling radiation on elevation and horizontal dimension. When the number of antennas increases in the form of 2D arrangement, spatial separation can be realized simultaneously in horizontal and elevation domain and vertical beam-steering can increase SINR of UEs in high floors. In this paper, we study the system operations and implementations for supporting 3D beamforming with 8Tx antennas. In our schemes, by reusing the conventional CSI feedback framework, the system can operate 2D active array without harming the backward compatibility. Evaluation results show that 3D beamforming provides capacity boosting over the conventional 2D beamforming systems while keeping same antenna structure.
PDF

EFFICIENT IMAGE SEGMENTATION FOR MANIFESTING VISUAL OBJECTS

Park, Hyun-Sang;Lim, Jung-Eun;Ra, Jong-Beom
- 한국방송∙미디어공학회:학술대회논문집
- /
- 한국방송공학회 1999년도 KOBA 방송기술 워크샵 KOBA Broadcasting Technology Workshop
- /
- pp.159-164
- /
- 1999
Homogeneous but distinct visual objects having low-contrast boundaries are usually merged in most of the segmentation algorithms. To alleviate this problem, an efficient image segmentation algorithm based on a bottom-up approach is proposed by using spatial domain information only. For initial image segmentation, we adopt an efficient marker extraction algorithm conforming to the human visual system. Then, two region-merging algorithms are successively applied so that homogeneous visual objects can be represented as simple as possible without destroying low-contrast real boundaries among them. The resultant segmentation describes homogeneous visual objects with few regions while preserving semantic object shapes well. Finally, a size-based region decision procedure may be applied to represent complex visual objects simpler, if their precise semantic contents are not necessary. Experimental results show that the proposed image segmentation algorithm represents homogeneous visual objects with a few regions and describes complex visual objects with a marginal number of regions with well-preserved semantic object shapes.

검색결과 212건 처리시간 0.026초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)