Search | Korea Science

피처레벨 비디오 분석과, 적응적 장면 선택을 이용한 비디오 캡셔닝 피처 생성

Lee, Ju-Hee;Kang, Je-Won
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.212-214
- /
- 2020
본 논문에서는 비디오의 피처레벨 분석을 통해 비디오의 장면 구성 특징을 파악하고, 그에 적응적으로 대표 프레임을 선택하는 방법을 제안한다. 제안된 방법으로 생성된 캡셔닝 피처는 비디오를 잘 요약하고, 이를 통해 효과적인 캡셔닝을 수행할 수 있다. 기존 비디오 캡셔닝 연구에서는 비디오의 장면 구성을 고려하지 않고 단순 등간격으로 프레임 추출을 통하여 비디오 캡셔닝을 수행하였다. 이는 다양한 장면의 모임으로 이루어진 비디오의 특성을 고려하지 않은 방법으로, 경우에 따라 주요 장면을 놓치거나, 불필요하게 중복된 프레임을 선택하는 문제가 발생한다. 본 논문에서는 비디오의 피처레벨 분석을 통해 비디오의 구성 특징을 파악하고, 이를 고려해 적응적으로 주요 프레임을 추출하여 이와 같은 문제를 해결하여 비디오 캡셔닝 에서의 성능향상을 보인다. 제안 알고리즘을 이용하여 생성된 피처는 비디오를 잘 요약하여 비디오 캡셔닝 수행 시, MSVD 데이터 셋에서 4 개의 평가지표에 대해 약 0.78%의 성능향상을 보였고, MSR-VTT 데이터 셋에서 약 0.6%의 성능향상을 보였다.
PDF

LCU-Level Rate Control for HEVC Considering Hierarchical Coding Structure (HEVC 의 계층적 부호화 구조를 고려한 LCU 단위의 비트율 제어 기법)

Park, Dong Il;Kim, Jae-Gon;Jeong, Dae-Gwon;Kim, Jongho;Kim, Hui-Yong;Choi, Jin Soo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.199-201
- /
- 2011
본 논문에서는 현재 표준화가 진행중인 HEVC 의 고정 비트율(CBR) 부호화를 위한 비트율 제어(rate control) 기법을 다룬다. HEVC 의 임의접근(Random Access: RA) 부호화 모드는 계층적-B 부호화 구조를 통해 높은 부호화 효율을 제공할 수 있다. 기존의 HEVC 를 위한 비트율 제어 방식으로는 2 차 비트율-왜곡 모델 기반의 시간계층 및 프레임 타입에 따른 비트율 특성을 반영한 프레임 레벨의 비트율 제어 기법이 제시되었다. 이 같은 기존의 프레임 레벨의 비트율 제어 기법은 임의접근 모드의 계층적-B 구조에서 동작성능이 확인되었으나, HEVC 의 기본적인 부호화 단위(Coding Unit: CU)의 특성이 반영되지 않아 비트율 제어의 정확성이 제한되었다. 본 논문에서는 기존의 계층적 부호화 구조를 고려한 프레임 레벨의 비트율 제어 기법을 확장한 CU 레벨에서의 비트율 제어 기법을 제시하고 모의실험을 통해 제시된 기법의 비트율 제어 성능을 확인한다.
PDF

Inter-Intra Motion Estimation in Wavelet based Codec (웨이블릿 코덱에서의 Inter-Intra 움직임 예측 기법)

이주경;김충길;강정구;정기동
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.04d
- /
- pp.187-189
- /
- 2003
웨이블릿 변환에 기반한 동영상 코덱에서의 움직임 예측 기법은 OCT 기반 코덱과 유사하게 이전 프레임과의 움직임 예측을 통하여 수행된다. 그러나, 현재 프레임이 이전 프레임을 참조하므로 네트워크상의 전송시 이전 프레임에 발생한 오류가 전달되는 오류 전파의 문제도 발생하게 된다. 본 논문에서는 웨이블릿 변환된 프레임의 특성을 이용하여 최상위 레벨의 LL 부대역만 이전 프레임과의 움직임 예측을 수행하고, 나머지 부대역에 대하여 프레임 내의 상위레벨의 부대역이 하위 부대역을 창조하여 예측 및 보상을 수행하여 오류전파의 가능성을 최소화하는 Inter-Intra ME 동영상 코덱을 제안한다 제안된 움직임 예측을 사용하여 MAD(Mean-Absolute Differences)를 측정한 결과, 프레임간 변화가 심한 경우에는 제안된 기법과 이전 프레임의 부대역을 참조한 기법 사이의 압축율은 유사하게 나타났으며, 변화가 적은 경우에는 이전 프레임을 참조하는 것의 압축율이 높게 나타났다. 그러나, 네트워크 전송시 발생하는 오류전파에는 제안된 기법의 성능이 우수한 것으로 나타났다.
PDF

Adaptive Quantization of Difference Wavelet Image for Close-Range Low-Bitrate Transmission (근거리 저전송률 통신을 위한 차영상 웨이브릿 적응 양자화)

Jeong Won-Kyo;Leef Kyeong-Hwan;Lee Yong-Doo
- Journal of Korea Multimedia Society
- /
- v.7 no.9
- /
- pp.1246-1254
- /
- 2004
This paper presents a image coding method that is well adaptive to close-range video transmission because of its low titrate and simple coding procedure. At first, it reduces temporal redundancies by performing image DPCM between previous frame and current frame, and makes wavelet transformed image of this difference image. Then, the coefficients are quantized selectively by using the coefficient values of base level and mid-frequency level because inter-level redundancies are widely exists in multi-resolution images. Finally quantized coefficients are made iron the function that implies the target bitrate, the average coefficient energy, and the value of the level. The proposed method shows the effective Performance in the experiments using the continuous motion images and transition images.
PDF

Adaptive Quantization of Image Sequence using Block Activity Level and Edge Feature Classification (블록의 활성 레벨과 에지 특성의 분류를 이용한 동영상의 적응 양자화)

안철준;공성곤
- Proceedings of the Korean Institute of Intelligent Systems Conference
- /
- 1997.11a
- /
- pp.191-194
- /
- 1997
본 논문에서는 2D-DCT 변환된 동영상 프레임 사이의 오차 블록들의 활성 레벨(atcivity level)과 에지의 특성을 분류하여 동영상의 적응적인 양자화를 제안한다. 각 블록에서는 활성 레벨이 각기 다르고, 같은 활성 레벨이라 할지라도 에지의 특성도 각기 다르게 나타난다. 적응적인 양자화를 위해서, 2D-DCT 변환된 영상 오차의 각 블록의 활성레벨 뿐만 아니라 AC 계수들의 분포에 따른 에지 특성을 분류하면, 블록의 활성 레벨만을 일률적으로 적용한 Sorting 방법의 경우보다 향상된 영상을 복원할 수 있다. 블록의 활성 레벨은 AC energy에 의해서 측정하고, 에지 특성은 AC 계수들의 분포에 의해 결정하게 된다.
PDF

Quantization Level Selection of Intra-Frame for MPEG-4 Video Encoder (MPEG-4 부호화기에서의 인트라 프레임 양자화 레벨 선정)

Kim Jeong Woo;Cho Seong Hwan
- Journal of Korea Multimedia Society
- /
- v.8 no.1
- /
- pp.9-18
- /
- 2005
This paper presents the method of calculating the quantization level of the intra-frame in MPEG-4 video encoder. The intra-frame is an essential part in that the quality of the whole GOP is affected by the quality of this frame since the intra-frame, which works as a reference frame within GOP, continuously propagates through other frames. This work proposes how to use bits assigned for gaining the quantization level of the intra-frame, complexity of input images, and GOP structures. The result shows that while existing approaches have the decline in efficiency by using fixed values or show different qualifies depending on the characteristics of the images, the current approach shows the steady results in various images. Comparing with Q2 algorithm obtained in MPEG-4 VM, the approach suggested in this paper gains the benefit of maximum 3.49dB with some variations depending on the characteristics of the images.
PDF

(A Study on an Adaptive Multimedia Synchronization Scheme for Media Stream Transmission) (미디어 스트림 전송을 위한 적응형 멀티미디어 동기화 기법에 관한 연구)

지정규
- Journal of the Korea Computer Industry Society
- /
- v.3 no.9
- /
- pp.1251-1260
- /
- 2002
Real-time application programs have synchronization constraints which need to be met between media-data. Synchronization method represents feedback method including virtual client-side buffer. This buffer is used in buffer level method. It is client-leading synchronization that is absorbing variable transmission delay time and that is synchronizing by feedback control. It is the important factor for playback rate and QoS if the buffer level is normal or not. To solve the problems, we can control the start of transmission in multimedia server by appling filtering, control and network evaluation function. Synchronization method is processing for smooth presentation without cut-off while media is playing out. When audio frame which is master media is in high threshold buffer level we decrease play out time gradually, otherwise we increase it slowly.
PDF

Graph-based High-level Motion Segmentation using Normalized Cuts (Normalized Cuts을 이용한 그래프 기반의 하이레벨 모션 분할)

Yun, Sung-Ju;Park, An-Jin;Jung, Kee-Chul
- Journal of KIISE:Software and Applications
- /
- v.35 no.11
- /
- pp.671-680
- /
- 2008
Motion capture devices have been utilized in producing several contents, such as movies and video games. However, since motion capture devices are expensive and inconvenient to use, motions segmented from captured data was recycled and synthesized to utilize it in another contents, but the motions were generally segmented by contents producers in manual. Therefore, automatic motion segmentation is recently getting a lot of attentions. Previous approaches are divided into on-line and off-line, where ow line approaches segment motions based on similarities between neighboring frames and off-line approaches segment motions by capturing the global characteristics in feature space. In this paper, we propose a graph-based high-level motion segmentation method. Since high-level motions consist of repeated frames within temporal distances, we consider similarities between neighboring frames as well as all similarities among all frames within the temporal distance. This is achieved by constructing a graph, where each vertex represents a frame and the edges between the frames are weighted by their similarity. Then, normalized cuts algorithm is used to partition the constructed graph into several sub-graphs by globally finding minimum cuts. In the experiments, the results using the proposed method showed better performance than PCA-based method in on-line and GMM-based method in off-line, as the proposed method globally segment motions from the graph constructed based similarities between neighboring frames as well as similarities among all frames within temporal distances.
PDF KSCI

A Study on the Noise-Level Measurement using the Energy and relation of closed pitch (에너지와 인근피치간에 유사도를 이용한 잡음레벨 검출에 관한 연구)

Kang InGyu;Bae MyungJin
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.77-80
- /
- 2004
인간은 "습관적 피치 레벨" 즉 자연스럽게 말할 때 평균적으로 사용하는 피치를 갖는다. 하지만 음성에 잡음이 첨가 되면 이 피치가 불규칙하게 바뀌게 된다. 이점을 이용하여 음성의 잡음레벨을 측정할 수 있다. 본 논문에서는 입력음성의 에너지를 구하고 일정 에너지레벨 이상에서의 구간에 대해 NAMDF(Normalized Average Magnitude Difference Function)방법으로 피치를 구하고, 각 프레임을 피치단위로 분절한 뒤 인근 피치간의 유사도를 측정하여 입력음성데이터의 잡음레벨을 검출하는 방법을 제안하였다.
PDF

A Study on the relation of closed pitch for Noise-Level Measurement (음성의 잡음레벨 추정을 위한 피치간 유사도 측정에 관한 연구)

Kang InGyu;Kang SungMo;Bae MyungJin
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.73-76
- /
- 2004
인간은 "습관적 피치 레벨" 즉 자연스럽게 말할 때 평균적으로 사용하는 피치를 갖는다. 하지만 음성에 잡음이 첨가되면 이 피치가 불규칙하게 바뀌게 된다. 이점을 이용하여 음성의 잡음레벨을 측정할 수 있다. 본 논문에서는 입력음성의 에너지를 구하고 일정 에너지레벨 이상에서의 구간에 대해 NAMDF(Normalized Average Magnitude Difference Function)방법으로 피치를 구하고, 각 프레임을 피치단위로 분절한 뒤 인근 피치간의 유사도를 측정하여 입력음성데이터의 잡음레벨을 검출하는 방법을 제안하였다.
PDF

Search Result 195, Processing Time 0.043 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)