Search | Korea Science

Context-Adaptive Intra Prediction Model Training and Its Coding Performance Analysis (문맥적응적 화면내 예측 모델 학습 및 부호화 성능분석)

Moon, Gihwa;Park, Dohyeon;Kim, Jae-Gon
- Journal of Broadcast Engineering
- /
- v.27 no.3
- /
- pp.332-340
- /
- 2022
Recently, with the development of deep learning and artificial neural network technologies, research on the application of neural network has been actively conducted in the field of video coding. In particular, deep learning-based intra prediction is being studied as a way to overcome the performance limitations of the existing intra prediction techniques. This paper presents a method of context-adaptive neural network-based intra prediction model training and its coding performance analysis. In other words, in this paper, we implement and train a known intra prediction model based on convolutional neural network (CNN) that predicts a current block using contextual information from reference blocks. Then, we integrate the trained model into HM16.19 as an additional intra prediction mode and evaluate the coding performance of the trained model. Experimental results show that the trained model gives 0.28% BD-rate bit saving over HEVC in All Intra (AI) coding mode. In addition, the coding performance change of training considering block partition is also presented.
https://doi.org/10.5909/JBE.2022.27.3.332 인용 PDF KSCI KPUBS

An Adaptive Intra Coding Technique Using 1-D and 2-D Integer Transforms (1차원 및 2차원 정수 변환을 이용한 적응적 화면내 코딩 기법)

Park, Min-Cheol;Kim, Dong-Won;Moon, Joo-Hee
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.46 no.5
- /
- pp.66-79
- /
- 2009
In this paper, we propose a new adaptive intra coding technique using 1-D and 2-D integer transforms for improving coding efficiency of H.264/AVC. Proposed technique selects the most effective transform and prediction mode for each block after processing 1-D and 2-D transforms of all prediction modes. In case of using 1-D transform, $4{\times}4$ block is divided into four $1{\times}4$ or $4{\times}1$ subblocks and then each subblock is predicted and subtracted by using the decoded subblock located at the nearest position in the direction of prediction. After prediction error subblock is processed by 1-D transform and quantization, four subblocks are merged back into original $4{\times}4$ block and then, reordered as 1-D signal by a DC biased zigzag scanning pattern according to the prediction mode. Finally, comparing the coding efficiency between bitstreams based on 1-D transform and conventional 2-D transform, prediction mode and quantized coefficients for each block are decided and corresponding quantized coefficients are transmitted. Experimental results show that the proposed adaptive technique increases 0.34dB in BD-PSNR and decreases 4.03% in BD-Bitrate on the average compared with H.264/AVC.
PDF KSCI

Modified Adaptive Motion Vector Resolution (수정된 적응적 움직임 벡터 해상도 부호화 방법)

Jang, Myoung-Hun;Han, Jong-Ki;Bae, Jinsoo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.11a
- /
- pp.46-48
- /
- 2011
기존의 참조 소프트웨어인 MPEG-2, MPEG-4, H.264/AVC에서는 움직임 벡터를 찾을 때 항상 고정된 해상도를 사용하였으며 다른 참조 소프트웨어인 KTA에서는 움직임 벡터를 찾을 때 움직임벡터의 해상도를 슬라이스 단위로 성능이 가장 높은 해상도를 선택해서 사용하였다. 하지만 움직임 벡터의 해상도는 블록마다 서로 다르기 때문에 블록별로 서로 다른 해상도를 적응적으로 사용할 필요가 있다. 적응적인 움직임 벡터 해상도 부호화 방법은 이러한 점을 이용하여 블록 별로 현재 블록의 움직임 벡터가 1/4 해상도인지 1/8 해상도인지에 판단하고 그에 대한 정보를 복호기에 전송해준다. 제안하는 알고리즘은 적응적 움직임 벡터 해상도를 사용하여 부호화 할 때 1/8 해상도 움직임 벡터가 성능이 없다고 판단되는 곳에선 적응적 움직임 벡터 해상도 방식을 사용하지 않고 1/4 해상도로만 움직임 벡터를 찾는다. 이러한 경우 해상도 정보를 복호기에 전송하지 않아 부호화 효율을 높일 수 있고 또한 1/8 해상도에 대한 움직임 예측을 하지 않기 때문에 부호화기 복잡도를 낮출 수 있다. 실험결과 평균 0.2%의 성능을 얻을 수 있었으며 부호화기 복잡도는 4% 감소하였다.
PDF

Adaptive Combination of Intra/Inter Predictions in JM KTA Software (JM KTA 소프트웨어에서 인트라 및 인터 예측블록이 혼합된 코딩 방법)

Kim, Min-Jae;Seo, Chan-Won;Jang, Myung-Hun;Han, Jong-Ki
- Journal of Broadcast Engineering
- /
- v.16 no.2
- /
- pp.190-206
- /
- 2011
We propose an adaptive combination scheme of intra and inter prediction modes, where uni-directional intra prediction, bi-directional intra prediction, and inter prediction method are adaptively selected in an EMB (extended macro block). For each EMB, after all inter blocks have been encoded and decoded, the reconstructed blocks are used as reference data for bi-directional intra prediction of other blocks. Whereas conventional intra coding scheme does not use the right and below side pixels of the current block as reference data, the proposed method uses those for bi-directional intra prediction mode. In this paper, we propose three advanced techniques; (a) filter design for bi-directional prediction, (b) adaptive coding order scheme which increases the chance to use the bi-directional intra prediction mode, (c) modification of syntax to represent coding order. The information for the coding order is informed to the decoder by using the modified syntax structure without adding any additional flag. The simulation results show that the proposed scheme reduces the BD-Rate by 0.5%, on average, compared to KTA.
https://doi.org/10.5909/JEB.2011.16.2.190 인용 PDF KSCI

Hybrid Coding for Multi-spectral Satellite Image Compression (다중스펙트럼 위성영상 압축을 위한 복합부호화 기법)

Jung, Kyeong-Hoon
- Journal of the Korean Association of Geographic Information Studies
- /
- v.3 no.1
- /
- pp.1-11
- /
- 2000
The hybrid coding algorithm for multi-spectral image obtained from satellite is discussed. As the spatial and spectral resolution of satellite image are rapidly increasing, there are enormous amounts of data to be processed for computer processing and data transmission. Therefore an efficient coding algorithm is essential for multi-spectral image processing. In this paper, VQ(vector quantization), quadtree decomposition, and DCT(discrete cosine transform) are combined to compress the multi-spectral image. VQ is employed for predictive coding by using the fact that each band of multi-spectral image has the same spatial feature, and DCT is for the compression of residual image. Moreover, the image is decomposed into quadtree structure in order to allocate the data bit according to the information content within the image block to improve the coding efficiency. Computer simulation on Landsat TM image shows the validity of the proposed coding algorithm.
PDF

Adaptive Reference Structure Decision Method for HEVC Encoder (HEVC 부호화기의 적응적 참조 구조 변경 방법)

Mok, Jung-Soo;Kim, JaeRyun;Ahn, Yong-Jo;Sim, Donggyu
- Journal of Broadcast Engineering
- /
- v.22 no.1
- /
- pp.1-14
- /
- 2017
This paper proposes adaptive reference structure decision method to improve the performance of HEVC (High Efficiency Video Coding) encoder. When an event occurs in the input sequence, such as scene change, scene rotation, fade in/out, or light on/off, the proposed algorithm changes the reference structure to improve the inter prediction performance. The proposed algorithm divides GOP (Group Of Pictures) into two sub-groups based on the picture that has such event and decides the reference pictures in the divided sub-groups. Also, this paper proposes fast encoding method which changes the picture type of first encoded picture in the GOP that has such event to CRA (Clean Random Access). With the statistical feature that intra prediction is selected by high probability for the first encoded picture in the GOP carrying such event, the proposed fast encoding method does not operate inter prediction. The experimental result shows that the proposed adaptive reference structure decision method improves the BD-rate 0.3% and reduces encoding time 4.9% on average under the CTC (Common Test Condition) for standardization. In addition, the proposed reference structure decision method with the picture type change reduces the average encoding time 12.2% with 0.11% BD-rate loss.
https://doi.org/10.5909/JBE.2017.22.1.1 인용 PDF KSCI KPUBS

Selective Multiple Reference Frames Algorithm (선택적 다중 참조프레임 적용방법)

Han, Ki-Hun
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2013.06a
- /
- pp.357-358
- /
- 2013
H.264 등 동영상 압축 표준에서는 비디오 신호의 시간적 중복 데이터를 제거하기 위해 움직임 추정/보상을 수행한다. 또한 움직임 추정/보상의 정확성을 향상하기 위해 다중 참조프레임을 지원한다. 여러 장의 참조 프레임 중 현재 블록과 가장 유사한 참조 프레임 영역으로부터 움직임 추정/보상을 수행하여 보다 정확한 예측에 의해 잔차신호의 크기가 감소하게 되고, 그 결과 부호화 효율이 더욱 개선되었다. 본 논문에서는 다중 참조 프레임을 사용한 움직임 추정/보상의 효율을 유지하면서도 참조프레임을 나타내는 참조프레임 인덱스 비트를 줄여주어 부호화 효율을 더욱 개선하는 방법을 제안한다. 본 논문에서 제안하는 방법은 움직임 추정/보상 시, 각각의 참조 프레임에서 움직임 추정/보상에 사용되는 예측화소들을 비교하여 다중 참조 프레임이 효과가 있다고 판단 되는 경우에만 다중 참조 프레임 움직임 추정/보상을 수행하고, 다중 참조 프레임이 효과가 없다고 판단 되는 경우에는 단일 참조 프레임 움직임 추정/보상을 적응적으로 수행하였다. 실험결과 제안하는 방법은 다중 참조 프레임 인덱스 부호화에 소요되는 비트를 절감하면서도 부호화 효율을 유지함을 확인 할 수 있었다. 제안하는 방법은 동영상 압축 코덱에 적용되어 압축 성능을 더욱 향상 할 수 있다.
PDF

Multi-view residual image coding technique using adaptive quantization and scanning method (적응적 양자화 및 스캔 방법을 이용한 다시점 차영상 부호화에 관한 연구)

임정은;손광훈
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.27 no.3A
- /
- pp.249-257
- /
- 2002
본 논문에서는 스테레오/다시점 영상을 효율적으로 압축할 수 있는 차 영상 부호화 방법을 제안한다. 예측된 영상과 원 영상의 차이 정보를 보다 효율적으로 전송하기 위하여 DCT를 기반으로 차 영상 부호화를 하게 되는데 DCT 계수들의 방향성을 이용하여 양자화 및 스캔 방법을 각 블록의 특성에 따라 다르게 적용하였다. 특히 다시점 영상의 부호화는 첫 번째 시점 영상을 기준 영상으로 정하여 나머지 시점 영상을 기준 영상으로부터 변이를 추정하여 복원하는 방식과 다시점 영상 중 가려진 영역의 비율을 고려하여 가려진 영역이 상대적으로 제일 적은 영상을 기준 영상으로 설정하여 나머지 영상을 변이 추정하여 복원하는 방법으로 나누어 실험하였다. 실험 결과 모든 압축률에 대하여 제안 방식이 기존의 차 영상 부호화 방법보다 우수함을 확인하였고, 가려진 영역의 상대적인 비율을 고려하여 다시점 영상을 부호화한 제안 방식이 기존의 방식 및 첫 번째 시점을 기준 영상으로 설정하여 부호화한 제안 방식보다 우수함을 확인하였다.
PDF KSCI

Adaptive Deblocking Filter based on Video Contents (영상에 적응적인 디블로킹 필터 개발)

Lee Sang Rae
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2003.11a
- /
- pp.65-68
- /
- 2003
블록화 현상은 블록 기반의 부호화와 이에 따른 거친 양자화 계수를 적용할 때 나타날 뿐 아니라 블록화가 나타난 블록을 움직임 보상으로 가져와 적용할 때 이후 영상에 전파되게 된다. 이를 방지하기 위해 H.264/MPEG-4 AVC 표준은 부호화 및 복호화 과정에 동시에 포함된 형태의 루프 필터를 적용하였다. 필터는 블록 경계에서 경계 양쪽의 블록 예측 모드에 기반 한 필터의 세기를 결정하고 양자화 계수를 이용한 한계 값과 화소 값윽 비교하여 블록 경계에 적응적으로 적용한다. 이 때 필터의 특성을 결정하는 편차 값을 부호기에서 전송하게 되는데 이 값은 부호기 구현에 따라 달라질 수 있다. 본 논문은 부호화하는 각 영상의 특성을 정의하고 편차 값을 정함으로써 영상에 적응적인 디블로킹 필터 알고리즘을 구현 및 실험을 통하여 검증한다.
PDF

Fast H.264/AVC Full Search Algorithm using Spatial and Temporal Correlation (시.공간적 상관도를 이용한 고속 H.264/AVC 전 영역 탐색 방법)

Moon, Ji-Hee;Ho, Yo-Sung
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2009.11a
- /
- pp.67-70
- /
- 2009
H.264/AVC 동영상 표준은 기존의 동영상 표준과 비교했을 때 뛰어난 압축률을 보인다. 특히 세밀한 움직임 예측을 통해 영상을 효율적으로 압축하지만 움직임 예측은 H.264/AVC 동영상 표준의 높은 복잡도의 원인 중 하나이다. 따라서 H.264/AVC의 부호화 시간을 단축하기 위해서는 고속 움직임 추정 기법이 필수적이다. 일반적으로 영상 신호는 인접한 화면과 매크로블록 사이에서 상관관계가 높고 부호화하고자 하는 매크로블록의 움직임벡터는 인접한 매크로블록에서 결정된 최적의 움직임 벡터와 유사한 방향성을 가진다. 그러므로 고정된 탐색 영역의 크기를 이용하면 불필요한 영역까지 움직임 예측 과정이 수행되어 계산량이 증가한다. 본 논문에서는 영상의 공간적, 시간적 상관도를 이용하여 탐색 영역의 크기를 결정하는 방법을 제안한다. 인접하는 블록들의 움직임 벡터의 표준편차를 이용하여 움직임이 작은 영역에서는 작은 탐색 영역을 이용하여 움직임 예측을 수행하고 반대로 움직임이 큰 영역에서는 큰 탐색 영역을 이용하여 움직임 예측을 수행한다. 또한 현재 화면과 참조 화면의 거리차가 클수록 참조 화면으로 선택되는 확률이 낮다는 사실을 이용하여 적응적으로 탐색 영역의 크기를 조절한다. 제안한 방법은 기존의 전 영역 탐색 방법과 유사한 부호화 성능을 보이면서 움직임 예측 시간이 평균 약 58.93% 감소하는 것을 확인할 수 있다.
PDF

Search Result 127, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)