• Title/Summary/Keyword: video CODEC

Search Result 315, Processing Time 0.025 seconds

Non-Reference P Frame Coding for Low-Delay Encoding in Internet Video Coding (IVC의 저지연 부호화 모드를 위한 비참조 P 프레임의 부호화 기법)

  • Kim, Dong-Hyun;Kim, Jin-Soo;Kim, Jae-Gon
    • Journal of Broadcast Engineering
    • /
    • v.19 no.2
    • /
    • pp.250-256
    • /
    • 2014
  • Non-reference P frame coding is used to enhance coding efficiency in low-delay encoding configuration of Internet Video Coding (IVC), which is being standardized as a royalty-free video codec in MPEG. The existing method of non-reference P frame coding which was adopted in the reference Test Model of IVC (ITM) 4.0 adaptively applies a non-reference P frame with a fixed coding structure based on the magnitude of motion vectors (MVs), however, which unexpectedly degrades the coding efficiency for some sequences. In this paper, the existing non-reference P frame coding is improved by changing non-reference P frame coding structure and applying a new adaptive method using the ratio of the amount of generated bits of non-reference frames to that of reference frames as well as MVs. Experimental results show that the proposed non-reference P frame coding gives 6.6% BD-rate bit saving in average over ITM 7.0.

Linear Sub-band Decomposition based Pre-processing Algorithm for Perceptual Video Coding (지각적 동영상 부호화를 위한 선형 부 대역 분해 기반 전처리 기법)

  • Choi, Kwang Yeon;Song, Byung Cheol
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.1
    • /
    • pp.80-87
    • /
    • 2017
  • This paper proposes a pre-processing algorithm to improve perceptual video coding efficiency which decomposes an input frame via a sub-band decomposition, and suppresses only high frequency band(s) having low visual sensitivity. First, we decompose the input frame into several frequency subbands by a linear sub-band decomposition. Next, high frequency subband(s) which is rarely recognized by human visual system (HVS) is suppressed by applying relatively small gain(s). Finally, the high frequency suppressed frame is compressed by a specific video encoder. We can find from the experimental results that if comparing before-use and after-use of the proposed pre-processing prior to the encoder, no visual difference is shown. Also, the proposed algorithm achieves bit-saving of 13.12% on average in a H.264 video encoder.

A Method of Intra Mode Coding for Joint Exploration Model (JEM) (차세대 비디오 부호화 실험모델(JEM)의 화면내 예측 모드 부호화 기법)

  • Park, Dohyeon;Lee, Jinho;Kang, Jung Won;Kim, Jae-Gon
    • Journal of Broadcast Engineering
    • /
    • v.23 no.4
    • /
    • pp.495-502
    • /
    • 2018
  • JVET (Joint Video Exploration Team) which explored evolving technologies of video coding with capabilities beyond HEVC (High Efficiency Video Coding), released a references software codec named the Joint Exploration Model (JEM) for performance verification of coding technologies. JEM has 67 intra prediction modes that extend the 35 modes of HEVC for intra prediction. Therefore, the enhancement of the coding performance is limited due to the overhead of prediction mode coding. In this paper, we analyze the probabilities of prediction modes selections, and then we propose a more efficient intra prediction mode coding based on the results of analyzed mode occurrence. In addition, we propose a context modeling for CABAC (Context-Adaptive Binary Arithmetic Coding) of the proposed mode coding. Experimental results show that the BD-rate gain is 0.02% on the AI (All Intra) coding structure compared to JEM 7.0. We need to optimize context modeling for additional coding performance enhancement.

Mobile Interactive Broadcasting Learning Solution Study on Development of Education (모바일 양방향 화상 교육시스템 개발에 관한 연구)

  • Kim, Tai-Dal;Lee, Byung-Kwon
    • Journal of Internet Computing and Services
    • /
    • v.13 no.1
    • /
    • pp.57-63
    • /
    • 2012
  • Recently, Development of high-speed internet networks can be a real-time video conference system. SD-quality still images using the software codec system exists, and two-way system with mobile systems, but in most video only send and receive video system. In this study, HD web-cam using the two-way live broadcasts simultaneously in the education system and the mobile system has developed an audible system. In the study using the H.264 standard video compression techniques were applied to high-bandwidth compression technology, smart phones, to develop a streaming server for broadcasting were applied. Video sharing, document, web-sharing was made possible in real time. In addition, the private IP for WiFi zone, available anytime, anywhere that can be converted to a public IP technology, IP tunneling technology applied.

Optimal Coding Model for Screen Contents Applications from the Coding Performance Analysis of High Efficient Coding Tools in HEVC (HEVC 고성능 압축 도구들의 성능 분석을 통한 스크린 콘텐츠 응용 최적 부호화 모델)

  • Han, Chan-Hee;Lee, Si-Woong
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.12
    • /
    • pp.544-554
    • /
    • 2012
  • Screen content refers to images or videos generated by various electronic devices such as computers or mobile phones, whereas natural content refers to images captured by cameras. Screen contents show different statistical characteristics from natural images, so the conventional video codecs which were developed mainly for the coding of natural videos cannot guarantee good coding performances for screen contents. Recently, researches on efficient SCC(Screen Content Coding) are being actively studied, and especially at ongoing JCT-VC(Joint Collaborative Team on Video Coding) meeting for HEVC(High Efficiency Video Coding) standard, SCC issues are being discussed steadily. In this paper, we analyze the performances of high efficient coding tools in HM(HEVC Test Model) on SCC, and present an optimized SCC model based on the analysis results. We also present the characteristics of screen contents and the future research issues as well.

Video Compression Standard Prediction using Attention-based Bidirectional LSTM (어텐션 알고리듬 기반 양방향성 LSTM을 이용한 동영상의 압축 표준 예측)

  • Kim, Sangmin;Park, Bumjun;Jeong, Jechang
    • Journal of Broadcast Engineering
    • /
    • v.24 no.5
    • /
    • pp.870-878
    • /
    • 2019
  • In this paper, we propose an Attention-based BLSTM for predicting the video compression standard of a video. Recently, in NLP, many researches have been studied to predict the next word of sentences, classify and translate sentences by their semantics using the structure of RNN, and they were commercialized as chatbots, AI speakers and translator applications, etc. LSTM is designed to solve the gradient vanishing problem in RNN, and is used in NLP. The proposed algorithm makes video compression standard prediction possible by applying BLSTM and Attention algorithm which focuses on the most important word in a sentence to a bitstream of a video, not an sentence of a natural language.

Survey on Deep learning-based Content-adaptive Video Compression Techniques (딥러닝 기반 컨텐츠 적응적 영상 압축 기술 동향)

  • Han, Changwoo;Kim, Hongil;Kang, Hyun-ku;Kwon, Hyoungjin;Lim, Sung-Chang;Jung, Seung-Won
    • Journal of Broadcast Engineering
    • /
    • v.27 no.4
    • /
    • pp.527-537
    • /
    • 2022
  • As multimedia contents demand and supply increase, internet traffic around the world increases. Several standardization groups are striving to establish more efficient compression standards to mitigate the problem. In particular, research to introduce deep learning technology into compression standards is actively underway. Despite the fact that deep learning-based technologies show high performance, they suffer from the domain gap problem when test video sequences have different characteristics of training video sequences. To this end, several methods have been made to introduce content-adaptive deep video compression. In this paper, we will look into these methods by three aspects: codec information-aware methods, model selection methods, and information signaling methods.

Performance Analysis of Super-Resolution based Video Coding for HEVC (HEVC 기반 초해상화를 이용한 비디오 부호화 효율 성능 분석)

  • Ki, Sehwan;Kim, Dae-Eun;Jun, Ki Nam;Baek, Seung Ho;Choi, Jeung Won;Kim, Dong Hyun;Kim, Munchurl
    • Journal of Broadcast Engineering
    • /
    • v.24 no.2
    • /
    • pp.306-314
    • /
    • 2019
  • Since the resolutions of videos increase rapidly, there are continuing needs for effective video compression methods despite an increase in the transmission bandwidth. In order to satisfy such a demand, a reconstructive video coding (RVC) method by using a super resolution has been proposed. Since RVC reduces the resolution of the input video, when frames are compressed to the same size, the number of bits per pixel increases, thereby reducing coding artifacts caused by video coding. However, RVC method using super resolution is not effective in all target bitrates. Comparing the size of the loss generated while downsizing the resolution and the size of the loss caused by the video compression, only when the size of loss generated in the video compression is larger, RVC method can perform the improved compression performance compared to direct video coding. In particular, since HEVC has considerably higher compression performance than the previous standard video codec, it can be experimentally confirmed that the compression distortions become larger than the distortions of downsizing the resolution only in the very low-bitrate conditions. In this paper, we applied RVC based HEVC in various video types and measured the target bitrates that RVC method can be effectively applied.

A Fast Decision Method of Quadtree plus Binary Tree (QTBT) Depth in JEM (차세대 비디오 코덱(JEM)의 고속 QTBT 분할 깊이 결정 기법)

  • Yoon, Yong-Uk;Park, Do-Hyun;Kim, Jae-Gon
    • Journal of Broadcast Engineering
    • /
    • v.22 no.5
    • /
    • pp.541-547
    • /
    • 2017
  • The Joint Exploration Model (JEM), which is a reference SW codec of the Joint Video Exploration Team (JVET) exploring the future video standard technology, provides a recursive Quadtree plus Binary Tree (QTBT) block structure. QTBT can achieve enhanced coding efficiency by adding new block structures at the expense of largely increased computational complexity. In this paper, we propose a fast decision algorithm of QTBT block partitioning depth that uses the rate-distortion (RD) cost of the upper and current depth to reduce the complexity of the JEM encoder. Experimental results showed that the computational complexity of JEM 5.0 can be reduced up to 21.6% and 11.0% with BD-rate increase of 0.7% and 1.2% in AI (All Intra) and RA (Random Access), respectively.

Real-time Stereo Video Generation using Graphics Processing Unit (GPU를 이용한 실시간 양안식 영상 생성 방법)

  • Shin, In-Yong;Ho, Yo-Sung
    • Journal of Broadcast Engineering
    • /
    • v.16 no.4
    • /
    • pp.596-601
    • /
    • 2011
  • In this paper, we propose a fast depth-image-based rendering method to generate a virtual view image in real-time using a graphic processor unit (GPU) for a 3D broadcasting system. Before the transmission, we encode the input 2D+depth video using the H.264 coding standard. At the receiver, we decode the received bitstream and generate a stereo video using a GPU which can compute in parallel. In this paper, we apply a simple and efficient hole filling method to reduce the decoder complexity and reduce hole filling errors. Besides, we design a vertical parallel structure for a forward mapping process to take advantage of the single instruction multiple thread structure of GPU. We also utilize high speed GPU memories to boost the computation speed. As a result, we can generate virtual view images 15 times faster than the case of CPU-based processing.