• Title/Summary/Keyword: Video Compression System

Search Result 195, Processing Time 0.029 seconds

Comparison of Image Compression Performance based on RoI Extraction Methods for Machines Vision (RoI 추출 방법에 따른 기계를 위한 영상 압축 성능 비교)

  • Lee, Yegi;Kim, Shin;Yoon, Kyoungro
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.146-149
    • /
    • 2022
  • 기존 RDO(Rate Distortion Optimization) 기반 압축 방식은 압축 성능에 초점을 두기 때문에 영상 내 인지 특성이 무시될 수 있다. 따라서 RoI(Region of Interest)을 기반으로 압축률을 조절하는 연구가 고안[1, 2, 3, 4] 되었으며, HVS(Human Visual System) 관점에서 영상 내 중요한 부분에 대해 더 높은 품질로 영상을 압축하는 연구가 대부분이다. 최근 인공지능 기술이 발전함에 따라 지능형 영상 분석에 대한 수요가 증가하고 있으며, 이에 따라 머신 비전을 위한 영상 부호화 및 효율적인 전송에 대한 필요성이 대두되고 있다. 본 논문에서는 VVC(Versatile Video Coding)의 dQP(delta Quantization Parameter)를 활용하여 RoI(Region of Interest) 기반압축 방법을 제안하고, 두가지의 RoI 추출 방식을 소개한다. Detectron2 Faster R-CNN X101-FPN [5]의 첫번째 탐지기를 통해 후보 영역 기반 RoI 을 추출하고, 두번째 탐지기를 통해 객체 기반 RoI 을 추출하여, 영상 내 객체 부분과 비객체 부분으로 나누어 서로 다른 압축률로 압축을 수행하였으며, 이에 따른 성능을 비교하고자 한다.

  • PDF

An Advanced QER Selection Algorithm Based on MMT Protocol for 360-Degree VR Video Streaming (MMT 프로토콜 기반의 360도 VR 비디오 전송을 위한 개선된 QER 선택 알고리듬)

  • Kim, A-young;An, Eun-bin;Seo, Kwang-deok
    • Journal of Broadcast Engineering
    • /
    • v.24 no.6
    • /
    • pp.948-955
    • /
    • 2019
  • As interests in 360-degree VR (Virtual Reality) video services enormously grow, compression and streaming technologies for VR video data have been rapidly developed. Quality Emphasized Region (QER) based streaming scheme has been developed as a kind of viewport-adaptive 360-degree video streaming technology for maintaining immersive experience and reducing bandwidth waste. For selecting a QER corresponding to the user's gaze coordinate, QER-based streaming scheme requires the calculation of Quality Emphasis Center (QEC) distance and signaling message delivery for requesting QER switching. QEC distance calculations require high computational complexity because of repeated calculations as many times as the number of QERs. Furthermore, the signaling message interval results in a trade-off relationship between efficient bandwidth usage and flexible QER switching. In this paper, we propose an improved QER selection algorithm based on MMT protocol to solve this problem. The proposed algorithm could achieve computational complexity reduction by using preprocessed QER_ID_MAP. Also, flexible QER switching could be achieved, as well as efficient bandwidth utilization by an adaptive adjustment of the signaling interval.

Stereo image compression based on error concealment for 3D television (3차원 텔레비전을 위한 에러 은닉 기반 스테레오 영상 압축)

  • Bak, Sungchul;Sim, Donggyu;Namkung, Jae-Chan;Oh, Seoung-jun
    • Journal of Broadcast Engineering
    • /
    • v.10 no.3
    • /
    • pp.286-296
    • /
    • 2005
  • This paper presents a stereo-based image compression and transmission system for 3D realistic television. In the proposed system, a disparity map is extracted from an input stereo image pair and the extracted disparity map and one of two input images are transmitted or stored at a local or remote site. However, correspondences can not be determined in occlusion areas. Thus, it is not easy to recover 3D information in such regions. In this paper, a reconstruction image compensation algorithm based on error block concealment and in-loop filtering is proposed to minimize the reconstruction error in generating stereo image pair. The effectiveness of the proposed algorithm is shown in term of objective accuracy of reconstruction image with several real stereo image pairs.

FPGA-based One-Chip Architecture and Design of Real-time Video CODEC with Embedded Blind Watermarking (블라인드 워터마킹을 내장한 실시간 비디오 코덱의 FPGA기반 단일 칩 구조 및 설계)

  • 서영호;김대경;유지상;김동욱
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.8C
    • /
    • pp.1113-1124
    • /
    • 2004
  • In this paper, we proposed a hardware(H/W) structure which can compress and recontruct the input image in real time operation and implemented it into a FPGA platform using VHDL(VHSIC Hardware Description Language). All the image processing element to process both compression and reconstruction in a FPGA were considered each of them was mapped into H/W with the efficient structure for FPGA. We used the DWT(discrete wavelet transform) which transforms the data from spatial domain to the frequency domain, because use considered the motion JPEG2000 as the application. The implemented H/W is separated to both the data path part and the control part. The data path part consisted of the image processing blocks and the data processing blocks. The image processing blocks consisted of the DWT Kernel fur the filtering by DWT, Quantizer/Huffman Encoder, Inverse Adder/Buffer for adding the low frequency coefficient to the high frequency one in the inverse DWT operation, and Huffman Decoder. Also there existed the interface blocks for communicating with the external application environments and the timing blocks for buffering between the internal blocks The global operations of the designed H/W are the image compression and the reconstruction, and it is operated by the unit of a field synchronized with the A/D converter. The implemented H/W used the 69%(16980) LAB(Logic Array Block) and 9%(28352) ESB(Embedded System Block) in the APEX20KC EP20K600CB652-7 FPGA chip of ALTERA, and stably operated in the 70MHz clock frequency. So we verified the real time operation of 60 fields/sec(30 frames/sec).

Quality Verification of Fixed and Mobile Hybrid 3DTV Services via a Subjective Test of Mixed-resolution Stereoscopic Videos (혼합 해상도 양안식 영상에 대한 주관적 화질평가를 통한 고정 및 이동 융합형 3DTV 서비스의 품질 검증)

  • Lee, Jooyoung;Kim, Sung-Hoon;Jeong, Seyoon;Choi, Jin Soo;Kang, Dong-Wook;Jung, Kyeong-Hoon;Kim, Jinwoong
    • Journal of Broadcast Engineering
    • /
    • v.19 no.2
    • /
    • pp.148-157
    • /
    • 2014
  • Various techniques have been developed for efficient compression of stereoscopic 3D videos. Mixed-resolution based approach is one representative bit-rate saving method based on the characteristics of human visual system that the mixed-resolution stereoscopic videos are perceived close to the higher resolution. However, when the difference between the left and right image resolutions is bigger than a certain threshold level, it causes the perceived quality degradation of the 3D images. Subsequently, several researches tried to find the correlation between the difference in resolution and the level of the perceived quality degradation, but they conducted the experiments just considering the difference in resolution without considering the viewing distances, so thereby different results were retrieved from test to test. In this work, we calculated the optimal viewing distance based on the human visual system, and conducted the subjective tests with the calculated viewing distance. With the results, we demonstrate that the fixed and mobile hybrid 3DTV, which is based on mixed-resolution stereoscopic images, can provide the high quality 3D services.

System Design and Implementation of FLV Move Picture Solution Based on IDC apply to Mini IPTV (IDC기반 FLV동영상 솔루션의 Mini IPTV 적용시스템의설계 및 구현)

  • Kwon, O-Byoung;Shin, Hyun-Cheul
    • Convergence Security Journal
    • /
    • v.11 no.4
    • /
    • pp.11-17
    • /
    • 2011
  • In this paper, we propose system design and implementation of FLV Move Picture Solution based on IDC apply to Mini IPTV. IDC of energy minimize and Green Data Center design resources make use of maximization, and Mini IPTV of diverse contents service provider and network provider offer Mini IPTV customers to transmit network. at this time implemented on the improve Move Picture of motion blur and compressibility using FLV solution file format and a compression technique, and reduced traffic cost and solved security question. specially, on the web quickly growing a branch of e-Learning wish to be of help.

Post-Processing for JPEG-Coded Image Deblocking via Sparse Representation and Adaptive Residual Threshold

  • Wang, Liping;Zhou, Xiao;Wang, Chengyou;Jiang, Baochen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.3
    • /
    • pp.1700-1721
    • /
    • 2017
  • The problem of blocking artifacts is very common in block-based image and video compression, especially at very low bit rates. In this paper, we propose a post-processing method for JPEG-coded image deblocking via sparse representation and adaptive residual threshold. This method includes three steps. First, we obtain the dictionary by online dictionary learning and the compressed images. The dictionary is then modified by the histogram of oriented gradient (HOG) feature descriptor and K-means cluster. Second, an adaptive residual threshold for orthogonal matching pursuit (OMP) is proposed and used for sparse coding by combining blind image blocking assessment. At last, to take advantage of human visual system (HVS), the edge regions of the obtained deblocked image can be further modified by the edge regions of the compressed image. The experimental results show that our proposed method can keep the image more texture and edge information while reducing the image blocking artifacts.

Efficient Screen Splitting Methods - A Case Study in Block-wise Motion Detection

  • Layek, Md. Abu;Chung, TaeChoong;Huh, Eui-Nam
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.10
    • /
    • pp.5074-5094
    • /
    • 2016
  • Screen splitting is one of the fundamental tasks in different methods including video and image compression, screen classification, screen content coding and the like. These methods in turn support various applications in data communications, remote screen sharing, remote desktop delivery to assist teaching-learning, telemedicine, Desktop as a Service etc. In the literature we find systems requiring splitting assumes a fixed size split that do not change dynamically, also there is no analysis why that split is chosen in terms of performance. By doing mathematical analysis this paper first finds the efficient splitting schemes that can be easily automated to make a system adaptive. Thereafter, taking the screen motion detection as a case study, it demonstrates the effects of various splitting methods on motion detection performance. The simulation results clearly shows how classification performances varies with different splitting which will facilitate to choose the best splitting for a specific application scenario as well as making the system adaptive by providing dynamic splitting.

Loss Compression and Loss Correction Technique of 3D Point Cloud Data (3차원 데이터의 손실압축과 손실보정기법 연구)

  • Shin, Kwang-seong;Shin, Seong-yoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.351-352
    • /
    • 2021
  • Due to the recent rapid change in the social environment due to Corona 19, the need for non-face-to-face/contact-based information exchange technology is rapidly emerging. Due to these changes, the development of an alternative system using a sense of immersion and a sense of presence is urgently required. In this study, in order to implement a video conferencing system, we implemented a technology for transmitting large-capacity 3D data in real time without delay. For this, the applied algorithm of GAN, the latest deep learning algorithm of the unsupervised learning series, was used.

  • PDF

PSNR Evaluation of P Company DSA System between Server Display Monitor and Client Display Monitor (P사 DSA 시스템의 Server Display Monitor와 Client Display Monitor의 PSNR 평가)

  • Lee, Junhaeng
    • Journal of the Korean Society of Radiology
    • /
    • v.8 no.1
    • /
    • pp.43-49
    • /
    • 2014
  • PACS is needed medical imaging with large-capacity storage device. Slower transmission degrades the performance of the PACS. Thus, the image read by the reading of the long-term stored image without compromising the quality of the video, which does not affect future readings in the range will be compressed and stored. Compression and video storage, and video transport Noise generated during storage and transmission of medical images and the resulting loss of information that occurs when the monitor output from many problems. The study estimates server display monitor and client display monitor of philips DSA system, and suggests that the evaluation and improvement about PSNR, process from server display signal obtaining to client display monitor. P company DSA is used in the test. Two monitors that are $1280{\times}1024$ pixel monitor of P company and 1536x2048 pixel monitor of Wide are used displaying angiography picture. MARO-view is taken in PACS program, and Visual $C^{++}$ is taken as accomplishing PSNR measurement program. As a result of experiment, no change in No 1, 3 of PSNR appear that there is no error in telephotograph and display. In terms of compressibility, low compressibility has small change of definition, and there was not remarkable drawback of compressibility which has little change in definition.