• Title/Summary/Keyword: 2D Video

Search Result 910, Processing Time 0.036 seconds

A Study on Motion Estimation Encoder Supporting Variable Block Size for H.264/AVC (H.264/AVC용 가변 블록 크기를 지원하는 움직임 추정 부호기의 연구)

  • Kim, Won-Sam;Sohn, Seung-Il
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.12 no.10
    • /
    • pp.1845-1852
    • /
    • 2008
  • The key elements of inter prediction are motion estimation(ME) and motion compensation(MC). Motion estimation is to find the optimum motion vectors, not only by using a distance criteria like the SAD, but also by taking into account the resulting number of 비트s in the 비트 stream. Motion compensation is compensate for movement of blocks of current frame. Inter-prediction Encoding is always the main bottleneck in high-quality streaming applications. Therefore, in real-time streaming applications, dedicated hardware for executing Inter-prediction is required. In this paper, we studied a motion estimator(ME) for H.264/AVC. The designed motion estimator is based on 2-D systolic array and it connects processing elements for fast SAD(Sum of Absolute Difference) calculation in parallel. By providing different path for the upper and lower lesion of each reference data and adjusting the input sequence, consecutive calculation for motion estimation is executed without pipeline stall. With data reuse technique, it reduces memory access, and there is no extra delay for finding optimal partitions and motion vectors. The motion estimator supports variable-block size and takes 328 cycles for macro-block calculation. The proposed architecture is local memory-free different from paper [6] using local memory. This motion estimation encoder can be applicable to real-time video processing.

A Study on an Improved H.264 Inter mode decision method (H.264 인터모드 결정 방법 개선에 관한 연구)

  • Gong, Jae-Woong;Jung, Jae-Jin;Hwang, Eui-Sung;Kim, Tae-Hyoung;Kim, Doo-Young
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.9 no.4
    • /
    • pp.245-252
    • /
    • 2008
  • In this paper, we propose a new method for improving the H 264 encoding process and motion estimation part. Our approach is a method to reduce the encoding running time through the omission of reference frame in the mode selection process of H 264 and an improvement of SAD computing process. To evaluate the proposed method, we used the H 264 standard image of QCIF size and TIN 4:2:0 format. Experimental results show that proposed SAD algorithm 1 can improve the speed of encoding runnung time by an average of 4.7% with a negligible degradation of PSNR. However, SAD algorithm 2 can improve the speed of encoding runnung time by an average of 9.6% with 0.98dB degradation of PSNR.

  • PDF

A Feature Point Recognition Ratio Improvement Method for Immersive Contents Using Deep Learning (딥 러닝을 이용한 실감형 콘텐츠 특징점 인식률 향상 방법)

  • Park, Byeongchan;Jang, Seyoung;Yoo, Injae;Lee, Jaechung;Kim, Seok-Yoon;Kim, Youngmo
    • Journal of IKEEE
    • /
    • v.24 no.2
    • /
    • pp.419-425
    • /
    • 2020
  • The market size of immersive 360-degree video contents, which are noted as one of the main technology of the fourth industry, increases every year. However, since most of the images are distributed through illegal distribution networks such as Torrent after the DRM gets lifted, the damage caused by illegal copying is also increasing. Although filtering technology is used as a technology to respond to these issues in 2D videos, most of those filtering technology has issues in that it has to overcome the technical limitation such as huge feature-point data volume and the related processing capacity due to ultra high resolution such as 4K UHD or higher in order to apply the existing technology to immersive 360° videos. To solve these problems, this paper proposes a feature-point recognition ratio improvement method for immersive 360-degree videos using deep learning technology.

Application for Furniture Arrangement based on Virtual Reality to Share Interior Design in Real-time (실시간 인테리어 공유를 위한 가상현실 기반 가구 배치 애플리케이션)

  • Han, Ah-Reum;Park, Taejung
    • Journal of Digital Contents Society
    • /
    • v.18 no.2
    • /
    • pp.249-256
    • /
    • 2017
  • As single-person households increase, demands for self interior design has been also increasing. The furniture arrangement takes big part of interior design. But sometimes people need professional helps for the proper and effective arrangement of furniture which makes self-interior design harder. Often, people share information and advices about interior design through Social Network Service. Experts could help directly but this approach would have several issues: First, the detailed and accurate advice would be difficult because experts have to examine the space through 2D pictures or video. Second, they have to make an appointment which would cause inconvenience to both customer and expert. In this paper, the furniture arrangement application for real-time sharing of interior design based on virtual reality is proposed to make up for the weak points. People can precisely share information and advice about self-interior design through this application in realtime, reducing waste of time and energy.

Enhanced Fast Luma Adjustment for High Dynamic Range Television Broadcasting (고-휘도 텔레비전 방송을 위한 개선된 빠른 휘도 조절 기법)

  • Oh, Kyung Seok;Kim, Yong-Goo
    • Journal of Broadcast Engineering
    • /
    • v.23 no.2
    • /
    • pp.302-315
    • /
    • 2018
  • Highly non-linear electro-optical transfer function of the Perceptual Quantizer was approximated by a truncated Taylor series, resulting in a closed form solution for luma adjustment. This previous solution is fast and quite suitable for the hardware implementation of luma adjustment, but the approximation error becomes relatively large in the range of 600~3,900 cd/m2 linear light. In order to reduce such approximation error, we propose a new linear model, for which a correction is performed on the position and the slope of line based on the scope of approximation. In order to verify the approximation capability of the proposed linear model, a comparative study on the luma adjustment schemes was conducted using various high dynamic range test video sequences. Via the comparative study, we identified a significant performance enhancement over the previous fast luma adjustment scheme, where a 4.65dB of adjusted luma t-PSNR gain was obtained for a test sequence having a large portion of saturated color pixels.

A Rate Control Algorithm of MPEG-2 Video Encoding Based Target Bit Matching at Scene Changes (장면전환 발생시 예상 비트 조정을 통한 MPEG-2 비디오 부호화 비트율 제어 알고리즘)

  • Moon Ho-seok;Park Sang-sung;Sohn Myung-ho;Jang Dong-sik
    • Journal of KIISE:Software and Applications
    • /
    • v.31 no.12
    • /
    • pp.1621-1627
    • /
    • 2004
  • The decrease of visual quality at scene change occurs when the difference between the amount of target bits and actual coding is high. Especially, scene change at the P-Picture can lead to severely degrade visual qualities at itself and the pictures referencing it. In this paper, under the occurrence of scene change, we propose a new method, based on the analysis of existing inaccurate bits allocation, to improve the visual qualities of scene-changed and following pictures. The method allocates extra bits to scene-changed Picture and changes them upto the level of the complexity of intra picture. Also, the method changes target bits of following pictures upto the complexity of picture prior to the scene change. Computer simulation shows that the proposed method has improved 0.5-1.2dB higher than TM5 method in terms of PSNR.

A Fast Half Pixel Motion Estimation Method based on the Correlations between Integer pixel MVs and Half pixel MVs (정 화소 움직임 벡터와 반 화소 움직임 벡터의 상관성을 이용한 빠른 반 화소 움직임 추정 기법)

  • Yoon HyoSun;Lee GueeSang
    • The KIPS Transactions:PartB
    • /
    • v.12B no.2 s.98
    • /
    • pp.131-136
    • /
    • 2005
  • Motion Estimation (ME) has been developed to remove redundant data contained in a sequence of image. And ME is an important part of video encoding systems, since it can significantly affect the qualify of an encoded sequences. Generally, ME consists of two stages, the integer pixel motion estimation and the half pixel motion estimation. Many methods have been developed to reduce the computational complexity at the integer pixel motion estimation. However, the studies are needed at the half pixel motion estimation to reduce the complexity. In this paper, a method based on the correlations between integer pixel motion vectors and half pixel motion vectors is proposed for the half pixel motion estimation. The proposed method has less computational complexity than the full half pixel search method (FHSM) that needs the bilinear interpolation of half pixels and examines nine half pixel points to the find the half pixel motion vector. Experimental results show that the speedup improvement of the proposed method over FHSM can be up to $2.5\~80$ times faster and the image quality degradation is about to $0.07\~0.69(dB)$.

Attention-Based Heart Rate Estimation using MobilenetV3

  • Yeo-Chan Yoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.12
    • /
    • pp.1-7
    • /
    • 2023
  • The advent of deep learning technologies has led to the development of various medical applications, making healthcare services more convenient and effective. Among these applications, heart rate estimation is considered a vital method for assessing an individual's health. Traditional methods, such as photoplethysmography through smart watches, have been widely used but are invasive and require additional hardware. Recent advancements allow for contactless heart rate estimation through facial image analysis, providing a more hygienic and convenient approach. In this paper, we propose a lightweight methodology capable of accurately estimating heart rate in mobile environments, using a specialized 2-channel network structure based on 2D convolution. Our method considers both subtle facial movements and color changes resulting from blood flow and muscle contractions. The approach comprises two major components: an Encoder for analyzing image features and a regression layer for evaluating Blood Volume Pulse. By incorporating both features simultaneously our methodology delivers more accurate results even in computing environments with limited resources. The proposed approach is expected to offer a more efficient way to monitor heart rate without invasive technology, particularly well-suited for mobile devices.

Investigation of Power Saving Efficiency for the OFDM Based Multimedia Communication Terminal (OFDM 기반 광대역 멀티미디어 단말의 전력절감 효율 분석에 관한 연구)

  • Moon, Jae-Pil;Lee, Eun-Seo;Kim, Dong-Hwan;Lee, Jae-Sik;Chang, Tae-Gyu
    • Proceedings of the IEEK Conference
    • /
    • 2005.11a
    • /
    • pp.155-158
    • /
    • 2005
  • An invesitigation on power consumption of a mobile multimedia system using OFDM and MDVS technique is reported here. Analysis and simulation are performed to find the significances of proposed Microscopic Dynamic Voltage Scaling(MDVS) tehnique[4] on digital processor in terms of power saving. A study is also made to show power reduction in mobile multimedia system by incorporating OFDM modulation scheme in RF front-end. Finally, overall power consumption by functionally distinguished blocks ie. RF front-end, digital processor and human interface unit is shown here. Total power consumption is 8.2W for 2Mbps SD-quality WCDMA multimedia video service - the power consumption of digital processor is 3.9W(48%), the power consumption of RF front-end is 3.2W (36%), and the power consumption of interface is 1.8W(16%). Power saving of applying purposed MDVS technique is 35% in digital processor, and power saving of OFDM technique is 10-12dB in RF front-end.

  • PDF

Efficient Real-time Multimedia Streaming System Using Partial Transport Stream for IPTV Services

  • Lee, Eun-Jo;Park, Sung-Kwon
    • Journal of Ubiquitous Convergence Technology
    • /
    • v.2 no.2
    • /
    • pp.88-96
    • /
    • 2008
  • IPTV Content delivery systems over wired networks confront scalability problems due to their high network bandwidth requirement for real-time services. Especially, VoD service provides Trick Mode features such as pause, fast forward and similar operations. However, Trick Mode services are delivered by the method of unicast only for controlling of the stream. With a point of views, this paper propose a new real-time multimedia streaming architecture over IP Networks, which tries to achieve bandwidth efficiency and supporting for mass clients better than traditional unicast services. The proposed methods divide the Transport Stream into a series of segments. After that, this divided partial Transport Stream makes multicast streaming periodically. Meanwhile Set-top Box of a client makes a rearrangement orderly by using Presentation Time Stamp field from the served Transport Stream packets. While the current Transport Stream segment is playing, it should be guaranteed that the next segment is downloaded on time. Consequently, the original video content can be played out continuously. The detail introduction of a new real-time multimedia streaming system with analysis and simulation follows as below.

  • PDF