• Title/Summary/Keyword: video representation

Search Result 194, Processing Time 0.024 seconds

Efficient Compression Technique of Multi-view Image with Color and Depth Information by Layered Depth Image Representation (계층적 깊이 영상 표현에 의한 컬러와 깊이 정보를 포함하는 다시점 영상에 대한 효율적인 압축기술)

  • Lim, Joong-Hee;Shin, Jong-Hong;Jee, Inn-Ho
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.2C
    • /
    • pp.186-193
    • /
    • 2009
  • Multi-view video is necessary to develop a new compression encoding technique for storage and transmission, because of a huge amount of data. Layered depth image is an efficient representation method of multi-view video data. This method makes a data structure that is synthesis of multi-view color and depth image. This paper proposed enhanced compression method by presentation of efficient layered depth image using real distance comparison, solution of overlap problem, and YCrCb color transformation. In experimental results, confirmed high compression performance and good reconstructed image.

Silhouette-Edge-Based Descriptor for Human Action Representation and Recognition

  • Odoyo, Wilfred O.;Choi, Jae-Ho;Moon, In-Kyu;Cho, Beom-Joon
    • Journal of information and communication convergence engineering
    • /
    • v.11 no.2
    • /
    • pp.124-131
    • /
    • 2013
  • Extraction and representation of postures and/or gestures from human activities in videos have been a focus of research in this area of action recognition. With various applications cropping up from different fields, this paper seeks to improve the performance of these action recognition machines by proposing a shape-based silhouette-edge descriptor for the human body. Information entropy, a method to measure the randomness of a sequence of symbols, is used to aid the selection of vital key postures from video frames. Morphological operations are applied to extract and stack edges to uniquely represent different actions shape-wise. To classify an action from a new input video, a Hausdorff distance measure is applied between the gallery representations and the query images formed from the proposed procedure. The method is tested on known public databases for its validation. An effective method of human action annotation and description has been effectively achieved.

Real-Time Apartment Building Detection and Tracking with AdaBoost Procedure and Motion-Adjusted Tracker

  • Hu, Yi;Jang, Dae-Sik;Park, Jeong-Ho;Cho, Seong-Ik;Lee, Chang-Woo
    • ETRI Journal
    • /
    • v.30 no.2
    • /
    • pp.338-340
    • /
    • 2008
  • In this letter, we propose a novel approach to detecting and tracking apartment buildings for the development of a video-based navigation system that provides augmented reality representation of guidance information on live video sequences. For this, we propose a building detector and tracker. The detector is based on the AdaBoost classifier followed by hierarchical clustering. The classifier uses modified Haar-like features as the primitives. The tracker is a motion-adjusted tracker based on pyramid implementation of the Lukas-Kanade tracker, which periodically confirms and consistently adjusts the tracking region. Experiments show that the proposed approach yields robust and reliable results and is far superior to conventional approaches.

  • PDF

Multiple Description Coding Using Directional Discrete Cosine Transform

  • Lama, Ramesh Kumar;Kwon, Goo-Rak
    • Journal of information and communication convergence engineering
    • /
    • v.11 no.4
    • /
    • pp.293-297
    • /
    • 2013
  • Delivery of high quality video over a wide area network with large number of users poses great challenges for the video communication system. To ensure video quality, multiple descriptions have recently attracted various attention as a way of encoding and visual information delivery over wireless network. We propose a new efficient multiple description coding (MDC) technique. Quincunx lattice sub-sampling is used for generating multiple descriptions of an image. In this paper, we propose the application of a directional discrete cosine transform (DCT) to a sub-sampled quincunx lattice to create an MDC representation. On the decoder side, the image is decoded from the received side information. If all the descriptions arrive successfully, the image is reconstructed by combining the descriptions. However, if only one side description is received, decoding is executed using an interpolation process. The experimental results show that such the directional DCT can achieve a better coding gain as well as energy packing efficiency than the conventional DCT with re-alignment.

A HIGH PRECISION CAMERA OPERATING PARAMETER MEASUREMENT SYSTEM AND ITS APPLICATION TO IMAGE MOTION INFERRING

  • Wentao-Zheng;Yoshiaki-Shishikui;Yasuaki-Kanatsugu;Yutaka-Tanaka
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1999.06a
    • /
    • pp.77-82
    • /
    • 1999
  • Information about camera operating such as zoom, focus, pan, tilt and tracking is useful not only for efficient video coding, but also for content-based video representation. A camera operating parameter measurement system designed specifically for these applications is therefore developed. This system, implemented in real time and synchronized with the video signal, measures the precise camera operating parameters. We calibrated the camera lens using a camera model that accounts for redial lens distortion. The system is then applied to infer image motion from pan and tilt operating parameters. The experimental results show that the inferred motion coincides with the actual motion very well, with an error of less than 0.5 pixel even for large motion up to 80 pixels.

An Energy-aware Buffer-based Video Streaming Optimization Scheme (에너지 효율적인 버퍼 기반 비디오 스트리밍 최적화 기법)

  • Kang, Young-myoung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.10
    • /
    • pp.1563-1566
    • /
    • 2022
  • Video streaming applications such as Netflix and Youtube are widely used in our daily life. A DASH based streaming client exploits adaptive bit rate (ABR) method to choose the most appropriate video source representation that the network can support. In this paper we propose a novel energy-aware ABR scheme that adds the ability to monitor energy efficiency in addition to the linear quadratic regulator algorithm we previously introduced. Our trace-driven simulation studies show that our proposed scheme mitigates and shortens re-buffering, resulting in energy savings of mobile devices while preserving the similar QoE compared to the state-of-the-art ABR algorithms.

Efficient Representation of Patch Packing Information for Immersive Video Coding (몰입형 비디오 부호화를 위한 패치 패킹 정보의 효율적인 표현)

  • Lim, Sung-Gyun;Yoon, Yong-Uk;Kim, Jae-Gon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • fall
    • /
    • pp.126-128
    • /
    • 2021
  • MPEG(Moving Picture Experts Group) 비디오 그룹은 사용자에게 움직임 시차(motion parallax)를 제공하면서 3D 공간 내에서 임의의 위치와 방향의 시점(view)을 렌더링(rendering) 가능하게 하는 6DoF(Degree of Freedom)의 몰입형 비디오 부호화 표준인 MIV(MPEG Immersive Video) 표준화를 진행하고 있다. MIV 표준화 과정에서 참조 SW 인 TMIV(Test Model for Immersive Video)도 함께 개발하고 있으며 점진적으로 부호화 성능을 개선하고 있다. TMIV 는 여러 뷰로 구성된 방대한 크기의 6DoF 비디오를 압축하기 위하여 입력되는 뷰 비디오들 간의 중복성을 제거하고 남은 영역들은 각각 개별적인 패치(patch)로 만든 후 아틀라스에 패킹(packing)하여 부호화되는 화소수를 줄인다. 이때 아틀라스 비디오에 패킹된 패치들의 위치 정보를 메타데이터로 압축 비트열과 함께 전송하게 되며, 본 논문에서는 이러한 패킹 정보를 보다 효율적으로 표현하기 위한 방법을 제안한다. 제안방법은 기존 TMIV10.0 에 비해 약 10%의 메타데이터를 감소시키고 종단간 BD-rate 성능을 0.1% 향상시킨다.

  • PDF

VLSI Architecture for Video Object Boundary Enhancement (비디오객체의 경계향상을 위한 VLSI 구조)

  • Kim, Jinsang-
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.11A
    • /
    • pp.1098-1103
    • /
    • 2005
  • The edge and contour information are very much appreciated by the human visual systems and are responsible for our perceptions and recognitions. Therefore, if edge information is integrated during extracting video objects, we can generate boundaries of oects closer to human visual systems for multimedia applications such as interaction between video objects, object-based coding, and representation. Most of object extraction methods are difficult to implement real-time systems due to their iterative and complex arithmetic operations. In this paper, we propose a VLSI architecture integrating edge information to extract video objects for precisely located object boundaries. The proposed architecture can be easily implemented into hardware due to simple arithmetic operations. Also, it can be applied to real-time object extraction for object-oriented multimedia applications.

A Study on Image Representation of Bisexual Lighting (바이섹슈얼 라이팅(Bisexual Lighting)의 영상 표현 연구)

  • QIAO, YINA
    • Trans-
    • /
    • v.11
    • /
    • pp.119-142
    • /
    • 2021
  • Video was a cultural practice based on image. The audience longs to experience new things, not everyday things through by video images. There are many components of the image, but among them, color, a visual representation, plays a big role. Since the advent of color films, color has constantly evolved as an important component of visual art and has become an important role in innovative visual art design. According to film history data, filmmakers were interested in color since the film was created in 1895, but in the early stages of film development, film colors were only black and white. Because these two colors no longer satisfy viewers, more natural colors began to emerge from the film as it was colored. However, with the development of historical paintings, the lack of artistic creation and the public's level increased, making people more active in using colors because simple reproduction of natural colors alone does not satisfy people. The colors in the video are both techniques of expression and can be understood by mind and thought. It is also an indication that colors do not just exist, but they work strongly on human psychology. Now people are so motivated by repetitive and unimportant information that they find that the human intuitive system simplifies the information they receive unconsciously that they have certain customs and characteristics when they see things. Color is part of the film language, or color language can express the film's ideological themes or portray vivid characters in the film, and people are receiving more intuitive messages. This study analyzed the basic color components of bisexual lighting, namely, pink, blue, and purple, and analyzed how human psychology is affected through color, combining the scenes from the video. The purpose of this paper is to explore what color language bisexual lighting is expressed using color properties in images and how bisexual lighting interacts with human psychology through color.

Similarity Search Algorithm Based on Hyper-Rectangular Representation of Video Data Sets (비디오 데이터 세트의 하이퍼 사각형 표현에 기초한 비디오 유사성 검색 알고리즘)

  • Lee, Seok-Lyong
    • The KIPS Transactions:PartD
    • /
    • v.11D no.4
    • /
    • pp.823-834
    • /
    • 2004
  • In this research, the similarity search algorithms are provided for large video data streams. A video stream that consists of a number of frames can be expressed by a sequence in the multidimensional data space, by representing each frame with a multidimensional vector By analyzing various characteristics of the sequence, it is partitioned into multiple video segments and clusters which are represented by hyper-rectangles. Using the hyper-rectangles of video segments and clusters, similarity functions between two video streams are defined, and two similarity search algorithms are proposed based on the similarity functions algorithms by hyper-rectangles and by representative frames. The former is an algorithm that guarantees the correctness while the latter focuses on the efficiency with a slight sacrifice of the correctness Experiments on different types of video streams and synthetically generated stream data show the strength of our proposed algorithms.