• Title/Summary/Keyword: Video sequence

Search Result 507, Processing Time 0.028 seconds

Video Image Mosaicing Technique Using 3 Dimensional Multi Base Lines (3차원 다중 기선을 사용만 비데오 영상 모자이크 기술)

  • 전재춘;서용철
    • Korean Journal of Remote Sensing
    • /
    • v.20 no.2
    • /
    • pp.125-137
    • /
    • 2004
  • In case of using image sequence taken from a moving camera along a road in an urban area, general video mosaicing technique based on a single baseline cannot create 2-D image mosaics. To solve the drawback, this paper proposed a new image mosaicing technique through 3-D multi-baselines that can create image mosaics in 3-D space. The core of the proposed method is that each image frame has a dependent baseline, an equation of first order, calculated by using ground control point (GCP) of optical flows. The proposed algorithm consists of 4 steps: calculation of optical flows using hierarchical strategy, calculation of camera exterior orientation, determination of multi-baselines, and seamless image mosaics. This paper realized and showed the proposed algorithm that can create efficient image mosaics in 3-D space from real image sequence.

New Prefiltering Methods based on a Histogram Matching to Compensate Luminance and Chrominance Mismatch for Multi-view Video (다시점 비디오의 휘도 및 색차 성분 불일치 보상을 위한 히스토그램 매칭 기반의 전처리 기법)

  • Lee, Dong-Seok;Yoo, Ji-Sang
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.6
    • /
    • pp.127-136
    • /
    • 2010
  • In multi-view video, illumination disharmony between neighboring views can occur on account of different location of each camera and imperfect camera calibration, and so on. Such discrepancy can be the cause of the performance decrease of multi-view video coding by mismatch of inter-view prediction which refer to the pictures obtained from the neighboring views at the same time. In this paper, we propose an efficient histogram-based prefiltering algorithm to compensate mismatches between the luminance and chrominance components in multi-view video for improving its coding efficiency. To compensate illumination variation efficiently, all camera frames of a multi-view sequence are adjusted to a predefined reference through the histogram matching. A Cosited filter that is used for chroma subsampling in many video encoding schemes is applied to each color component prior to histogram matching to improve its performance. The histogram matching is carried out in the RGB color space after color space converting from YCbCr color space. The effective color conversion skill that has respect to direction of edge and range of pixel value in an image is employed in the process. Experimental results show that the compression ratio for the proposed algorithm is improved comparing with other methods.

An Intelligent Display Scheme of Soccer Video for Multimedia Mobile Devices (멀티미디어 이동형 단말을 위한 축구경기 비디오의 지능적 디스플레이 방법)

  • Seo Kee-Won;Kim Chang-Ick
    • Journal of Broadcast Engineering
    • /
    • v.11 no.2 s.31
    • /
    • pp.207-221
    • /
    • 2006
  • A fully automatic and computationally efficient method is proposed for intelligent display of soccer video on small multimedia mobile devices. The rapid progress of the multimedia signal processing has contributed to the extensive use of multimedia devices with a small LCD panel. With these emerging small mobile devices, the video sequences captured for standard- or HDTV broadcasting may give the small-display-viewers uncomfortable experiences in understanding what is happening in a scene. For instance, in a soccer video sequence taken by a long-shot camera technique, the tiny objects (e.g., soccer ball and players) may not be clearly viewed on the small LCD panel. Thus, an intelligent display technique is needed for small-display-viewers. To this end, one of the key technologies is to determine region of interest (ROI), which is a part of the scene that viewers pay more attention to than other regions. In this paper, the focus is on soccer video display for mobile devices. Instead of taking visual saliency into account, we take domain-specific approach to exploit the characteristics of the soccer video. The proposed scheme includes three modules; ground color learning, shot classification, and ROI determination. The experimental results show the propose scheme is capable of intelligent video display on mobile devices.

An Efficient Face Region Detection for Content-based Video Summarization (내용기반 비디오 요약을 위한 효율적인 얼굴 객체 검출)

  • Kim Jong-Sung;Lee Sun-Ta;Baek Joong-Hwan
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.7C
    • /
    • pp.675-686
    • /
    • 2005
  • In this paper, we propose an efficient face region detection technique for the content-based video summarization. To segment video, shot changes are detected from a video sequence and key frames are selected from the shots. We select one frame that has the least difference between neighboring frames in each shot. The proposed face detection algorithm detects face region from selected key frames. And then, we provide user with summarized frames included face region that has an important meaning in dramas or movies. Using Bayes classification rule and statistical characteristic of the skin pixels, face regions are detected in the frames. After skin detection, we adopt the projection method to segment an image(frame) into face region and non-face region. The segmented regions are candidates of the face object and they include many false detected regions. So, we design a classifier to minimize false lesion using CART. From SGLD matrices, we extract the textual feature values such as Inertial, Inverse Difference, and Correlation. As a result of our experiment, proposed face detection algorithm shows a good performance for the key frames with a complex and variant background. And our system provides key frames included the face region for user as video summarized information.

A Study on the Sequence Analysis Technique of Urban Landscape Color and Urban Color Characteristics in accordance with Spatial Openness - Focusing on the View of the Daegu Monorail - (도시 경관색채의 시퀀스 분석기법과 공간 개방도에 따른 도시색채 특성연구 - 대구광역시 지상철 조망을 중심으로 -)

  • Koo, Min-Ah
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.44 no.6
    • /
    • pp.120-136
    • /
    • 2016
  • This study, views the color of scenery not as a static state, but rather as a continuous sequence of perceptions that incorporates the concept of time. This study derived techniques to quantitatively analyze the flow and data from this sequence. By utilizing this, urban color trends can be based on openness. This is very close to what would be experienced by an actual viewer: it extracted color data and visual amount from frames at 2-second intervals by shooting a video of the color sequence of the city as seen from both the left and right sides from the inside of the monorail (line 3 of the Daegu urban railway). These images were classified by color group, brightness, chroma, high chroma distribution derived techniques such as openness of space, brightness level, clarity level, high-chroma distribution and code, advantage of visual amount, dominant factor exposure, hot and cold color image and dynamic of sequence rhythm. During the derived sequence, the data determines the openness in the visual amount of sky and it was found that the tendency of the colors of the city was opening regression analysis. The more colorful the city is opening the brightness is lowered, the chroma increased slightly, cold colors significantly increased, which also had a very deep relationship with Lynch enclosed proportion, color change of the city trends through the actual scenery could grasp in more detail.

An Efficient Smoothing Algorithm Using the Change of Frame Sequence in GOP (GOP를 구성하는 프레임들의 순서 변경을 이용한 효율적인 스무딩 알고리즘)

  • Lee, Myoun-Jae
    • Journal of Korea Game Society
    • /
    • v.6 no.2
    • /
    • pp.51-60
    • /
    • 2006
  • Smoothing is a transmission plan where variable rate video data is converted to a constant bit rate stream. Among them are CBA, MCBA, MVBA, PCRTT and others. But, in these algorithm, a transmission plan is made in according to stored frame sequence in these algorithms. In case that the number of bytes in frames in GOP differs greatly each other, this may cause unnecessary transmission rate changes and may require high transmission rates abruptly when frame's byte is large. In result, it is difficult to use efficient network resource. In this paper, we proposed a smoothing algorithm that find the optimal frame sequence in short time by using backtracking method and smoothing's structure for the proposed smoothing algorithm. This algorithm decides the sequence of frames which requires the lowest variance of frame's bytes in GOP and make a transmission plan. In order to show the performance, we compared with MVBA algorithm by various evaluation factors such as the number of rate changes, peak rate, rate variability.

  • PDF

Raising Visual Experience of Soccer Video for Mobile Viewers (이동형 단말기 사용자를 위한 축구경기 비디오의 시청경험 향상 방법)

  • Ahn, Il-Koo;Ko, Jae-Seung;Kim, Won-Jun;Kim, Chang-Ick
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.13 no.3
    • /
    • pp.165-178
    • /
    • 2007
  • The recent progress in multimedia signal processing and transmission technologies has contributed to the extensive use of multimedia devices to watch sports games with small LCD panel. However, the most of video sequences are captured for normal viewing on standard TV or HDTV, for cost reasons, merely resized and delivered without additional editing. This may give the small-display-viewers uncomfortable experiences in understanding what is happening in a scene. For instance, in a soccer video sequence taken by a long-shot camera techniques, the tiny objects (e.g., soccer ball and players) may not be clearly viewed on the small LCD panel. Moreover, it is also difficult to recognize the contents of the scorebox which contains the elapsed time and scores. This renuires intelligent display technique to provide small-display-viewers with better experience. To this end, one of the key technologies is to determine region of interest (ROI) and display the magnified ROI on the screen, where ROI is a part of the scene that viewers pay more attention to than other regions. Examples include a region surrounding a ball in long-shot and a scorebox located in the comer of each frame. In this paper, we propose a scheme for raising viewing experiences of multimedia mobile device users. Instead of taking generic approaches utilizing visually salient features for extraction of ROI in a scene, we take domain-specific approach to exploit unique attributes of the soccer video. The proposed scheme consists of two modules: ROI determination and scorebox extraction. The experimental results show that the proposed scheme offers useful tools for intelligent video display on multimedia mobile devices.

Rate control to reduce bitrate fluctuation on HEVC

  • Yoo, Jonghun;Nam, Junghak;Ryu, Jiwoo;Sim, Donggyu
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.1 no.3
    • /
    • pp.152-160
    • /
    • 2012
  • This paper proposes a frame-level rate control algorithm for low delay video applications to reduce the fluctuations in the bitrate. The proposed algorithm minimizes the bitrate fluctuations in two ways with minimal coding loss. First, the proposed rate control applies R-Q model to all frames including the first frame of every group of pictures (GOP) except for the first one of a sequence. Conventional rate control algorithms do not use any R-Q models for the first frame of each GOP and do not estimate the generated-bit. An unexpected output rate result from the first frame affects the remainder of the pictures in the rate control. Second, a rate-distortion (R-D) cost is calculated regardless of the hierarchical coding structure for low bitrate fluctuations because the hierarchical coding structure controls the output bitrate in rate distortion optimization (RDO) process. The experimental results show that the average variance of per-frame bits with the proposed algorithm can reduce by approximately 33.8% with a delta peak signal-to-noise ratio (PSNR) degradation of 1.4dB for a "low-delay B" coding structure and by approximately 35.7% with a delta-PSNR degradation of 1.3dB for a "low-delay P" coding structure, compared to HM 8.0 rate control.

  • PDF

Implementation of a Multi-DSP Board for High-definition Video Signal Processing and a Real-time Tracking System for Objects in the Video Sequence (고해상도 영상처리에 적합한 다중 DSP 보드의 구현 및 비디오 영상 내 물체의 실시간 추적 시스템)

  • Jeong, Cheol-Jun;Kim, Jin-Yul;Lee, Cheol-Woo;Yang, Yoon-Gi
    • Proceedings of the KIEE Conference
    • /
    • 2008.04a
    • /
    • pp.113-114
    • /
    • 2008
  • 본 논문에서는 HD 비디오 영상 처리를 효과적으로 수행할 수 있는 다중 DSP 아키텍쳐를 제안하고 프로토타입 보드를 설계 제작하였다. 또한, 구현된 보드를 이용하여 비디오 영상 내 물체(얼굴)의 실시간 추석시스템을 구현하였다. 물체 추적 기법인 PF(Particle Filtering) 기법은 배경 클러터가 존재하는 환경에서도 강인하게 물체를 추적할 수 있지만 많은 수의 샘플을 사용하는 경우 필요한 계산량이 많아져 실시간 구현이 매우 어렵다는 문제점을 가지고 있다. 본 논문에서는 이러한 경우에도 실시간 추적이 가능하도록 병렬화된 PF 추적 방법을 제안하고 제작된 보드 상에 구현하였다. 구현된 병렬 처리 추적에서는 150개의 PF 샘플들을 5개의 슬레이브 DSP로 분산하여 컬러 유사도 기탄의 관측 확률을 계산하고 그 결과를 마스터 DSP에서 종합하여 추적의 정확도를 높이고자 하였다. 실험에는 $720{\times}480$ 픽셀 영상이 사용되었으며, 실험 결과 배경 클러터가 존재하는 경우에도 충분한 PF 샘플 수의 사용에 따라 대상 물체를 강인하게 추적하는 우수한 성능을 확인할 수 있었다.

  • PDF

A Study on the Tendencies of the Motion Graphic Expressions in the Title sequence (타이틀 시퀀스에서 모션그래픽의 표현경향에 관한 연구)

  • Jung, Hee-Jin;Na, Jun-Ki
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2006.11a
    • /
    • pp.195-199
    • /
    • 2006
  • In the field of video design, motion graphics tends to be realized to deliver a certain message properly incorporating basic motion graphics components such as space, form of expression, and time, and has continued to be used as a powerful communication tool. Many cases proved that title sequences were produced as a result of combination of a variety of quick images and effects with appropriately chosen sounds to meet the demands of audience. This study indicated that motion graphics began to be widely used as a more powerful video communication tool for title sequences and also applied for M-NET, DMB, CABLE TV, IPTV, and others.

  • PDF