Energy Minimization Based Semantic Video Object Extraction

  • Kim, Dong-Hyun (School of Electrical and Electronic Engineering, Yonsei University) ;
  • Choi, Sung-Hwan (School of Electrical and Electronic Engineering, Yonsei University) ;
  • Kim, Bong-Joe (School of Electrical and Electronic Engineering, Yonsei University) ;
  • Shin, Hyung-Chul (School of Electrical and Electronic Engineering, Yonsei University) ;
  • Sohn, Kwang-Hoon (School of Electrical and Electronic Engineering, Yonsei University)
  • 김동현 (연세대학교 전기전자공학과) ;
  • 최성환 (연세대학교 전기전자공학과) ;
  • 김봉조 (연세대학교 전기전자공학과) ;
  • 신형철 (연세대학교 전기전자공학과) ;
  • 손광훈 (연세대학교 전기전자공학과)
  • Published : 2010.07.08

Abstract

In this paper, we propose a semi-automatic method for semantic video object extraction which extracts meaningful objects from an input sequence with one correctly segmented training image. Given one correctly segmented image acquired by the user's interaction in the first frame, the proposed method automatically segments and tracks the objects in the following frames. We formulate the semantic object extraction procedure as an energy minimization problem at the fragment level instead of pixel level. The proposed energy function consists of two terms: data term and smoothness term. The data term is computed by considering patch similarity, color, and motion information. Then, the smoothness term is introduced to enforce the spatial continuity. Finally, iterated conditional modes (ICM) optimization is used to minimize energy function in a globally optimal manner. The proposed semantic video object extraction method provides faithful results for various types of image sequences.

Keywords