• Title/Summary/Keyword: Mean shift segmentation

Search Result 46, Processing Time 0.027 seconds

A design of MPEG-4 video object segmentation using color/motion information (칼라/움직임 정보를 이용한 MPEG-4 비디오 객체 분할 설계)

  • 김준기;이호석
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.10b
    • /
    • pp.206-208
    • /
    • 2000
  • 본 논문은 칼라 정보와 움직임 정보를 이용한 객체 분할 기법의 설계에 대하여 소개한다. 객체 분할 알고리즘은 L*u*v 공간의 칼라 특성과 움직임 특성을 결합하여 설계하였다. 즉 공간 분할은 mean shift 칼라 클러스터링 알고리즘(color clustering algorithm)을 사용하여 중심 칼라 영역에 따라 동일한 칼라 지역으로 통합한다. 시간 분할은 움직임 검출을 위하여 affine six parameter 움직임 모델과 optical flow equation를 이용하여 움직임이 발생한 부분을 검출한다. 다음에 공간 분할과 시간 분할에 따라 결과를 통합하고 MAD(mean absolute difference)를 사용하여 객체를 추출하는 알고리즘을 설계하였다.

  • PDF

Video-based Stained Glass

  • Kang, Dongwann;Lee, Taemin;Shin, Yong-Hyeon;Seo, Sanghyun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.7
    • /
    • pp.2345-2358
    • /
    • 2022
  • This paper presents a method to generate stained-glass animation from video inputs. The method initially segments an input video volume into several regions considered as fragments of glass by mean-shift segmentation. However, the segmentation predominantly results in over-segmentation, causing several tiny segments in a highly textured area. In practice, assembling significantly tiny or large glass fragments is avoided to ensure architectural stability in stained glass manufacturing. Therefore, we use low-frequency components in the segmentation to prevent over-segmentation and subdivide segmented regions that are oversized. The subdividing must be coherent between adjacent frames to prevent temporal artefacts, such as flickering and the shower door effect. To temporally subdivide regions coherently, we obtain a panoramic image from the segmented regions in input frames, subdivide it using a weighted Voronoi diagram, and thereafter project the subdivided regions onto the input frames. To render stained glass fragment for each coherent region, we determine the optimal match glass fragment for the region from a dataset consisting of real stained-glass fragment images and transfer its color and texture to the region. Finally, applying lead came at the boundary of the regions in each frame yields temporally coherent stained-glass animation.

Design of a Recognizing System for Vehicle's License Plates with English Characters

  • Xing, Xiong;Choi, Byung-Jae;Chae, Seog;Lee, Mun-Hee
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.9 no.3
    • /
    • pp.166-171
    • /
    • 2009
  • In recent years, video detection systems have been implemented in various infrastructures such as airport, public transportation, power generation system, water dam and so on. Recognizing moving objects in video sequence is an important problem in computer vision, with applications in several fields, such as video surveillance and target tracking. Segmentation and tracking of multiple vehicles in crowded situations is made difficult by inter-object occlusion. In the system described in this paper, the mean shift algorithm is firstly used to filter and segment a color vehicle image in order to get candidate regions. These candidate regions are then analyzed and classified in order to decide whether a candidate region contains a license plate or not. And then some characters in the license plate is recognized by using the fuzzy ARTMAP neural network, which is a relatively new architecture of the neural network family and has the capability to learn incrementally unlike the conventional BP network. We finally design a license plate recognition system using the mean shift algorithm and fuzzy ARTMAP neural network and show its performance via some computer simulations.

An Effective Framework for Contented-Based Image Retrieval with Multi-Instance Learning Techniques

  • Peng, Yu;Wei, Kun-Juan;Zhang, Da-Li
    • Journal of Ubiquitous Convergence Technology
    • /
    • v.1 no.1
    • /
    • pp.18-22
    • /
    • 2007
  • Multi-Instance Learning(MIL) performs well to deal with inherently ambiguity of images in multimedia retrieval. In this paper, an effective framework for Contented-Based Image Retrieval(CBIR) with MIL techniques is proposed, the effective mechanism is based on the image segmentation employing improved Mean Shift algorithm, and processes the segmentation results utilizing mathematical morphology, where the goal is to detect the semantic concepts contained in the query. Every sub-image detected is represented as a multiple features vector which is regarded as an instance. Each image is produced to a bag comprised of a flexible number of instances. And we apply a few number of MIL algorithms in this framework to perform the retrieval. Extensive experimental results illustrate the excellent performance in comparison with the existing methods of CBIR with MIL.

  • PDF

Human Tracking and Body Silhouette Extraction System for Humanoid Robot (휴머노이드 로봇을 위한 사람 검출, 추적 및 실루엣 추출 시스템)

  • Kwak, Soo-Yeong;Byun, Hye-Ran
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.6C
    • /
    • pp.593-603
    • /
    • 2009
  • In this paper, we propose a new integrated computer vision system designed to track multiple human beings and extract their silhouette with an active stereo camera. The proposed system consists of three modules: detection, tracking and silhouette extraction. Detection was performed by camera ego-motion compensation and disparity segmentation. For tracking, we present an efficient mean shift based tracking method in which the tracking objects are characterized as disparity weighted color histograms. The silhouette was obtained by two-step segmentation. A trimap is estimated in advance and then this was effectively incorporated into the graph cut framework for fine segmentation. The proposed system was evaluated with respect to ground truth data and it was shown to detect and track multiple people very well and also produce high quality silhouettes. The proposed system can assist in gesture and gait recognition in field of Human-Robot Interaction (HRI).

Adaptive Segmentation Approach to Extraction of Road and Sky Regions (도로와 하늘 영역 추출을 위한 적응적 분할 방법)

  • Park, Kyoung-Hwan;Nam, Kwang-Woo;Rhee, Yang-Won;Lee, Chang-Woo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.7
    • /
    • pp.105-115
    • /
    • 2011
  • In Vision-based Intelligent Transportation System(ITS) the segmentation of road region is a very basic functionality. Accordingly, in this paper, we propose a region segmentation method using adaptive pattern extraction technique to segment road regions and sky regions from original images. The proposed method consists of three steps; firstly we perform the initial segmentation using Mean Shift algorithm, the second step is the candidate region selection based on a static-pattern matching technique and the third is the region growing step based on a dynamic-pattern matching technique. The proposed method is able to get more reliable results than the classic region segmentation methods which are based on existing split and merge strategy. The reason for the better results is because we use adaptive patterns extracted from neighboring regions of the current segmented regions to measure the region homogeneity. To evaluate advantages of the proposed method, we compared our method with the classical pattern matching method using static-patterns. In the experiments, the proposed method was proved that the better performance of 8.12% was achieved when we used adaptive patterns instead of static-patterns. We expect that the proposed method can segment road and sky areas in the various road condition in stable, and take an important role in the vision-based ITS applications.

Multi-scale Image Segmentation Using MSER and its Application (MSER을 이용한 다중 스케일 영상 분할과 응용)

  • Lee, Jin-Seon;Oh, Il-Seok
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.3
    • /
    • pp.11-21
    • /
    • 2014
  • Multi-scale image segmentation is important in many applications such as image stylization and medical diagnosis. This paper proposes a novel segmentation algorithm based on MSER(maximally stable extremal region) which captures multi-scale structure and is stable and efficient. The algorithm collects MSERs and then partitions the image plane by redrawing MSERs in specific order. To denoise and smooth the region boundaries, hierarchical morphological operations are developed. To illustrate effectiveness of the algorithm's multi-scale structure, effects of various types of LOD control are shown for image stylization. The proposed technique achieves this without time-consuming multi-level Gaussian smoothing. The comparisons of segmentation quality and timing efficiency with mean shift-based Edison system are presented.

Contrast Enhancement based on Gaussian Region Segmentation (가우시안 영역 분리 기반 명암 대비 향상)

  • Shim, Woosung
    • Journal of Broadcast Engineering
    • /
    • v.22 no.5
    • /
    • pp.608-617
    • /
    • 2017
  • Methods of contrast enhancement have problem such as side effect of over-enhancement with non-gaussian histogram distribution, tradeoff enhancement efficiency against brightness preserving. In order to enhance contrast at various histogram distribution, segmentation to region with gaussian distribution and then enhance contrast each region. First, we segment an image into several regions using GMM(Gaussian Mixture Model)fitting by that k-mean clustering and EM(Expectation-Maximization) in $L^*a^*b^*$ color space. As a result region segmentation, we get the region map and probability map. Then we apply local contrast enhancement algorithm that mean shift to minimum overlapping of each region and preserve brightness histogram equalization. Experiment result show that proposed region based contrast enhancement method compare to the conventional method as AMBE(AbsoluteMean Brightness Error) and AE(Average Entropy), brightness is maintained and represented detail information.

Adaptive Region Segmentation using Static/Dynamic Pattern Matching (정적/동적 패턴을 이용한 적응적 영역 분할 방법)

  • Park, Kyoung-Hwan;Lee, Chi-Won;Lee, Chang-Woo
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2010.07a
    • /
    • pp.145-148
    • /
    • 2010
  • 본 논문에서 우리는 도로 영역과 하늘 영역, 그리고 도로와 하늘이 아닌 나머지 영역으로 분할하기 위해 동적인(dynamic) 패턴을 이용한 적응적인(adaptive) 병합 방법을 제안한다. 원본영상에서 Mean Shift 알고리즘과 라벨링(Labeling)을 수행하고 영역을 과분할 한다. 컬러에 의해서 도로와 하늘영역이 검출되지 못하는 영역을 위해서 도로 영역과 하늘 영역에서 동적인 패턴 추출한 후 매칭을 통해 유사 영역을 병합한다. 이것은 도로와 하늘의 정보를 현재 환경에서 적응적으로 추출하는 방법이다. 실험에서 정적인(static) 패턴을 사용해서 병합하는 방법과 동적인 패턴을 사용해서 병합하는 방법을 비교하였다. 그 결과, 동적인 패턴을 사용하였을 때 8.12%의 향상된 성능을 보였다.

  • PDF

Image-based Soft Drink Type Classification and Dietary Assessment System Using Deep Convolutional Neural Network with Transfer Learning

  • Rubaiya Hafiz;Mohammad Reduanul Haque;Aniruddha Rakshit;Amina khatun;Mohammad Shorif Uddin
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.2
    • /
    • pp.158-168
    • /
    • 2024
  • There is hardly any person in modern times who has not taken soft drinks instead of drinking water. The rate of people taking soft drinks being surprisingly high, researchers around the world have cautioned from time to time that these drinks lead to weight gain, raise the risk of non-communicable diseases and so on. Therefore, in this work an image-based tool is developed to monitor the nutritional information of soft drinks by using deep convolutional neural network with transfer learning. At first, visual saliency, mean shift segmentation, thresholding and noise reduction technique, collectively known as 'pre-processing' are adopted to extract the location of drinks region. After removing backgrounds and segment out only the desired area from image, we impose Discrete Wavelength Transform (DWT) based resolution enhancement technique is applied to improve the quality of image. After that, transfer learning model is employed for the classification of drinks. Finally, nutrition value of each drink is estimated using Bag-of-Feature (BoF) based classification and Euclidean distance-based ratio calculation technique. To achieve this, a dataset is built with ten most consumed soft drinks in Bangladesh. These images were collected from imageNet dataset as well as internet and proposed method confirms that it has the ability to detect and recognize different types of drinks with an accuracy of 98.51%.