• Title/Summary/Keyword: mean shift segmentation

Search Result 46, Processing Time 0.018 seconds

Detection of Multiple Salient Objects by Categorizing Regional Features

  • Oh, Kang-Han;Kim, Soo-Hyung;Kim, Young-Chul;Lee, Yu-Ra
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.1
    • /
    • pp.272-287
    • /
    • 2016
  • Recently, various and effective contrast based salient object detection models to focus on a single target have been proposed. However, there is a lack of research on detection of multiple objects, and also it is a more challenging task than single target process. In the multiple target problem, we are confronted by new difficulties caused by distinct difference between properties of objects. The characteristic of existing models depending on the global maximum distribution of data point would become a drawback for detection of multiple objects. In this paper, by analyzing limitations of the existing methods, we have devised three main processes to detect multiple salient objects. In the first stage, regional features are extracted from over-segmented regions. In the second stage, the regional features are categorized into homogeneous cluster using the mean-shift algorithm with the kernel function having various sizes. In the final stage, we compute saliency scores of the categorized regions using only spatial features without the contrast features, and then all scores are integrated for the final salient regions. In the experimental results, the scheme achieved superior detection accuracy for the SED2 and MSRA-ASD benchmarks with both a higher precision and better recall than state-of-the-art approaches. Especially, given multiple objects having different properties, our model significantly outperforms all existing models.

Image Retrieval Using Color & Spatial Distribution between Pixel Layers (Pixel layer 들 간의 색상 공간 분포에 따른 공간적 분포를 이용한 영상 검색)

  • An, Jaehyun;Ha, Seong Jong;Lee, Sang Hwa;Cho, Nam Ik
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.294-297
    • /
    • 2012
  • 본 논문에서는 컬러 영상의 검색을 위하여 영상을 색상 정보에 기반한 pixel layer (cluster)의 집합체로 모델링하고, 두 layer 간의 유사도를 각 layer 를 이루는 pixel 들의 색상 분포에 따른 공간적 분포를 이용하여 측정하는 기법을 제안한다. 먼저 pixel layering 단계에서는 HSV 색 공간에서 mean-shift clustering 알고리즘을 통해 초기 layer 들을 얻고, 비슷한 색상의 layer 들은 합쳐 영상의 soft segmentation 과 유사한 결과를 얻는다. 비교할 두 영상에서 pixel layering 을 한 후, 각 layer 를 이진화된 공간분포 지도로 형성하고 그 차이를 비교함으로써 유사도를 측정한다. 이 때, 사용하는 가중치로서 HSV 색 공간 분포의 비슷한 정도를 정의하는데, 이는 HSV 색 공간을 XYZ 의 3 차원 좌표로 설정하고, overlap 되는 pixel 수로 정의하였다. 본 논문에서 제안한 pixel layer 들 간의 색상 공간 분포에 따른 공간적 분포를 이용한 영상 검색 기법은 MPEG-7 에서 정의한 대표색상 기반의 영상 검색보다 우수한 성능을 보여주었다.

  • PDF

Color image segmentation by level set method (레벨셋 기법을 이용한 컬러 이미지 분할)

  • Yoo, Ju-Han;Jung, Moon-Ryul
    • Journal of the Korea Computer Graphics Society
    • /
    • v.18 no.2
    • /
    • pp.9-15
    • /
    • 2012
  • In this paper, we propose a method to segment a color image into several meaningful regions. We suppose that the meaningful region has a set of colors with high frequency in the color image. To find these colors, the color image is represented as several sets of color points in RGB space. And when we use the density of points defined in this method, color belonging to a dense region of color points in RGB space refers to the color that appeared frequently in the image. Eventually, we can find meaningful regions by looking for regions with high density of color points using our level set function in RGB space. However, if a meaningful region does not have a contiguous region of the sufficient size in the image, this is not a meaningful region but meaningless region. Thus, the pixels in the meaningless region are assigned to the biggest meaningful region belonging to its neighboring pixels in the color image. Our method divides the color image into meaningful regions by applying the density of color points to level set function in RGB space. This is different from the existing level set method that is defined only in 2D image.

Mapping Man-Made Levee Line Using LiDAR Data and Aerial Orthoimage (라이다 데이터와 항공 정사영상을 활용한 인공 제방선 지도화)

  • Choung, Yun-Jae;Park, Hyen-Cheol;Chung, Youn-In;Jo, Myung-Hee
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.14 no.1
    • /
    • pp.84-93
    • /
    • 2011
  • Levee line mapping is critical to the protection of environments in river zones, the prevention of river flood and the development of river zones. Use of the remote sensing data such as LiDAR and aerial orthoimage is efficient for river mapping due to their accessibility and higher accuracy in horizontal and vertical direction. Airborne laser scanning (LiDAR) has been used for river zone mapping due to its ability to penetrate shallow water and its high vertical accuracy. Use of image source is also efficient for extraction of features by analysis of its image source. Therefore, aerial orthoimage also have been used for river zone mapping tasks due to its image source and its higher accuracy in horizontal direction. Due to these advantages, in this paper, research on three dimensional levee line mapping is implemented using LiDAR and aerial orthoimage separately. Accuracy measurement is implemented for both extracted lines generated by each data using the ground truths and statistical comparison is implemented between two measurement results. Statistical results show that the generated 3D levee line using LiDAR data has higher accuracy than the generated 3D levee line using aerial orthoimage in horizontal direction and vertical direction.

3D Stereoscopic Image Generation of a 2D Medical Image (2D 의료영상의 3차원 입체영상 생성)

  • Kim, Man-Bae;Jang, Seong-Eun;Lee, Woo-Keun;Choi, Chang-Yeol
    • Journal of Broadcast Engineering
    • /
    • v.15 no.6
    • /
    • pp.723-730
    • /
    • 2010
  • Recently, diverse 3D image processing technologies have been applied in industries. Among them, stereoscopic conversion is a technology to generate a stereoscopic image from a conventional 2D image. The technology can be applied to movie and broadcasting contents and the viewer can watch 3D stereoscopic contents. Further the stereoscopic conversion is required to be applied to other fields. Following such trend, the aim of this paper is to apply the stereoscopic conversion to medical fields. The medical images can deliver more detailed 3D information with a stereoscopic image compared with a 2D plane image. This paper presents a novel methodology for converting a 2D medical image into a 3D stereoscopic image. For this, mean shift segmentation, edge detection, intensity analysis, etc are utilized to generate a final depth map. From an image and the depth map, left and right images are constructed. In the experiment, the proposed method is performed on a medical image such as CT (Computed Tomograpy). The stereoscopic image displayed on a 3D monitor shows a satisfactory performance.

Low Resolution Depth Interpolation using High Resolution Color Image (고해상도 색상 영상을 이용한 저해상도 깊이 영상 보간법)

  • Lee, Gyo-Yoon;Ho, Yo-Sung
    • Smart Media Journal
    • /
    • v.2 no.4
    • /
    • pp.60-65
    • /
    • 2013
  • In this paper, we propose a high-resolution disparity map generation method using a low-resolution time-of-flight (TOF) depth camera and color camera. The TOF depth camera is efficient since it measures the range information of objects using the infra-red (IR) signal in real-time. It also quantizes the range information and provides the depth image. However, there are some problems of the TOF depth camera, such as noise and lens distortion. Moreover, the output resolution of the TOF depth camera is too small for 3D applications. Therefore, it is essential to not only reduce the noise and distortion but also enlarge the output resolution of the TOF depth image. Our proposed method generates a depth map for a color image using the TOF camera and the color camera simultaneously. We warp the depth value at each pixel to the color image position. The color image is segmented using the mean-shift segmentation method. We define a cost function that consists of color values and segmented color values. We apply a weighted average filter whose weighting factor is defined by the random walk probability using the defined cost function of the block. Experimental results show that the proposed method generates the depth map efficiently and we can reconstruct good virtual view images.

  • PDF