• 제목/요약/키워드: Segmentation Processing

검색결과 1,038건 처리시간 0.036초

A Deep Learning-Based Image Semantic Segmentation Algorithm

  • Chaoqun, Shen;Zhongliang, Sun
    • Journal of Information Processing Systems
    • /
    • 제19권1호
    • /
    • pp.98-108
    • /
    • 2023
  • This paper is an attempt to design segmentation method based on fully convolutional networks (FCN) and attention mechanism. The first five layers of the Visual Geometry Group (VGG) 16 network serve as the coding part in the semantic segmentation network structure with the convolutional layer used to replace pooling to reduce loss of image feature extraction information. The up-sampling and deconvolution unit of the FCN is then used as the decoding part in the semantic segmentation network. In the deconvolution process, the skip structure is used to fuse different levels of information and the attention mechanism is incorporated to reduce accuracy loss. Finally, the segmentation results are obtained through pixel layer classification. The results show that our method outperforms the comparison methods in mean pixel accuracy (MPA) and mean intersection over union (MIOU).

ESRGAN과 Semantic Soft Segmentation을 이용한 객체 분할의 성능 개선 (Performance Improvement of Object Segmentation Using ESRGAN and Semantic Soft Segmentation)

  • 윤동식;곽노윤
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2020년도 춘계학술발표대회
    • /
    • pp.468-471
    • /
    • 2020
  • 본 논문은 ESRGAN(Enhanced Super Resolution GAN)과 Semantic Soft Segmentation을 이용한 객체 분할의 성능 개선에 관한 것이다. 본 논문의 연구진이 이미 제안한 Mask R-CNN과 Semantic Soft Segmentation을 이용한 객체 분할 방법은 전반적으로 객체 분할 성능이 양호한 반면, 객체의 크기가 상대적으로 작으면 분할 성능이 저조해지는 문제점이 있었다. 본 논문은 이러한 문제점을 해결하기 위한 것으로, Mask R-CNN을 통해 검출된 객체의 크기가 일정 기준치 이하인 경우, ESRGAN을 통해 초해상화를 수행한 후, Semantic Soft Segmentation을 수행함으로써 소형 객체의 분할 성능을 개선함에 그 목적이 있다. 제안된 방법에 따르면, 기존의 방볍에 비해 크기가 작은 객체의 분할 특성을 좀 더 효과적으로 개선할 수 있음을 확인할 수 있었다.

Car detection area segmentation using deep learning system

  • Dong-Jin Kwon;Sang-hoon Lee
    • International journal of advanced smart convergence
    • /
    • 제12권4호
    • /
    • pp.182-189
    • /
    • 2023
  • A recently research, object detection and segmentation have emerged as crucial technologies widely utilized in various fields such as autonomous driving systems, surveillance and image editing. This paper proposes a program that utilizes the QT framework to perform real-time object detection and precise instance segmentation by integrating YOLO(You Only Look Once) and Mask R CNN. This system provides users with a diverse image editing environment, offering features such as selecting specific modes, drawing masks, inspecting detailed image information and employing various image processing techniques, including those based on deep learning. The program advantage the efficiency of YOLO to enable fast and accurate object detection, providing information about bounding boxes. Additionally, it performs precise segmentation using the functionalities of Mask R CNN, allowing users to accurately distinguish and edit objects within images. The QT interface ensures an intuitive and user-friendly environment for program control and enhancing accessibility. Through experiments and evaluations, our proposed system has been demonstrated to be effective in various scenarios. This program provides convenience and powerful image processing and editing capabilities to both beginners and experts, smoothly integrating computer vision technology. This paper contributes to the growth of the computer vision application field and showing the potential to integrate various image processing algorithms on a user-friendly platform

Scale Invariant Auto-context for Object Segmentation and Labeling

  • Ji, Hongwei;He, Jiangping;Yang, Xin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제8권8호
    • /
    • pp.2881-2894
    • /
    • 2014
  • In complicated environment, context information plays an important role in image segmentation/labeling. The recently proposed auto-context algorithm is one of the effective context-based methods. However, the standard auto-context approach samples the context locations utilizing a fixed radius sequence, which is sensitive to large scale-change of objects. In this paper, we present a scale invariant auto-context (SIAC) algorithm which is an improved version of the auto-context algorithm. In order to achieve scale-invariance, we try to approximate the optimal scale for the image in an iterative way and adopt the corresponding optimal radius sequence for context location sampling, both in training and testing. In each iteration of the proposed SIAC algorithm, we use the current classification map to estimate the image scale, and the corresponding radius sequence is then used for choosing context locations. The algorithm iteratively updates the classification maps, as well as the image scales, until convergence. We demonstrate the SIAC algorithm on several image segmentation/labeling tasks. The results demonstrate improvement over the standard auto-context algorithm when large scale-change of objects exists.

Statistical Image Processing using Java on the Web

  • Lim, Dong Hoon;Park, Eun Hee
    • Communications for Statistical Applications and Methods
    • /
    • 제9권2호
    • /
    • pp.355-366
    • /
    • 2002
  • The web is one of the most plentiful sources of images. The web has an immediate need for image processing technology in Java. This paper provides a practical introduction to statistical image processing using Java on the web. The paper describes how images are represented in Java and deals with four image processing operations based on basic statistical methods: point processing, spatial filtering, edge detection and image segmentation.

영상 특성과 스켈레톤 분석을 이용한 실시간 인간 객체 추출 (Realtime Human Object Segmentation Using Image and Skeleton Characteristics)

  • 김민준;이주철;김원하
    • 방송공학회논문지
    • /
    • 제21권5호
    • /
    • pp.782-791
    • /
    • 2016
  • 영상에서 배경으로부터 객체를 추출하는 영상 segmentation 알고리즘은 물체 인식 및 추적 등 다양한 응용분야에서 활용될 수 있다. 본 논문에서는 고정된 카메라에서 다수의 초기 프레임을 참조하여 실시간 객체 segmentation 방법을 제안한다. 먼저 객체와 배경을 분류하는 확률모델을 제안하였으며 초기 프레임 동안에 카메라의 color consistency와 focus 특성을 분석하여 안정적인 segmentation 성능을 증가시켰다. 또한 분류된 객체에서 human의 skeleton 특성을 이용하여 추출 결과를 보정하는 방법을 제안한다. 마지막으로 제안된 알고리즘은 객체 segmentation 실시간 처리를 위하여 복잡도를 최소화하므로 다양한 mobile 단말에 확대 적용 가능하다.

의료 영상처리에서의 물리적 이론을 활용한 객체 유효 인식 방법 (Effective Object Recognition based on Physical Theory in Medical Image Processing)

  • 은성종;황보택근
    • 한국콘텐츠학회논문지
    • /
    • 제12권12호
    • /
    • pp.63-70
    • /
    • 2012
  • 의료 영상처리 분야에서의 일반적인 객체 인식 방법은 영역 분할 알고리즘을 기반으로 처리되어진다. 컴퓨팅 분야에서의 이러한 영역 분할 알고리즘은 대부분 밝기 정보, 형태 정보, 패턴 분석 등 다양한 입력정보의 컴퓨팅 처리를 통해 처리된다. 그러나 이러한 컴퓨팅 방법으로는 앞서 언급된 입력 정보들이 의미가 없을 경우, 영역 분할에 많은 제약이 따르게 된다. 따라서 본 논문은 이러한 컴퓨팅 처리의 근본적인 제약사항을 해결하고자, MR 이론의 R2-map 정보 기반의 효과적인 영역 분할 방법은 제안하였다. 본 방법은 간 영역이 포함된 영상에서 실험하였으며, R2-map의 특징점들을 2차원 영역성장법의 씨앗점으로 설정한 후, 검출된 영역의 최종 경계선 보정작업을 통해 경계가 모호하더라도 영역 분할이 가능하게끔 하였다. 해당 영상의 실험 결과, 평균 7.5%의 평균 영역 차이로 기존의 대표 영역 분할 알고리즘에 비해 높은 정확도가 산출되었다.

시뮬레이션을 이용한 라이다 데이터 분할 기법의 성능 평가 (Performance Assessment of a LIDAR Data Segmentation Method based on Simulation)

  • 김성준;이임평
    • 한국측량학회:학술대회논문집
    • /
    • 한국측량학회 2010년 춘계학술발표회 논문집
    • /
    • pp.231-233
    • /
    • 2010
  • Many algorithms for processing LIDAR data are being developed for diverse applications not limited to patch segmentation, bare-earth filtering and building extraction. However, since we cannot exactly know the true locations of LIDAR points, it is difficult to assess the performance of a LIDAR data processing algorithm. In this paper, we thus attempted the performance assessment of the segmentation algorithm developed by Lee (2006) using the LIDAR data generated through simulation based on sensor modelling. Consequently, based on simulation, we can perform the performance assessment of a LIDAR processing algorithm more objectively and quantitatively with an automatic procedure.

  • PDF

Disparity-based Error Concealment for Stereoscopic Images with Superpixel Segmentation

  • Zhang, Yizhang;Tang, Guijin;Liu, Xiaohua;Sun, Changming
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제12권9호
    • /
    • pp.4375-4388
    • /
    • 2018
  • To solve the problem of transmission errors in stereoscopic images, this paper proposes a novel error concealment (EC) method using superpixel segmentation and adaptive disparity selection (SSADS). Our algorithm consists of two steps. The first step is disparity estimation for each pixel in a reference image. In this step, the numbers of superpixel segmentation labels of stereoscopic images are used as a new constraint for disparity matching to reduce the effect of mismatching. The second step is disparity selection for a lost block. In this step, a strategy based on boundary smoothness is proposed to adaptively select the optimal disparity which is used for error concealment. Experimental results demonstrate that compared with other methods, the proposed method has significant advantages in both objective and subjective quality assessment.