• 제목/요약/키워드: Model Based Segmentation

검색결과 602건 처리시간 0.03초

움직임열화를 갖는 영상의 화질개선을 위한 객체기반 영상복원기법 (Object-based Image Restoration Method for Enhancing Motion Blurred Images)

  • 정유찬;백준기
    • 전자공학회논문지S
    • /
    • 제35S권12호
    • /
    • pp.77-83
    • /
    • 1998
  • 일반적으로 동영상은 물체의 움직임에 의해 움직임 열화를 겪는다. 본 논문의 목적은 이러한 움직임 열화의 해석을 위한 모델을 제시하고 정칙화된 반복 기법을 이용하여 이를 제거하기위한 복원방식을 제안하는 것이다. 제안된 모델에서는 기존의 공간 불변적인 모델의 한계를 극복하기 위하여 움직이는 물체와 정지된 배경과의 경계에서 일어나는 현상을 수학적으로 해석하게 된다. 그리고 복원 과정에서의 객체기반적 처리를 위하여 움직임을 기반으로 하는 영상 분할 기법을 소개하는데, 이 기법은 기존의 연구를 바탕으로 본 연구에 맞도록 응용하여 사용한다. 제안된 모델을 근거로 한 영상복원 기법은 제약조건을 이용한 반복적 방법으로서 사전에 추정된 열화정보를 이용하여 움직임 열화를 제거하개 된다. 제안된 방법의 성능은 실험결과로서 확인할 수 있다.

  • PDF

Enhanced CNN Model for Brain Tumor Classification

  • Kasukurthi, Aravinda;Paleti, Lakshmikanth;Brahmaiah, Madamanchi;Sree, Ch.Sudha
    • International Journal of Computer Science & Network Security
    • /
    • 제22권5호
    • /
    • pp.143-148
    • /
    • 2022
  • Brain tumor classification is an important process that allows doctors to plan treatment for patients based on the stages of the tumor. To improve classification performance, various CNN-based architectures are used for brain tumor classification. Existing methods for brain tumor segmentation suffer from overfitting and poor efficiency when dealing with large datasets. The enhanced CNN architecture proposed in this study is based on U-Net for brain tumor segmentation, RefineNet for pattern analysis, and SegNet architecture for brain tumor classification. The brain tumor benchmark dataset was used to evaluate the enhanced CNN model's efficiency. Based on the local and context information of the MRI image, the U-Net provides good segmentation. SegNet selects the most important features for classification while also reducing the trainable parameters. In the classification of brain tumors, the enhanced CNN method outperforms the existing methods. The enhanced CNN model has an accuracy of 96.85 percent, while the existing CNN with transfer learning has an accuracy of 94.82 percent.

Texture superpixels merging by color-texture histograms for color image segmentation

  • Sima, Haifeng;Guo, Ping
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제8권7호
    • /
    • pp.2400-2419
    • /
    • 2014
  • Pre-segmented pixels can reduce the difficulty of segmentation and promote the segmentation performance. This paper proposes a novel segmentation method based on merging texture superpixels by computing inner similarity. Firstly, we design a set of Gabor filters to compute the amplitude responses of original image and compute the texture map by a salience model. Secondly, we employ the simple clustering to extract superpixles by affinity of color, coordinates and texture map. Then, we design a normalized histograms descriptor for superpixels integrated color and texture information of inner pixels. To obtain the final segmentation result, all adjacent superpixels are merged by the homogeneity comparison of normalized color-texture features until the stop criteria is satisfied. The experiments are conducted on natural scene images and synthesis texture images demonstrate that the proposed segmentation algorithm can achieve ideal segmentation on complex texture regions.

Development of ResNet-based WBC Classification Algorithm Using Super-pixel Image Segmentation

  • Lee, Kyu-Man;Kang, Soon-Ah
    • 한국컴퓨터정보학회논문지
    • /
    • 제23권4호
    • /
    • pp.147-153
    • /
    • 2018
  • In this paper, we propose an efficient WBC 14-Diff classification which performs using the WBC-ResNet-152, a type of CNN model. The main point of view is to use Super-pixel for the segmentation of the image of WBC, and to use ResNet for the classification of WBC. A total of 136,164 blood image samples (224x224) were grouped for image segmentation, training, training verification, and final test performance analysis. Image segmentation using super-pixels have different number of images for each classes, so weighted average was applied and therefore image segmentation error was low at 7.23%. Using the training data-set for training 50 times, and using soft-max classifier, TPR average of 80.3% for the training set of 8,827 images was achieved. Based on this, using verification data-set of 21,437 images, 14-Diff classification TPR average of normal WBCs were at 93.4% and TPR average of abnormal WBCs were at 83.3%. The result and methodology of this research demonstrates the usefulness of artificial intelligence technology in the blood cell image classification field. WBC-ResNet-152 based morphology approach is shown to be meaningful and worthwhile method. And based on stored medical data, in-depth diagnosis and early detection of curable diseases is expected to improve the quality of treatment.

Data-Driven Approaches for Evaluating Countries in the International Construction Market

  • Lee, Kang-Wook;Han, Seung H.
    • 국제학술발표논문집
    • /
    • The 6th International Conference on Construction Engineering and Project Management
    • /
    • pp.496-500
    • /
    • 2015
  • International construction projects are inherently more risky than domestic projects with multi-dimensional uncertainties that require complementary risk management at both the country and project levels. However, despite a growing need for systematic country evaluations, most studies have focused on project-level decisions and lack country-based approaches for firms in the construction industry. Accordingly, this study suggests data-driven approaches for evaluating countries using two quantitative models. The first is a two-stage country segmentation model that not only screens negative countries based on country attractiveness (macro-segmentation) but also identifies promising countries based on the level of past project performance in a given country (micro-segmentation). The second is a multi-criteria country segmentation model that combines a firm's business objective with the country evaluation process based on Kraljic's matrix and fuzzy preference relations (FPR). These models utilize not only secondary data from internationally reputable institutions but also performance data on Korean firms from 1990 to 2014 to evaluate 29 countries. The proposed approaches enable firms to enhance their decision-making capacity for evaluating and selecting countries at the early stage of corporate strategy development.

  • PDF

저비트율 동영상 전송을 위한 움직임 기반 동영상 분할 (The Motion-Based Video Segmentation for Low Bit Rate Transmission)

  • 이범로;정진현
    • 한국정보처리학회논문지
    • /
    • 제6권10호
    • /
    • pp.2838-2844
    • /
    • 1999
  • The motion-based video segmentation provides a powerful method of video compression, because it defines a region with similar motion, and it makes video compression system to more efficiently describe motion video. In this paper, we propose the Modified Fuzzy Competitive Learning Algorithm (MFCLA) to improve the traditional K-menas clustering algorithm to implement the motion-based video segmentation efficiently. The segmented region is described with the affine model, which consists of only six parameters. This affine model was calculated with optical flow, describing the movements of pixels by frames. This method could be applied in the low bit rate video transmission, such as video conferencing system.

  • PDF

관개용수로 CCTV 이미지를 이용한 CNN 딥러닝 이미지 모델 적용 (Application of CCTV Image and Semantic Segmentation Model for Water Level Estimation of Irrigation Channel)

  • 김귀훈;김마가;윤푸른;방재홍;명우호;최진용;최규훈
    • 한국농공학회논문집
    • /
    • 제64권3호
    • /
    • pp.63-73
    • /
    • 2022
  • A more accurate understanding of the irrigation water supply is necessary for efficient agricultural water management. Although we measure water levels in an irrigation canal using ultrasonic water level gauges, some errors occur due to malfunctions or the surrounding environment. This study aims to apply CNN (Convolutional Neural Network) Deep-learning-based image classification and segmentation models to the irrigation canal's CCTV (Closed-Circuit Television) images. The CCTV images were acquired from the irrigation canal of the agricultural reservoir in Cheorwon-gun, Gangwon-do. We used the ResNet-50 model for the image classification model and the U-Net model for the image segmentation model. Using the Natural Breaks algorithm, we divided water level data into 2, 4, and 8 groups for image classification models. The classification models of 2, 4, and 8 groups showed the accuracy of 1.000, 0.987, and 0.634, respectively. The image segmentation model showed a Dice score of 0.998 and predicted water levels showed R2 of 0.97 and MAE (Mean Absolute Error) of 0.02 m. The image classification models can be applied to the automatic gate-controller at four divisions of water levels. Also, the image segmentation model results can be applied to the alternative measurement for ultrasonic water gauges. We expect that the results of this study can provide a more scientific and efficient approach for agricultural water management.

실시간 고압축 MPEG-4 부호화를 위한 비디오 객체 분할과 프레임 전처리 (Video object segmentation and frame preprocessing for real-time and high compression MPEG-4 encoding)

  • 김준기;이호석
    • 한국통신학회논문지
    • /
    • 제28권2C호
    • /
    • pp.147-161
    • /
    • 2003
  • 비디오 객체 분할(Video Object Segmentation)은 MPEG-4 부호화의 핵심기술로 실시간 요구사항을 위해 빠르고 정확하여야 한다. 그러나 대부분의 존재하는 알고리즘은 계산량이 많으며 실시간 응용을 위해 적합하지 않다. 또한 이전 MPEG-4 VM(Verification Model) 기본 모델은 MPEG-4 부호화 처리를 위한 기본 알고리즘을 제공하였으나 실시간 요구사항을 위한 카메라 입력 시스템, 실용적인 소프트웨어 개발, 비디오 객체 분할 그리고 압축효율에 많은 제한이 있다. 이에 본 논문은 기본 MPEG-4 VM모델에 내용 기반 비디오 코딩의 핵심인 VOP 추출알고리즘, 실시간 카메라 입력 시스템, 압축율을 높일 수 있는 움직임 감지 알고리즘을 추가하여 최대 180:1의 압축율을 보여주는 실시간 고압축 MPEG-4 전처리 시스템을 개발하였다.

자기공명영상의 비지도 분할을 위한 통계적 모델기반 적응적 방법 (A Statistically Model-Based Adaptive Technique to Unsupervised Segmentation of MR Images)

  • 김태우
    • 한국정보처리학회논문지
    • /
    • 제7권1호
    • /
    • pp.286-295
    • /
    • 2000
  • 본 논문은 MR 영상의 비지도 분할을 위하여 MDL원리를 이용한 통계적 모델기반의 적응적 방법을 제안한다. 이 방법에서 조직 영역을 MRF로 모델링함으로써 잡음에 대응하고, 창으로 정의되는 국소영역 내의 밝기값을 가우스 혼합으로 모델링함으로써 영상의 비균일성을 흡수한다. 분할 알고리즘은 ICM을 기반으로 하며 MAP를 근사적으로 추정하고, 모델 파라미터를 국소영역으로부터 구한다. 파라미터 추정과 분할을 위한 창의 크기는 MDL원리를 이용하여 영상으로부터 추정한다. 실험에서 제안한 방법이 특히 비균일성이 있는 MR영상의 분할에서 국소영역의 영상특성을 잘 반영하였으며, 기존의 방법보다 더 좋은 결과를 보여주었다.

  • PDF

Impacts of label quality on performance of steel fatigue crack recognition using deep learning-based image segmentation

  • Hsu, Shun-Hsiang;Chang, Ting-Wei;Chang, Chia-Ming
    • Smart Structures and Systems
    • /
    • 제29권1호
    • /
    • pp.207-220
    • /
    • 2022
  • Structural health monitoring (SHM) plays a vital role in the maintenance and operation of constructions. In recent years, autonomous inspection has received considerable attention because conventional monitoring methods are inefficient and expensive to some extent. To develop autonomous inspection, a potential approach of crack identification is needed to locate defects. Therefore, this study exploits two deep learning-based segmentation models, DeepLabv3+ and Mask R-CNN, for crack segmentation because these two segmentation models can outperform other similar models on public datasets. Additionally, impacts of label quality on model performance are explored to obtain an empirical guideline on the preparation of image datasets. The influence of image cropping and label refining are also investigated, and different strategies are applied to the dataset, resulting in six alternated datasets. By conducting experiments with these datasets, the highest mean Intersection-over-Union (mIoU), 75%, is achieved by Mask R-CNN. The rise in the percentage of annotations by image cropping improves model performance while the label refining has opposite effects on the two models. As the label refining results in fewer error annotations of cracks, this modification enhances the performance of DeepLabv3+. Instead, the performance of Mask R-CNN decreases because fragmented annotations may mistake an instance as multiple instances. To sum up, both DeepLabv3+ and Mask R-CNN are capable of crack identification, and an empirical guideline on the data preparation is presented to strengthen identification successfulness via image cropping and label refining.