• Title/Summary/Keyword: Image semantic segmentation

Search Result 145, Processing Time 0.028 seconds

Semantic Information Inference among Objects in Image Using Ontology (온톨로지를 이용한 이미지 내 객체사이의 의미 정보 추론)

  • Kim, Ji-Won;Kim, Chul-Won
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.15 no.3
    • /
    • pp.579-586
    • /
    • 2020
  • There is a large amount of multimedia data on the web page, and a method of extracting semantic information from low level visual information for accurate retrieval is being studied. However, most of these techniques extract one of information from a single image, so it is difficult to extract semantic information when multiple objects are combined in the image. In this paper, each low-level feature is extracted to extract various objects and backgrounds in an image, and these are divided into predefined backgrounds and objects using SVM. The objects and backgrounds divided in this way are constructed with ontology, infer the semantic information of location and association using inference engine. It's possible to extract the semantic information. We propose this method process the complex and high-level semantic information in image.

Exploratory Study of the Applicability of Kompsat 3/3A Satellite Pan-sharpened Imagery Using Semantic Segmentation Model (아리랑 3/3A호 위성 융합영상의 Semantic Segmentation을 통한 활용 가능성 탐색 연구)

  • Chae, Hanseong;Rhim, Heesoo;Lee, Jaegwan;Choi, Jinmu
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_4
    • /
    • pp.1889-1900
    • /
    • 2022
  • Roads are an essential factor in the physical functioning of modern society. The spatial information of the road has much longer update cycle than the traffic situation information, and it is necessary to generate the information faster and more accurately than now. In this study, as a way to achieve that goal, the Pan-sharpening technique was applied to satellite images of Kompsat 3 and 3A to improve spatial resolution. Then, the data were used for road extraction using the semantic segmentation technique, which has been actively researched recently. The acquired Kompsat 3/3A pan-sharpened images were trained by putting it into a U-Net based segmentation model along with Massachusetts road data, and the applicability of the images were evaluated. As a result of training and verification, it was found that the model prediction performance was maintained as long as certain conditions were maintained for the input image. Therefore, it is expected that the possibility of utilizing satellite images such as Kompsat satellite will be even higher if rich training data are constructed by applying a method that minimizes the impact of surrounding environmental conditions affecting models such as shadows and surface conditions.

Improved Performance of Image Semantic Segmentation using NASNet (NASNet을 이용한 이미지 시맨틱 분할 성능 개선)

  • Kim, Hyoung Seok;Yoo, Kee-Youn;Kim, Lae Hyun
    • Korean Chemical Engineering Research
    • /
    • v.57 no.2
    • /
    • pp.274-282
    • /
    • 2019
  • In recent years, big data analysis has been expanded to include automatic control through reinforcement learning as well as prediction through modeling. Research on the utilization of image data is actively carried out in various industrial fields such as chemical, manufacturing, agriculture, and bio-industry. In this paper, we applied NASNet, which is an AutoML reinforced learning algorithm, to DeepU-Net neural network that modified U-Net to improve image semantic segmentation performance. We used BRATS2015 MRI data for performance verification. Simulation results show that DeepU-Net has more performance than the U-Net neural network. In order to improve the image segmentation performance, remove dropouts that are typically applied to neural networks, when the number of kernels and filters obtained through reinforcement learning in DeepU-Net was selected as a hyperparameter of neural network. The results show that the training accuracy is 0.5% and the verification accuracy is 0.3% better than DeepU-Net. The results of this study can be applied to various fields such as MRI brain imaging diagnosis, thermal imaging camera abnormality diagnosis, Nondestructive inspection diagnosis, chemical leakage monitoring, and monitoring forest fire through CCTV.

Training a semantic segmentation model for cracks in the concrete lining of tunnel (터널 콘크리트 라이닝 균열 분석을 위한 의미론적 분할 모델 학습)

  • Ham, Sangwoo;Bae, Soohyeon;Kim, Hwiyoung;Lee, Impyeong;Lee, Gyu-Phil;Kim, Donggyou
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.23 no.6
    • /
    • pp.549-558
    • /
    • 2021
  • In order to keep infrastructures such as tunnels and underground facilities safe, cracks of concrete lining in tunnel should be detected by regular inspections. Since regular inspections are accomplished through manual efforts using maintenance lift vehicles, it brings about traffic jam, exposes works to dangerous circumstances, and deteriorates consistency of crack inspection data. This study aims to provide methodology to automatically extract cracks from tunnel concrete lining images generated by the existing tunnel image acquisition system. Specifically, we train a deep learning based semantic segmentation model with open dataset, and evaluate its performance with the dataset from the existing tunnel image acquisition system. In particular, we compare the model performance in case of using all of a public dataset, subset of the public dataset which are related to tunnel surfaces, and the tunnel-related subset with negative examples. As a result, the model trained using the tunnel-related subset with negative examples reached the best performance. In the future, we expect that this research can be used for planning efficient model training strategy for crack detection.

Assembly performance evaluation method for prefabricated steel structures using deep learning and k-nearest neighbors

  • Hyuntae Bang;Byeongjun Yu;Haemin Jeon
    • Smart Structures and Systems
    • /
    • v.32 no.2
    • /
    • pp.111-121
    • /
    • 2023
  • This study proposes an automated assembly performance evaluation method for prefabricated steel structures (PSSs) using machine learning methods. Assembly component images were segmented using a modified version of the receptive field pyramid. By factorizing channel modulation and the receptive field exploration layers of the convolution pyramid, highly accurate segmentation results were obtained. After completing segmentation, the positions of the bolt holes were calculated using various image processing techniques, such as fuzzy-based edge detection, Hough's line detection, and image perspective transformation. By calculating the distance ratio between bolt holes, the assembly performance of the PSS was estimated using the k-nearest neighbors (kNN) algorithm. The effectiveness of the proposed framework was validated using a 3D PSS printing model and a field test. The results indicated that this approach could recognize assembly components with an intersection over union (IoU) of 95% and evaluate assembly performance with an error of less than 5%.

Drone Image AI Analysis Model for Ecological Environment Investigation (생태 환경 조사를 위한 드론영상 AI분석 모델)

  • Shin, Kwang-seong;Shin, Seong-yoon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.05a
    • /
    • pp.355-356
    • /
    • 2021
  • Geological and biological surveys are conducted every year to investigate the state of tidal flat loss and ecological changes in the Saemangeum embankment. In addition, various activities for forest monitoring and large-scale environmental monitoring are being actively carried out throughout Korea. Due to the recent development of drone technology and artificial intelligence technology, various studies are being conducted to perform these activities more efficiently and economically. In this study, we propose an image segmentation technique using semantic segmentation to efficiently investigate and analyze large-scale ecological environments using Drone.

  • PDF

Image Retrieval System of semantic Inference using Objects in Images (이미지의 객체에 대한 의미 추론 이미지 검색 시스템)

  • Kim, Ji-Won;Kim, Chul-Won
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.11 no.7
    • /
    • pp.677-684
    • /
    • 2016
  • With the increase of multimedia information such as image, researches on extracting high-level semantic information from low-level visual information has been realized, and in order to automatically generate this kind of information. Various technologies have been developed. Generally, image retrieval is widely preceded by comparing colors and shapes among images. In some cases, images with similar color, shape and even meaning are hard to retrieve. In this article, in order to retrieve the object in an image, technical value of middle level is converted into meaning value of middle level. Furthermore, to enhance accuracy of segmentation, K-means algorithm is engaged to compute k values for various images. Thus, object retrieval can be achieved by segmented low-level feature and relationship of meaning is derived from ontology. The method mentioned in this paper is supposed to be an effective approach to retrieve images as required by users.

Deep Learning-based Interior Design Recognition (딥러닝 기반 실내 디자인 인식)

  • Wongyu Lee;Jihun Park;Jonghyuk Lee;Heechul Jung
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.19 no.1
    • /
    • pp.47-55
    • /
    • 2024
  • We spend a lot of time in indoor space, and the space has a huge impact on our lives. Interior design plays a significant role to make an indoor space attractive and functional. However, it should consider a lot of complex elements such as color, pattern, and material etc. With the increasing demand for interior design, there is a growing need for technologies that analyze these design elements accurately and efficiently. To address this need, this study suggests a deep learning-based design analysis system. The proposed system consists of a semantic segmentation model that classifies spatial components and an image classification model that classifies attributes such as color, pattern, and material from the segmented components. Semantic segmentation model was trained using a dataset of 30000 personal indoor interior images collected for research, and during inference, the model separate the input image pixel into 34 categories. And experiments were conducted with various backbones in order to obtain the optimal performance of the deep learning model for the collected interior dataset. Finally, the model achieved good performance of 89.05% and 0.5768 in terms of accuracy and mean intersection over union (mIoU). In classification part convolutional neural network (CNN) model which has recorded high performance in other image recognition tasks was used. To improve the performance of the classification model we suggests an approach that how to handle data that has data imbalance and vulnerable to light intensity. Using our methods, we achieve satisfactory results in classifying interior design component attributes. In this paper, we propose indoor space design analysis system that automatically analyzes and classifies the attributes of indoor images using a deep learning-based model. This analysis system, used as a core module in the A.I interior recommendation service, can help users pursuing self-interior design to complete their designs more easily and efficiently.

AAW-based Cell Image Segmentation Method (적응적 관심윈도우 기반의 세포영상 분할 기법)

  • Seo, Mi-Suk;Ko, Byoung-Chul;Nam, Jae-Yeal
    • The KIPS Transactions:PartB
    • /
    • v.14B no.2
    • /
    • pp.99-106
    • /
    • 2007
  • In this paper, we present an AAW(Adaptive Attention Window) based cell image segmentation method. For semantic AAW detection we create an initial Attention Window by using a luminance map. Then the initial AW is reduced to the optimal size of the real ROI(Region of Interest) by using a quad tree segmentation. The purpose of AAW is to remove the background and to reduce the amount of processing time for segmenting ROIs. Experimental results show that the proposed method segments one or more ROIs efficiently and gives the similar segmentation result as compared with the human perception.

Real-time semantic segmentation of gastric intestinal metaplasia using a deep learning approach

  • Vitchaya Siripoppohn;Rapat Pittayanon;Kasenee Tiankanon;Natee Faknak;Anapat Sanpavat;Naruemon Klaikaew;Peerapon Vateekul;Rungsun Rerknimitr
    • Clinical Endoscopy
    • /
    • v.55 no.3
    • /
    • pp.390-400
    • /
    • 2022
  • Background/Aims: Previous artificial intelligence (AI) models attempting to segment gastric intestinal metaplasia (GIM) areas have failed to be deployed in real-time endoscopy due to their slow inference speeds. Here, we propose a new GIM segmentation AI model with inference speeds faster than 25 frames per second that maintains a high level of accuracy. Methods: Investigators from Chulalongkorn University obtained 802 histological-proven GIM images for AI model training. Four strategies were proposed to improve the model accuracy. First, transfer learning was employed to the public colon datasets. Second, an image preprocessing technique contrast-limited adaptive histogram equalization was employed to produce clearer GIM areas. Third, data augmentation was applied for a more robust model. Lastly, the bilateral segmentation network model was applied to segment GIM areas in real time. The results were analyzed using different validity values. Results: From the internal test, our AI model achieved an inference speed of 31.53 frames per second. GIM detection showed sensitivity, specificity, positive predictive, negative predictive, accuracy, and mean intersection over union in GIM segmentation values of 93%, 80%, 82%, 92%, 87%, and 57%, respectively. Conclusions: The bilateral segmentation network combined with transfer learning, contrast-limited adaptive histogram equalization, and data augmentation can provide high sensitivity and good accuracy for GIM detection and segmentation.