• Title/Summary/Keyword: semantic segmentation

Search Result 244, Processing Time 0.022 seconds

A Hybrid Semantic-Geometric Approach for Clutter-Resistant Floorplan Generation from Building Point Clouds

  • Kim, Seongyong;Yajima, Yosuke;Park, Jisoo;Chen, Jingdao;Cho, Yong K.
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.792-799
    • /
    • 2022
  • Building Information Modeling (BIM) technology is a key component of modern construction engineering and project management workflows. As-is BIM models that represent the spatial reality of a project site can offer crucial information to stakeholders for construction progress monitoring, error checking, and building maintenance purposes. Geometric methods for automatically converting raw scan data into BIM models (Scan-to-BIM) often fail to make use of higher-level semantic information in the data. Whereas, semantic segmentation methods only output labels at the point level without creating object level models that is necessary for BIM. To address these issues, this research proposes a hybrid semantic-geometric approach for clutter-resistant floorplan generation from laser-scanned building point clouds. The input point clouds are first pre-processed by normalizing the coordinate system and removing outliers. Then, a semantic segmentation network based on PointNet++ is used to label each point as ceiling, floor, wall, door, stair, and clutter. The clutter points are removed whereas the wall, door, and stair points are used for 2D floorplan generation. A region-growing segmentation algorithm paired with geometric reasoning rules is applied to group the points together into individual building elements. Finally, a 2-fold Random Sample Consensus (RANSAC) algorithm is applied to parameterize the building elements into 2D lines which are used to create the output floorplan. The proposed method is evaluated using the metrics of precision, recall, Intersection-over-Union (IOU), Betti error, and warping error.

  • PDF

Driving Assist System using Semantic Segmentation based on Deep Learning (딥러닝 기반의 의미론적 영상 분할을 이용한 주행 보조 시스템)

  • Kim, Jung-Hwan;Lee, Tae-Min;Lim, Joonhong
    • Journal of IKEEE
    • /
    • v.24 no.1
    • /
    • pp.147-153
    • /
    • 2020
  • Conventional lane detection algorithms have problems in that the detection rate is lowered in road environments having a large change in curvature and illumination. The probabilistic Hough transform method has low lane detection rate since it exploits edges and restrictive angles. On the other hand, the method using a sliding window can detect a curved lane as the lane is detected by dividing the image into windows. However, the detection rate of this method is affected by road slopes because it uses affine transformation. In order to detect lanes robustly and avoid obstacles, we propose driving assist system using semantic segmentation based on deep learning. The architecture for segmentation is SegNet based on VGG-16. The semantic image segmentation feature can be used to calculate safety space and predict collisions so that we control a vehicle using adaptive-MPC to avoid objects and keep lanes. Simulation results with CARLA show that the proposed algorithm detects lanes robustly and avoids unknown obstacles in front of vehicle.

Graph-based Segmentation for Scene Understanding of an Autonomous Vehicle in Urban Environments (무인 자동차의 주변 환경 인식을 위한 도시 환경에서의 그래프 기반 물체 분할 방법)

  • Seo, Bo Gil;Choe, Yungeun;Roh, Hyun Chul;Chung, Myung Jin
    • The Journal of Korea Robotics Society
    • /
    • v.9 no.1
    • /
    • pp.1-10
    • /
    • 2014
  • In recent years, the research of 3D mapping technique in urban environments obtained by mobile robots equipped with multiple sensors for recognizing the robot's surroundings is being studied actively. However, the map generated by simple integration of multiple sensors data only gives spatial information to robots. To get a semantic knowledge to help an autonomous mobile robot from the map, the robot has to convert low-level map representations to higher-level ones containing semantic knowledge of a scene. Given a 3D point cloud of an urban scene, this research proposes a method to recognize the objects effectively using 3D graph model for autonomous mobile robots. The proposed method is decomposed into three steps: sequential range data acquisition, normal vector estimation and incremental graph-based segmentation. This method guarantees the both real-time performance and accuracy of recognizing the objects in real urban environments. Also, it can provide plentiful data for classifying the objects. To evaluate a performance of proposed method, computation time and recognition rate of objects are analyzed. Experimental results show that the proposed method has efficiently in understanding the semantic knowledge of an urban environment.

Implementation of Photovoltaic Panel failure detection system using semantic segmentation (시멘틱세그멘테이션을 활용한 태양광 패널 고장 감지 시스템 구현)

  • Shin, Kwang-Seong;Shin, Seong-Yoon
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.12
    • /
    • pp.1777-1783
    • /
    • 2021
  • The use of drones is gradually increasing for the efficient maintenance of large-scale renewable energy power generation complexes. For a long time, photovoltaic panels have been photographed with drones to manage panel loss and contamination. Various approaches using artificial intelligence are being tried for efficient maintenance of large-scale photovoltaic complexes. Recently, semantic segmentation-based application techniques have been developed to solve the image classification problem. In this paper, we propose a classification model using semantic segmentation to determine the presence or absence of failures such as arcs, disconnections, and cracks in solar panel images obtained using a drone equipped with a thermal imaging camera. In addition, an efficient classification model was implemented by tuning several factors such as data size and type and loss function customization in U-Net, which shows robust classification performance even with a small dataset.

Effect of Learning Data on the Semantic Segmentation of Railroad Tunnel Using Deep Learning (딥러닝을 활용한 철도 터널 객체 분할에 학습 데이터가 미치는 영향)

  • Ryu, Young-Moo;Kim, Byung-Kyu;Park, Jeongjun
    • Journal of the Korean Geotechnical Society
    • /
    • v.37 no.11
    • /
    • pp.107-118
    • /
    • 2021
  • Scan-to-BIM can be precisely mod eled by measuring structures with Light Detection And Ranging (LiDAR) and build ing a 3D BIM (Building Information Modeling) model based on it, but has a limitation in that it consumes a lot of manpower, time, and cost. To overcome these limitations, studies are being conducted to perform semantic segmentation of 3D point cloud data applying deep learning algorithms, but studies on how segmentation result changes depending on learning data are insufficient. In this study, a parametric study was conducted to determine how the size and track type of railroad tunnels constituting learning data affect the semantic segmentation of railroad tunnels through deep learning. As a result of the parametric study, the similar size of the tunnels used for learning and testing, the higher segmentation accuracy, and the better results when learning through a double-track tunnel than a single-line tunnel. In addition, when the training data is composed of two or more tunnels, overall accuracy (OA) and mean intersection over union (MIoU) increased by 10% to 50%, it has been confirmed that various configurations of learning data can contribute to efficient learning.

Exploratory Study of the Applicability of Kompsat 3/3A Satellite Pan-sharpened Imagery Using Semantic Segmentation Model (아리랑 3/3A호 위성 융합영상의 Semantic Segmentation을 통한 활용 가능성 탐색 연구)

  • Chae, Hanseong;Rhim, Heesoo;Lee, Jaegwan;Choi, Jinmu
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_4
    • /
    • pp.1889-1900
    • /
    • 2022
  • Roads are an essential factor in the physical functioning of modern society. The spatial information of the road has much longer update cycle than the traffic situation information, and it is necessary to generate the information faster and more accurately than now. In this study, as a way to achieve that goal, the Pan-sharpening technique was applied to satellite images of Kompsat 3 and 3A to improve spatial resolution. Then, the data were used for road extraction using the semantic segmentation technique, which has been actively researched recently. The acquired Kompsat 3/3A pan-sharpened images were trained by putting it into a U-Net based segmentation model along with Massachusetts road data, and the applicability of the images were evaluated. As a result of training and verification, it was found that the model prediction performance was maintained as long as certain conditions were maintained for the input image. Therefore, it is expected that the possibility of utilizing satellite images such as Kompsat satellite will be even higher if rich training data are constructed by applying a method that minimizes the impact of surrounding environmental conditions affecting models such as shadows and surface conditions.

Parallel Dense Merging Network with Dilated Convolutions for Semantic Segmentation of Sports Movement Scene

  • Huang, Dongya;Zhang, Li
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.11
    • /
    • pp.3493-3506
    • /
    • 2022
  • In the field of scene segmentation, the precise segmentation of object boundaries in sports movement scene images is a great challenge. The geometric information and spatial information of the image are very important, but in many models, they are usually easy to be lost, which has a big influence on the performance of the model. To alleviate this problem, a parallel dense dilated convolution merging Network (termed PDDCM-Net) was proposed. The proposed PDDCMNet consists of a feature extractor, parallel dilated convolutions, and dense dilated convolutions merged with different dilation rates. We utilize different combinations of dilated convolutions that expand the receptive field of the model with fewer parameters than other advanced methods. Importantly, PDDCM-Net fuses both low-level and high-level information, in effect alleviating the problem of accurately segmenting the edge of the object and positioning the object position accurately. Experimental results validate that the proposed PDDCM-Net achieves a great improvement compared to several representative models on the COCO-Stuff data set.

Compound Loss Function of semantic segmentation models for imbalanced construction data

  • Chern, Wei-Chih;Kim, Hongjo;Asari, Vijayan;Nguyen, Tam
    • International conference on construction engineering and project management
    • /
    • 2022.06a
    • /
    • pp.808-813
    • /
    • 2022
  • This study presents the problems of data imbalance, varying difficulties across target objects, and small objects in construction object segmentation for far-field monitoring and utilize compound loss functions to address it. Construction site scenes of assembling scaffolds were analyzed to test the effectiveness of compound loss functions for five construction object classes---workers, hardhats, harnesses, straps, hooks. The challenging problem was mitigated by employing a focal and Jaccard loss terms in the original loss function of LinkNet segmentation model. The findings indicates the importance of the loss function design for model performance on construction site scenes for far-field monitoring.

  • PDF

Semiautomatic segmentation for MPEG-4 encoding (MPEG-4 부호화를 위한 반자동 영상분할)

  • 김진철;김재환;하종수;김영로;고성제
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.97-100
    • /
    • 2001
  • In this paper, We propose a new semiautomatic segmentation method using spatio-temporal similarity. In the proposed scheme, segmentation is performed using gradual region merging and hi-direction at spatio-temporal refinement. Simulation results show the efficiency of the proposed method in semantic object extraction.

  • PDF

Modified Pyramid Scene Parsing Network with Deep Learning based Multi Scale Attention (딥러닝 기반의 Multi Scale Attention을 적용한 개선된 Pyramid Scene Parsing Network)

  • Kim, Jun-Hyeok;Lee, Sang-Hun;Han, Hyun-Ho
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.11
    • /
    • pp.45-51
    • /
    • 2021
  • With the development of deep learning, semantic segmentation methods are being studied in various fields. There is a problem that segmenation accuracy drops in fields that require accuracy such as medical image analysis. In this paper, we improved PSPNet, which is a deep learning based segmentation method to minimized the loss of features during semantic segmentation. Conventional deep learning based segmentation methods result in lower resolution and loss of object features during feature extraction and compression. Due to these losses, the edge and the internal information of the object are lost, and there is a problem that the accuracy at the time of object segmentation is lowered. To solve these problems, we improved PSPNet, which is a semantic segmentation model. The multi-scale attention proposed to the conventional PSPNet was added to prevent feature loss of objects. The feature purification process was performed by applying the attention method to the conventional PPM module. By suppressing unnecessary feature information, eadg and texture information was improved. The proposed method trained on the Cityscapes dataset and use the segmentation index MIoU for quantitative evaluation. As a result of the experiment, the segmentation accuracy was improved by about 1.5% compared to the conventional PSPNet.