• Title/Summary/Keyword: 3D U-Net

Search Result 68, Processing Time 0.027 seconds

Architecture of 3D-GIS Visualization Application Based on OpenGL (OpenGL 기반 3D_GIS 가시화 어플리케이션 아키텍쳐)

  • Kim, Seung-Yeop;Lee, Gi-Won
    • 한국공간정보시스템학회:학술대회논문집
    • /
    • 2005.05a
    • /
    • pp.97-100
    • /
    • 2005
  • 3차원 공간정보는 u-Korea, 전자정부, 유비쿼터스, LBS등의 기반 인프라 및 3차원 그래픽 처리기술, 가상현실 기술 등의 종합적으로 적용되는 고부가가치 통합 기술로 필요성이 부각되고 있다. 본 연구에서는 컴퓨터 그래픽스 분야에서 많이 적용되고 있는 공개 그래픽 라이브러리인 OpenGL(Open Graphics Library) 기반의 3D-GIS 가시화 어플리케이션 아키텍쳐를 중심으로 렌더링 기법을 분석하고자 한다. 한편 본 연구의 실험은 Visual Studio.NET환경에서 3D-GIS 모델 Prototype을 구현하여 수행하였으며 향후 실시간 모바일 3D-GIS 렌더링을 위한 기반 기술로 적용될 수 있는 OpcnGL-ES의 확장 가능성을 검토하고자 하였다.

  • PDF

Synthetic Computed Tomography Generation while Preserving Metallic Markers for Three-Dimensional Intracavitary Radiotherapy: Preliminary Study

  • Jin, Hyeongmin;Kang, Seonghee;Kang, Hyun-Cheol;Choi, Chang Heon
    • Progress in Medical Physics
    • /
    • v.32 no.4
    • /
    • pp.172-178
    • /
    • 2021
  • Purpose: This study aimed to develop a deep learning architecture combining two task models to generate synthetic computed tomography (sCT) images from low-tesla magnetic resonance (MR) images to improve metallic marker visibility. Methods: Twenty-three patients with cervical cancer treated with intracavitary radiotherapy (ICR) were retrospectively enrolled, and images were acquired using both a computed tomography (CT) scanner and a low-tesla MR machine. The CT images were aligned to the corresponding MR images using a deformable registration, and the metallic dummy source markers were delineated using threshold-based segmentation followed by manual modification. The deformed CT (dCT), MR, and segmentation mask pairs were used for training and testing. The sCT generation model has a cascaded three-dimensional (3D) U-Net-based architecture that converts MR images to CT images and segments the metallic marker. The performance of the model was evaluated with intensity-based comparison metrics. Results: The proposed model with segmentation loss outperformed the 3D U-Net in terms of errors between the sCT and dCT. The structural similarity score difference was not significant. Conclusions: Our study shows the two-task-based deep learning models for generating the sCT images using low-tesla MR images for 3D ICR. This approach will be useful to the MR-only workflow in high-dose-rate brachytherapy.

3D Mesh Reconstruction Technique from Single Image using Deep Learning and Sphere Shape Transformation Method (딥러닝과 구체의 형태 변형 방법을 이용한 단일 이미지에서의 3D Mesh 재구축 기법)

  • Kim, Jeong-Yoon;Lee, Seung-Ho
    • Journal of IKEEE
    • /
    • v.26 no.2
    • /
    • pp.160-168
    • /
    • 2022
  • In this paper, we propose a 3D mesh reconstruction method from a single image using deep learning and a sphere shape transformation method. The proposed method has the following originality that is different from the existing method. First, the position of the vertex of the sphere is modified to be very similar to the 3D point cloud of an object through a deep learning network, unlike the existing method of building edges or faces by connecting nearby points. Because 3D point cloud is used, less memory is required and faster operation is possible because only addition operation is performed between offset value at the vertices of the sphere. Second, the 3D mesh is reconstructed by covering the surface information of the sphere on the modified vertices. Even when the distance between the points of the 3D point cloud created by correcting the position of the vertices of the sphere is not constant, it already has the face information of the sphere called face information of the sphere, which indicates whether the points are connected or not, thereby preventing simplification or loss of expression. can do. In order to evaluate the objective reliability of the proposed method, the experiment was conducted in the same way as in the comparative papers using the ShapeNet dataset, which is an open standard dataset. As a result, the IoU value of the method proposed in this paper was 0.581, and the chamfer distance value was It was calculated as 0.212. The higher the IoU value and the lower the chamfer distance value, the better the results. Therefore, the efficiency of the 3D mesh reconstruction was demonstrated compared to the methods published in other papers.

Prediction of aerodynamics using VGG16 and U-Net (VGG16 과 U-Net 구조를 이용한 공력특성 예측)

  • Bo Ra, Kim;Seung Hun, Lee;Seung Hyun, Jang;Gwang Il, Hwang;Min, Yoon
    • Journal of the Korean Society of Visualization
    • /
    • v.20 no.3
    • /
    • pp.109-116
    • /
    • 2022
  • The optimized design of airfoils is essential to increase the performance and efficiency of wind turbines. The aerodynamic characteristics of airfoils near the stall show large deviation from experiments and numerical simulations. Hence, it is needed to perform repetitive analysis of various shapes near the stall. To overcome this, the artificial intelligence is used and combined with numerical simulations. In this study, three types of airfoils are chosen, which are S809, S822 and SD7062 used in wind turbines. A convolutional neural network model is proposed in the combination of VGG16 and U-Net. Learning data are constructed by extracting pressure fields and aerodynamic characteristics through numerical analysis of 2D shape. Based on these data, the pressure field and lift coefficient of untrained airfoils are predicted. As a result, even in untrained airfoils, the pressure field is accurately predicted with an error of within 0.04%.

CATHARE simulation results of the natural circulation characterisation test of the PKL test facility

  • Salah, Anis Bousbia
    • Nuclear Engineering and Technology
    • /
    • v.53 no.5
    • /
    • pp.1446-1453
    • /
    • 2021
  • In the past, several experimental investigations aiming at characterizing the natural circulation (NC) behavior in test facilities were carried out. They showed a variety of flow patterns characterized by an inverted U-shape of the NC flow curve versus primary mass inventory. On the other hand, attempts to reproduce such curves using thermal-hydraulic system codes, showed 10-30% differences between the measured and calculated NC mass flow rate. Actually, the used computer codes are generally based upon nodalization using single U-tube representation. Such model may not allow getting accurate simulation of most of the NC phenomena occurring during such tests (like flow redistribution and flow reversal in some SG U-tubes). Simulations based on multi-U-tubes model, showed better agreement with the overall behavior, but remain unable to predict NC phenomena taking place in the steam generator (SG) during the experiment. In the current study, the CATHARE code is considered in order to assess a NC characterization test performed in the four loops PKL facility. For this purpose, four different SG nodalizations including, single and multi-U-tubes, 1D and 3D SG inlet/outlet zones are considered. In general, it is shown that the 1D and 3D models exhibit similar prediction results up to a certain point of the rising part of the inverted U-shape of the NC flow curve. After that, the results bifurcate with, on the one hand, a tendency of the 1D models to over-predict the measured NC mass flow rate and on the other hand, a tendency of the 3D models to under-predict the NC flow rate.

Deep Learning Approach for Automatic Discontinuity Mapping on 3D Model of Tunnel Face (터널 막장 3차원 지형모델 상에서의 불연속면 자동 매핑을 위한 딥러닝 기법 적용 방안)

  • Chuyen Pham;Hyu-Soung Shin
    • Tunnel and Underground Space
    • /
    • v.33 no.6
    • /
    • pp.508-518
    • /
    • 2023
  • This paper presents a new approach for the automatic mapping of discontinuities in a tunnel face based on its 3D digital model reconstructed by LiDAR scan or photogrammetry techniques. The main idea revolves around the identification of discontinuity areas in the 3D digital model of a tunnel face by segmenting its 2D projected images using a deep-learning semantic segmentation model called U-Net. The proposed deep learning model integrates various features including the projected RGB image, depth map image, and local surface properties-based images i.e., normal vector and curvature images to effectively segment areas of discontinuity in the images. Subsequently, the segmentation results are projected back onto the 3D model using depth maps and projection matrices to obtain an accurate representation of the location and extent of discontinuities within the 3D space. The performance of the segmentation model is evaluated by comparing the segmented results with their corresponding ground truths, which demonstrates the high accuracy of segmentation results with the intersection-over-union metric of approximately 0.8. Despite still being limited in training data, this method exhibits promising potential to address the limitations of conventional approaches, which only rely on normal vectors and unsupervised machine learning algorithms for grouping points in the 3D model into distinct sets of discontinuities.

Alzheimer progression classification using fMRI data (fMRI 데이터를 이용한 알츠하이머 진행상태 분류)

  • Ju Hyeon-Noh;Hee-Deok Yang
    • Smart Media Journal
    • /
    • v.13 no.4
    • /
    • pp.86-93
    • /
    • 2024
  • The development of functional magnetic resonance imaging (fMRI) has significantly contributed to mapping brain functions and understanding brain networks during rest. This paper proposes a CNN-LSTM-based classification model to classify the progression stages of Alzheimer's disease. Firstly, four preprocessing steps are performed to remove noise from the fMRI data before feature extraction. Secondly, the U-Net architecture is utilized to extract spatial features once preprocessing is completed. Thirdly, the extracted spatial features undergo LSTM processing to extract temporal features, ultimately leading to classification. Experiments were conducted by adjusting the temporal dimension of the data. Using 5-fold cross-validation, an average accuracy of 96.4% was achieved, indicating that the proposed method has high potential for identifying the progression of Alzheimer's disease by analyzing fMRI data.

ASUSD nuclear data sensitivity and uncertainty program package: Validation on fusion and fission benchmark experiments

  • Kos, Bor;Cufar, Aljaz;Kodeli, Ivan A.
    • Nuclear Engineering and Technology
    • /
    • v.53 no.7
    • /
    • pp.2151-2161
    • /
    • 2021
  • Nuclear data (ND) sensitivity and uncertainty (S/U) quantification in shielding applications is performed using deterministic and probabilistic approaches. In this paper the validation of the newly developed deterministic program package ASUSD (ADVANTG + SUSD3D) is presented. ASUSD was developed with the aim of automating the process of ND S/U while retaining the computational efficiency of the deterministic approach to ND S/U analysis. The paper includes a detailed description of each of the programs contained within ASUSD, the computational workflow and validation results. ASUSD was validated on two shielding benchmark experiments from the Shielding Integral Benchmark Archive and Database (SINBAD) - the fission relevant ASPIS Iron 88 experiment and the fusion relevant Frascati Neutron Generator (FNG) Helium Cooled Pebble Bed (HCPB) Test Blanket Module (TBM) mock-up experiment. The validation process was performed in two stages. Firstly, the Denovo discrete ordinates transport solver was validated as a standalone solver. Secondly, the ASUSD program package as a whole was validated as a ND S/U analysis tool. Both stages of the validation process yielded excellent results, with a maximum difference of 17% in final uncertainties due to ND between ASUSD and the stochastic ND S/U approach. Based on these results, ASUSD has proven to be a user friendly and computationally efficient tool for deterministic ND S/U analysis of shielding geometries.

Road Extraction from Images Using Semantic Segmentation Algorithm (영상 기반 Semantic Segmentation 알고리즘을 이용한 도로 추출)

  • Oh, Haeng Yeol;Jeon, Seung Bae;Kim, Geon;Jeong, Myeong-Hun
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.3
    • /
    • pp.239-247
    • /
    • 2022
  • Cities are becoming more complex due to rapid industrialization and population growth in modern times. In particular, urban areas are rapidly changing due to housing site development, reconstruction, and demolition. Thus accurate road information is necessary for various purposes, such as High Definition Map for autonomous car driving. In the case of the Republic of Korea, accurate spatial information can be generated by making a map through the existing map production process. However, targeting a large area is limited due to time and money. Road, one of the map elements, is a hub and essential means of transportation that provides many different resources for human civilization. Therefore, it is essential to update road information accurately and quickly. This study uses Semantic Segmentation algorithms Such as LinkNet, D-LinkNet, and NL-LinkNet to extract roads from drone images and then apply hyperparameter optimization to models with the highest performance. As a result, the LinkNet model using pre-trained ResNet-34 as the encoder achieved 85.125 mIoU. Subsequent studies should focus on comparing the results of this study with those of studies using state-of-the-art object detection algorithms or semi-supervised learning-based Semantic Segmentation techniques. The results of this study can be applied to improve the speed of the existing map update process.

Effective Multi-Modal Feature Fusion for 3D Semantic Segmentation with Multi-View Images (멀티-뷰 영상들을 활용하는 3차원 의미적 분할을 위한 효과적인 멀티-모달 특징 융합)

  • Hye-Lim Bae;Incheol Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.12
    • /
    • pp.505-518
    • /
    • 2023
  • 3D point cloud semantic segmentation is a computer vision task that involves dividing the point cloud into different objects and regions by predicting the class label of each point. Existing 3D semantic segmentation models have some limitations in performing sufficient fusion of multi-modal features while ensuring both characteristics of 2D visual features extracted from RGB images and 3D geometric features extracted from point cloud. Therefore, in this paper, we propose MMCA-Net, a novel 3D semantic segmentation model using 2D-3D multi-modal features. The proposed model effectively fuses two heterogeneous 2D visual features and 3D geometric features by using an intermediate fusion strategy and a multi-modal cross attention-based fusion operation. Also, the proposed model extracts context-rich 3D geometric features from input point cloud consisting of irregularly distributed points by adopting PTv2 as 3D geometric encoder. In this paper, we conducted both quantitative and qualitative experiments with the benchmark dataset, ScanNetv2 in order to analyze the performance of the proposed model. In terms of the metric mIoU, the proposed model showed a 9.2% performance improvement over the PTv2 model using only 3D geometric features, and a 12.12% performance improvement over the MVPNet model using 2D-3D multi-modal features. As a result, we proved the effectiveness and usefulness of the proposed model.