• Title/Summary/Keyword: 합성곱 신경망

Search Result 529, Processing Time 0.026 seconds

Generation of virtual mandibular first molar teeth and accuracy analysis using deep convolutional generative adversarial network (심층 합성곱 생성적 적대 신경망을 활용한 하악 제1대구치 가상 치아 생성 및 정확도 분석)

  • Eun-Jeong Bae;Sun-Young Ihm
    • Journal of Technologic Dentistry
    • /
    • v.46 no.2
    • /
    • pp.36-41
    • /
    • 2024
  • Purpose: This study aimed to generate virtual mandibular left first molar teeth using deep convolutional generative adversarial networks (DCGANs) and analyze their matching accuracy with actual tooth morphology to propose a new paradigm for using medical data. Methods: Occlusal surface images of the mandibular left first molar scanned using a dental model scanner were analyzed using DCGANs. Overall, 100 training sets comprising 50 original and 50 background-removed images were created, thus generating 1,000 virtual teeth. These virtual teeth were classified based on the number of cusps and occlusal surface ratio, and subsequently, were analyzed for consistency by expert dental technicians over three rounds of examination. Statistical analysis was conducted using IBM SPSS Statistics ver. 23.0 (IBM), including intraclass correlation coefficient for intrarater reliability, one-way ANOVA, and Tukey's post-hoc analysis. Results: Virtual mandibular left first molars exhibited high consistency in the occlusal surface ratio but varied in other criteria. Moreover, consistency was the highest in the occlusal buccal lingual criteria at 91.9%, whereas discrepancies were observed most in the occusal buccal cusp criteria at 85.5%. Significant differences were observed among all groups (p<0.05). Conclusion: Based on the classification of the virtually generated left mandibular first molar according to several criteria, DCGANs can generate virtual data highly similar to real data. Thus, subsequent research in the dental field, including the development of improved neural network structures, is necessary.

Algorithm development for texture and color style transfer of cultural heritage images (문화유산 이미지의 질감과 색상 스타일 전이를 위한 알고리즘 개발 연구)

  • Baek Seohyun;Cho Yeeun;Ahn Sangdoo;Choi Jongwon
    • Conservation Science in Museum
    • /
    • v.31
    • /
    • pp.55-70
    • /
    • 2024
  • Style transfer algorithms are currently undergoing active research and are used, for example, to convert ordinary images into classical painting styles. However, such algorithms have yet to produce appropriate results when applied to Korean cultural heritage images, while the number of cases for such applications also remains insufficient. Accordingly, this study attempts to develop a style transfer algorithm that can be applied to styles found among Korean cultural heritage. The algorithm was produced by improving data comprehension by enabling it to learn meaningful characteristics of the styles through representation learning and to separate the cultural heritage from the background in the target images, allowing it to extract the style-relevant areas with the desired color and texture from the style images. This study confirmed that, by doing so, a new image can be created by effectively transferring the characteristics of the style image while maintaining the form of the target image, which thereby enables the transfer of a variety of cultural heritage styles.

Deep-Learning-Based Mine Detection Using Simulated Data (시뮬레이션 데이터 기반으로 학습된 딥러닝 모델을 활용한 지뢰식별연구)

  • Buhwan Jeon;Chunju Lee
    • Journal of The Korean Institute of Defense Technology
    • /
    • v.5 no.4
    • /
    • pp.16-21
    • /
    • 2023
  • Although the global number of landmines is on a declining trend, the damages caused by previously buried landmines persist. In light of this, the present study contemplates solutions to issues and constraints that may arise due to the improvement of mine detection equipment and the reduction in the number of future soldiers. Current mine detectors lack data storage capabilities, posing limitations on data collection for research purposes. Additionally, practical data collection in real-world environments demands substantial time and manpower. Therefore, in this study, gprMax simulation was utilized to generate data. The lightweight CNN-based model, MobileNet, was trained and validated with real data, achieving a high identification rate of 97.35%. Consequently, the potential integration of technologies such as deep learning and simulation into geographical detection equipment is highlighted, offering a pathway to address potential future challenges. The study aims to somewhat alleviate these issues and anticipates contributing to the development of our military capabilities in becoming a future scientific and technological force.

  • PDF

Proposal of a Step-by-Step Optimized Campus Power Forecast Model using CNN-LSTM Deep Learning (CNN-LSTM 딥러닝 기반 캠퍼스 전력 예측 모델 최적화 단계 제시)

  • Kim, Yein;Lee, Seeun;Kwon, Youngsung
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.10
    • /
    • pp.8-15
    • /
    • 2020
  • A forecasting method using deep learning does not have consistent results due to the differences in the characteristics of the dataset, even though they have the same forecasting models and parameters. For example, the forecasting model X optimized with dataset A would not produce the optimized result with another dataset B. The forecasting model with the characteristics of the dataset needs to be optimized to increase the accuracy of the forecasting model. Therefore, this paper proposes novel optimization steps for outlier removal, dataset classification, and a CNN-LSTM-based hyperparameter tuning process to forecast the daily power usage of a university campus based on the hourly interval. The proposing model produces high forecasting accuracy with a 2% of MAPE with a single power input variable. The proposing model can be used in EMS to suggest improved strategies to users and consequently to improve the power efficiency.

Multi-Object Goal Visual Navigation Based on Multimodal Context Fusion (멀티모달 맥락정보 융합에 기초한 다중 물체 목표 시각적 탐색 이동)

  • Jeong Hyun Choi;In Cheol Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.9
    • /
    • pp.407-418
    • /
    • 2023
  • The Multi-Object Goal Visual Navigation(MultiOn) is a visual navigation task in which an agent must visit to multiple object goals in an unknown indoor environment in a given order. Existing models for the MultiOn task suffer from the limitation that they cannot utilize an integrated view of multimodal context because use only a unimodal context map. To overcome this limitation, in this paper, we propose a novel deep neural network-based agent model for MultiOn task. The proposed model, MCFMO, uses a multimodal context map, containing visual appearance features, semantic features of environmental objects, and goal object features. Moreover, the proposed model effectively fuses these three heterogeneous features into a global multimodal context map by using a point-wise convolutional neural network module. Lastly, the proposed model adopts an auxiliary task learning module to predict the observation status, goal direction and the goal distance, which can guide to learn the navigational policy efficiently. Conducting various quantitative and qualitative experiments using the Habitat-Matterport3D simulation environment and scene dataset, we demonstrate the superiority of the proposed model.

Comparison of CNN and GAN-based Deep Learning Models for Ground Roll Suppression (그라운드-롤 제거를 위한 CNN과 GAN 기반 딥러닝 모델 비교 분석)

  • Sangin Cho;Sukjoon Pyun
    • Geophysics and Geophysical Exploration
    • /
    • v.26 no.2
    • /
    • pp.37-51
    • /
    • 2023
  • The ground roll is the most common coherent noise in land seismic data and has an amplitude much larger than the reflection event we usually want to obtain. Therefore, ground roll suppression is a crucial step in seismic data processing. Several techniques, such as f-k filtering and curvelet transform, have been developed to suppress the ground roll. However, the existing methods still require improvements in suppression performance and efficiency. Various studies on the suppression of ground roll in seismic data have recently been conducted using deep learning methods developed for image processing. In this paper, we introduce three models (DnCNN (De-noiseCNN), pix2pix, and CycleGAN), based on convolutional neural network (CNN) or conditional generative adversarial network (cGAN), for ground roll suppression and explain them in detail through numerical examples. Common shot gathers from the same field were divided into training and test datasets to compare the algorithms. We trained the models using the training data and evaluated their performances using the test data. When training these models with field data, ground roll removed data are required; therefore, the ground roll is suppressed by f-k filtering and used as the ground-truth data. To evaluate the performance of the deep learning models and compare the training results, we utilized quantitative indicators such as the correlation coefficient and structural similarity index measure (SSIM) based on the similarity to the ground-truth data. The DnCNN model exhibited the best performance, and we confirmed that other models could also be applied to suppress the ground roll.

CNN Model-based Arrhythmia Classification using Image-typed ECG Data (이미지 타입의 ECG 데이터를 사용한 CNN 모델 기반 부정맥 분류)

  • Yeon-Suk Bang;Myung-Soo Jang;Yousik Hong;Sang-Suk Lee;Jun-Sang Yu;Woo-Beom Lee
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.24 no.4
    • /
    • pp.205-212
    • /
    • 2023
  • Among cardiac diseases, arrhythmias can lead to serious complications such as stroke, heart attack, and heart failure if left untreated, so continuous and accurate ECG monitoring is crucial for clinical care. However, the accurate interpretation of electrocardiogram (ECG) data is entirely dependent on medical doctors, which requires additional time and cost. Therefore, this paper proposes an arrhythmia recognition module for the purpose of developing a medical platform through the analysis of abnormal pulse waveforms based on Lifelogs. The proposed method is to convert ECG data into image format instead of time series data, apply visual pattern recognition technology, and then detect arrhythmia using CNN model. In order to validate the arrhythmia classification of the CNN model by image type conversion of ECG data proposed in this paper, the MIT-BIH arrhythmia dataset was used, and the result showed an accuracy of 97%.

Rainfall image DB construction for rainfall intensity estimation from CCTV videos: focusing on experimental data in a climatic environment chamber (CCTV 영상 기반 강우강도 산정을 위한 실환경 실험 자료 중심 적정 강우 이미지 DB 구축 방법론 개발)

  • Byun, Jongyun;Jun, Changhyun;Kim, Hyeon-Joon;Lee, Jae Joon;Park, Hunil;Lee, Jinwook
    • Journal of Korea Water Resources Association
    • /
    • v.56 no.6
    • /
    • pp.403-417
    • /
    • 2023
  • In this research, a methodology was developed for constructing an appropriate rainfall image database for estimating rainfall intensity based on CCTV video. The database was constructed in the Large-Scale Climate Environment Chamber of the Korea Conformity Laboratories, which can control variables with high irregularity and variability in real environments. 1,728 scenarios were designed under five different experimental conditions. 36 scenarios and a total of 97,200 frames were selected. Rain streaks were extracted using the k-nearest neighbor algorithm by calculating the difference between each image and the background. To prevent overfitting, data with pixel values greater than set threshold, compared to the average pixel value for each image, were selected. The area with maximum pixel variability was determined by shifting with every 10 pixels and set as a representative area (180×180) for the original image. After re-transforming to 120×120 size as an input data for convolutional neural networks model, image augmentation was progressed under unified shooting conditions. 92% of the data showed within the 10% absolute range of PBIAS. It is clear that the final results in this study have the potential to enhance the accuracy and efficacy of existing real-world CCTV systems with transfer learning.

Detection of Steel Ribs in Tunnel GPR Images Based on YOLO Algorithm (YOLO 알고리즘을 활용한 터널 GPR 이미지 내 강지보재 탐지)

  • Bae, Byongkyu;Ahn, Jaehun;Jung, Hyunjun;Yoo, Chang Kyoon
    • Journal of the Korean Geotechnical Society
    • /
    • v.39 no.7
    • /
    • pp.31-37
    • /
    • 2023
  • Since tunnels are built underground, it is impossible to check visually the location and degree of deterioration of steel ribs. Therefore, in tunnel maintenance, GPR images are generally used to detect steel ribs. While research on GPR image analysis employing artificial neural networks has primarily focused on detecting underground pipes and road damage, there have been limited applications for analyzing tunnel GPR data, specifically for steel rib detection, both internationally and domestically. In this study, a one-step object detection algorithm called YOLO, based on a convolutional neural network, was utilized to automate the localization of steel ribs using GPR data. The performance of the algorithm is then analyzed. Two datasets were employed for the analysis. A dataset comprising 512 original images and another dataset consisting of 2,048 augmented images. The omission rate, which represents the ratio of undetected steel ribs to the total number of steel ribs, was 0.38% for the model using the augmented data, whereas the omission rate for the model using only the original data was 7.18%. Thus, from an automation standpoint, it is more practical to employ an augmented dataset.

Turtle Neck Syndrome Posture Correction Service Using CNN-based Learning Model (CNN기반의 학습모델을 활용한 거북목 증후군 자세 교정 시스템)

  • Han, Ji-Ye;Park, Jin-Ho
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.7
    • /
    • pp.47-55
    • /
    • 2020
  • Along with the increased use of smart devices, the incidence of turtle neck syndrome among modern people has increased. Turtle neck syndrome is a posture in which the head is forward compared to the torso due to longer front muscles in the neck and shorter upper muscles, and it is more effective to fix the usual posture habits than surgery or medication. Thus, in this paper, a system is proposed to detect and warn posture that can cause turtle neck syndrome in real time. Image data of correct posture and turtle neck posture are collected to create a CNN-based learning model. Using only the webcam(Built-in camera), the sitting position that enters the camera is verified in real time through the learning model, and if it is a turtle neck position, it generates a warning sound and induces the correct posture. The system can induce people to correct their usual posture habits to treat turtle neck syndrome and prevent more serious diseases such as neck discs.