• Title/Summary/Keyword: Deep Learning System

Search Result 1,745, Processing Time 0.029 seconds

Deep Learning-based Approach for Visitor Detection and Path Tracking to Enhance Safety in Indoor Cultural Facilities (실내 문화시설 안전을 위한 딥러닝 기반 방문객 검출 및 동선 추적에 관한 연구)

  • Wonseop Shin;Seungmin, Rho
    • Journal of Platform Technology
    • /
    • v.11 no.4
    • /
    • pp.3-12
    • /
    • 2023
  • In the post-COVID era, the importance of quarantine measures is greatly emphasized, and accordingly, research related to the detection of mask wearing conditions and prevention of other infectious diseases using deep learning is being conducted. However, research on the detection and tracking of visitors to cultural facilities to prevent the spread of diseases is equally important, so research on this should be conducted. In this paper, a convolutional neural network-based object detection model is trained through transfer learning using a pre-collected dataset. The weights of the trained detection model are then applied to a multi-object tracking model to monitor visitors. The visitor detection model demonstrates results with a precision of 96.3%, recall of 85.2%, and an F1-score of 90.4%. Quantitative results of the tracking model include a MOTA (Multiple Object Tracking Accuracy) of 65.6%, IDF1 (ID F1 Score) of 68.3%, and HOTA (Higher Order Tracking Accuracy) of 57.2%. Furthermore, a qualitative comparison with other multi-object tracking models showcased superior results for the model proposed in this paper. The research of this paper can be applied to the hygiene systems within cultural facilities in the post-COVID era.

  • PDF

Estimation of Heading Date of Paddy Rice from Slanted View Images Using Deep Learning Classification Model

  • Hyeokjin Bak;Hoyoung Ban;SeongryulChang;Dongwon Gwon;Jae-Kyeong Baek;Jeong-Il Cho;Wan-Gyu Sang
    • Proceedings of the Korean Society of Crop Science Conference
    • /
    • 2022.10a
    • /
    • pp.80-80
    • /
    • 2022
  • Estimation of heading date of paddy rice is laborious and time consuming. Therefore, automatic estimation of heading date of paddy rice is highly essential. In this experiment, deep learning classification models were used to classify two difference categories of rice (vegetative and reproductive stage) based on the panicle initiation of paddy field. Specifically, the dataset includes 444 slanted view images belonging to two categories and was then expanded to include 1,497 images via IMGAUG data augmentation technique. We adopt two transfer learning strategies: (First, used transferring model weights already trained on ImageNet to six classification network models: VGGNet, ResNet, DenseNet, InceptionV3, Xception and MobileNet, Second, fine-tuned some layers of the network according to our dataset). After training the CNN model, we used several evaluation metrics commonly used for classification tasks, including Accuracy, Precision, Recall, and F1-score. In addition, GradCAM was used to generate visual explanations for each image patch. Experimental results showed that the InceptionV3 is the best performing model in terms of the accuracy, average recall, precision, and F1-score. The fine-tuned InceptionV3 model achieved an overall classification accuracy of 0.95 with a high F1-score of 0.95. Our CNN model also represented the change of rice heading date under different date of transplanting. This study demonstrated that image based deep learning model can reliably be used as an automatic monitoring system to detect the heading date of rice crops using CCTV camera.

  • PDF

Prerequisite Research for the Development of an End-to-End System for Automatic Tooth Segmentation: A Deep Learning-Based Reference Point Setting Algorithm (자동 치아 분할용 종단 간 시스템 개발을 위한 선결 연구: 딥러닝 기반 기준점 설정 알고리즘)

  • Kyungdeok Seo;Sena Lee;Yongkyu Jin;Sejung Yang
    • Journal of Biomedical Engineering Research
    • /
    • v.44 no.5
    • /
    • pp.346-353
    • /
    • 2023
  • In this paper, we propose an innovative approach that leverages deep learning to find optimal reference points for achieving precise tooth segmentation in three-dimensional tooth point cloud data. A dataset consisting of 350 aligned maxillary and mandibular cloud data was used as input, and both end coordinates of individual teeth were used as correct answers. A two-dimensional image was created by projecting the rendered point cloud data along the Z-axis, where an image of individual teeth was created using an object detection algorithm. The proposed algorithm is designed by adding various modules to the Unet model that allow effective learning of a narrow range, and detects both end points of the tooth using the generated tooth image. In the evaluation using DSC, Euclid distance, and MAE as indicators, we achieved superior performance compared to other Unet-based models. In future research, we will develop an algorithm to find the reference point of the point cloud by back-projecting the reference point detected in the image in three dimensions, and based on this, we will develop an algorithm to divide the teeth individually in the point cloud through image processing techniques.

Development of a Deep Learning-Based Automated Analysis System for Facial Vitiligo Treatment Evaluation (안면 백반증 치료 평가를 위한 딥러닝 기반 자동화 분석 시스템 개발)

  • Sena Lee;Yeon-Woo Heo;Solam Lee;Sung Bin Park
    • Journal of Biomedical Engineering Research
    • /
    • v.45 no.2
    • /
    • pp.95-100
    • /
    • 2024
  • Vitiligo is a condition characterized by the destruction or dysfunction of melanin-producing cells in the skin, resulting in a loss of skin pigmentation. Facial vitiligo, specifically affecting the face, significantly impacts patients' appearance, thereby diminishing their quality of life. Evaluating the efficacy of facial vitiligo treatment typically relies on subjective assessments, such as the Facial Vitiligo Area Scoring Index (F-VASI), which can be time-consuming and subjective due to its reliance on clinical observations like lesion shape and distribution. Various machine learning and deep learning methods have been proposed for segmenting vitiligo areas in facial images, showing promising results. However, these methods often struggle to accurately segment vitiligo lesions irregularly distributed across the face. Therefore, our study introduces a framework aimed at improving the segmentation of vitiligo lesions on the face and providing an evaluation of vitiligo lesions. Our framework for facial vitiligo segmentation and lesion evaluation consists of three main steps. Firstly, we perform face detection to minimize background areas and identify the face area of interest using high-quality ultraviolet photographs. Secondly, we extract facial area masks and vitiligo lesion masks using a semantic segmentation network-based approach with the generated dataset. Thirdly, we automatically calculate the vitiligo area relative to the facial area. We evaluated the performance of facial and vitiligo lesion segmentation using an independent test dataset that was not included in the training and validation, showing excellent results. The framework proposed in this study can serve as a useful tool for evaluating the diagnosis and treatment efficacy of vitiligo.

Addressing Inter-floor Noise Issues in Apartment Buildings using On-Sensor AI Embedded with TinyML on Ultra-Low-Power Systems

  • Jae-Won Kwak;In-Yeop Choi
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.3
    • /
    • pp.75-81
    • /
    • 2024
  • In this paper, we proposes a method for real-time processing of inter-floor noise problems by embedding TinyML, which includes a deep learning model, into ultra-low-power systems. The reason this method is feasible is because of lightweight deep learning model technology, which allows even systems with small computing resources to perform inference autonomously. The conventional method proposed to solve inter-floor noise problems was to send data collected from sensors to a server for analysis and processing. However, this centralized processing method has issues with high costs, complexity, and difficulty in real-time processing. In this paper, we address these limitations by employing On-Sensor AI using TinyML. The method presented in this paper is simple to install, cost-effective, and capable of processing problems in real-time.

Deep Learning Based Pine Nut Detection in UAV Aerial Video (UAV 항공 영상에서의 딥러닝 기반 잣송이 검출)

  • Kim, Gyu-Min;Park, Sung-Jun;Hwang, Seung-Jun;Kim, Hee Yeong;Baek, Joong-Hwan
    • Journal of Advanced Navigation Technology
    • /
    • v.25 no.1
    • /
    • pp.115-123
    • /
    • 2021
  • Pine nuts are Korea's representative nut forest products and profitable crops. However, pine nuts are harvested by climbing the trees themselves, thus the risk is high. In order to solve this problem, it is necessary to harvest pine nuts using a robot or an unmanned aerial vehicle(UAV). In this paper, we propose a deep learning based detection method for harvesting pine nut in UAV aerial images. For this, a video was recorded in a real pine forest using UAV, and a data augmentation technique was used to supplement a small number of data. As the data for 3D detection, Unity3D was used to model the virtual pine nut and the virtual environment, and the labeling was acquired using the 3D transformation method of the coordinate system. Deep learning algorithms for detection of pine nuts distribution area and 2D and 3D detection of pine nuts objects were used DeepLabV3+, YOLOv4, and CenterNet, respectively. As a result of the experiment, the detection rate of pine nuts distribution area was 82.15%, the 2D detection rate was 86.93%, and the 3D detection rate was 59.45%.

Artificial intelligence-based indoor positioning technology trends and prospects (인공지능 기반 실내 측위 기술 동향 및 전망)

  • An, Hyeon-U;Mun, Nam-Mi
    • Broadcasting and Media Magazine
    • /
    • v.25 no.1
    • /
    • pp.75-82
    • /
    • 2020
  • 디지털 트윈이나 증강현실, 가상현실, 자율주행 등과 같이 현실 좌표계의 위치를 다루거나 현실과 가상세계를 융합하는 기술들에 있어 측위 기술은 상당히 주요하게 작용한다. 측위 기술은 그 목적과 타겟 디바이스에 따라 매우 다양하게 존재하며, 기존 측위 기술들에 인공지능을 융합하여 정밀도와 측위 주기를 개선시키는 등 다양한 연구가 진행되고 있는 분야이다. 본 고에서는 기존의 다양한 측위 기술들의 동향과 인공지능을 융합하여 성능을 높인 사례들에 대해 설명한다.

Automatic Detection System of Underground Pipe Using 3D GPR Exploration Data and Deep Convolutional Neural Networks

  • Son, Jeong-Woo;Moon, Gwi-Seong;Kim, Yoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.2
    • /
    • pp.27-37
    • /
    • 2021
  • In this paper, we propose Automatic detection system of underground pipe which automatically detects underground pipe to help experts. Actual location of underground pipe does not match with blueprint due to various factors such as ground changes over time, construction discrepancies, etc. So, various accidents occur during excavation or just by ageing. Locating underground utilities is done through GPR exploration to prevent these accidents but there are shortage of experts, because GPR data is enormous and takes long time to analyze. In this paper, To analyze 3D GPR data automatically, we use 3D image segmentation, one of deep learning technique, and propose proper data generation algorithm. We also propose data augmentation technique and pre-processing module that are adequate to GPR data. In experiment results, we found the possibility for pipe analysis using image segmentation through our system recorded the performance of F1 score 40.4%.

Rear-Approaching Vehicle Detection Research using Region of Interesting based on Faster R-CNN (Faster R-CNN 기반의 관심영역 유사도를 이용한 후방 접근차량 검출 연구)

  • Lee, Yeung-Hak;Kim, Joong-Soo;Shim, Jae-Chnag
    • Journal of IKEEE
    • /
    • v.23 no.1
    • /
    • pp.235-241
    • /
    • 2019
  • In this paper, we propose a new algorithm to detect rear-approaching vehicle using the frame similarity of ROI(Region of Interest) based on deep learning algorithm for use in agricultural machinery systems. Since the vehicle detection system for agricultural machinery needs to detect only a vehicle approaching from the rear. we use Faster R-CNN model that shows excellent accuracy rate in deep learning for vehicle detection. And we proposed an algorithm that uses the frame similarity for ROI using constrained conditions. Experimental results show that the proposed method has a detection rate of 99.9% and reduced the false positive values.

Localization and size estimation for breaks in nuclear power plants

  • Lin, Ting-Han;Chen, Ching;Wu, Shun-Chi;Wang, Te-Chuan;Ferng, Yuh-Ming
    • Nuclear Engineering and Technology
    • /
    • v.54 no.1
    • /
    • pp.193-206
    • /
    • 2022
  • Several algorithms for nuclear power plant (NPP) break event detection, isolation, localization, and size estimation are proposed. A break event can be promptly detected and isolated after its occurrence by simultaneously monitoring changes in the sensing readings and by employing an interquartile range-based isolation scheme. By considering the multi-sensor data block of a break to be rank-one, it can be located as the position whose lead field vector is most orthogonal to the noise subspace of that data block using the Multiple Signal Classification (MUSIC) algorithm. Owing to the flexibility of deep neural networks in selecting the best regression model for the available data, we can estimate the break size using multiple-sensor recordings of the break regardless of the sensor types. The efficacy of the proposed algorithms was evaluated using the data generated by Maanshan NPP simulator. The experimental results demonstrated that the MUSIC method could distinguish two near breaks. However, if the two breaks were close and of small sizes, the MUSIC method might wrongly locate them. The break sizes estimated by the proposed deep learning model were close to their actual values, but relative errors of more than 8% were seen while estimating small breaks' sizes.