• Title/Summary/Keyword: improved YOLOv5

Search Result 19, Processing Time 0.027 seconds

A Study on Biomass Estimation Technique of Invertebrate Grazers Using Multi-object Tracking Model Based on Deep Learning (딥러닝 기반 다중 객체 추적 모델을 활용한 조식성 무척추동물 현존량 추정 기법 연구)

  • Bak, Suho;Kim, Heung-Min;Lee, Heeone;Han, Jeong-Ik;Kim, Tak-Young;Lim, Jae-Young;Jang, Seon Woong
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.3
    • /
    • pp.237-250
    • /
    • 2022
  • In this study, we propose a method to estimate the biomass of invertebrate grazers from the videos with underwater drones by using a multi-object tracking model based on deep learning. In order to detect invertebrate grazers by classes, we used YOLOv5 (You Only Look Once version 5). For biomass estimation we used DeepSORT (Deep Simple Online and real-time tracking). The performance of each model was evaluated on a workstation with a GPU accelerator. YOLOv5 averaged 0.9 or more mean Average Precision (mAP), and we confirmed it shows about 59 fps at 4 k resolution when using YOLOv5s model and DeepSORT algorithm. Applying the proposed method in the field, there was a tendency to be overestimated by about 28%, but it was confirmed that the level of error was low compared to the biomass estimation using object detection model only. A follow-up study is needed to improve the accuracy for the cases where frame images go out of focus continuously or underwater drones turn rapidly. However,should these issues be improved, it can be utilized in the production of decision support data in the field of invertebrate grazers control and monitoring in the future.

Multi-Human Behavior Recognition Based on Improved Posture Estimation Model

  • Zhang, Ning;Park, Jin-Ho;Lee, Eung-Joo
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.5
    • /
    • pp.659-666
    • /
    • 2021
  • With the continuous development of deep learning, human behavior recognition algorithms have achieved good results. However, in a multi-person recognition environment, the complex behavior environment poses a great challenge to the efficiency of recognition. To this end, this paper proposes a multi-person pose estimation model. First of all, the human detectors in the top-down framework mostly use the two-stage target detection model, which runs slow down. The single-stage YOLOv3 target detection model is used to effectively improve the running speed and the generalization of the model. Depth separable convolution, which further improves the speed of target detection and improves the model's ability to extract target proposed regions; Secondly, based on the feature pyramid network combined with context semantic information in the pose estimation model, the OHEM algorithm is used to solve difficult key point detection problems, and the accuracy of multi-person pose estimation is improved; Finally, the Euclidean distance is used to calculate the spatial distance between key points, to determine the similarity of postures in the frame, and to eliminate redundant postures.

A study on the detection of pedestrians in crosswalks using multi-spectrum (다중스펙트럼을 이용한 횡단보도 보행자 검지에 관한 연구)

  • kim, Junghun;Choi, Doo-Hyun;Lee, JongSun;Lee, Donghwa
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.27 no.1
    • /
    • pp.11-18
    • /
    • 2022
  • The use of multi-spectral cameras is essential for day and night pedestrian detection. In this paper, a color camera and a thermal imaging infrared camera were used to detect pedestrians near a crosswalk for 24 hours at an intersection with a high risk of traffic accidents. For pedestrian detection, the YOLOv5 object detector was used, and the detection performance was improved by using color images and thermal images at the same time. The proposed system showed a high performance of 0.940 mAP in the day/night multi-spectral (color and thermal image) pedestrian dataset obtained from the actual crosswalk site.

Deep-learning-based gestational sac detection in ultrasound images using modified YOLOv7-E6E model

  • Tae-kyeong Kim;Jin Soo Kim;Hyun-chong Cho
    • Journal of Animal Science and Technology
    • /
    • v.65 no.3
    • /
    • pp.627-637
    • /
    • 2023
  • As the population and income levels rise, meat consumption steadily increases annually. However, the number of farms and farmers producing meat decrease during the same period, reducing meat sufficiency. Information and Communications Technology (ICT) has begun to be applied to reduce labor and production costs of livestock farms and improve productivity. This technology can be used for rapid pregnancy diagnosis of sows; the location and size of the gestation sacs of sows are directly related to the productivity of the farm. In this study, a system proposes to determine the number of gestation sacs of sows from ultrasound images. The system used the YOLOv7-E6E model, changing the activation function from sigmoid-weighted linear unit (SiLU) to a multi-activation function (SiLU + Mish). Also, the upsampling method was modified from nearest to bicubic to improve performance. The model trained with the original model using the original data achieved mean average precision of 86.3%. When the proposed multi-activation function, upsampling, and AutoAugment were applied, the performance improved by 0.3%, 0.9%, and 0.9%, respectively. When all three proposed methods were simultaneously applied, a significant performance improvement of 3.5% to 89.8% was achieved.

Vehicle Detection Algorithm Using Super Resolution Based on Deep Residual Dense Block for Remote Sensing Images (원격 영상에서 심층 잔차 밀집 기반의 초고해상도 기법을 이용한 차량 검출 알고리즘)

  • Oh-Seol Kwon
    • Journal of Broadcast Engineering
    • /
    • v.28 no.1
    • /
    • pp.124-131
    • /
    • 2023
  • Object detection techniques are increasingly used to obtain information on physical characteristics or situations of a specific area from remote images. The accuracy of object detection is decreased in remote sensing images with low resolution because the low resolution reduces the amount of detail that can be captured in an image. A single neural network is proposed to joint the super-resolution method and object detection method. The proposed method constructs a deep residual-based network to restore object features in low-resolution images. Moreover, the proposed method is used to improve the performance of object detection by jointing a single network with YOLOv5. The proposed method is experimentally tested using VEDAI data for low-resolution images. The results show that vehicle detection performance improved by 81.38% on mAP@0.5 for VISIBLE data.

A Development on Deep Learning-based Detecting Technology of Rebar Placement for Improving Building Supervision Efficiency (감리업무 효율성 향상을 위한 딥러닝 기반 철근배근 디텍팅 기술 개발)

  • Park, Jin-Hui;Kim, Tae-Hoon;Choo, Seung-Yeon
    • Journal of the Architectural Institute of Korea Planning & Design
    • /
    • v.36 no.5
    • /
    • pp.93-103
    • /
    • 2020
  • The purpose of this study is to suggest a supervisory way to improve the efficiency of Building Supervision using Deep Learning, especially object detecting technology. Since the establishment of the Building Supervision system in Korea, it has been changed and improved many times systematically, but it is hard to find any improvement in terms of implementing methods. Therefore, the Supervision is until now the area where a lot of money, time and manpower are needed. This might give a room for superficial, formal and documentary supervision that could lead to faulty construction. This study suggests a way of Building Supervision which is more automatic and effective so that it can lead to save the time, effort and money. And the way is to detect the hoop-bars of a column and count the number of it automatically. For this study, we made a hoop-bar detecting network by transfor learnning of YOLOv2 network through MATLAB. Among many training experiments, relatively most accurate network was selected, and this network was able to detect rebar placement in building site pictures with the accuracy of 92.85% for similar images to those used in trainings, and 90% or more for new images at specific distance. It was also able to count the number of hoop-bars. The result showed the possibility of automatic Building Supervision and its efficiency improvement.

A Study on Image Preprocessing Methods for Automatic Detection of Ship Corrosion Based on Deep Learning (딥러닝 기반 선박 부식 자동 검출을 위한 이미지 전처리 방안 연구)

  • Yun, Gwang-ho;Oh, Sang-jin;Shin, Sung-chul
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.25 no.4_2
    • /
    • pp.573-586
    • /
    • 2022
  • Corrosion can cause dangerous and expensive damage and failures of ship hulls and equipment. Therefore, it is necessary to maintain the vessel by periodic corrosion inspections. During visual inspection, many corrosion locations are inaccessible for many reasons, especially safety's point of view. Including subjective decisions of inspectors is one of the issues of visual inspection. Automation of visual inspection is tried by many pieces of research. In this study, we propose image preprocessing methods by image patch segmentation and thresholding. YOLOv5 was used as an object detection model after the image preprocessing. Finally, it was evaluated that corrosion detection performance using the proposed method was improved in terms of mean average precision.

GAN-based Video Denoising for Robust Pig Detection System (GAN 기반의 영상 잡음에 강인한 돼지 탐지 시스템)

  • Bo, Zhao;Lee, Jonguk;Atif, Othmane;Park, Daihee;Chung, Yongwha
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.11a
    • /
    • pp.700-703
    • /
    • 2021
  • Infrared cameras are widely used in recent research for automatic monitoring the abnormal behaviors of the pig. However, when deployed in real pig farms, infrared cameras always get polluted due to the harsh environment of pig farms which negatively affects the performance of pig monitoring. In this paper, we propose a real-time noise-robust infrared camera-based pig automatic monitoring system to improve the robustness of pigs' automatic monitoring in real pig farms. The proposed system first uses a preprocessor with a U-Net architecture that was trained as a GAN generator to transform the noisy images into clean images, then uses a YOLOv5-based detector to detect pigs. The experimental results show that with adding the preprocessing step, the average pig detection precision improved greatly from 0.639 to 0.759.

A study on accident prevention AI system based on estimation of bus passengers' intentions (시내버스 승하차 의도분석 기반 사고방지 AI 시스템 연구)

  • Seonghwan Park;Sunoh Byun;Junghoon Park
    • Smart Media Journal
    • /
    • v.12 no.11
    • /
    • pp.57-66
    • /
    • 2023
  • In this paper, we present a study on an AI-based system utilizing the CCTV system within city buses to predict the intentions of boarding and alighting passengers, with the aim of preventing accidents. The proposed system employs the YOLOv7 Pose model to detect passengers, while utilizing an LSTM model to predict intentions of tracked passengers. The system can be installed on the bus's CCTV terminals, allowing for real-time visual confirmation of passengers' intentions throughout driving. It also provides alerts to the driver, mitigating potential accidents during passenger transitions. Test results show accuracy rates of 0.81 for analyzing boarding intentions and 0.79 for predicting alighting intentions onboard. To ensure real-time performance, we verified that a minimum of 5 frames per second analysis is achievable in a GPU environment. his algorithm enhance the safety of passenger transitions during bus operations. In the future, with improved hardware specifications and abundant data collection, the system's expansion into various safety-related metrics is promising. This algorithm is anticipated to play a pivotal role in ensuring safety when autonomous driving becomes commercialized. Additionally, its applicability could extend to other modes of public transportation, such as subways and all forms of mass transit, contributing to the overall safety of public transportation systems.