• 제목/요약/키워드: YOLOv5

검색결과 169건 처리시간 0.026초

YOLO 모델 앙상블을 이용한 복잡한 장면에서의 Mask Detection 기법 (Mask detection in complex scenes using an ensemble of YOLO models)

  • 후쉬펑;임현석;곽정환
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2022년도 제65차 동계학술대회논문집 30권1호
    • /
    • pp.97-98
    • /
    • 2022
  • 코로나바이러스-19 팬데믹 이후 매일 수만 명의 환자가 발생하고 있다. 보건당국은 사람들의 생활 안전을 보호하기 위해 공항, 정류장 등 공공장소에서는 반드시 마스크를 착용하라고 지시하고 있다. 마스크를 착용하는 목적은 감염으로부터 신체를 보호하고 바이러스 전파와 확산을 막기 위한 것이다. 공공장소에서는 많은 인원에 대한 일괄적인 마스크 착용 검사를 하기 어렵고, 육안으로 확인하는 마스크 착용 검사 방법은 인파가 몰리는 장소에서 검사 효율이 떨어지며 누락되는 경우도 많이 발생한다. 본 연구에서는 입력 이미지에 존재하는 얼굴 영역을 YOLOv4와 YOLOv5 모델을 통해 예측하여 마스크의 착용 여부를 판단하되, 앙상블 기법을 적용하여 보다 효과적인 BB(Bounding Box) 추출 및 마스크 착용 탐지 기법을 적용한다. 따라서 공공장소의 마스크 착용실태를 효과적으로 모니터링 할 수 있는 방법을 제안한다.

  • PDF

딥러닝을 이용한 육불화텅스텐(WF6) 제조 공정의 지능형 영상 감지 시스템 구현 (Implementation of an Intelligent Video Detection System using Deep Learning in the Manufacturing Process of Tungsten Hexafluoride)

  • 손승용;김영목;최두현
    • 한국재료학회지
    • /
    • 제31권12호
    • /
    • pp.719-726
    • /
    • 2021
  • Through the process of chemical vapor deposition, Tungsten Hexafluoride (WF6) is widely used by the semiconductor industry to form tungsten films. Tungsten Hexafluoride (WF6) is produced through manufacturing processes such as pulverization, wet smelting, calcination and reduction of tungsten ores. The manufacturing process of Tungsten Hexafluoride (WF6) is required thorough quality control to improve productivity. In this paper, a real-time detection system for oxidation defects that occur in the manufacturing process of Tungsten Hexafluoride (WF6) is proposed. The proposed system is implemented by applying YOLOv5 based on Convolutional Neural Network (CNN); it is expected to enable more stable management than existing management, which relies on skilled workers. The implementation method of the proposed system and the results of performance comparison are presented to prove the feasibility of the method for improving the efficiency of the WF6 manufacturing process in this paper. The proposed system applying YOLOv5s, which is the most suitable material in the actual production environment, demonstrates high accuracy (mAP@0.5 99.4 %) and real-time detection speed (FPS 46).

딥러닝 기반 소형선박 승선자 조난 인지 시스템 (Deep Learning based Distress Awareness System for Small Boat)

  • 전해명;노재규
    • 대한임베디드공학회논문지
    • /
    • 제17권5호
    • /
    • pp.281-288
    • /
    • 2022
  • According to statistics conducted by the Korea Coast Guard, the number of accidents on small boats under 5 tons is increasing every year. This is because only a small number of people are on board. The previously developed maritime distress and safety systems are not well distributed because passengers must be equipped with additional remote equipment. The purpose of this study is to develop a distress awareness system that recognizes man over-board situations in real time. This study aims to present the part of the passenger tracking system among the small ship's distress awareness situational system that can generate passenger's location information in real time using deep learning based object detection and tracking technologies. The system consisted of the following steps. 1) the passenger location information is generated in the form of Bounding box using its detection model (YOLOv3). 2) Based on the Bounding box data, Deep SORT predicts the Bounding box's position in the next frame of the image with Kalman filter. 3) When the actual Bounding Box is created within the range predicted by Kalman-filter, Deep SORT repeats the process of recognizing it as the same object. 4) If the Bounding box deviates the ship's area or an error occurs in the number of tracking occupant, the system is decided the distress situation and issues an alert. This study is expected to complement the problems of existing technologies and ensure the safety of individuals aboard small boats.

Multi-Human Behavior Recognition Based on Improved Posture Estimation Model

  • Zhang, Ning;Park, Jin-Ho;Lee, Eung-Joo
    • 한국멀티미디어학회논문지
    • /
    • 제24권5호
    • /
    • pp.659-666
    • /
    • 2021
  • With the continuous development of deep learning, human behavior recognition algorithms have achieved good results. However, in a multi-person recognition environment, the complex behavior environment poses a great challenge to the efficiency of recognition. To this end, this paper proposes a multi-person pose estimation model. First of all, the human detectors in the top-down framework mostly use the two-stage target detection model, which runs slow down. The single-stage YOLOv3 target detection model is used to effectively improve the running speed and the generalization of the model. Depth separable convolution, which further improves the speed of target detection and improves the model's ability to extract target proposed regions; Secondly, based on the feature pyramid network combined with context semantic information in the pose estimation model, the OHEM algorithm is used to solve difficult key point detection problems, and the accuracy of multi-person pose estimation is improved; Finally, the Euclidean distance is used to calculate the spatial distance between key points, to determine the similarity of postures in the frame, and to eliminate redundant postures.

딥러닝을 위한 마스크 착용 유형별 데이터셋 구축 및 검출 모델에 관한 연구 (The Study for Type of Mask Wearing Dataset for Deep learning and Detection Model)

  • 황호성;김동현;김호철
    • 대한의용생체공학회:의공학회지
    • /
    • 제43권3호
    • /
    • pp.131-135
    • /
    • 2022
  • Due to COVID-19, Correct method of wearing mask is important to prevent COVID-19 and the other respiratory tract infections. And the deep learning technology in the image processing has been developed. The purpose of this study is to create the type of mask wearing dataset for deep learning models and select the deep learning model to detect the wearing mask correctly. The Image dataset is the 2,296 images acquired using a web crawler. Deep learning classification models provided by tensorflow are used to validate the dataset. And Object detection deep learning model YOLOs are used to select the detection deep learning model to detect the wearing mask correctly. In this process, this paper proposes to validate the type of mask wearing datasets and YOLOv5 is the effective model to detect the type of mask wearing. The experimental results show that reliable dataset is acquired and the YOLOv5 model effectively recognize type of mask wearing.

Detection and Recognition of Vehicle License Plates using Deep Learning in Video Surveillance

  • Farooq, Muhammad Umer;Ahmed, Saad;Latif, Mustafa;Jawaid, Danish;Khan, Muhammad Zofeen;Khan, Yahya
    • International Journal of Computer Science & Network Security
    • /
    • 제22권11호
    • /
    • pp.121-126
    • /
    • 2022
  • The number of vehicles has increased exponentially over the past 20 years due to technological advancements. It is becoming almost impossible to manually control and manage the traffic in a city like Karachi. Without license plate recognition, traffic management is impossible. The Framework for License Plate Detection & Recognition to overcome these issues is proposed. License Plate Detection & Recognition is primarily performed in two steps. The first step is to accurately detect the license plate in the given image, and the second step is to successfully read and recognize each character of that license plate. Some of the most common algorithms used in the past are based on colour, texture, edge-detection and template matching. Nowadays, many researchers are proposing methods based on deep learning. This research proposes a framework for License Plate Detection & Recognition using a custom YOLOv5 Object Detector, image segmentation techniques, and Tesseract's optical character recognition OCR. The accuracy of this framework is 0.89.

딥러닝을 활용한 루푸스 신염 진단을 위한 생검 조직 내 사구체 검출 (Glomerular Detection for Diagnosis of Lupus Nephritis using Deep Learning)

  • 정제현;하석민;임종우;김현성;박호섭;명재경
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2022년도 제66차 하계학술대회논문집 30권2호
    • /
    • pp.85-87
    • /
    • 2022
  • 루푸스 신염을 정확히 진단하기 위해서는 신장의 침 생검을 통한 조직검사를 통해 사구체들을 찾아내고, 각각의 염증 정도를 분류해야 한다. 하지만 이에는 의료진의 많은 시간과 노력이 소요된다. 따라서 본 연구에서는 이러한 한계를 극복하기 위해 합성곱 신경망 (Convolutional neural network, CNN)에 기반한 검출 및 분할에 딥 러닝 접근법을 적용하는 YOLOv5 알고리즘을 통해 검체 이미지 내에서 사구체를 자동으로 검출해 내도록 하였다. 그리고 루푸스 신염 환자의 슬라이드 이미지에 대한 태깅 작업을 거쳐 학습을 위한 데이터와 테스트를 위한 데이터를 생성하여 학습 및 테스트에 활용하였다. 그 결과 고화질의 검체 이미지 내에서 대부분의 사구체를 0.9 이상의 높은 precision과 recall로 검출해 낼 수 있었다. 이를 통해 신장 내부의 사구체 검출을 자동화하고 추후 연구를 통해 사구체 염증 정도를 단계화 할 수 있는 발판을 마련하였다.

  • PDF

다중 카메라 환경에서의 안면인식 기반의 영유아 활동 사진 자동 생성 시스템 (A system for automatically generating activity photos of infants based on facial recognition in a multi-camera environment)

  • 이정석;이규호;김건희;최창훈;박경로;손호준;유홍석
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2023년도 제68차 하계학술대회논문집 31권2호
    • /
    • pp.481-483
    • /
    • 2023
  • 본 논문에서는 다중 카메라환경에서의 안면인식 기반 영유아 활동 사진 자동 생성 시스템을 개발했다. 개발한 시스템은 어린이집에서 알림장 작성을 위한 촬영하는 동안 보육에 부주의하여 안전사고가 발생하는 것을 방지 할 수 있다. 시스템은 이동식 수집기와 분류 서버로 나뉘어 작동하게 된다. 이동식 수집기는 Raspberry Pi를 이용하였고 초당 1장 내외의 사진을 촬영하여 SAMBA를 사용 공유폴더에 저장한다. 분류 서버에서는 YOLOv5를 사용해 안면을 인식해 분류한다. OpenCV와 TensorFlow-Keras를 통해 분류된 사진에서의 표정을 파악하여 부모에게 전송할 웃는사진만을 분류하여 남겨둔다. 이외의 사진은 /dev/null로 이동하여 삭제된다.

  • PDF

Deep-learning-based gestational sac detection in ultrasound images using modified YOLOv7-E6E model

  • Tae-kyeong Kim;Jin Soo Kim;Hyun-chong Cho
    • Journal of Animal Science and Technology
    • /
    • 제65권3호
    • /
    • pp.627-637
    • /
    • 2023
  • As the population and income levels rise, meat consumption steadily increases annually. However, the number of farms and farmers producing meat decrease during the same period, reducing meat sufficiency. Information and Communications Technology (ICT) has begun to be applied to reduce labor and production costs of livestock farms and improve productivity. This technology can be used for rapid pregnancy diagnosis of sows; the location and size of the gestation sacs of sows are directly related to the productivity of the farm. In this study, a system proposes to determine the number of gestation sacs of sows from ultrasound images. The system used the YOLOv7-E6E model, changing the activation function from sigmoid-weighted linear unit (SiLU) to a multi-activation function (SiLU + Mish). Also, the upsampling method was modified from nearest to bicubic to improve performance. The model trained with the original model using the original data achieved mean average precision of 86.3%. When the proposed multi-activation function, upsampling, and AutoAugment were applied, the performance improved by 0.3%, 0.9%, and 0.9%, respectively. When all three proposed methods were simultaneously applied, a significant performance improvement of 3.5% to 89.8% was achieved.