• 제목/요약/키워드: Deep Learning based System

검색결과 1,198건 처리시간 0.029초

Evaluation of maxillary sinusitis from panoramic radiographs and cone-beam computed tomographic images using a convolutional neural network

  • Serindere, Gozde;Bilgili, Ersen;Yesil, Cagri;Ozveren, Neslihan
    • Imaging Science in Dentistry
    • /
    • 제52권2호
    • /
    • pp.187-195
    • /
    • 2022
  • Purpose: This study developed a convolutional neural network (CNN) model to diagnose maxillary sinusitis on panoramic radiographs(PRs) and cone-beam computed tomographic (CBCT) images and evaluated its performance. Materials and Methods: A CNN model, which is an artificial intelligence method, was utilized. The model was trained and tested by applying 5-fold cross-validation to a dataset of 148 healthy and 148 inflamed sinus images. The CNN model was implemented using the PyTorch library of the Python programming language. A receiver operating characteristic curve was plotted, and the area under the curve, accuracy, sensitivity, specificity, positive predictive value, and negative predictive values for both imaging techniques were calculated to evaluate the model. Results: The average accuracy, sensitivity, and specificity of the model in diagnosing sinusitis from PRs were 75.7%, 75.7%, and 75.7%, respectively. The accuracy, sensitivity, and specificity of the deep-learning system in diagnosing sinusitis from CBCT images were 99.7%, 100%, and 99.3%, respectively. Conclusion: The diagnostic performance of the CNN for maxillary sinusitis from PRs was moderately high, whereas it was clearly higher with CBCT images. Three-dimensional images are accepted as the "gold standard" for diagnosis; therefore, this was not an unexpected result. Based on these results, deep-learning systems could be used as an effective guide in assisting with diagnoses, especially for less experienced practitioners.

차원축소 없는 채널집중 네트워크를 이용한 SAR 변형표적 식별 (SAR Recognition of Target Variants Using Channel Attention Network without Dimensionality Reduction)

  • 박지훈;최여름;채대영;임호
    • 한국군사과학기술학회지
    • /
    • 제25권3호
    • /
    • pp.219-230
    • /
    • 2022
  • In implementing a robust automatic target recognition(ATR) system with synthetic aperture radar(SAR) imagery, one of the most important issues is accurate classification of target variants, which are the same targets with different serial numbers, configurations and versions, etc. In this paper, a deep learning network with channel attention modules is proposed to cope with the recognition problem for target variants based on the previous research findings that the channel attention mechanism selectively emphasizes the useful features for target recognition. Different from other existing attention methods, this paper employs the channel attention modules without dimensionality reduction along the channel direction from which direct correspondence between feature map channels can be preserved and the features valuable for recognizing SAR target variants can be effectively derived. Experiments with the public benchmark dataset demonstrate that the proposed scheme is superior to the network with other existing channel attention modules.

Fast Convergence GRU Model for Sign Language Recognition

  • Subramanian, Barathi;Olimov, Bekhzod;Kim, Jeonghong
    • 한국멀티미디어학회논문지
    • /
    • 제25권9호
    • /
    • pp.1257-1265
    • /
    • 2022
  • Recognition of sign language is challenging due to the occlusion of hands, accuracy of hand gestures, and high computational costs. In recent years, deep learning techniques have made significant advances in this field. Although these methods are larger and more complex, they cannot manage long-term sequential data and lack the ability to capture useful information through efficient information processing with faster convergence. In order to overcome these challenges, we propose a word-level sign language recognition (SLR) system that combines a real-time human pose detection library with the minimized version of the gated recurrent unit (GRU) model. Each gate unit is optimized by discarding the depth-weighted reset gate in GRU cells and considering only current input. Furthermore, we use sigmoid rather than hyperbolic tangent activation in standard GRUs due to performance loss associated with the former in deeper networks. Experimental results demonstrate that our pose-based optimized GRU (Pose-OGRU) outperforms the standard GRU model in terms of prediction accuracy, convergency, and information processing capability.

Enhanced 3D Residual Network for Human Fall Detection in Video Surveillance

  • Li, Suyuan;Song, Xin;Cao, Jing;Xu, Siyang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권12호
    • /
    • pp.3991-4007
    • /
    • 2022
  • In the public healthcare, a computational system that can automatically and efficiently detect and classify falls from a video sequence has significant potential. With the advancement of deep learning, which can extract temporal and spatial information, has become more widespread. However, traditional 3D CNNs that usually adopt shallow networks cannot obtain higher recognition accuracy than deeper networks. Additionally, some experiences of neural network show that the problem of gradient explosions occurs with increasing the network layers. As a result, an enhanced three-dimensional ResNet-based method for fall detection (3D-ERes-FD) is proposed to directly extract spatio-temporal features to address these issues. In our method, a 50-layer 3D residual network is used to deepen the network for improving fall recognition accuracy. Furthermore, enhanced residual units with four convolutional layers are developed to efficiently reduce the number of parameters and increase the depth of the network. According to the experimental results, the proposed method outperformed several state-of-the-art methods.

파편 탐지 성능 향상을 위한 딥러닝 초해상도화 효과 분석 (Analysis of the Effect of Deep-learning Super-resolution for Fragments Detection Performance Enhancement)

  • 이유석
    • 한국군사과학기술학회지
    • /
    • 제26권3호
    • /
    • pp.234-245
    • /
    • 2023
  • The Arena Fragmentation Test(AFT) is designed to analyze warhead performance by measuring fragmentation data. In order to evaluate the results of the AFT, a set of AFT images are captured by high-speed cameras. To detect objects in the AFT image set, ResNet-50 based Faster R-CNN is used as a detection model. However, because of the low resolution of the AFT image set, a detection model has shown low performance. To enhance the performance of the detection model, Super-resolution(SR) methods are used to increase the AFT image set resolution. To this end, The Bicubic method and three SR models: ZSSR, EDSR, and SwinIR are used. The use of SR images results in an increase in the performance of the detection model. While the increase in the number of pixels representing a fragment flame in the AFT images improves the Recall performance of the detection model, the number of pixels representing noise also increases, leading to a slight decreases in Precision performance. Consequently, the F1 score is increased by up to 9 %, demonstrating the effectiveness of SR in enhancing the performance of the detection model.

딥러닝 기반 자동차 모델 및 번호판 인식 시스템 구현 (Implementation of Deep Learning-Based Vehicle Model and License Plate Recognition System)

  • 함경윤;강길남;이장현;이정우;박동훈;류명춘
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2022년도 제66차 하계학술대회논문집 30권2호
    • /
    • pp.465-466
    • /
    • 2022
  • 본 논문에서는 딥러닝 영상인식 기술을 활용한 객체검출 모델인 YOLOv4를 활용하여 차량의 모델과 번호판인식 시스템을 제안한다. 본 논문에서 제안하는 시스템은 실시간 영상처리기술인 YOLOv4를 사용하여 차량모델 인식과 번호판 영역 검출을 하고, CNN(Convolutional Neural Network)알고리즘을 이용하여 번호판의 글자와 숫자를 인식한다. 이러한 방법을 이용한다면 카메라 1대로 차량의 모델 인식과 번호판 인식이 가능하다. 차량모델 인식과 번호판 영역 검출에는 실제 데이터를 사용하였으며, 차량 번호판 문자 인식의 경우 실제 데이터와 가상 데이터를 사용하였다. 차량 모델 인식 정확도는 92.3%, 번호판 검출 98.9%, 번호판 문자 인식 94.2%를 기록하였다.

  • PDF

딥러닝 영상인식을 이용한 디지털 트윈 기반 구역별 유동 인구 추정 시스템 설계 (Design of a Zone-based Population Estimation System using Deep Learning Image Recognition for Digital Twin)

  • 하옥균;김진찬;김용진;옥용훈;나동훈;이욱렬
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2023년도 제68차 하계학술대회논문집 31권2호
    • /
    • pp.41-42
    • /
    • 2023
  • 인구 밀집도가 높은 곳에서의 안전사고 대응과 이에 대한 예방을 위한 기술 및 해결 방안의 필요성이 증가하고 있다. 이를 위한 기존의 기술들은 지능형 CCTV 기반의 경고 알림을 울리는 방식과 스마트폰의 신호를 수집하여 유동인구를 측정하는 기술 등이 사용되고 있다. 그러나 군중 밀집 사고의 원인인 병목현상과 군중 난류 현상까지 대응하지는 못하는 문제점이 있다. 본 논문에서는 CCTV로부터 수집된 영상 정보만으로 딥러닝 영상인식 기술을 이용하여 병목현상이 일어나기 쉬운 출입구의 유·출입 인구 카운팅과 광장의 밀집도 분석을 디지털 트윈 기반으로 실시하고 이를 통해 위험 상황 발생 시 출입구의 통제와 대피를 위한 안내가 가능한 시스템을 제시한다. 제시하는 시스템은 유동 인구가 많고 인구의 급격한 밀집으로 인해 발생할 수 있는 안전사고의 예방과 이를 해결하기 위한 통제 및 안내를 위한 대처 방법으로 활용할 수 있다.

  • PDF

딥러닝 영상인식을 이용한 PCB 기판 비전 검사 시스템 개발 (Development of PCB board vision inspection system using image recognition based on deep learning)

  • 이창훈;이민성;심정민;강동원;윤태진
    • 한국컴퓨터정보학회:학술대회논문집
    • /
    • 한국컴퓨터정보학회 2024년도 제69차 동계학술대회논문집 32권1호
    • /
    • pp.289-290
    • /
    • 2024
  • PCB(Printed circuit board)생산시에 중요한 역할을 담당하는 비전검사 시스템의 성능은 지속적으로 발전해왔다. 기존 머신 비전 검사 시스템은 이미지가 불규칙하고 비정형일 경우 해석이 어렵고 전문가의 경험에 의존한다. 그리고 비전검사 시스템 개발 당시의 기준과 다른 불량이 발생한다면 검출이 불가능 하거나 정확도가 낮게 나온다. 본 논문에서는 이를 개선하고자 딥러닝 영상인식을 이용한 PCB 기판 비전 검사 시스템을 구현하였다. 딥러닝 영상인식 알고리즘은 YOLOv4를 이용하고, 워핑(warping)과 시킨 PCB 이미지를 학습하여 비전검사 시스템을 구성하였다. 딥러닝 영상인식 기술의 처리 속도를 보완하고자 QR코드로 PCB 기판 종류를 인식하고, 해당 PCB 부품의 미삽은 정답 이미지 바운딩 박스 좌표와 비교하여 불량품을 발견하면 표시해준다. 기판의 부품 인식을 위해 기판 데이터는 직접 촬영하여 수집하였다. 이를 활용하여 PCB 생산 공정에서 비전검사 시스템의 성능이 향상되었고,, 다양한 PCB를 생산에 신속하게 대응할 수 있다.

  • PDF

Customer-based Recommendation Model for Next Merchant Recommendation

  • Bayartsetseg Kalina;Ju-Hong Lee
    • 스마트미디어저널
    • /
    • 제12권5호
    • /
    • pp.9-16
    • /
    • 2023
  • In the recommendation system of the credit card company, it is necessary to understand the customer patterns to predict a customer's next merchant based on their histories. The data we want to model is much more complex and there are various patterns that customers choose. In such a situation, it is necessary to use an effective model that not only shows the relevance of the merchants, but also the relevance of the customers relative to these merchants. The proposed model aims to predict the next merchant for the customer. To improve prediction performance, we propose a novel model, called Customer-based Recommendation Model (CRM), to produce a more efficient representation of customers. For the next merchant recommendation system, we use a synthetic credit card usage dataset, BC'17. To demonstrate the applicability of the proposed model, we also apply it to the next item recommendation with another real-world transaction dataset, IJCAI'16.

프로세싱 인 메모리 시스템에서의 PolyBench 구동에 대한 동작 성능 및 특성 분석과 고찰 (Performance Analysis and Identifying Characteristics of Processing-in-Memory System with Polyhedral Benchmark Suite)

  • 김정근
    • 반도체디스플레이기술학회지
    • /
    • 제22권3호
    • /
    • pp.142-148
    • /
    • 2023
  • In this paper, we identify performance issues in executing compute kernels from PolyBench, which includes compute kernels that are the core computational units of various data-intensive workloads, such as deep learning and data-intensive applications, on Processing-in-Memory (PIM) devices. Therefore, using our in-house simulator, we measured and compared the various performance metrics of workloads based on traditional out-of-order and in-order processors with Processing-in-Memory-based systems. As a result, the PIM-based system improves performance compared to other computing models due to the short-term data reuse characteristic of computational kernels from PolyBench. However, some kernels perform poorly in PIM-based systems without a multi-layer cache hierarchy due to some kernel's long-term data reuse characteristics. Hence, our evaluation and analysis results suggest that further research should consider dynamic and workload pattern adaptive approaches to overcome performance degradation from computational kernels with long-term data reuse characteristics and hidden data locality.

  • PDF