• Title/Summary/Keyword: Deep learning based control

Search Result 237, Processing Time 0.027 seconds

Optimization-based Deep Learning Model to Localize L3 Slice in Whole Body Computerized Tomography Images (컴퓨터 단층촬영 영상에서 3번 요추부 슬라이스 검출을 위한 최적화 기반 딥러닝 모델)

  • Seongwon Chae;Jae-Hyun Jo;Ye-Eun Park;Jin-Hyoung, Jeong;Sung Jin Kim;Ahnryul Choi
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.16 no.5
    • /
    • pp.331-337
    • /
    • 2023
  • In this paper, we propose a deep learning model to detect lumbar 3 (L3) CT images to determine the occurrence and degree of sarcopenia. In addition, we would like to propose an optimization technique that uses oversampling ratio and class weight as design parameters to address the problem of performance degradation due to data imbalance between L3 level and non-L3 level portions of CT data. In order to train and test the model, a total of 150 whole-body CT images of 104 prostate cancer patients and 46 bladder cancer patients who visited Gangneung Asan Medical Center were used. The deep learning model used ResNet50, and the design parameters of the optimization technique were selected as six types of model hyperparameters, data augmentation ratio, and class weight. It was confirmed that the proposed optimization-based L3 level extraction model reduced the median L3 error by about 1.0 slices compared to the control model (a model that optimized only 5 types of hyperparameters). Through the results of this study, accurate L3 slice detection was possible, and additionally, we were able to present the possibility of effectively solving the data imbalance problem through oversampling through data augmentation and class weight adjustment.

Active control of flow around a 2D square cylinder using plasma actuators (2차원 사각주 주위 유동의 플라즈마 능동제어에 대한 연구)

  • Paraskovia Kolesova;Mustafa G. Yousif;Hee-Chang Lim
    • Journal of the Korean Society of Visualization
    • /
    • v.22 no.2
    • /
    • pp.44-54
    • /
    • 2024
  • This study investigates the effectiveness of using a plasma actuator for active control of turbulent flow around a finite square cylinder. The primary objective is to analyze the impact of plasma actuators on flow separation and wake region characteristics, which are critical for reducing drag and suppressing vortex-induced vibrations. Direct Numerical Simulation (DNS) was employed to explore the flow dynamics at various operational parameters, including different actuation frequencies and voltages. The proposed methodology employs a neural network trained using the Proximal Policy Optimization (PPO) algorithm to determine optimal control policies for plasma actuators. This network is integrated with a computational fluid dynamics (CFD) solver for real-time control. Results indicate that this deep reinforcement learning (DRL)-based strategy outperforms existing methods in controlling flow, demonstrating robustness and adaptability across various flow conditions, which highlights its potential for practical applications.

Improving Confidence in Synthetic Infrared Image Refinement using Grad-CAM-based Explainable AI Techniques (Grad-CAM 기반의 설명가능한 인공지능을 사용한 합성 이미지 개선 방법)

  • Taeho Kim;Kangsan Kim;Hyochoong Bang
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.6
    • /
    • pp.665-676
    • /
    • 2024
  • Infrared imaging is a powerful non-destructive and non-invasive technique to detect infrared radiations and capture valuable insights inaccessible through the visible spectrum. It has been widely used in the military for reconnaissance, hazard detection, night vision, guidance systems, and countermeasures. There is a huge potential for machine learning models to improve trivial infrared imaging tasks in military applications. One major roadblock is the scarcity and control over infrared imaging datasets related to military applications. One possible solution is to use synthetic infrared images to train machine learning networks. However, synthetic IR images present a domain gap and produce weak learning models that do not generalize well. We investigate adversarial networks and Explainable AI(XAI) techniques to refine synthetic infrared imaging data, enhance their realism, and synthesize refiner networks with XAI. We use a U-Net-based refiner network to refine synthetic infrared data and a PatchGAN discriminator to distinguish between the refined and real IR images. Grad-CAM XAI technique is used for network synthesis. We also analyzed the frequency domain patterns and power spectra of real and synthetic infrared images to find key attributes to distinguish real from synthetic. We tested our refined images on the realism benchmarks using frequency domain analysis.

Deep Learning-Based Modulation Detection for NOMA Systems

  • Xie, Wenwu;Xiao, Jian;Yang, Jinxia;Wang, Ji;Peng, Xin;Yu, Chao;Zhu, Peng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.15 no.2
    • /
    • pp.658-672
    • /
    • 2021
  • Since the signal with strong power need be demodulated first for successive interference cancellation (SIC) receiver in non-orthogonal multiple access (NOMA) systems, the base station (BS) need inform the near user terminal (UT), which has allocated higher power, of the far UT's modulation mode. To avoid unnecessary signaling overhead of control channel, a blind detection algorithm of NOMA signal modulation mode is designed in this paper. Taking the joint constellation density diagrams of NOMA signal as the detection features, the deep residual network is built for classification, so as to detect the modulation mode of NOMA signal. In view of the fact that the joint constellation diagrams are easily polluted by high intensity noise and lose their real distribution pattern, the wavelet denoising method is adopted to improve the quality of constellations. The simulation results represent that the proposed algorithm can achieve satisfactory detection accuracy in NOMA systems. In addition, the factors affecting the recognition performance are also verified and analyzed.

Personalized Speech Classification Scheme for the Smart Speaker Accessibility Improvement of the Speech-Impaired people (언어장애인의 스마트스피커 접근성 향상을 위한 개인화된 음성 분류 기법)

  • SeungKwon Lee;U-Jin Choe;Gwangil Jeon
    • Smart Media Journal
    • /
    • v.11 no.11
    • /
    • pp.17-24
    • /
    • 2022
  • With the spread of smart speakers based on voice recognition technology and deep learning technology, not only non-disabled people, but also the blind or physically handicapped can easily control home appliances such as lights and TVs through voice by linking home network services. This has greatly improved the quality of life. However, in the case of speech-impaired people, it is impossible to use the useful services of the smart speaker because they have inaccurate pronunciation due to articulation or speech disorders. In this paper, we propose a personalized voice classification technique for the speech-impaired to use for some of the functions provided by the smart speaker. The goal of this paper is to increase the recognition rate and accuracy of sentences spoken by speech-impaired people even with a small amount of data and a short learning time so that the service provided by the smart speaker can be actually used. In this paper, data augmentation and one cycle learning rate optimization technique were applied while fine-tuning ResNet18 model. Through an experiment, after recording 10 times for each 30 smart speaker commands, and learning within 3 minutes, the speech classification recognition rate was about 95.2%.

A Study on Vehicle Number Recognition Technology in the Side Using Slope Correction Algorithm (기울기 보정 알고리즘을 이용한 측면에서의 차량 번호 인식 기술 연구)

  • Lee, Jaebeom;Jang, Jongwook;Jang, Sungjin
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.05a
    • /
    • pp.465-468
    • /
    • 2022
  • The incidence of traffic accidents is increasing every year, and Korea is among the top OECD countries. In order to improve this, various road traffic laws are being implemented, and various traffic control methods using equipment such as unmanned speed cameras and traffic control cameras are being applied. However, as drivers avoid crackdowns by detecting the location of traffic control cameras in advance through navigation, a mobile crackdown system that can be cracked down is needed, and research is needed to increase the recognition rate of vehicle license plates on the side of the road for accurate crackdown. This paper proposes a method to improve the vehicle number recognition rate on the road side by applying a gradient correction algorithm using image processing. In addition, custom data learning was conducted using a CNN-based YOLO algorithm to improve character recognition accuracy. It is expected that the algorithm can be used for mobile traffic control cameras without restrictions on the installation location.

  • PDF

Development of AI Detection Model based on CCTV Image for Underground Utility Tunnel (지하공동구의 CCTV 영상 기반 AI 연기 감지 모델 개발)

  • Kim, Jeongsoo;Park, Sangmi;Hong, Changhee;Park, Seunghwa;Lee, Jaewook
    • Journal of the Society of Disaster Information
    • /
    • v.18 no.2
    • /
    • pp.364-373
    • /
    • 2022
  • Purpose: The purpose of this paper is to develope smoke detection using AI model for detecting the initial fire in underground utility tunnels using CCTV Method: To improve detection performance of smoke which is high irregular, a deep learning model for fire detection was trained to optimize smoke detection. Also, several approaches such as dataset cleansing and gradient exploding release were applied to enhance model, and compared with results of those. Result: Results show the proposed approaches can improve the model performance, and the final model has good prediction capability according to several indexes such as mAP. However, the final model has low false negative but high false positive capacities. Conclusion: The present model can apply to smoke detection in underground utility tunnel, fixing the defect by linking between the model and the utility tunnel control system.

Quality Control Plan of Water Level in Agricultural Reservoirs using a Deep-Learning Based LSTM Model (딥러닝 기반 LSTM 모형을 이용한 농업용 저수지 수위자료 품질관리 방안)

  • Yang, Mi-Hye;Nam, Won-Ho;Shin, An-Kook;Kang, Mun-Sung;Kim, Taegon
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2020.06a
    • /
    • pp.128-128
    • /
    • 2020
  • 최근 농업환경의 변화와 기후변화에 대응하기 위해 농업용수 관리 정보화 및 과학화의 필요성이 증대되어 실시간으로 저수지 저수량과 농업용수 공급량을 파악하기 위해 자동 수위계측시설이 도입되었다. 농림축산식품부의 저수지 자동수위측정기 설치 및 운영지침에 따라 현재 농어촌공사 관리 저수지 1,734개소 및 수로부 1,880개소에 자동수위계가 설치되어 있으며, 저수지와 수로에서 10분 간격으로 수위자료가 생성되고 있다. 농업용 저수지 수문자료의 공인지점은 2016년 6개소에서 2019년 49개소로 증대되고 있으며, 데이터 품질 저하의 최소화 및 신뢰성 있는 수문자료 생성의 필요성이 증가함에 따라 농업용 저수지의 특성을 반영한 저수지 수위 오결측 데이터 보정 방안 및 수문 자료 품질관리 방안이 요구된다. 농업용 저수지의 수위 변화 및 강우-유출 현상은 물리적 모형을 구축하여 기상, 지형 등 영향 인자와 수위(또는 유출)와의 상관관계를 분석하는 것은 무적으로 불가능하였지만, 최근 인공신경망 (Artificial Neural Network, ANN) 등과 같이 black-box 형태의 모형을 이용하여 비선형적인 수문해석이 가능해졌다. 본 연구에서는 빅데이터와 인공신경망을 결합시킨 알고리즘인 딥러닝 (Deep Learning) 기반의 LSTM (Long Short-Term Memory) 모형을 활용하여 농업용 저수지 수위자료를 검토하여 자동계측기에서 발생하는 오류 보정을 위해 품질관리 방안을 제시하고자 한다.

  • PDF

Speckle Noise Reduction and Image Quality Improvement in U-net-based Phase Holograms in BL-ASM (BL-ASM에서 U-net 기반 위상 홀로그램의 스펙클 노이즈 감소와 이미지 품질 향상)

  • Oh-Seung Nam;Ki-Chul Kwon;Jong-Rae Jeong;Kwon-Yeon Lee;Nam Kim
    • Korean Journal of Optics and Photonics
    • /
    • v.34 no.5
    • /
    • pp.192-201
    • /
    • 2023
  • The band-limited angular spectrum method (BL-ASM) causes aliasing errors due to spatial frequency control problems. In this paper, a sampling interval adjustment technique for phase holograms and a technique for reducing speckle noise and improving image quality using a deep-learningbased U-net model are proposed. With the proposed technique, speckle noise is reduced by first calculating the sampling factor and controlling the spatial frequency by adjusting the sampling interval so that aliasing errors can be removed in a wide range of propagation. The next step is to improve the quality of the reconstructed image by learning the phase hologram to which the deep learning model is applied. In the S/W simulation of various sample images, it was confirmed that the peak signal-to-noise ratio (PSNR) and structural similarity index measure (SSIM) were improved by 5% and 0.14% on average, compared with the existing BL-ASM.

Prognosis of Blade Icing of Rotorcraft Drones through Vibration Analysis (진동분석을 통한 회전익 드론의 블레이드 착빙 예지)

  • Seonwoo Lee;Jaeseok Do;Jangwook Hur
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.27 no.1
    • /
    • pp.1-7
    • /
    • 2024
  • Weather is one of the main causes of aircraft accidents, and among the phenomena caused by weather, icing is a phenomenon in which an ice layer is formed when an object exposed to an atmosphere below a freezing temperature collides with supercooled water droplets. If this phenomenon occurs in the rotor blades, it causes defects such as severe vibration in the airframe and eventually leads to loss of control and an accident. Therefore, it is necessary to foresee the icing situation so that it can ascend and descend at an altitude without a freezing point. In this study, vibration data in normal and faulty conditions was acquired, data features were extracted, and vibration was predicted through deep learning-based algorithms such as CNN, LSTM, CNN-LSTM, Transformer, and TCN, and performance was compared to evaluate blade icing. A method for minimizing operating loss is suggested.