Search | Korea Science

Object Detection Performance Analysis between On-GPU and On-Board Analysis for Military Domain Images

Du-Hwan Hur;Dae-Hyeon Park;Deok-Woong Kim;Jae-Yong Baek;Jun-Hyeong Bak;Seung-Hwan Bae
- Journal of the Korea Society of Computer and Information
- /
- v.29 no.8
- /
- pp.157-164
- /
- 2024
In this paper, we propose a discussion that the feasibility of deploying a deep learning-based detector on the resource-limited board. Although many studies evaluate the detector on machines with high-performed GPUs, evaluation on the board with limited computation resources is still insufficient. Therefore, in this work, we implement the deep-learning detectors and deploy them on the compact board by parsing and optimizing a detector. To figure out the performance of deep learning based detectors on limited resources, we monitor the performance of several detectors with different H/W resource. On COCO detection datasets, we compare and analyze the evaluation results of detection model in On-Board and the detection model in On-GPU in terms of several metrics with mAP, power consumption, and execution speed (FPS). To demonstrate the effect of applying our detector for the military area, we evaluate them on our dataset consisting of thermal images considering the flight battle scenarios. As a results, we investigate the strength of deep learning-based on-board detector, and show that deep learning-based vision models can contribute in the flight battle scenarios.
https://doi.org/10.9708/jksci.2024.29.08.157 인용 PDF HTML

Use of Unmanned Aerial Vehicle Imagery and Deep Learning UNet to Classification Upland Crop in Small Scale Agricultural Land (무인항공기와 딥러닝(UNet)을 이용한 소규모 농지의 밭작물 분류)

Choi, Seokkeun;Lee, Soungki;Kang, Yeonbin;Choi, Do Yeon;Choi, Juweon
- Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
- /
- v.38 no.6
- /
- pp.671-679
- /
- 2020
In order to increase the food self-sufficiency rate, monitoring and analysis of crop conditions in the cultivated area is important, and the existing measurement methods in which agricultural personnel perform measurement and sampling analysis in the field are time-consuming and labor-intensive for this reason inefficient. In order to overcome this limitation, it is necessary to develop an efficient method for monitoring crop information in a small area where many exist. In this study, RGB images acquired from unmanned aerial vehicles and vegetation index calculated using RGB image were applied as deep learning input data to classify complex upland crops in small farmland. As a result of each input data classification, the classification using RGB images showed an overall accuracy of 80.23% and a Kappa coefficient of 0.65, In the case of using the RGB image and vegetation index, the additional data of 3 vegetation indices (ExG, ExR, VDVI) were total accuracy 89.51%, Kappa coefficient was 0.80, and 6 vegetation indices (ExG, ExR, VDVI, RGRI, NRGDI, ExGR) showed 90.35% and Kappa coefficient of 0.82. As a result, the accuracy of the data to which the vegetation index was added was relatively high compared to the method using only RGB images, and the data to which the vegetation index was added showed a significant improvement in accuracy in classifying complex crops.
https://doi.org/10.7848/ksgpc.2020.38.6.671 인용 PDF KSCI

A Study on Lightweight CNN-based Interpolation Method for Satellite Images (위성 영상을 위한 경량화된 CNN 기반의 보간 기술 연구)

Kim, Hyun-ho;Seo, Doochun;Jung, JaeHeon;Kim, Yongwoo
- Korean Journal of Remote Sensing
- /
- v.38 no.2
- /
- pp.167-177
- /
- 2022
In order to obtain satellite image products using the image transmitted to the ground station after capturing the satellite images, many image pre/post-processing steps are involved. During the pre/post-processing, when converting from level 1R images to level 1G images, geometric correction is essential. An interpolation method necessary for geometric correction is inevitably used, and the quality of the level 1G images is determined according to the accuracy of the interpolation method. Also, it is crucial to speed up the interpolation algorithm by the level processor. In this paper, we proposed a lightweight CNN-based interpolation method required for geometric correction when converting from level 1R to level 1G. The proposed method doubles the resolution of satellite images and constructs a deep learning network with a lightweight deep convolutional neural network for fast processing speed. In addition, a feature map fusion method capable of improving the image quality of multispectral (MS) bands using panchromatic (PAN) band information was proposed. The images obtained through the proposed interpolation method improved by about 0.4 dB for the PAN image and about 4.9 dB for the MS image in the quantitative peak signal-to-noise ratio (PSNR) index compared to the existing deep learning-based interpolation methods. In addition, it was confirmed that the time required to acquire an image that is twice the resolution of the 36,500×36,500 input image based on the PAN image size is improved by about 1.6 times compared to the existing deep learning-based interpolation method.
https://doi.org/10.7780/kjrs.2022.38.2.3 인용 PDF KSCI HTML

A Deep Learning-based Real-time Deblurring Algorithm on HD Resolution (HD 해상도에서 실시간 구동이 가능한 딥러닝 기반 블러 제거 알고리즘)

Shim, Kyujin;Ko, Kangwook;Yoon, Sungjoon;Ha, Namkoo;Lee, Minseok;Jang, Hyunsung;Kwon, Kuyong;Kim, Eunjoon;Kim, Changick
- Journal of Broadcast Engineering
- /
- v.27 no.1
- /
- pp.3-12
- /
- 2022
Image deblurring aims to remove image blur, which can be generated while shooting the pictures by the movement of objects, camera shake, blurring of focus, and so forth. With the rise in popularity of smartphones, it is common to carry portable digital cameras daily, so image deblurring techniques have become more significant recently. Originally, image deblurring techniques have been studied using traditional optimization techniques. Then with the recent attention on deep learning, deblurring methods based on convolutional neural networks have been actively proposed. However, most of them have been developed while focusing on better performance. Therefore, it is not easy to use in real situations due to the speed of their algorithms. To tackle this problem, we propose a novel deep learning-based deblurring algorithm that can be operated in real-time on HD resolution. In addition, we improved the training and inference process and could increase the performance of our model without any significant effect on the speed and the speed without any significant effect on the performance. As a result, our algorithm achieves real-time performance by processing 33.74 frames per second at 1280×720 resolution. Furthermore, it shows excellent performance compared to its speed with a PSNR of 29.78 and SSIM of 0.9287 with the GoPro dataset.
https://doi.org/10.5909/JBE.2022.27.1.3 인용 PDF KSCI KPUBS

Recent R&D Trends for Pretrained Language Model (딥러닝 사전학습 언어모델 기술 동향)

Lim, J.H.;Kim, H.K.;Kim, Y.K.
- Electronics and Telecommunications Trends
- /
- v.35 no.3
- /
- pp.9-19
- /
- 2020
Recently, a technique for applying a deep learning language model pretrained from a large corpus to fine-tuning for each application task has been widely used as a language processing technology. The pretrained language model shows higher performance and satisfactory generalization performance than existing methods. This paper introduces the major research trends related to deep learning pretrained language models in the field of language processing. We describe in detail the motivations, models, learning methods, and results of the BERT language model that had significant influence on subsequent studies. Subsequently, we introduce the results of language model studies after BERT, focusing on SpanBERT, RoBERTa, ALBERT, BART, and ELECTRA. Finally, we introduce the KorBERT pretrained language model, which shows satisfactory performance in Korean language. In addition, we introduce techniques on how to apply the pretrained language model to Korean (agglutinative) language, which consists of a combination of content and functional morphemes, unlike English (refractive) language whose endings change depending on the application.
https://doi.org/10.22648/ETRI.2020.J.350302 인용 PDF

Worker's Behavior Monitoring using Deep Learning (딥러닝을 이용한 작업자 행동 모니터링)

Lee, Se-hoon;Kim, Kim-woo;Yu, Jin-hwan;Tak, Jin-hyun
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2019.01a
- /
- pp.57-58
- /
- 2019
본 논문에서는 앞서 진행한 연구들과 딥러닝을 이용한 고소작업자 행동 모니터링 논문에 이어 작업자 위험 행동분류 시스템을 개선할 수 있는 연구 결과를 비교, 설명한다. 이번 연구에서는 작업자의 행동에 따른 고도계 센서의 데이터를 추가로 수집하여 작업자의 더 다양한 행동을 분류하고 위험 행동 패턴 분석을 위한 방향을 제시한다.
PDF

Multi-object Tracking System for Disaster Context-aware using Deep Learning (드론 영상에서 재난 상황인지를 위한 딥러닝 기반 다중 객체 추적 시스템)

Kim, Chanran;Song, Jein;Lee, Jaehoon
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.07a
- /
- pp.697-700
- /
- 2020
고위험의 재난 상황에서 사람이 상황을 판단하고, 요구조자를 탐색하며, 구조하는 것은 추가 피해를 발생시킬 수 있다. 따라서 재난 상황에서도 이동과 접근이 용이한 무인항공에 관한 연구와 개발이 활발히 이루어지고 있다. 재난 상황에서 신속하게 대처하기 위해서는 선제적 상황인지 기술이 필요하다. 이에 본 논문은 구조 및 대피를 위해 사람, 자동차, 자전거 등의 객체를 인식하고 중복 인식을 피하기 위해 추적하는 딥러닝 기반 다중 객체 추적 시스템을 제안한다. 2019 인공지능 R&D 그랜드 챌린지 상황인지 부문에서의 대회 결과로 실험 성능을 증명한다.
PDF

Predictions of dam inflow on Han-river basin using LSTM (LSTM을 이용한 한강유역 댐유입량 예측)

Kim, Jongho;Tran, Trung Duc
- Proceedings of the Korea Water Resources Association Conference
- /
- 2020.06a
- /
- pp.319-319
- /
- 2020
최근 데이터 과학의 획기적인 발전 덕분에 딥러닝 (Deep Learning) 알고리즘이 개발되어 다양한 분야에 널리 적용되고 있다. 본 연구에서는 인공신경망 중 하나인 LSTM(Long-Short Term Memory) 네트워크를 사용하여 댐 유입량을 예측하였다. 구체적인 내용으로, (1) LSTM에 필요한 입력 데이터를 효율적으로 사전 처리하는 방법, (2) LSTM의 하이퍼 매개변수를 결정하는 방법 및 (3) 다양한 손실 함수(Loss function)를 선택하고 그 영향을 평가하는 방법 등을 다루었다. 제안된 LSTM 모델은 강우량(R), 댐유입량(Q) 기온(T), 기저유량(BF) 등을 포함한 다양한 입력 변수들의 함수로 가정하였으며, CCF(Cross Correlations), ACF(Autocorrelations) 및 PACF(Partial Autocorrelations) 등의 기법을 사용하여 입력 변수를 결정하였다. 다양한 sequence length를 갖는 (즉 t, t-1, … t-n의 시간 지연을 갖는) 입력 변수를 적용하여 데이터 학습에 최적의 시퀀스 길이를 결정하였다. LSTM 네트워크 모델을 적용하여 2014년부터 2020년까지 한강 유역 9개의 댐 유입량을 추정하였다. 본 연구로부터 댐 유입량을 예측하는 것은 홍수 및 가뭄 통제를 위한 필수 요건들 중 하나이며 수자원 계획 및 관리에 도움이 될 것이다.
PDF

Metonymy Resolution based on Neural Approach (딥러닝 방식을 이용한 환유 해소)

Whang, Taesun;Lee, Chanhee;Yang, Kisu;Lee, Dongyub;Koo, Youngeun;Jeon, Taehee;Lim, Heuiseok
- Annual Conference on Human and Language Technology
- /
- 2019.10a
- /
- pp.375-379
- /
- 2019
언어학에서의 환유법은 표현을 위해 빌려온 대상이 다양한 의미로 해석 가능하기에 매우 어렵고 난해한 분야이다. 환유의 특성 상 주어진 엔티티의 환유 여부를 구분하기 위해서는 앞뒤 단어와의 연관성 뿐만 아니라 문장 전체의 문맥 정보에 대한 고려가 필수적이다. 최근 이러한 문맥 정보를 고려하여 학습된 다양한 모델들이 등장하면서 환유법에 대한 연구를 하기에 좋은 환경이 구축되고 있다. 본 논문에서는 언어학적 자질 정보를 최소화한 딥러닝을 이용한 환유 해소 모델을 제안한다. LSTM 기반의 feature-based 모델과 및 BERT, XLNet, RoBERTa와 같은 fine-tuning 모델들에 대한 실험을 진행하였다. 실험 결과, fine-tuning 모델들이 baseline과 비교하여 뛰어난 성능 향상을 가져왔으며, 특히 XLNet 모델은 두 개의 환유 해소 데이터 SemEval 2007와 ReLocaR에 대해 각각 90.1%과 95.8%의 정확도를 보여주었다.
PDF

Optimization of Sensor Data Window Size for Deep Learning Regression Model (딥러닝 회귀 모델 개발을 위한 센서 데이터 윈도우 사이즈 최적화 기법)

Choi, Min-Seo;Yoo, Dong-Yeon;Lee, Jung-Won
- Proceedings of the Korea Information Processing Society Conference
- /
- 2022.05a
- /
- pp.610-613
- /
- 2022
센서 데이터의 중요성이 커지면서 센서 데이터 처리 연구의 수요가 증가하고 있다. 센서 데이터 기반의 딥러닝 모델 개발 시, 센서 데이터 단일 값에 의한 출력이 아닌 시계열적인 특성을 반영하여 연속적인 데이터 간의 연관성을 파악할 수 있는 슬라이딩 윈도우 기법을 통해 효율적으로 데이터를 분석하고 처리할 수 있다. 하지만, 기존의 방법들은 학습 성능(학습 시간 및 모델 성능)에 미치는 영향을 평가하는 기준 없이 입력 데이터의 윈도우 사이즈를 임의로 설정하여 데이터를 처리해 왔다. 따라서, 본 논문은 학습 시간과 모델 성능을 기준으로 센서 데이터의 윈도우 사이즈 최적화 기법을 제안한다. 제안한 방법은 전류를 이용하여 스위치와 다이오드 온도를 추정하는 가상 센서(virtual sensor) 실험 테스트베드에 적용하여, 학습 시간 중심으로는 5%의 윈도우 사이즈를, 모델 성능 중심으로는 R2 SCORE 의 값을 0.9295 로 갖는 8%의 윈도우 사이즈가 최적으로 도출되었다.
https://doi.org/10.3745/PKIPS.y2022m05a.610 인용 PDF

Search Result 351, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)