• 제목/요약/키워드: image deep learning

검색결과 1,769건 처리시간 0.045초

A Deep Learning Approach for Classification of Cloud Image Patches on Small Datasets

  • Phung, Van Hiep;Rhee, Eun Joo
    • Journal of information and communication convergence engineering
    • /
    • 제16권3호
    • /
    • pp.173-178
    • /
    • 2018
  • Accurate classification of cloud images is a challenging task. Almost all the existing methods rely on hand-crafted feature extraction. Their limitation is low discriminative power. In the recent years, deep learning with convolution neural networks (CNNs), which can auto extract features, has achieved promising results in many computer vision and image understanding fields. However, deep learning approaches usually need large datasets. This paper proposes a deep learning approach for classification of cloud image patches on small datasets. First, we design a suitable deep learning model for small datasets using a CNN, and then we apply data augmentation and dropout regularization techniques to increase the generalization of the model. The experiments for the proposed approach were performed on SWIMCAT small dataset with k-fold cross-validation. The experimental results demonstrated perfect classification accuracy for most classes on every fold, and confirmed both the high accuracy and the robustness of the proposed model.

웨이블릿 퓨전에 의한 딥러닝 색상화의 성능 향상 (High-performance of Deep learning Colorization With Wavelet fusion)

  • 김영백;최현;조중휘
    • 대한임베디드공학회논문지
    • /
    • 제13권6호
    • /
    • pp.313-319
    • /
    • 2018
  • We propose a post-processing algorithm to improve the quality of the RGB image generated by deep learning based colorization from the gray-scale image of an infrared camera. Wavelet fusion is used to generate a new luminance component of the RGB image luminance component from the deep learning model and the luminance component of the infrared camera. PSNR is increased for all experimental images by applying the proposed algorithm to RGB images generated by two deep learning models of SegNet and DCGAN. For the SegNet model, the average PSNR is improved by 1.3906dB at level 1 of the Haar wavelet method. For the DCGAN model, PSNR is improved 0.0759dB on the average at level 5 of the Daubechies wavelet method. It is also confirmed that the edge components are emphasized by the post-processing and the visibility is improved.

이미지-텍스트 쌍을 활용한 이미지 분류 정확도 향상에 관한 연구 (A Study on Improvement of Image Classification Accuracy Using Image-Text Pairs)

  • 김미희;이주혁
    • 전기전자학회논문지
    • /
    • 제27권4호
    • /
    • pp.561-566
    • /
    • 2023
  • 딥러닝의 발전으로 다양한 컴퓨터 비전 연구를 수행할 수 있게 됐다. 딥러닝은 컴퓨터 비전 연구 중 이미지 처리에서 높은 정확도와 성능을 보여줬다. 하지만 대부분의 이미지 처리 방식은 이미지의 시각 정보만을 이용해 이미지를 처리하는 경우가 대부분이다. 이미지-텍스트 쌍을 활용할 경우 이미지와 관련된 설명, 주석 등의 텍스트 데이터가 이미지 자체에서는 얻기 힘든 추가적인 맥락과 시각 정보를 제공할 수 있다. 본 논문에서는 이미지-텍스트 쌍을 활용하여 이미지와 텍스트를 분석하는 딥러닝 모델 제안한다. 제안 모델은 이미지 정보만을 사용한 딥러닝 모델보다 약 11% 향상된 분류 정확도 결과를 보였다.

직물 이미지 결함 탐지를 위한 딥러닝 기술 연구: 트랜스포머 기반 이미지 세그멘테이션 모델 실험 (Deep Learning Models for Fabric Image Defect Detection: Experiments with Transformer-based Image Segmentation Models)

  • 이현상;하성호;오세환
    • 한국정보시스템학회지:정보시스템연구
    • /
    • 제32권4호
    • /
    • pp.149-162
    • /
    • 2023
  • Purpose In the textile industry, fabric defects significantly impact product quality and consumer satisfaction. This research seeks to enhance defect detection by developing a transformer-based deep learning image segmentation model for learning high-dimensional image features, overcoming the limitations of traditional image classification methods. Design/methodology/approach This study utilizes the ZJU-Leaper dataset to develop a model for detecting defects in fabrics. The ZJU-Leaper dataset includes defects such as presses, stains, warps, and scratches across various fabric patterns. The dataset was built using the defect labeling and image files from ZJU-Leaper, and experiments were conducted with deep learning image segmentation models including Deeplabv3, SegformerB0, SegformerB1, and Dinov2. Findings The experimental results of this study indicate that the SegformerB1 model achieved the highest performance with an mIOU of 83.61% and a Pixel F1 Score of 81.84%. The SegformerB1 model excelled in sensitivity for detecting fabric defect areas compared to other models. Detailed analysis of its inferences showed accurate predictions of diverse defects, such as stains and fine scratches, within intricated fabric designs.

BM3D and Deep Image Prior based Denoising for the Defense against Adversarial Attacks on Malware Detection Networks

  • Sandra, Kumi;Lee, Suk-Ho
    • International journal of advanced smart convergence
    • /
    • 제10권3호
    • /
    • pp.163-171
    • /
    • 2021
  • Recently, Machine Learning-based visualization approaches have been proposed to combat the problem of malware detection. Unfortunately, these techniques are exposed to Adversarial examples. Adversarial examples are noises which can deceive the deep learning based malware detection network such that the malware becomes unrecognizable. To address the shortcomings of these approaches, we present Block-matching and 3D filtering (BM3D) algorithm and deep image prior based denoising technique to defend against adversarial examples on visualization-based malware detection systems. The BM3D based denoising method eliminates most of the adversarial noise. After that the deep image prior based denoising removes the remaining subtle noise. Experimental results on the MS BIG malware dataset and benign samples show that the proposed denoising based defense recovers the performance of the adversarial attacked CNN model for malware detection to some extent.

심층 전이 학습을 이용한 이미지 검색의 문화적 특성 분석 (Analysis of Cultural Context of Image Search with Deep Transfer Learning)

  • Kim, Hyeon-sik;Jeong, Jin-Woo
    • 한국정보통신학회논문지
    • /
    • 제24권5호
    • /
    • pp.674-677
    • /
    • 2020
  • The cultural background of users utilizing image search engines has a significant impact on the satisfaction of the search results. Therefore, it is important to analyze and understand the cultural context of images for more accurate image search. In this paper, we investigate how the cultural context of images can affect the performance of image classification. To this end, we first collected various types of images (e.g,. food, temple, etc.) with various cultural contexts (e.g., Korea, Japan, etc.) from web search engines. Afterwards, a deep transfer learning approach using VGG19 and MobileNetV2 pre-trained with ImageNet was adopted to learn the cultural features of the collected images. Through various experiments we show the performance of image classification can be differently affected according to the cultural context of images.

흉부 X-ray 기반 딥 러닝 손실함수 성능 비교·분석 (Comparison and analysis of chest X-ray-based deep learning loss function performance)

  • 서진범;조영복
    • 한국정보통신학회논문지
    • /
    • 제25권8호
    • /
    • pp.1046-1052
    • /
    • 2021
  • 4차 산업의 발전과 고성능의 컴퓨팅 환경 구축으로 다양한 산업분야에서 인공지능이 적용되고 있다. 의료분야에서는 X-Ray, MRI, PET 등의 의료 영상 및 임상 자료를 이용하여 암, COVID-19, 골 연령 측정 등의 딥 러닝 학습이 진행되었다. 또한 스마트 의료기기, IoT 디바이스와 딥 러닝 알고리즘을 적용하여 ICT 의료 융합 기술 등이 연구되고 있다. 이러한 기술 중 의료 영상 기반 딥 러닝 학습은 의료 영상의 바이오마커를 정확히 찾아내고, 최소한의 손실률과 높은 정확도가 필요하다. 따라서 본 논문은 흉부 X-Ray 이미지 기반 딥 러닝 학습 과정에서 손실률을 도출하는 손실 함수 중 영상분류 알고리즘에서 사용되는 Cross-Entropy 함수들의 성능을 비교·분석하고자 한다.

딥러닝 기반 고성능 얼굴인식 기술 동향 (Research Trends for Deep Learning-Based High-Performance Face Recognition Technology)

  • 김형일;문진영;박종열
    • 전자통신동향분석
    • /
    • 제33권4호
    • /
    • pp.43-53
    • /
    • 2018
  • As face recognition (FR) has been well studied over the past decades, FR technology has been applied to many real-world applications such as surveillance and biometric systems. However, in the real-world scenarios, FR performances have been known to be significantly degraded owing to variations in face images, such as the pose, illumination, and low-resolution. Recently, visual intelligence technology has been rapidly growing owing to advances in deep learning, which has also improved the FR performance. Furthermore, the FR performance based on deep learning has been reported to surpass the performance level of human perception. In this article, we discuss deep-learning based high-performance FR technologies in terms of representative deep-learning based FR architectures and recent FR algorithms robust to face image variations (i.e., pose-robust FR, illumination-robust FR, and video FR). In addition, we investigate big face image datasets widely adopted for performance evaluations of the most recent deep-learning based FR algorithms.

선삭공정에서 딥러닝 영상처리 기법을 이용한 작업자 위험 감소 방안 연구 (A Study on Worker Risk Reduction Methods using the Deep Learning Image Processing Technique in the Turning Process)

  • 배용환;이영태;김호찬
    • 한국기계가공학회지
    • /
    • 제20권12호
    • /
    • pp.1-7
    • /
    • 2021
  • The deep learning image processing technique was used to prevent accidents in lathe work caused by worker negligence. During lathe operation, when the chuck is rotated, it is very dangerous if the operator's hand is near the chuck. However, if the chuck is stopped during operation, it is not dangerous for the operator's hand to be in close proximity to the chuck for workpiece measurement, chip removal or tool change. We used YOLO (You Only Look Once), a deep learning image processing program for object detection and classification. Lathe work images such as hand, chuck rotation and chuck stop are used for learning, object detection and classification. As a result of the experiment, object detection and class classification were performed with a success probability of over 80% at a confidence score 0.5. Thus, we conclude that the artificial intelligence deep learning image processing technique can be effective in preventing incidents resulting from worker negligence in future manufacturing systems.

적외선영상내 전력선 검출을 위한 하이브리드 방법 (A Hybrid Method for Recognizing Existence of Power Lines in Infrared Images)

  • 김종희;정찬호
    • 전기전자학회논문지
    • /
    • 제26권4호
    • /
    • pp.742-745
    • /
    • 2022
  • 본 논문에서 우리는 열화상에서 전력선 유무를 검출하는 영상처리 기법과 딥러닝 기반의 하이브리드 방법을 제안한다. 딥러닝은 다수의 데이터로부터 목적에 부합하는 특징 벡터를 학습할 수 있는 장점 덕분에 영상 인식, 객체 검출 등 다양한 분야에서 기존의 직접 설계한 특징 벡터를 사용하는 방법들보다 높은 성능을 달성할 수 있는 장점이 있고, 영상처리 기법은 사람의 직관을 그대로 적용할 수 있다는 장점이 있다. 두 장점을 모두 이용하여 열화상에서 전력선 유무를 검출하는 방법을 제안한다. 전력선 유무 검출에 가장 적합한 영상처리 기법을 찾기 위해 총 5가지 방법을 적용 및 비교하였고, 그 결과로 제안하는 방법은 기존의 영상처리 기반 방법과 딥러닝 기반의 방법 두 가지 모두에 비해 더 높은 99.48%의 정확도로 전력선 유무를 검출할 수 있다.