• Title/Summary/Keyword: SRGAN

Search Result 13, Processing Time 0.021 seconds

Performance Improvement of SRGAN's Discriminator via Mutual Distillation (상호증류를 통한 SRGAN 판별자의 성능 개선)

  • Yeojin Lee;Hanhoon Park
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.23 no.3
    • /
    • pp.160-165
    • /
    • 2022
  • Mutual distillation is a knowledge distillation method that guides a cohort of neural networks to learn cooperatively by transferring knowledge between them, without the help of a teacher network. This paper aims to confirm whether mutual distillation is also applicable to super-resolution networks. To this regard, we conduct experiments to apply mutual distillation to the discriminators of SRGANs and analyze the effect of mutual distillation on improving SRGAN's performance. As a result of the experiment, it was confirmed that SRGANs whose discriminators shared their knowledge through mutual distillation can produce super-resolution images enhanced in both quantitative and qualitative qualities.

Enhancement Method of CCTV Video Quality Based on SRGAN (SRGAN 기반의 CCTV 영상 화질 개선 기법)

  • Ha, Hyunsoo;Hwang, Byung-Yeon
    • Journal of Korea Multimedia Society
    • /
    • v.21 no.9
    • /
    • pp.1027-1034
    • /
    • 2018
  • CCTV has been known to possess high level of objectivity and utility. Hence, the government has recently focused on replacing low quality CCTV with higher quality ones or even by adding high resolution CCTV. However, converting all existing low-quality CCTV to high quality can be extremely costly. Furthermore, low quality videos prior to CCTV replacement are likely to be of poor quality and thus not utilized correctly. In order to solve these problems, this paper proposes a method to improve videos quality of images using SRGAN(Super Resolution Generative Advisory Networks). Through experiments, we have proven that it is possible to improve low quality CCTV videos clearly. For this experiment, a total of 4 types of CCTV videos were used and 10,000 images were sampled from each type. Those images could then be used for machine learning. The fact that the pre-process for machine learning has been done manually and the long time that required for machine learning seems to be complementary.

A Study of Lightening SRGAN Using Knowledge Distillation (지식증류 기법을 사용한 SRGAN 경량화 연구)

  • Lee, Yeojin;Park, Hanhoon
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.12
    • /
    • pp.1598-1605
    • /
    • 2021
  • Recently, convolutional neural networks (CNNs) have been widely used with excellent performance in various computer vision fields, including super-resolution (SR). However, CNN is computationally intensive and requires a lot of memory, making it difficult to apply to limited hardware resources such as mobile or Internet of Things devices. To solve these limitations, network lightening studies have been actively conducted to reduce the depth or size of pre-trained deep CNN models while maintaining their performance as much as possible. This paper aims to lighten the SR CNN model, SRGAN, using the knowledge distillation among network lightening technologies; thus, it proposes four techniques with different methods of transferring the knowledge of the teacher network to the student network and presents experiments to compare and analyze the performance of each technique. In our experimental results, it was confirmed through quantitative and qualitative evaluation indicators that student networks with knowledge transfer performed better than those without knowledge transfer, and among the four knowledge transfer techniques, the technique of conducting adversarial learning after transferring knowledge from the teacher generator to the student generator showed the best performance.

Super Resolution Performance Analysis of GAN according to Feature Extractor (특징 추출기에 따른 SRGAN의 초해상 성능 분석)

  • Park, Sung-Wook;Kim, Jun-Yeong;Park, Jun;Jung, Se-Hoon;Sim, Chun-Bo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.501-503
    • /
    • 2022
  • 초해상이란 해상도가 낮은 영상을 해상도가 높은 영상으로 합성하는 기술이다. 딥러닝은 영상의 해상도를 높이는 초해상 기술에도 응용되며 실현은 2아4년에 발표된 SRCNN(Super Resolution Convolutional Neural Network) 모델로부터 시작됐다. 이후 오토인코더 (Autoencoders) 구조로는 SRCAE(Super Resolution Convolutional Autoencoders), 합성된 영상을 실제 영상과 통계적으로 구분되지 않도록 강제하는 GAN (Generative Adversarial Networks) 구조로는 SRGAN(Super Resolution Generative Adversarial Networks) 모델이 발표됐다. 모두 SRCNN의 성능을 웃도는 모델들이나 그중 가장 높은 성능을 끌어내는 SRGAN 조차 아직 완벽한 성능을 내진 못한다. 본 논문에서는 SRGAN의 성능을 개선하기 위해 사전 훈련된 특징 추출기(Pre-trained Feature Extractor) VGG(Visual Geometry Group)-19 모델을 변경하고, 기존 모델과 성능을 비교한다. 실험 결과, VGG-19 모델보다 윤곽이 뚜렷하고, 실제 영상과 더 가까운 영상을 합성할 수 있는 모델을 발견할 수 있을 것으로 기대된다.

Convergence of Artificial Intelligence Techniques and Domain Specific Knowledge for Generating Super-Resolution Meteorological Data (기상 자료 초해상화를 위한 인공지능 기술과 기상 전문 지식의 융합)

  • Ha, Ji-Hun;Park, Kun-Woo;Im, Hyo-Hyuk;Cho, Dong-Hee;Kim, Yong-Hyuk
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.10
    • /
    • pp.63-70
    • /
    • 2021
  • Generating a super-resolution meteological data by using a high-resolution deep neural network can provide precise research and useful real-life services. We propose a new technique of generating improved training data for super-resolution deep neural networks. To generate high-resolution meteorological data with domain specific knowledge, Lambert conformal conic projection and objective analysis were applied based on observation data and ERA5 reanalysis field data of specialized institutions. As a result, temperature and humidity analysis data based on domain specific knowledge showed improved RMSE by up to 42% and 46%, respectively. Next, a super-resolution generative adversarial network (SRGAN) which is one of the aritifial intelligence techniques was used to automate the manual data generation technique using damain specific techniques as described above. Experiments were conducted to generate high-resolution data with 1 km resolution from global model data with 10 km resolution. Finally, the results generated with SRGAN have a higher resoltuion than the global model input data, and showed a similar analysis pattern to the manually generated high-resolution analysis data, but also showed a smooth boundary.

A Research on Re-examining Discriminator Design Space for Performance Improvement of ESRGAN (ESRGAN의 성능 향상을 위한 판별자 설계 공간 재검토에 관한 연구)

  • Sung-Wook Park;Jun-Yeong Kim;Jun Park;Se-Hoon Jung;Chun-Bo Sim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.513-514
    • /
    • 2023
  • 초해상은 저해상도의 영상을 고해상도 영상으로 합성하는 기술이다. 이 기술에 딥러닝이 적용되어, 2014년에는 SRCNN(Super Resolution Convolutional Neural Network) 모델이 발표됐다. 이후에는 SRCAE(Super Resolution Convolutional Autoencoders)와 GAN(Generative Adversarial Networks)을 기반으로 한 SRGAN(Super Resolution Generative Adversarial Networks) 등, SRCNN의 성능을 능가하는 모델들이 발표됐다. ESRGAN(Enhanced Super Resolution Generative Adversarial Networks)은 SRGAN 모델의 성능을 개선했지만, 완벽한 성능을 내지 못하는 문제점이 있다. 이에 본 논문에서는 판별자(Discriminator) 구조를 변경하여 ESRGAN의 성능을 개선한다. 실험 결과, 제안하는 모델이 ESRGAN보다 더 높은 성능을 보일 것으로 기대된다.

Improved Method of License Plate Detection and Recognition Facilitated by Fast Super-Resolution GAN (Fast Super-Resolution GAN 기반 자동차 번호판 검출 및 인식 성능 고도화 기법)

  • Min, Dongwook;Lim, Hyunseok;Gwak, Jeonghwan
    • Smart Media Journal
    • /
    • v.9 no.4
    • /
    • pp.134-143
    • /
    • 2020
  • Vehicle License Plate Recognition is one of the approaches for transportation and traffic safety networks, such as traffic control, speed limit enforcement and runaway vehicle tracking. Although it has been studied for decades, it is attracting more and more attention due to the recent development of deep learning and improved performance. Also, it is largely divided into license plate detection and recognition. In this study, experiments were conducted to improve license plate detection performance by utilizing various object detection methods and WPOD-Net(Warped Planar Object Detection Network) model. The accuracy was improved by selecting the method of detecting the vehicle(s) and then detecting the license plate(s) instead of the conventional method of detecting the license plate using the object detection model. In particular, the final performance was improved through the process of removing noise existing in the image by using the Fast-SRGAN model, one of the Super-Resolution methods. As a result, this experiment showed the performance has improved an average of 4.34% from 92.38% to 96.72% compared to previous studies.

Optimization And Performance Analysis Via GAN Model Layer Pruning (레이어 프루닝을 이용한 생성적 적대 신경망 모델 경량화 및 성능 분석 연구)

  • Kim, Dong-hwi;Park, Sang-hyo;Bae, Byeong-jun;Cho, Suk-hee
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • fall
    • /
    • pp.80-81
    • /
    • 2021
  • 딥 러닝 모델 사용에 있어서, 일반적인 사용자가 이용할 수 있는 하드웨어 리소스는 제한적이기 때문에 기존 모델을 경량화 할 수 있는 프루닝 방법을 통해 제한적인 리소스를 효과적으로 활용할 수 있도록 한다. 그 방법으로, 여러 딥 러닝 모델들 중 비교적 파라미터 수가 많은 것으로 알려진 GAN 아키텍처에 네트워크 프루닝을 적용함으로써 비교적 무거운 모델을 적은 파라미터를 통해 학습할 수 있는 방법을 제시한다. 또한, 본 논문을 통해 기존의 SRGAN 논문에서 가장 효과적인 결과로 제시했던 16 개의 residual block 의 개수를 실제로 줄여 봄으로써 기존 논문에서 제시했던 결과와의 차이에 대해 서술한다.

  • PDF

A Study on the Video Quality Improvement of National Intangible Cultural Heritage Documentary Film (국가무형문화재 기록영상 화질 개선에 관한 연구)

  • Kwon, Do-Hyung;Yu, Jeong-Min
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2020.07a
    • /
    • pp.439-441
    • /
    • 2020
  • 본 논문에서는 국가무형문화재 기록영상의 화질 개선에 관한 연구를 진행한다. 기록영상의 화질 개선을 위해 SRGAN 기반의 초해상화 복원영상 생성 프레임워크의 적용을 제안한다. Image aumentation과 median filter를 적용한 데이터셋과 적대적 신경망인 Generative Adversarial Network (GAN)을 기반으로 딥러닝 네트워크를 구축하여 입력된 Low-Resolution 이미지를 통해 High-Resolution의 복원 영상을 생성한다. 이 연구를 통해 국가무형문화재 기록영상 뿐만 아니라 문화재 전반의 사진 및 영상 기록 자료의 품질 개선 가능성을 제시하고, 영상 기록 자료의 아카이브 구축을 통해 지속적인 활용의 기초연구가 되는 것을 목표로 한다.

  • PDF

A Study on Image Quality Improvement for 3D Pagoda Restoration (3D 탑복원을 위한 화질 개선에 관한 연구)

  • Kim, Beom Jun-Ji;Lee, Hyun-woo;Kim, Ki-hyeop;Kim, Eun-ji;Kim, Young-jin;Lee, Byong-Kwon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.07a
    • /
    • pp.145-147
    • /
    • 2022
  • 본 논문에서는 훼손되어 식별할 수 없는 탑 이미지를 비롯해 낮은 해상도의 탑 이미지를 개선하기 위해 우리는 탑 이미지의 화질 개선을 인공지능을 이용하여 빠르게 개선을 해 보고자 한다. 최근에 Generative Adversarial Networks(GANS) 알고리즘에서 SrGAN 알고리즘이 나오면서 이미지 생성, 이미지 복원, 해상도 변화 분야가 지속해서 발전하고 있다. 이에 본 연구에서는 다양한 GAN 알고리즘을 화질 개선에 적용해 보았다. 탑 이미지에 GAN 알고리즘 중 SrGan을 적용하였으며 실험한 결과 Srgan 알고리즘은 학습이 진행되었으며, 낮은 해상도의 탑 이미지가 높은 해상도, 초고해상도 이미지가 생성되는 것을 확인했다.

  • PDF