• Title/Summary/Keyword: super 해상도

Search Result 240, Processing Time 0.021 seconds

Super-resolution Algorithm using Discrete Wavelet Transform for Single-image (이산 웨이블릿 변환을 이용한 영상의 초고해상도 기법)

  • Lim, Jong-Myeong;Yoo, Ji-Sang
    • Journal of Broadcast Engineering
    • /
    • v.17 no.2
    • /
    • pp.344-353
    • /
    • 2012
  • In this paper, we propose a super-resolution algorithm using discrete wavelet transform. In general super-resolution algorithms for single-image, probability based operations have been used for searching high-frequency components. Consequently, the complexity of the algorithm causes the increase of processing time. In the proposed algorithm, we use discrete wavelet transform to find high-frequency sub-bands. We perform inverse discrete wavelet transform using input image and high-frequency sub-bands of the same resolution as the input image which are obtained by performing discrete wavelet transform without down-sampling and then we obtain image with high-resolution. In the proposed algorithm, we use the down-sampled version of the original image ($512{\times}512$) as a test image ($256{\times}256$) to compare the performance of algorithms. Through experimental results, we confirm the improved efficiency of the proposed algorithm comparing with conventional interpolation algorithms and also decreased processing time comparing the probability based operations.

Deep Learning-based Super Resolution for Phase-only Holograms (위상 홀로그램을 위한 딥러닝 기반의 초고해상도)

  • Kim, Woosuk;Park, Byung-Seo;Kim, Jin-Kyum;Oh, Kwan-Jung;Kim, Jin-Woong;Kim, Dong-Wook;Seo, Young-Ho
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.935-943
    • /
    • 2020
  • In this paper, we propose a method using deep learning for high-resolution display of phase holograms. If a general interpolation method is used, the brightness of the reconstruction result is lowered, and noise and afterimages occur. To solve this problem, a hologram was trained with a neural network structure that showed good performance in the single-image super resolution (SISR). As a result, it was possible to improve the problem that occurred in the reconstruction result and increase the resolution. In addition, by adjusting the number of channels to increase performance, the result increased by more than 0.3dB in same training.

A Study on Lightweight Transformer Based Super Resolution Model Using Knowledge Distillation (지식 증류 기법을 사용한 트랜스포머 기반 초해상화 모델 경량화 연구)

  • Dong-hyun Kim;Dong-hun Lee;Aro Kim;Vani Priyanka Galia;Sang-hyo Park
    • Journal of Broadcast Engineering
    • /
    • v.28 no.3
    • /
    • pp.333-336
    • /
    • 2023
  • Recently, the transformer model used in natural language processing is also applied to the image super resolution field, showing good performance. However, these transformer based models have a disadvantage that they are difficult to use in small mobile devices because they are complex and have many learning parameters and require high hardware resources. Therefore, in this paper, we propose a knowledge distillation technique that can effectively reduce the size of a transformer based super resolution model. As a result of the experiment, it was confirmed that by applying the proposed technique to the student model with reduced number of transformer blocks, performance similar to or higher than that of the teacher model could be obtained.

SAR Image Processing Using SVD-Pseudo Spectrum Technique (SAR에 적용된 SVD-Pseudo Spectrum 기술)

  • Kim, Binhee;Kong, Seung-Hyun
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.3
    • /
    • pp.212-218
    • /
    • 2013
  • This paper presents an SVD(Singular Value Decomposition)-Pseudo Spectrum method for SAR (Synthetic Aperture Radar) imaging. The purpose of this work is to improve resolution and target separability of SAR images. This paper proposes SVD-Pseudo Spectrum method whose advantages are noise robustness, reduction of sidelobes and high resolution of spectral estimation. SVD-Pseudo Spectrum method uses Hankel Matrix of signal components and SVD (Singular Value Decomposition) method. In this paper, it is demonstrated that the SVD-Pseudo Spectrum method shows better performance than the matched filtering method and the conventional super-resolution based multiple signal classification (MUSIC) method in SAR image processing. The targets to be separated are modeled, and this modeled data is used to demonstrate the performance of algorithms.

CCTV Image Quality Enhancement using Histogram Loss and Sequential Task (히스토그램 손실함수와 순차적 작업을 이용한 CCTV 영상 화질 향상)

  • Jeong, Minkyo;Choi, Jongin;Jeong, Jechang
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.217-220
    • /
    • 2022
  • 본 논문에서는 CCTV 영상 화질을 향상하고 해상도를 높이기 위해 딥 러닝(Deep Learning)을 이용하여 잡음 제거(Denoising) 와 초해상도(Super-resolution) 작업을 수행한다. 데이터 증강(Data Augmentation)을 통한 초해상도 성능 향상을 위해서 잡음 제거 네트워크의 출력 영상을 초해상도 네트워크의 입력으로 사용하는 순차적 작업을 사용한다. 또한 딥 러닝을 이용한 영상처리에서 발생하는 평균 밝기 오차 문제를 해결하기 위한 손실함수(Loss Function)와 두 가지 이상의 순차적인 딥 러닝 작업에서 발생하는 문제점을 극복하기 위한 손실함수를 제안한다. 제안하는 손실함수는 네트워크의 출력 영상과 타겟 영상의 밝기 오차를 줄이는 것이 가능하고, 순차적 작업에서 보다 정확한 모델 성능 판단이 가능하다.

  • PDF

Improvement of Mask-RCNN Performance Using Deep-Learning-Based Arbitrary-Scale Super-Resolution Module (딥러닝 기반 임의적 스케일 초해상도 모듈을 이용한 Mask-RCNN 성능 향상)

  • Ahn, Young-Pill;Park, Hyun-Jun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.3
    • /
    • pp.381-388
    • /
    • 2022
  • In instance segmentation, Mask-RCNN is mostly used as a base model. Increasing the performance of Mask-RCNN is meaningful because it affects the performance of the derived model. Mask-RCNN has a transform module for unifying size of input images. In this paper, to improve the Mask-RCNN, we apply deep-learning-based ASSR to the resizing part in the transform module and inject calculated scale information into the model using IM(Integration Module). The proposed IM improves instance segmentation performance by 2.5 AP higher than Mask-RCNN in the COCO dataset, and in the periment for optimizing the IM location, the best performance was shown when it was located in the 'Top' before FPN and backbone were combined. Therefore, the proposed method can improve the performance of models using Mask-RCNN as a base model.

Multiple Binarization Quadtree Framework for Optimizing Deep Learning-Based Smoke Synthesis Method

  • Kim, Jong-Hyun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.4
    • /
    • pp.47-53
    • /
    • 2021
  • In this paper, we propose a quadtree-based optimization technique that enables fast Super-resolution(SR) computation by efficiently classifying and dividing physics-based simulation data required to calculate SR. The proposed method reduces the time required for quadtree computation by downscaling the smoke simulation data used as input data. By binarizing the density of the smoke in this process, a quadtree is constructed while mitigating the problem of numerical loss of density in the downscaling process. The data used for training is the COCO 2017 Dataset, and the artificial neural network uses a VGG19-based network. In order to prevent data loss when passing through the convolutional layer, similar to the residual method, the output value of the previous layer is added and learned. In the case of smoke, the proposed method achieved a speed improvement of about 15 to 18 times compared to the previous approach.

A study on the Performance Analysis of Super-Resolution Algorithms by the activation functions using Jetson Nano (젯슨 나노 기반 활성 함수에 따른 초해상화 알고리즘 성능 분석 연구)

  • Lim, Jae-Yoon;Kim, Yu-Min;Kim, Yongwoo
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.05a
    • /
    • pp.691-694
    • /
    • 2022
  • 최근 고해상도 영상이 필요하게 되었으며, 저해상도 영상을 고해상도 영상으로 변환하는 딥러닝 기반의 초해상도 알고리즘에 대한 연구가 활발히 진행되고 있다. 그럼에도 불구하고 딥러닝 기반의 초해상도 알고리즘은 하드웨어의 한계로 인해 임베디드 시스템에서 실행시간이 느린 단점이 있다. 본 논문에서는 심층신경망 기반의 초해상도 알고리즘의 네트워크 구조를 제시하고 다양한 활성화 함수에 따른 화질 및 실행시간 성능을 분석한다. 실험 결과, 젯슨 나노보드의 다양한 활성화 함수 중 화질과 실행 시간의 관계에서 도출한 최적의 활성화 함수가 PReLU 함수임을 확인하였다.

A Model Compression for Super Resolution Multi Scale Residual Networks based on a Layer-wise Quantization (계층별 양자화 기반 초해상화 다중 스케일 잔차 네트워크 압축)

  • Hwang, Jiwon;Bae, Sung-Ho
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.540-543
    • /
    • 2020
  • 기존의 초해상도 딥러닝 기법은 모델의 깊이가 깊어지면서, 좋은 성능을 내지만 점점 더 복잡해지고 있고, 실제로 사용하는데 있어 많은 시간을 요구한다. 이를 해결하기 위해, 우리는 딥러닝 모델의 가중치를 양자화 하여 추론시간을 줄이고자 한다. 초해상도 모델은 feature extraction, non-linear mapping, reconstruction 세 부분으로 나누어져 있으며, 레이어 사이에 많은 skip-connection 이 존재하는 특징이 있다. 따라서 양자화 시 최종 성능 하락에 미치는 영향력이 레이어 별로 다르며, 이를 감안하여 강화학습으로 레이어 별 최적 bit 를 찾아 성능 하락을 최소화한다. 본 논문에서는 Skip-connection 이 많이 존재하는 MSRN 을 사용하였으며, 결과에서 feature extraction, reconstruction 부분과 블록 내 특정 위치의 레이어가 항상 높은 bit 를 가짐을 알 수 있다. 기존에 영상 분류에 한정되어 사용되었던 혼합 bit 양자화를 사용하여 초해상도 딥러닝 기법의 모델 사이즈를 줄인 최초의 논문이며, 제안 방법은 모바일 등 제한된 환경에 적용 가능할 것으로 생각된다.

  • PDF