Search | Korea Science

Korean Text Image Super-Resolution for Improving Text Recognition Accuracy (텍스트 인식률 개선을 위한 한글 텍스트 이미지 초해상화)

Junhyeong Kwon;Nam Ik Cho
- Journal of Broadcast Engineering
- /
- v.28 no.2
- /
- pp.178-184
- /
- 2023
Finding texts in general scene images and recognizing their contents is a very important task that can be used as a basis for robot vision, visual assistance, and so on. However, for the low-resolution text images, the degradations, such as noise or blur included in text images, are more noticeable, which leads to severe performance degradation of text recognition accuracy. In this paper, we propose a new Korean text image super-resolution based on a Transformer-based model, which generally shows higher performance than convolutional neural networks. In the experiments, we show that text recognition accuracy for Korean text images can be improved when our proposed text image super-resolution method is used. We also propose a new Korean text image dataset for training our model, which contains massive HR-LR Korean text image pairs.
https://doi.org/10.5909/JBE.2023.28.2.178 인용 PDF

Multi-Frame-Based Super Resolution Algorithm by Using Motion Vector Normalization and Edge Pattern Analysis (움직임 벡터의 정규화 및 에지의 패턴 분석을 이용한 복수 영상 기반 초해상도 영상 생성 기법)

Kwon, Soon-Chan;Yoo, Jisang
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.38A no.2
- /
- pp.164-173
- /
- 2013
In this paper, we propose multi-frame based super resolution algorithm by using motion vector normalization and edge pattern analysis. Existing algorithms have constraints of sub-pixel motion and global translation between frames. Thus, applying of algorithms is limited. And single-frame based super resolution algorithm by using discrete wavelet transform which robust to these problems is proposed but it has another problem that quantity of information for interpolation is limited. To solve these problems, we propose motion vector normalization and edge pattern analysis for 2*2 block motion estimation. The experimental results show that the proposed algorithm has better performance than other conventional algorithms.
https://doi.org/10.7840/kics.2013.38A.2.164 인용 PDF KSCI

Performance Analysis of Super-Resolution based Video Coding for HEVC (HEVC 기반 초해상화를 이용한 비디오 부호화 효율 성능 분석)

Ki, Sehwan;Kim, Dae-Eun;Jun, Ki Nam;Baek, Seung Ho;Choi, Jeung Won;Kim, Dong Hyun;Kim, Munchurl
- Journal of Broadcast Engineering
- /
- v.24 no.2
- /
- pp.306-314
- /
- 2019
Since the resolutions of videos increase rapidly, there are continuing needs for effective video compression methods despite an increase in the transmission bandwidth. In order to satisfy such a demand, a reconstructive video coding (RVC) method by using a super resolution has been proposed. Since RVC reduces the resolution of the input video, when frames are compressed to the same size, the number of bits per pixel increases, thereby reducing coding artifacts caused by video coding. However, RVC method using super resolution is not effective in all target bitrates. Comparing the size of the loss generated while downsizing the resolution and the size of the loss caused by the video compression, only when the size of loss generated in the video compression is larger, RVC method can perform the improved compression performance compared to direct video coding. In particular, since HEVC has considerably higher compression performance than the previous standard video codec, it can be experimentally confirmed that the compression distortions become larger than the distortions of downsizing the resolution only in the very low-bitrate conditions. In this paper, we applied RVC based HEVC in various video types and measured the target bitrates that RVC method can be effectively applied.
https://doi.org/10.5909/JBE.2019.24.2.306 인용 PDF KSCI KPUBS HTML

A Study on Lightweight Transformer Based Super Resolution Model Using Knowledge Distillation (지식 증류 기법을 사용한 트랜스포머 기반 초해상화 모델 경량화 연구)

Dong-hyun Kim;Dong-hun Lee;Aro Kim;Vani Priyanka Galia;Sang-hyo Park
- Journal of Broadcast Engineering
- /
- v.28 no.3
- /
- pp.333-336
- /
- 2023
Recently, the transformer model used in natural language processing is also applied to the image super resolution field, showing good performance. However, these transformer based models have a disadvantage that they are difficult to use in small mobile devices because they are complex and have many learning parameters and require high hardware resources. Therefore, in this paper, we propose a knowledge distillation technique that can effectively reduce the size of a transformer based super resolution model. As a result of the experiment, it was confirmed that by applying the proposed technique to the student model with reduced number of transformer blocks, performance similar to or higher than that of the teacher model could be obtained.
https://doi.org/10.5909/JBE.2023.28.3.333 인용 PDF

Regularization-based Superresolution Demosaicing using Aperture Mask Wheels (조리개 마스크 휠을 이용한 정칙화 기반 초해상도 디모자이킹)

Shin, Jeongho
- Journal of Broadcast Engineering
- /
- v.23 no.1
- /
- pp.146-153
- /
- 2018
This paper presents a superresolution demosaicing technique that can restore high-resolution color image from differently blurred low resolution images in Bayer domain. The proposed superresolution demosaicing algorithm uses an aperture mask wheel to get differently blurred low resolution images, so we just need to estimate point spread function at each frame. In addition, it does not require image registration because there is no translational motion between low resolution images. By using a rotatable aperture mask wheel, consecutive captured images provide sufficiently exclusive information for superresolution. Therefore, the proposed method can reduce the registration error between the low-resolution image as well as the calculation amount for superresolution restoration. The existing lens system of the camera can be extended to obtain a superresolution image by only adding an rotatable aperture mask wheels. Finally, in order to verify the performance of the proposed system, experimental results are performed. The proposed method showed the significant improvements in the sense of spatial and color resolution.
https://doi.org/10.5909/JBE.2018.23.1.146 인용 PDF KSCI KPUBS

UHD TV Image Enhancement using Multi-frame Example-based Super-resolution (멀티프레임 예제기반 초해상도 영상복원을 이용한 UHD TV 영상 개선)

Jeong, Seokhwa;Yoon, Inhye;Paik, Joonki
- Journal of the Institute of Electronics and Information Engineers
- /
- v.52 no.3
- /
- pp.154-161
- /
- 2015
A novel multiframe super-resolution (SR) algorithm is presented to overcome the limitation of existing single-image SR algorithms using motion information from adjacent frames in a video. The proposed SR algorithm consists of three steps: i) definition of a local region using interframe motion vectors, ii) multiscale patch generation and adaptive selection of multiple optimum patches, and iii) combination of optimum patches for super-resolution. The proposed algorithm increases the accuracy of patch selection using motion information and multiscale patches. Experimental results show that the proposed algorithm performs better than existing patch-based SR algorithms in the sense of both subjective and objective measures including the peak signal-to-noise ratio (PSNR) and structural similarity measure (SSIM).
https://doi.org/10.5573/ieie.2015.52.3.154 인용 PDF KSCI

SqueezeNet based Single Image Super Resolution using Knowledge Distillation (SqueezeNet 기반의 지식 증류 가법을 활용한 초해상화 기법)

Seo, Yu lim;Kang, Suk-Ju
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2020.11a
- /
- pp.226-227
- /
- 2020
근래의 초해상화 (super-resolution, SR) 연구는 네트워크를 깊고, 넓게 만들어 성능을 높이는데 주를 이뤘다. 그러나 동시에 높은 연산량과 메모리 소비량이 증가하는 문제가 발생하기 때문에 이를 실제로 하드웨어로 구현하기에는 어려운 문제가 존재한다. 그렇기에 우리는 네트워크 최적화를 통해 성능 감소를 최소화하면서 파라미터 수를 줄이는 네트워크 SqueezeSR을 설계하였다. 또한 지식 증류(Knowledge Distillation, KD)를 이용해 추가적인 파라미터 수 증가 없이 성능을 높일 수 있는 학습 방법을 제안한다. 또한 KD 시 teacher network의 성능이 보다 student network에 잘 전달되도록 feature map 간의 비교를 통해 학습 효율을 높일 수 있었다. 결과적으로 우리는 KD 기법을 통해 추가적인 파라미터 수 증가 없이 성능을 높여 다른 SR네트워크 대비 더 빠르고 성능 감소를 최소화한 네트워크를 제안한다.
PDF

A Study on the Video Quality Improvement of National Intangible Cultural Heritage Documentary Film (국가무형문화재 기록영상 화질 개선에 관한 연구)

Kwon, Do-Hyung;Yu, Jeong-Min
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2020.07a
- /
- pp.439-441
- /
- 2020
본 논문에서는 국가무형문화재 기록영상의 화질 개선에 관한 연구를 진행한다. 기록영상의 화질 개선을 위해 SRGAN 기반의 초해상화 복원영상 생성 프레임워크의 적용을 제안한다. Image aumentation과 median filter를 적용한 데이터셋과 적대적 신경망인 Generative Adversarial Network (GAN)을 기반으로 딥러닝 네트워크를 구축하여 입력된 Low-Resolution 이미지를 통해 High-Resolution의 복원 영상을 생성한다. 이 연구를 통해 국가무형문화재 기록영상 뿐만 아니라 문화재 전반의 사진 및 영상 기록 자료의 품질 개선 가능성을 제시하고, 영상 기록 자료의 아카이브 구축을 통해 지속적인 활용의 기초연구가 되는 것을 목표로 한다.
PDF

Super-resolution based on multi-channel input convolutional residual neural network (다중 채널 입력 Convolution residual neural networks 기반의 초해상화 기법)

Youm, Gwang-Young;Kim, Munchurl
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2016.06a
- /
- pp.37-39
- /
- 2016
최근 Convolutional neural networks(CNN) 기반의 초해상화 기법인 Super-Resolution Convolutional Neural Networks (SRCNN) 이 좋은 PSNR 성능을 발휘하는 것으로 보고되었다 [1]. 하지만 많은 제안 방법들이 고주파 성분을 복원하는데 한계를 드러내는 것처럼, SRCNN 도 고주파 성분 복원에 한계점을 지니고 있다. 또한 SRCNN 의 네트워크 층을 깊게 만들면 좋은 PSNR 성능을 발휘하는 것으로 널리 알려져 있지만, 네트워크의 층을 깊게 하는 것은 네트워크 파라미터 학습을 어렵게 하는 경향이 있다. 네트워크의 층을 깊게 할 경우, gradient 값이 아래(역방향) 층으로 갈수록 발산하거나 0 으로 수렴하여, 네트워크 파라미터 학습이 제대로 되지 않는 현상이 발생하기 때문이다. 따라서 본 논문에서는 네트워크 층을 깊게 하는 대신에, 입력을 다중 채널로 구성하여, 네트워크에 고주파 성분에 관한 추가적인 정보를 주는 방법을 제안하였다. 많은 초해상화 기법들이 고주파 성분의 복원 능력이 부족하다는 점에 착안하여, 우리는 네트워크가 고주파 성분에 관한 많은 정보를 필요로 한다는 것을 가정하였다. 따라서 우리는 네트워크의 입력을 고주파 성분이 여러 가지 강도로 입력되도록 저해상도 입력 영상들을 구성하였다. 또한 잔차신호 네트워크(residual networks)를 도입하여, 네트워크 파라미터를 학습할 때 고주파 성분의 복원에 집중할 수 있도록 하였다. 본 논문의 효율성을 검증하기 위하여 set5 데이터와 set14 데이터에 관하여 실험을 진행하였고, SRCNN 과 비교하여 set5 데이터에서는 2, 3, 4 배에 관하여 각각 평균 0.29, 0.35, 0.17dB 의 PSNR 성능 향상이 있었으며, set14 데이터에서는 3 배의 관하여 평균 0.20dB 의 PSNR 성능 향상이 있었다.
PDF

Stochastic Weight Averaging for Improving the Performance of Image Super-Resolution (Stochastic Weight Averaging 알고리즘을 이용한 이미지 초해상도 성능 개선)

Yoon, Jeong Hwan;Cho, Nam Ik
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2021.06a
- /
- pp.345-347
- /
- 2021
단일 이미지 초해상도는 딥러닝의 발전과 함께 놀라운 성능 향상이 이루어 졌다. 이러한 딥러닝 모델은 매우 많은 파라미터를 갖고 있어 많은 연산량과 메모리를 필요로 한다. 하지만 사용할 수 있는 리소스는 한정되어 있기 때문에 네트워크를 경량화 시키려는 연구도 지속되어 왔다. 본 논문에서는 Stochastic Weight Averaging (SWA) 알고리즘을 이용하여 상대적으로 적은 양의 메모리와 연산을 추가해 이미지 초해상도 모델의 성능을 높이고 안정적인 학습을 달성하였다. SWA 알고리즘을 적용한 모델은 그렇지 않은 모델에 비해 테스트셋에서 최대 0.13dB 의 성능 향상을 보였다.
PDF

Search Result 102, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)