통합 검색 | Korea Science

초고해상도 복원에서 성능 향상을 위한 다양한 Attention 연구 (A Study on Various Attention for Improving Performance in Single Image Super Resolution)

문환복;윤상민
- 방송공학회논문지
- /
- 제25권6호
- /
- pp.898-910
- /
- 2020
컴퓨터 비전에서 단일 영상 기반의 초고해상도 영상 복원의 중요성과 확장성으로 관련 분야에서 많은 연구가 진행되어 왔으며, 최근 딥러닝에 대한 관심이 증가하면서 딥러닝을 활용한 단안 영상 기반 초고해상도 연구가 활발히 진행되고 있다. 대부분의 딥러닝을 기반으로 하는 단안 영상 기반 초고해상도 복원 연구는 복원 성능을 향상시키기 위해 네트워크의 구조, 손실 함수, 학습 방법에 초점이 맞추어 연구가 진행되었다. 한편, 딥러닝 네트워크를 깊게 쌓지 않고 초고해상도 영상 복원 성능을 향상시키기 위해 추출된 특징 맵을 강조하는 Attention Module에 대한 연구가 다양한 분야에 적용되어 왔다. Attention Module은 다양한 관점에서 네트워크의 목적에 맞는 특징 정보를 강조 및 스케일링 한다. 본 논문에서는 초고해상도 복원 네트워크를 기반으로 다양한 구조의 Channel Attention과 Spatial Attention을 설계하고, 다양한 관점에서 특징 맵을 강조하기 위해 다중 Attention Module 구조를 설계하여 성능을 분석 및 비교한다.
https://doi.org/10.5909/JBE.2020.25.6.898 인용 PDF KSCI KPUBS

DA-Res2Net: a novel Densely connected residual Attention network for image semantic segmentation

Zhao, Xiaopin;Liu, Weibin;Xing, Weiwei;Wei, Xiang
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제14권11호
- /
- pp.4426-4442
- /
- 2020
Since scene segmentation is becoming a hot topic in the field of autonomous driving and medical image analysis, researchers are actively trying new methods to improve segmentation accuracy. At present, the main issues in image semantic segmentation are intra-class inconsistency and inter-class indistinction. From our analysis, the lack of global information as well as macroscopic discrimination on the object are the two main reasons. In this paper, we propose a Densely connected residual Attention network (DA-Res2Net) which consists of a dense residual network and channel attention guidance module to deal with these problems and improve the accuracy of image segmentation. Specifically, in order to make the extracted features equipped with stronger multi-scale characteristics, a densely connected residual network is proposed as a feature extractor. Furthermore, to improve the representativeness of each channel feature, we design a Channel-Attention-Guide module to make the model focusing on the high-level semantic features and low-level location features simultaneously. Experimental results show that the method achieves significant performance on various datasets. Compared to other state-of-the-art methods, the proposed method reaches the mean IOU accuracy of 83.2% on PASCAL VOC 2012 and 79.7% on Cityscapes dataset, respectively.
https://doi.org/10.3837/tiis.2020.11.010 인용 PDF KSCI HTML

Attention-based for Multiscale Fusion Underwater Image Enhancement

Huang, Zhixiong;Li, Jinjiang;Hua, Zhen
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제16권2호
- /
- pp.544-564
- /
- 2022
Underwater images often suffer from color distortion, blurring and low contrast, which is caused by the propagation of light in the underwater environment being affected by the two processes: absorption and scattering. To cope with the poor quality of underwater images, this paper proposes a multiscale fusion underwater image enhancement method based on channel attention mechanism and local binary pattern (LBP). The network consists of three modules: feature aggregation, image reconstruction and LBP enhancement. The feature aggregation module aggregates feature information at different scales of the image, and the image reconstruction module restores the output features to high-quality underwater images. The network also introduces channel attention mechanism to make the network pay more attention to the channels containing important information. The detail information is protected by real-time superposition with feature information. Experimental results demonstrate that the method in this paper produces results with correct colors and complete details, and outperforms existing methods in quantitative metrics.
https://doi.org/10.3837/tiis.2022.02.010 인용 PDF KSCI HTML

Crack detection based on ResNet with spatial attention

Yang, Qiaoning;Jiang, Si;Chen, Juan;Lin, Weiguo
- Computers and Concrete
- /
- 제26권5호
- /
- pp.411-420
- /
- 2020
Deep Convolution neural network (DCNN) has been widely used in the healthy maintenance of civil infrastructure. Using DCNN to improve crack detection performance has attracted many researchers' attention. In this paper, a light-weight spatial attention network module is proposed to strengthen the representation capability of ResNet and improve the crack detection performance. It utilizes attention mechanism to strengthen the interested objects in global receptive field of ResNet convolution layers. Global average spatial information over all channels are used to construct an attention scalar. The scalar is combined with adaptive weighted sigmoid function to activate the output of each channel's feature maps. Salient objects in feature maps are refined by the attention scalar. The proposed spatial attention module is stacked in ResNet50 to detect crack. Experiments results show that the proposed module can got significant performance improvement in crack detection.
https://doi.org/10.12989/cac.2020.26.5.411 인용 KSCI

Recovery of underwater images based on the attention mechanism and SOS mechanism

Li, Shiwen;Liu, Feng;Wei, Jian
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제16권8호
- /
- pp.2552-2570
- /
- 2022
Underwater images usually have various problems, such as the color cast of underwater images due to the attenuation of different lights in water, the darkness of image caused by the lack of light underwater, and the haze effect of underwater images because of the scattering of light. To address the above problems, the channel attention mechanism, strengthen-operate-subtract (SOS) boosting mechanism and gated fusion module are introduced in our paper, based on which, an underwater image recovery network is proposed. First, for the color cast problem of underwater images, the channel attention mechanism is incorporated in our model, which can well alleviate the color cast of underwater images. Second, as for the darkness of underwater images, the similarity between the target underwater image after dehazing and color correcting, and the image output by our model is used as the loss function, so as to increase the brightness of the underwater image. Finally, we employ the SOS boosting module to eliminate the haze effect of underwater images. Moreover, experiments were carried out to evaluate the performance of our model. The qualitative analysis results show that our method can be applied to effectively recover the underwater images, which outperformed most methods for comparison according to various criteria in the quantitative analysis.
https://doi.org/10.3837/tiis.2022.08.005 인용 PDF KSCI HTML

합성곱 신경망의 Channel Attention 모듈 및 제한적인 각도 다양성 조건에서의 SAR 표적영상 식별로의 적용 (Channel Attention Module in Convolutional Neural Network and Its Application to SAR Target Recognition Under Limited Angular Diversity Condition)

박지훈;서승모;유지희
- 한국군사과학기술학회지
- /
- 제24권2호
- /
- pp.175-186
- /
- 2021
In the field of automatic target recognition(ATR) with synthetic aperture radar(SAR) imagery, it is usually impractical to obtain SAR target images covering a full range of aspect views. When the database consists of SAR target images with limited angular diversity, it can lead to performance degradation of the SAR-ATR system. To address this problem, this paper proposes a deep learning-based method where channel attention modules(CAMs) are inserted to a convolutional neural network(CNN). Motivated by the idea of the squeeze-and-excitation(SE) network, the CAM is considered to help improve recognition performance by selectively emphasizing discriminative features and suppressing ones with less information. After testing various CAM types included in the ResNet18-type base network, the SE CAM and its modified forms are applied to SAR target recognition using MSTAR dataset with different reduction ratios in order to validate recognition performance improvement under the limited angular diversity condition.
https://doi.org/10.9766/KIMST.2021.24.2.175 인용 PDF KSCI

Dual Attention Based Image Pyramid Network for Object Detection

Dong, Xiang;Li, Feng;Bai, Huihui;Zhao, Yao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제15권12호
- /
- pp.4439-4455
- /
- 2021
Compared with two-stage object detection algorithms, one-stage algorithms provide a better trade-off between real-time performance and accuracy. However, these methods treat the intermediate features equally, which lacks the flexibility to emphasize meaningful information for classification and location. Besides, they ignore the interaction of contextual information from different scales, which is important for medium and small objects detection. To tackle these problems, we propose an image pyramid network based on dual attention mechanism (DAIPNet), which builds an image pyramid to enrich the spatial information while emphasizing multi-scale informative features based on dual attention mechanisms for one-stage object detection. Our framework utilizes a pre-trained backbone as standard detection network, where the designed image pyramid network (IPN) is used as auxiliary network to provide complementary information. Here, the dual attention mechanism is composed of the adaptive feature fusion module (AFFM) and the progressive attention fusion module (PAFM). AFFM is designed to automatically pay attention to the feature maps with different importance from the backbone and auxiliary network, while PAFM is utilized to adaptively learn the channel attentive information in the context transfer process. Furthermore, in the IPN, we build an image pyramid to extract scale-wise features from downsampled images of different scales, where the features are further fused at different states to enrich scale-wise information and learn more comprehensive feature representations. Experimental results are shown on MS COCO dataset. Our proposed detector with a 300 × 300 input achieves superior performance of 32.6% mAP on the MS COCO test-dev compared with state-of-the-art methods.
https://doi.org/10.3837/tiis.2021.12.010 인용 PDF KSCI

휴대용 PC내에 실장된 강제공랭 모듈 주위의 유체유동과 온도분포 (Fluid Flow and Temperature Distribution around a Surface-Mounted Module Cooled by Forced Air Flow in a Portable Personal Computers)

박상희;신대종;이인태
- 대한기계학회:학술대회논문집
- /
- 대한기계학회 2002년도 학술대회지
- /
- pp.729-732
- /
- 2002
This paper reports an experimental study around a module about forced air flow by blower($35{\times}35{\times}6mm^3$) in portable PC(10mm high, 200mm wide, and 235mm long). The channel inlet flow velocity has been varied between 0.26, 0.52 and 0.78m/s. The power input to the module is 4Wthis report, particular attention is directed to the fluid flow and adiabatic wall temperature($T_(ad)$) around a module which is under fluid mechanical and thermal influences of the module. The fluid flow around a module was visualized using PIV system. Liquid crystal thernography is used to determine the adiabatic wall temperature around a heated module on an acrylic board. Plots of $T_(ad)$ (or F) show marked effects of dispersion of thermal wake near the module.
PDF

Multi-scale context fusion network for melanoma segmentation

Zhenhua Li;Lei Zhang
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제18권7호
- /
- pp.1888-1906
- /
- 2024
Aiming at the problems that the edge of melanoma image is fuzzy, the contrast with the background is low, and the hair occlusion makes it difficult to segment accurately, this paper proposes a model MSCNet for melanoma segmentation based on U-net frame. Firstly, a multi-scale pyramid fusion module is designed to reconstruct the skip connection and transmit global information to the decoder. Secondly, the contextural information conduction module is innovatively added to the top of the encoder. The module provides different receptive fields for the segmented target by using the hole convolution with different expansion rates, so as to better fuse multi-scale contextural information. In addition, in order to suppress redundant information in the input image and pay more attention to melanoma feature information, global channel attention mechanism is introduced into the decoder. Finally, In order to solve the problem of lesion class imbalance, this paper uses a combined loss function. The algorithm of this paper is verified on ISIC 2017 and ISIC 2018 public datasets. The experimental results indicate that the proposed algorithm has better accuracy for melanoma segmentation compared with other CNN-based image segmentation algorithms.
https://doi.org/10.3837/tiis.2024.07.009 인용 PDF HTML

Real Scene Text Image Super-Resolution Based on Multi-Scale and Attention Fusion

Xinhua Lu;Haihai Wei;Li Ma;Qingji Xue;Yonghui Fu
- Journal of Information Processing Systems
- /
- 제19권4호
- /
- pp.427-438
- /
- 2023
Plenty of works have indicated that single image super-resolution (SISR) models relying on synthetic datasets are difficult to be applied to real scene text image super-resolution (STISR) for its more complex degradation. The up-to-date dataset for realistic STISR is called TextZoom, while the current methods trained on this dataset have not considered the effect of multi-scale features of text images. In this paper, a multi-scale and attention fusion model for realistic STISR is proposed. The multi-scale learning mechanism is introduced to acquire sophisticated feature representations of text images; The spatial and channel attentions are introduced to capture the local information and inter-channel interaction information of text images; At last, this paper designs a multi-scale residual attention module by skillfully fusing multi-scale learning and attention mechanisms. The experiments on TextZoom demonstrate that the model proposed increases scene text recognition's (ASTER) average recognition accuracy by 1.2% compared to text super-resolution network.
https://doi.org/10.3745/JIPS.02.0199 인용 PDF

검색결과 19건 처리시간 0.021초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)