Search | Korea Science

Boundary and Reverse Attention Module for Lung Nodule Segmentation in CT Images (CT 영상에서 폐 결절 분할을 위한 경계 및 역 어텐션 기법)

Hwang, Gyeongyeon;Ji, Yewon;Yoon, Hakyoung;Lee, Sang Jun
- IEMEK Journal of Embedded Systems and Applications
- /
- v.17 no.5
- /
- pp.265-272
- /
- 2022
As the risk of lung cancer has increased, early-stage detection and treatment of cancers have received a lot of attention. Among various medical imaging approaches, computer tomography (CT) has been widely utilized to examine the size and growth rate of lung nodules. However, the process of manual examination is a time-consuming task, and it causes physical and mental fatigue for medical professionals. Recently, many computer-aided diagnostic methods have been proposed to reduce the workload of medical professionals. In recent studies, encoder-decoder architectures have shown reliable performances in medical image segmentation, and it is adopted to predict lesion candidates. However, localizing nodules in lung CT images is a challenging problem due to the extremely small sizes and unstructured shapes of nodules. To solve these problems, we utilize atrous spatial pyramid pooling (ASPP) to minimize the loss of information for a general U-Net baseline model to extract rich representations from various receptive fields. Moreover, we propose mixed-up attention mechanism of reverse, boundary and convolutional block attention module (CBAM) to improve the accuracy of segmentation small scale of various shapes. The performance of the proposed model is compared with several previous attention mechanisms on the LIDC-IDRI dataset, and experimental results demonstrate that reverse, boundary, and CBAM (RB-CBAM) are effective in the segmentation of small nodules.
https://doi.org/10.14372/IEMEK.2022.17.5.265 인용 PDF KSCI

Main Cause of the Interference between Visual Search and Spatial Working Memory Task (시각 탐색과 공간적 작업기억간 상호 간섭의 원인)

Ahn Jae-Won;Kim Min-Shik
- Korean Journal of Cognitive Science
- /
- v.16 no.3
- /
- pp.155-174
- /
- 2005
Oh and Kim (2004) and Woodman and Lurk (2004) demonstrated that spatial working memory (SWM) load Interfered concurrent visual search and that search process also impaired the maintenance of spatial information implying that visual search and SWM task both require access to the same limited-capacity mechanism. Two obvious possibilities have been suggested about what this shared limited-capacity mechanism is: common demand for attention to the locations where the items f9r the two tasks were presented (spatial attention load hypothesis), and common use of working memory to maintain a record of locations have been processed(SWM load hypothesis). To test these two hypothetical explanations, Experiment 1 replicated the mutual interference between visual search and SWM task in spite of difference of procedure with preceding researches; possible areas where the items for two tasks were presented were not separated. In Experiment 2, we presented the items for visual search either in the same quadrants where the items for SWM task had appeared (same-location rendition) or in the different quadrants (different-location condition). As a result, search efficiency was more impaired in the different-location condition than in the same-location condition. The memory accuracy was worse in the different-location rendition than in the same-location rendition. Overall results of study indicate that the mutual interference between SWM and visual search might be related to the overload of spatial attention, but not to that of SWM.
PDF

Dual Attention Based Image Pyramid Network for Object Detection

Dong, Xiang;Li, Feng;Bai, Huihui;Zhao, Yao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.15 no.12
- /
- pp.4439-4455
- /
- 2021
Compared with two-stage object detection algorithms, one-stage algorithms provide a better trade-off between real-time performance and accuracy. However, these methods treat the intermediate features equally, which lacks the flexibility to emphasize meaningful information for classification and location. Besides, they ignore the interaction of contextual information from different scales, which is important for medium and small objects detection. To tackle these problems, we propose an image pyramid network based on dual attention mechanism (DAIPNet), which builds an image pyramid to enrich the spatial information while emphasizing multi-scale informative features based on dual attention mechanisms for one-stage object detection. Our framework utilizes a pre-trained backbone as standard detection network, where the designed image pyramid network (IPN) is used as auxiliary network to provide complementary information. Here, the dual attention mechanism is composed of the adaptive feature fusion module (AFFM) and the progressive attention fusion module (PAFM). AFFM is designed to automatically pay attention to the feature maps with different importance from the backbone and auxiliary network, while PAFM is utilized to adaptively learn the channel attentive information in the context transfer process. Furthermore, in the IPN, we build an image pyramid to extract scale-wise features from downsampled images of different scales, where the features are further fused at different states to enrich scale-wise information and learn more comprehensive feature representations. Experimental results are shown on MS COCO dataset. Our proposed detector with a 300 × 300 input achieves superior performance of 32.6% mAP on the MS COCO test-dev compared with state-of-the-art methods.
https://doi.org/10.3837/tiis.2021.12.010 인용 PDF KSCI

Region of Interest Detection Based on Visual Attention and Threshold Segmentation in High Spatial Resolution Remote Sensing Images

Zhang, Libao;Li, Hao
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.7 no.8
- /
- pp.1843-1859
- /
- 2013
The continuous increase of the spatial resolution of remote sensing images brings great challenge to image analysis and processing. Traditional prior knowledge-based region detection and target recognition algorithms for processing high resolution remote sensing images generally employ a global searching solution, which results in prohibitive computational complexity. In this paper, a more efficient region of interest (ROI) detection algorithm based on visual attention and threshold segmentation (VA-TS) is proposed, wherein a visual attention mechanism is used to eliminate image segmentation and feature detection to the entire image. The input image is subsampled to decrease the amount of data and the discrete moment transform (DMT) feature is extracted to provide a finer description of the edges. The feature maps are combined with weights according to the amount of the "strong points" and the "salient points". A threshold segmentation strategy is employed to obtain more accurate region of interest shape information with the very low computational complexity. Experimental statistics have shown that the proposed algorithm is computational efficient and provide more visually accurate detection results. The calculation time is only about 0.7% of the traditional Itti's model.
https://doi.org/10.3837/tiis.2013.08.006 인용 PDF KSCI

A New Residual Attention Network based on Attention Models for Human Action Recognition in Video

Kim, Jee-Hyun;Cho, Young-Im
- Journal of the Korea Society of Computer and Information
- /
- v.25 no.1
- /
- pp.55-61
- /
- 2020
With the development of deep learning technology and advances in computing power, video-based research is now gaining more and more attention. Video data contains a large amount of temporal and spatial information, which is the biggest difference compared with image data. It has a larger amount of data. It has attracted intense attention in computer vision. Among them, motion recognition is one of the research focuses. However, the action recognition of human in the video is extremely complex and challenging subject. Based on many research in human beings, we have found that artificial intelligence-like attention mechanisms are an efficient model for cognition. This efficient model is ideal for processing image information and complex continuous video information. We introduce this attention mechanism into video action recognition, paying attention to human actions in video and effectively improving recognition efficiency. In this paper, we propose a new 3D residual attention network using convolutional neural network based on two attention models to identify human action behavior in the video. An evaluation result of our model showed up to 90.7% accuracy.
https://doi.org/10.9708/jksci.2020.25.01.055 인용 PDF KSCI

Skin Lesion Segmentation with Codec Structure Based Upper and Lower Layer Feature Fusion Mechanism

Yang, Cheng;Lu, GuanMing
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.1
- /
- pp.60-79
- /
- 2022
The U-Net architecture-based segmentation models attained remarkable performance in numerous medical image segmentation missions like skin lesion segmentation. Nevertheless, the resolution gradually decreases and the loss of spatial information increases with deeper network. The fusion of adjacent layers is not enough to make up for the lost spatial information, thus resulting in errors of segmentation boundary so as to decline the accuracy of segmentation. To tackle the issue, we propose a new deep learning-based segmentation model. In the decoding stage, the feature channels of each decoding unit are concatenated with all the feature channels of the upper coding unit. Which is done in order to ensure the segmentation effect by integrating spatial and semantic information, and promotes the robustness and generalization of our model by combining the atrous spatial pyramid pooling (ASPP) module and channel attention module (CAM). Extensive experiments on ISIC2016 and ISIC2017 common datasets proved that our model implements well and outperforms compared segmentation models for skin lesion segmentation.
https://doi.org/10.3837/tiis.2022.01.004 인용 PDF KSCI HTML

Boundary-Aware Dual Attention Guided Liver Segment Segmentation Model

Jia, Xibin;Qian, Chen;Yang, Zhenghan;Xu, Hui;Han, Xianjun;Ren, Hao;Wu, Xinru;Ma, Boyang;Yang, Dawei;Min, Hong
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.16 no.1
- /
- pp.16-37
- /
- 2022
Accurate liver segment segmentation based on radiological images is indispensable for the preoperative analysis of liver tumor resection surgery. However, most of the existing segmentation methods are not feasible to be used directly for this task due to the challenge of exact edge prediction with some tiny and slender vessels as its clinical segmentation criterion. To address this problem, we propose a novel deep learning based segmentation model, called Boundary-Aware Dual Attention Liver Segment Segmentation Model (BADA). This model can improve the segmentation accuracy of liver segments with enhancing the edges including the vessels serving as segment boundaries. In our model, the dual gated attention is proposed, which composes of a spatial attention module and a semantic attention module. The spatial attention module enhances the weights of key edge regions by concerning about the salient intensity changes, while the semantic attention amplifies the contribution of filters that can extract more discriminative feature information by weighting the significant convolution channels. Simultaneously, we build a dataset of liver segments including 59 clinic cases with dynamically contrast enhanced MRI(Magnetic Resonance Imaging) of portal vein stage, which annotated by several professional radiologists. Comparing with several state-of-the-art methods and baseline segmentation methods, we achieve the best results on this clinic liver segment segmentation dataset, where Mean Dice, Mean Sensitivity and Mean Positive Predicted Value reach 89.01%, 87.71% and 90.67%, respectively.
https://doi.org/10.3837/tiis.2022.01.002 인용 PDF KSCI HTML

Real Scene Text Image Super-Resolution Based on Multi-Scale and Attention Fusion

Xinhua Lu;Haihai Wei;Li Ma;Qingji Xue;Yonghui Fu
- Journal of Information Processing Systems
- /
- v.19 no.4
- /
- pp.427-438
- /
- 2023
Plenty of works have indicated that single image super-resolution (SISR) models relying on synthetic datasets are difficult to be applied to real scene text image super-resolution (STISR) for its more complex degradation. The up-to-date dataset for realistic STISR is called TextZoom, while the current methods trained on this dataset have not considered the effect of multi-scale features of text images. In this paper, a multi-scale and attention fusion model for realistic STISR is proposed. The multi-scale learning mechanism is introduced to acquire sophisticated feature representations of text images; The spatial and channel attentions are introduced to capture the local information and inter-channel interaction information of text images; At last, this paper designs a multi-scale residual attention module by skillfully fusing multi-scale learning and attention mechanisms. The experiments on TextZoom demonstrate that the model proposed increases scene text recognition's (ASTER) average recognition accuracy by 1.2% compared to text super-resolution network.
https://doi.org/10.3745/JIPS.02.0199 인용 PDF

DATCN: Deep Attention fused Temporal Convolution Network for the prediction of monitoring indicators in the tunnel

Bowen, Du;Zhixin, Zhang;Junchen, Ye;Xuyan, Tan;Wentao, Li;Weizhong, Chen
- Smart Structures and Systems
- /
- v.30 no.6
- /
- pp.601-612
- /
- 2022
The prediction of structural mechanical behaviors is vital important to early perceive the abnormal conditions and avoid the occurrence of disasters. Especially for underground engineering, complex geological conditions make the structure more prone to disasters. Aiming at solving the problems existing in previous studies, such as incomplete consideration factors and can only predict the continuous performance, the deep attention fused temporal convolution network (DATCN) is proposed in this paper to predict the spatial mechanical behaviors of structure, which integrates both the temporal effect and spatial effect and realize the cross-time prediction. The temporal convolution network (TCN) and self-attention mechanism are employed to learn the temporal correlation of each monitoring point and the spatial correlation among different points, respectively. Then, the predicted result obtained from DATCN is compared with that obtained from some classical baselines, including SVR, LR, MLP, and RNNs. Also, the parameters involved in DATCN are discussed to optimize the prediction ability. The prediction result demonstrates that the proposed DATCN model outperforms the state-of-the-art baselines. The prediction accuracy of DATCN model after 24 hours reaches 90 percent. Also, the performance in last 14 hours plays a domain role to predict the short-term behaviors of the structure. As a study case, the proposed model is applied in an underwater shield tunnel to predict the stress variation of concrete segments in space.
https://doi.org/10.12989/sss.2022.30.6.601 인용 KSCI

Local Climate Mediates Spatial and Temporal Variation in Carabid Beetle Communities on Hyangnobong, Korea

Park, Yong Hwan;Jang, Tae Woong;Jeong, Jong Cheol;Chae, Hee Mun;Kim, Jong Kuk
- Journal of Forest and Environmental Science
- /
- v.33 no.3
- /
- pp.161-171
- /
- 2017
Global environmental changes have the capacity to make dramatic alterations to floral and faunal composition, and elucidation of the mechanism is important for predicting its outcomes. Studies on global climate change have traditionally focused on statistical summaries within relatively wide scales of spatial and temporal changes, and less attention has been paid to variability in microclimates across spatial and temporal scales. Microclimate is a suite of climatic conditions measured in local areas near the earth's surface. Environmental variables in microclimatic scale can be critical for the ecology of organisms inhabiting there. Here we examine the effect of spatial and temporal changes in microclimates on those of carabid beetle communities in Hyangnobong, Korea. We found that climatic variables and the patterns of annual changes in carabid beetle communities differed among sites even within the single mountain system. Our results indicate the importance of temporal survey of communities at local scales, which is expected to reveal an additional fraction of variation in communities and underlying processes that has been overlooked in studies of global community patterns and changes.
https://doi.org/10.7747/JFES.2017.33.3.161 인용 PDF KSCI

Search Result 40, Processing Time 0.02 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)