• 제목/요약/키워드: Spatial attention mechanism

검색결과 40건 처리시간 0.025초

시각적 선택에 대한 신경 망 모형FeatureGate 모형의 하향식 기제 (A Neural Network Model for Visual Selection: Top-down mechanism of Feature Gate model)

  • 김민식
    • 인지과학
    • /
    • 제10권3호
    • /
    • pp.1.2-1.2
    • /
    • 1999
  • 시각적 선택에 대한 과거 정신물리학적, 신경 생리학적 연구결과를 토대로 Feature Gate 라는 신경 망 모형을 제안하였다. 이 모형에는 공간 배치도가 위계 적으로 구성되어 있으며, 정보의 흐름이 위계의 각 수준으로부터 그 다음 수준으로 넘어갈 때 주의 게이트에 의해 조절되도록 되어 있다. 주의 게이트들은 독특한 세부 특징을 가진 위치에 반응하는 상향식 시스템과 표적 세부 특징이 있는 위치에 반응하는 하향식 기제 모두에 의해 조절된다. 본 연구는 Feature Gate 모형의 하향식 기제에 초점을 맞추어 모형을 설명하고, 현재 다른 모형들이 설명하지 못하는 Moran & Desimone(1985)의 연구결과를 이 모형이 어떻게 설명하는지를 제시하고자 한다. Feature Gate 모형은 병렬 적인 세부특징 검색, 계열 적 접합표적 검색, 단서에 의한 주의의 점진적 감소 모형, 세부특징-주도적인 공간적 선택, 주의의 분할, 방해자극 위치의 억제, 주변 억제 등을 포함한 시각적 주의 연구의 여러 가지 많은 현상들을 설명하는데 하나의 일관적인 해석을 제공해 준다. 앞으로 이 모형을 더욱 확장, 발전 시켜 세부특징의 조합된 배열에 반응하는 상위 수준의 유닛을 사용한다면 시각적 선택과정이 포함된 형태 재인 모형으로 개발될 수 있다.

인간의 주의시각에 기반한 시각정보 선택 방법 (Visual Information Selection Mechanism Based on Human Visual Attention)

  • 최경주;박민철
    • 한국멀티미디어학회논문지
    • /
    • 제14권3호
    • /
    • pp.378-391
    • /
    • 2011
  • 본 논문에서는 입력장치로 들어오는 수많은 시각정보 중 현 시점에서 가장 유용하다고 생각되는 정보를 인간의 상향식 주의시각에 기반하여 선택하는 시각정보 선택기법에 대해 소개한다. 제안하는 시스템은 색상, 명도, 방위, 형태 등 저수준의 공간특징 외에 시간특징으로서 움직임 정보와 3차원 정보인 깊이 정보를 추가적으로 사용함으로써 기존방법에 비해 정보 선택의 정확도를 높혔다. 움직임 정보 추출 시 발생할 수 있는 노이즈를 제거하기 위해 인간의 움직임 인지에 대한 연구결과를 이용하는 새로운 접근법을 사용하였으며, 입력 영상 내 객체들이 부분적으로 겹쳐있다거나 동일한 현저도를 가지고 있을 때에도 현저한 영역을 제대로 선택해낼 수 있도록 깊이 정보를 사용하여 유의미한 영역을 선별하고 우선순위를 부여하였다. 실험결과를 통해 제안하는 방법이 기존의 방법에 비해 높은 정확도를 가짐을 확인할 수 있었다.

Depth Images-based Human Detection, Tracking and Activity Recognition Using Spatiotemporal Features and Modified HMM

  • Kamal, Shaharyar;Jalal, Ahmad;Kim, Daijin
    • Journal of Electrical Engineering and Technology
    • /
    • 제11권6호
    • /
    • pp.1857-1862
    • /
    • 2016
  • Human activity recognition using depth information is an emerging and challenging technology in computer vision due to its considerable attention by many practical applications such as smart home/office system, personal health care and 3D video games. This paper presents a novel framework of 3D human body detection, tracking and recognition from depth video sequences using spatiotemporal features and modified HMM. To detect human silhouette, raw depth data is examined to extract human silhouette by considering spatial continuity and constraints of human motion information. While, frame differentiation is used to track human movements. Features extraction mechanism consists of spatial depth shape features and temporal joints features are used to improve classification performance. Both of these features are fused together to recognize different activities using the modified hidden Markov model (M-HMM). The proposed approach is evaluated on two challenging depth video datasets. Moreover, our system has significant abilities to handle subject's body parts rotation and body parts missing which provide major contributions in human activity recognition.

라이프스타일에 따른 디자인 호텔 로비 디자인의 연상 기제에 관한 연구 (A Study on Association Mechanism of Lobby Design in Design Hotels according to Lifestyles)

  • 윤현주;류호창
    • 한국실내디자인학회논문집
    • /
    • 제25권6호
    • /
    • pp.116-126
    • /
    • 2016
  • In modern society which changes from quantity-seeking society to value-seeking one, people's various lifestyles have great effect on consumption patterns and work as an important factor in choosing hotels. The fact that design hotels, which provide unique experiences with differentiated and sensitive designs by reflecting various lifestyles, recently attract attention can be understood in the same context. As a matter of fact, design hotels recently serve as destinations as they become cultural and artistic icons which reflect customer lifestyles. Especially, the designs of lobby spaces in hotels play deciding role in customers' choices while representing the nature of hotels. In this respect, under the premise that the kinds of accumulated experiences are different depending on lifestyles and preferences for specific interior spaces are influenced by association mechanism formed by experiences, this study analyzed lobby spaces of design hotels which focus on specific lifestyles from the perspective of association mechanism based on experiences. As the method of analysis, this study classified the types of lifestyles and conducted case analysis to investigate what association mechanism works to enhance the preference of design hotels by types. Study classified lifestyles into experiential activity type, social meeting type, fashion-pursuing type and hideout-preferring type and analyzed cases of lobby designs in design hotels. The results of this case analysis are as follows; First, experiential activity type mainly utilized quasi-association and approach association through senses and social meeting type utilized quasi-association and memory association through emotions while fashion-pursuing type utilized quasi-association and presumption association through intuition and hideout-preferring type utilized quasi-association and approach association through thoughts. Second, it was found that most lobby designs are characterized by association mechanism in visual formative nature and that in temporal spatial nature working in complex way, and, through such process of association expansion, space stories are created. Stories of spaces created this way become unique identities of design hotels that provide new experiences for customers.

생리적 신호를 이용한 통증 인식을 위한 딥 러닝 네트워크 (Deep Learning Network Approach for Pain Recognition Using Physiological Signals)

  • ;이귀상;양형정;김수형
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2021년도 추계학술발표대회
    • /
    • pp.1001-1004
    • /
    • 2021
  • Pain is an unpleasant experience for the patient. The recognition and assessment of pain help tailor the treatment to the patient, and they are also challenging in the medical. In this paper, we propose an approach for pain recognition through a deep neural network applied to pre-processed physiological. The proposed approach applies the idea of shortcut connections to concatenate the spatial information of a convolutional neural network and the temporal information of a recurrent neural network. In addition, our proposed approach applies the attention mechanism and achieves competitive performance on the BioVid Heat Pain dataset.

시선추적을 이용한 카페 공간 마감재 차이의 시각주의력 특성 (Characteristics of Visual Attention for the Different Type of Material Finishing in Cafe Space Using by Eye-tracking)

  • 최진경;김주연
    • 한국실내디자인학회논문집
    • /
    • 제27권2호
    • /
    • pp.3-11
    • /
    • 2018
  • This study aims to investigate whether there is intensionally changing eye - gaze on the cafe space images with floor finishing materials. In the Yarbus' experiment, he argued that changing information that an observer is asked to obtain from an image changes pattern of eye movements. Based on the scan path evidence, this research have questions as (1) the difference of visual attention on finishing floor material stimulus, (2) visual attention of initial activity time and type of movement paths on AOIs, and (3) visual relation floor area with another AOIs. Eye movements were recorded with the SMI REDn Scientific, which sampled eye position at 30Hz and lasted 2 minutes(120s). Although viewing was binocular, only the right eye was tracked. Of the 66 observers(mean age 22 years, standard deviation: ${\pm}1.82$) who participated in the experiment done by the four point calibration and validation procedures at the beginning tasks. Analyzing qualitative data from the number of fixation and duration on AOIs divided into four parts (AOI I-Floor, AOI II-Wall, AOI III-Ceiling, and AOI IV-Counter) in the stimulus. The results from this experiment analyzed as follows. First, it was significant in the difference of the average number of AOIs fixation times observed for the spatial image using the wood tile flooring material and the polishing tile. The wood tile flooring of stimulus had higher fixation number on AOI-II, AOI-III, and AOI-IV than the polishing tile. On seeing AOI-I was higher attention in the polishing tile stimulus. Second, the observers examined AOI-II intensively in both stimuli. However, the visual intensity was also followed by on the AOI-IV and AOI-I in the wood tile flooring stimulus, and on AOI-I, AO-IV in the polishing tile. Third, visual attention data on each AOIs have divided into the time range of "5 sec" for both images. In the wood tile stimulus, the horizontal movement path followed by AOI-II, AOI-IV, and AOI-II. In the polished tile stimulus, the movement path followed by moving vertically to AOI-II, AOI-I, and AOI-II. This study approached meaningfully and found out the characteristics of visual attention, according to the different intentions of visual attention, the relationship pathways of visual mechanism appeared and also activated by eye-tracking experiments.

A Tuberculosis Detection Method Using Attention and Sparse R-CNN

  • Xu, Xuebin;Zhang, Jiada;Cheng, Xiaorui;Lu, Longbin;Zhao, Yuqing;Xu, Zongyu;Gu, Zhuangzhuang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권7호
    • /
    • pp.2131-2153
    • /
    • 2022
  • To achieve accurate detection of tuberculosis (TB) areas in chest radiographs, we design a chest X-ray TB area detection algorithm. The algorithm consists of two stages: the chest X-ray TB classification network (CXTCNet) and the chest X-ray TB area detection network (CXTDNet). CXTCNet is used to judge the presence or absence of TB areas in chest X-ray images, thereby excluding the influence of other lung diseases on the detection of TB areas. It can reduce false positives in the detection network and improve the accuracy of detection results. In CXTCNet, we propose a channel attention mechanism (CAM) module and combine it with DenseNet. This module enables the network to learn more spatial and channel features information about chest X-ray images, thereby improving network performance. CXTDNet is a design based on a sparse object detection algorithm (Sparse R-CNN). A group of fixed learnable proposal boxes and learnable proposal features are using for classification and location. The predictions of the algorithm are output directly without non-maximal suppression post-processing. Furthermore, we use CLAHE to reduce image noise and improve image quality for data preprocessing. Experiments on dataset TBX11K show that the accuracy of the proposed CXTCNet is up to 99.10%, which is better than most current TB classification algorithms. Finally, our proposed chest X-ray TB detection algorithm could achieve AP of 45.35% and AP50 of 74.20%. We also establish a chest X-ray TB dataset with 304 sheets. And experiments on this dataset showed that the accuracy of the diagnosis was comparable to that of radiologists. We hope that our proposed algorithm and established dataset will advance the field of TB detection.

내현적 주의와 재정향이 탐지과제 수행에 미치는 영향 (The effects of endogenous attention and reorienting on performance of detection task)

  • 고재형;김신우;이형철
    • 감성과학
    • /
    • 제15권1호
    • /
    • pp.37-46
    • /
    • 2012
  • 내현적 주의와 재정향이 탐지과제 수행에 미치는 영향을 반응촉진과 회귀억제를 통해 탐색하였다. Posner와 Cohen(1980)의 고전적인 탐지패러다임에서는 단서를 제시하고 일정한 자극제시시차 후 그 단서와 같거나 또는 다른 위치에 나타나는 표적에 대한 탐지수행을 관찰한다. 본 연구에서는 내현적 주의를 유발하기 위해 단서가 표적에 대해 예측력을 가지도록 조작하였고, 단서자극과 표적 사이에 새로운 단서(재정향단서)를 삽입하여 재정향을 유도하였다. 실험 1에서는 재정향단서가 제시되는 시기를 초기, 중기, 후기로 구분하여 실험을 실시하였다. 그 결과 재정향이 제시되는 시기별로 자극제시시차(150ms, 400ms, 850ms)에 따라 다른 양상의 반응촉진 및 회귀억제가 발생하였으나, 해석 가능한 일정 패턴을 확인하기는 어려웠다. 하지만 재정향이 초기에 발생한 실험조건을 재분석한 결과, 반응촉진과 회귀억제가 자극제시시차에 따라 교차하여 나타나는 단순탐지과제의 전형적인 결과를 얻을 수 있었다. 실험 2에서는 실험 1에서 재정향이 초기에 발생하는 조건에 대한 추가 실험을 실시하였다. 실험 결과, 자극제시시차가 짧을 때는 반응촉진이 발생하였으며 자극제시시차가 길때는 회귀억제가 발생하였다. 이 결과는 단서자극이 표적에 대한 예측력을 가질 때 자극제시시차가 긴 조건에서 회귀억제가 사라진다는 기존의 보고(Wright & Richard, 2000)와 반대되는 결과이다. 이 결과는 최초 단서가 제시된 후 회귀억제의 효과가 소멸되기 전에 매우 빠르게 제시되는 재정향단서는 이중 회귀억제를 가져온다는 것을 제안한다. 본 연구는 내현적 주의에 의해 특정한 위치에 주의를 할당할 때에도 반복적으로 빠르게 제시되는 단서자극은 회귀억제를 극대화함으로써 내현적 주의를 상쇄할 수 있음을 시사한다.

  • PDF

Grouting diffusion mechanism in an oblique crack in rock masses considering temporal and spatial variation of viscosity of fast-curing grouts

  • Huang, Shuling;Pei, Qitao;Ding, Xiuli;Zhang, Yuting;Liu, Dengxue;He, Jun;Bian, Kang
    • Geomechanics and Engineering
    • /
    • 제23권2호
    • /
    • pp.151-163
    • /
    • 2020
  • Grouting method is an effective way of reinforcing cracked rock masses and plugging water gushing. Current grouting diffusion models are generally developed for horizontal cracks, which is contradictory to the fact that the crack generally occurs in rock masses with irregular spatial distribution characteristics in real underground environments. To solve this problem, this study selected a cement-sodium silicate slurry (C-S slurry) generally used in engineering as a fast-curing grouting material and regarded the C-S slurry as a Bingham fluid with time-varying viscosity for analysis. Based on the theory of fluid mechanics, and by simultaneously considering the deadweight of slurry and characteristics of non-uniform spatial distribution of viscosity of fast-curing grouts, a theoretical model of slurry diffusion in an oblique crack in rock masses at constant grouting rate was established. Moreover, the viscosity and pressure distribution equations in the slurry diffusion zone were deduced, thus quantifying the relationship between grouting pressure, grouting time, and slurry diffusion distance. On this basis, by using a 3-d finite element program in multi-field coupled software Comsol, the numerical simulation results were compared with theoretical calculation values, further verifying the effectiveness of the theoretical model. In addition, through the analysis of two engineering case studies, the theoretical calculations and measured slurry diffusion radius were compared, to evaluate the application effects of the model in engineering practice. Finally, by using the established theoretical model, the influence of cracking in rock masses on the diffusion characteristics of slurry was analysed. The results demonstrate that the inclination angle of the crack in rock masses and azimuth angle of slurry diffusion affect slurry diffusion characteristics. More attention should be paid to the actual grouting process. The results can provide references for determining grouting parameters of fast-curing grouts in engineering practice.

DeepPTP: A Deep Pedestrian Trajectory Prediction Model for Traffic Intersection

  • Lv, Zhiqiang;Li, Jianbo;Dong, Chuanhao;Wang, Yue;Li, Haoran;Xu, Zhihao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제15권7호
    • /
    • pp.2321-2338
    • /
    • 2021
  • Compared with vehicle trajectories, pedestrian trajectories have stronger degrees of freedom and complexity, which poses a higher challenge to trajectory prediction tasks. This paper designs a mode to divide the trajectory of pedestrians at a traffic intersection, which converts the trajectory regression problem into a trajectory classification problem. This paper builds a deep model for pedestrian trajectory prediction at intersections for the task of pedestrian short-term trajectory prediction. The model calculates the spatial correlation and temporal dependence of the trajectory. More importantly, it captures the interactive features among pedestrians through the Attention mechanism. In order to improve the training speed, the model is composed of pure convolutional networks. This design overcomes the single-step calculation mode of the traditional recurrent neural network. The experiment uses Vulnerable Road Users trajectory dataset for related modeling and evaluation work. Compared with the existing models of pedestrian trajectory prediction, the model proposed in this paper has advantages in terms of evaluation indicators, training speed and the number of model parameters.