• Title/Summary/Keyword: the object-based attention

Search Result 219, Processing Time 0.022 seconds

Change Detection Using Deep Learning Based Semantic Segmentation for Nuclear Activity Detection and Monitoring (핵 활동 탐지 및 감시를 위한 딥러닝 기반 의미론적 분할을 활용한 변화 탐지)

  • Song, Ahram;Lee, Changhui;Lee, Jinmin;Han, Youkyung
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.6_1
    • /
    • pp.991-1005
    • /
    • 2022
  • Satellite imaging is an effective supplementary data source for detecting and verifying nuclear activity. It is also highly beneficial in regions with limited access and information, such as nuclear installations. Time series analysis, in particular, can identify the process of preparing for the conduction of a nuclear experiment, such as relocating equipment or changing facilities. Differences in the semantic segmentation findings of time series photos were employed in this work to detect changes in meaningful items connected to nuclear activity. Building, road, and small object datasets made of KOMPSAT 3/3A photos given by AIHub were used to train deep learning models such as U-Net, PSPNet, and Attention U-Net. To pick relevant models for targets, many model parameters were adjusted. The final change detection was carried out by including object information into the first change detection, which was obtained as the difference in semantic segmentation findings. The experiment findings demonstrated that the suggested approach could effectively identify altered pixels. Although the suggested approach is dependent on the accuracy of semantic segmentation findings, it is envisaged that as the dataset for the region of interest grows in the future, so will the relevant scope of the proposed method.

Fast, Accurate Vehicle Detection and Distance Estimation

  • Ma, QuanMeng;Jiang, Guang;Lai, DianZhi;cui, Hua;Song, Huansheng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.2
    • /
    • pp.610-630
    • /
    • 2020
  • A large number of people suffered from traffic accidents each year, so people pay more attention to traffic safety. However, the traditional methods use laser sensors to calculate the vehicle distance at a very high cost. In this paper, we propose a method based on deep learning to calculate the vehicle distance with a monocular camera. Our method is inexpensive and quite convenient to deploy on the mobile platforms. This paper makes two contributions. First, based on Light-Head RCNN, we propose a new vehicle detection framework called Light-Car Detection which can be used on the mobile platforms. Second, the planar homography of projective geometry is used to calculate the distance between the camera and the vehicles ahead. The results show that our detection system achieves 13FPS detection speed and 60.0% mAP on the Adreno 530 GPU of Samsung Galaxy S7, while only requires 7.1MB of storage space. Compared with the methods existed, the proposed method achieves a better performance.

Super-Resolution Reconstruction of Humidity Fields based on Wasserstein Generative Adversarial Network with Gradient Penalty

  • Tao Li;Liang Wang;Lina Wang;Rui Han
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.5
    • /
    • pp.1141-1162
    • /
    • 2024
  • Humidity is an important parameter in meteorology and is closely related to weather, human health, and the environment. Due to the limitations of the number of observation stations and other factors, humidity data are often not as good as expected, so high-resolution humidity fields are of great interest and have been the object of desire in the research field and industry. This study presents a novel super-resolution algorithm for humidity fields based on the Wasserstein generative adversarial network(WGAN) framework, with the objective of enhancing the resolution of low-resolution humidity field information. WGAN is a more stable generative adversarial networks(GANs) with Wasserstein metric, and to make the training more stable and simple, the gradient cropping is replaced with gradient penalty, and the network feature representation is improved by sub-pixel convolution, residual block combined with convolutional block attention module(CBAM) and other techniques. We evaluate the proposed algorithm using ERA5 relative humidity data with an hourly resolution of 0.25°×0.25°. Experimental results demonstrate that our approach outperforms not only conventional interpolation techniques, but also the super-resolution generative adversarial network(SRGAN) algorithm.

Design of the Mobile Electronic Voucher based on NFC (NFC 기반의 모바일 전자상품권 설계)

  • Lee, Seong Ho;Kim, KyungJun;No, Hyeo-Won;Ji, Yoo-Kang;Joung, Ki-Bong
    • Smart Media Journal
    • /
    • v.2 no.3
    • /
    • pp.34-38
    • /
    • 2013
  • Several costs including printing and management are added in commerce by vouchers. In this case that a voucher is digitalized and flown through mobile phones, these costs are diminished considerably. Also, as mobile commerce is on the increase according to the activation of smart phones, NFC technology with near distance communication is gave attention to. Therefore, researches for using NFC-based electronic vouchers to pay electronically on mobile are made. To analyze the flow path of electronic vouchers, which helps marketing strategy, this paper proposes the method to put useful data into them. The proposed method utilizes object memory model. The proposed electronic voucher includes the block for flow history data in addition to its information. In the future, we will implement and test this electronic voucher.

  • PDF

Outdoor Augmented Reality based 3D Model Visualization System of Cultural Heritage Sites (야외 증강현실 기반의 문화 유적지 3D 모델 시각화 시스템)

  • Han, Jong-Gil;Park, Kyoung-Wook;Ban, Kyeong-Jin;Kim, Eung-Kon
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.8 no.3
    • /
    • pp.459-464
    • /
    • 2013
  • Recently, at home and abroad cultural content industry has developed as the growing importance of history. Among them, the reconstruction contents area which combined with IT technology is attracting attention. Specially, using augmented reality technology, 3D visualization researches which restore contents of architectural heritage, cultural heritage sites, and artifacts have been performed in cultural content area. The existing cultural site restore contents are mostly made based on the images taken from indoor. In this paper, efficiently visualize the restore contents in indoor, but outdoors is limited. This theses presents the cultural heritage sites 3D model visualization system using augmented reality in outdoor. Proposed system augments 3D model to cultural heritage site in outdoor by using Smart Phone.

A Study on Reconstruction of Digital Space in Multi-layer Structure (다층적 구조에서 보여 지는 디지털 공간의 재구성에 관한 연구)

  • Chung, Kue-Hyung
    • Journal of Digital Convergence
    • /
    • v.12 no.12
    • /
    • pp.513-520
    • /
    • 2014
  • Since the beginning of history, men have done mimesis and produced illusion and succeeded art and culture instinctually. The subject which mention above included the object which can order and space around that. Perspective which began the Renaissance age was dominant way about understanding space in western history and it made modern visual system. Direction way of space which based perspective is changed as horizontal data included multi-layer structure in digital media age. This character make us possible to represent the space more efficiently. So we must have pay attention the direction way of space based on digital media, because it has meaning to show human value beyond a methodology of visual art culture.

Threat Situation Determination System Through AWS-Based Behavior and Object Recognition (AWS 기반 행위와 객체 인식을 통한 위협 상황 판단 시스템)

  • Ye-Young Kim;Su-Hyun Jeong;So-Hyun Park;Young-Ho Park
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.4
    • /
    • pp.189-198
    • /
    • 2023
  • As crimes frequently occur on the street, the spread of CCTV is increasing. However, due to the shortcomings of passively operated CCTV, the need for intelligent CCTV is attracting attention. Due to the heavy system of such intelligent CCTV, high-performance devices are required, which has a problem in that it is expensive to replace the general CCTV. To solve this problem, an intelligent CCTV system that recognizes low-quality images and operates even on devices with low performance is required. Therefore, this paper proposes a Saying CCTV system that can detect threats in real time by using the AWS cloud platform to lighten the system and convert images into text. Based on the data extracted using YOLO v4 and OpenPose, it is implemented to determine the risk object, threat behavior, and threat situation, and calculate the risk using machine learning. Through this, the system can be operated anytime and anywhere as long as the network is connected, and the system can be used even with devices with minimal performance for video shooting and image upload. Furthermore, it is possible to quickly prevent crime by automating meaningful statistics on crime by analyzing the video and using the data stored as text.

Design and Implementation of Mobile Vision-based Augmented Galaga using Real Objects (실제 물체를 이용한 모바일 비전 기술 기반의 실감형 갤러그의 설계 및 구현)

  • Park, An-Jin;Yang, Jong-Yeol;Jung, Kee-Chul
    • Journal of Korea Game Society
    • /
    • v.8 no.2
    • /
    • pp.85-96
    • /
    • 2008
  • Recently, research on augmented games as a new game genre has attracted a lot of attention. An augmented game overlaps virtual objects in an augmented reality(AR) environment, allowing game players to interact with the AR environment through manipulating real and virtual objects. However, it is difficult to release existing augmented games to ordinary game players, as the games generally use very expensive and inconvenient 'backpack' systems: To solve this problem, several augmented games have been proposed using mobile devices equipped with cameras, but it can be only enjoyed at a previously-installed location, as a ‘color marker' or 'pattern marker’ is used to overlap the virtual object with the real environment. Accordingly, this paper introduces an augmented game, called augmented galaga based on traditional well-known galaga, executed on mobile devices to make game players experience the game without any economic burdens. Augmented galaga uses real object in real environments, and uses scale-invariant features(SIFT), and Euclidean distance to recognize the real objects. The virtural aliens are randomly appeared around the specific objects, several specific objects are used to improve the interest aspect, andgame players attack the virtual aliens by moving the mobile devices towards specific objects and clicking a button of mobile devices. As a result, we expect that augmented galaga provides an exciting experience without any economic burdens for players based on the game paradigm, where the user interacts with both the physical world captured by a mobile camera and the virtual aliens automatically generated by a mobile devices.

  • PDF

A Study on the i-YOLOX Architecture for Multiple Object Detection and Classification of Household Waste (생활 폐기물 다중 객체 검출과 분류를 위한 i-YOLOX 구조에 관한 연구)

  • Weiguang Wang;Kyung Kwon Jung;Taewon Lee
    • Convergence Security Journal
    • /
    • v.23 no.5
    • /
    • pp.135-142
    • /
    • 2023
  • In addressing the prominent issues of climate change, resource scarcity, and environmental pollution associated with household waste, extensive research has been conducted on intelligent waste classification methods. These efforts range from traditional classification algorithms to machine learning and neural networks. However, challenges persist in effectively classifying waste in diverse environments and conditions due to insufficient datasets, increased complexity in neural network architectures, and performance limitations for real-world applications. Therefore, this paper proposes i-YOLOX as a solution for rapid classification and improved accuracy. The proposed model is evaluated based on network parameters, detection speed, and accuracy. To achieve this, a dataset comprising 10,000 samples of household waste, spanning 17 waste categories, is created. The i-YOLOX architecture is constructed by introducing the Involution channel convolution operator and the Convolution Branch Attention Module (CBAM) into the YOLOX structure. A comparative analysis is conducted with the performance of the existing YOLO architecture. Experimental results demonstrate that i-YOLOX enhances the detection speed and accuracy of waste objects in complex scenes compared to conventional neural networks. This confirms the effectiveness of the proposed i-YOLOX architecture in the detection and classification of multiple household waste objects.

A Critical Review on C. Norberg Schulz's Theory of the 'Placeness' - Centering around Heidegger's Thought of "Openness" - (노베르그-슐츠(C. Norberg-Schulz)의 '장소성' 이론에 대한 비판적 고찰 - 하이데거(Martin Heidegger)의 "개방성(Openness)"과 "틈새내기(Rift-design)" 사유를 근거로 -)

  • Lee, Seung-Heon;Lee, Dong-Eon
    • Journal of architectural history
    • /
    • v.12 no.3
    • /
    • pp.149-162
    • /
    • 2003
  • Schulz accepted the existentialist view based on Heidegger's thought and at the same time the objectivist view making fixed this living world, evoking controversies for discussion. He could not see various presentations of the meaning of place because he perceived elements of this world individually. Thus Schulz's mixed system of understanding is sternly different from Heidegger's thought. First, Heidegger suggests that place as existential space represents the occasion revelation of incidents in Dasein. While Schulz recognizes that place is a systematic space predetermined for Dasein. Second, Heidegger interprets the placeness as creative openness in which elements comprising this world face and interact with each other into one. In contrast, Schulz defines each of the elements through signification and regards it as invariable and static. Third, Heidegger perceives that the placeness is expressed with sustainable, complex images through "rift-design" which seeks dynamic interactions between the ground and the world. While Schulz attempts to take "Genius Loci" or "habituated scene" through "gathering" as a concept he regards static and then visualize such structural two factors, producing certain internal images of place. However, limits of Schulz's theory prevent us from exerting complete imagination and discovering the inner creative world of the object. Thus the ultimate goal of paying attention to the placeness, that is, the recovery of individual identity, fails due to the prevalence and abstraction of objectified thinking. In contrast, Heidegger's thought about "openness" is a useful means of realizing the placeness. Openness may be referred to a dynamic coordination in which the earth and the world sustain each other under incessant mutual tensions, but not sticking o each other. "Rift-design" is an openness strategy to cause tense relations by preventing structuralization intentively. This is a creative design that allows seeing original seams of the object.

  • PDF