• Title/Summary/Keyword: Multi-Vision

Search Result 491, Processing Time 0.031 seconds

A Multi-Stage Convolution Machine with Scaling and Dilation for Human Pose Estimation

  • Nie, Yali;Lee, Jaehwan;Yoon, Sook;Park, Dong Sun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.6
    • /
    • pp.3182-3198
    • /
    • 2019
  • Vision-based Human Pose Estimation has been considered as one of challenging research subjects due to problems including confounding background clutter, diversity of human appearances and illumination changes in scenes. To tackle these problems, we propose to use a new multi-stage convolution machine for estimating human pose. To provide better heatmap prediction of body joints, the proposed machine repeatedly produces multiple predictions according to stages with receptive field large enough for learning the long-range spatial relationship. And stages are composed of various modules according to their strategic purposes. Pyramid stacking module and dilation module are used to handle problem of human pose at multiple scales. Their multi-scale information from different receptive fields are fused with concatenation, which can catch more contextual information from different features. And spatial and channel information of a given input are converted to gating factors by squeezing the feature maps to a single numeric value based on its importance in order to give each of the network channels different weights. Compared with other ConvNet-based architectures, we demonstrated that our proposed architecture achieved higher accuracy on experiments using standard benchmarks of LSP and MPII pose datasets.

Deep Multi-task Network for Simultaneous Hazy Image Semantic Segmentation and Dehazing (안개영상의 의미론적 분할 및 안개제거를 위한 심층 멀티태스크 네트워크)

  • Song, Taeyong;Jang, Hyunsung;Ha, Namkoo;Yeon, Yoonmo;Kwon, Kuyong;Sohn, Kwanghoon
    • Journal of Korea Multimedia Society
    • /
    • v.22 no.9
    • /
    • pp.1000-1010
    • /
    • 2019
  • Image semantic segmentation and dehazing are key tasks in the computer vision. In recent years, researches in both tasks have achieved substantial improvements in performance with the development of Convolutional Neural Network (CNN). However, most of the previous works for semantic segmentation assume the images are captured in clear weather and show degraded performance under hazy images with low contrast and faded color. Meanwhile, dehazing aims to recover clear image given observed hazy image, which is an ill-posed problem and can be alleviated with additional information about the image. In this work, we propose a deep multi-task network for simultaneous semantic segmentation and dehazing. The proposed network takes single haze image as input and predicts dense semantic segmentation map and clear image. The visual information getting refined during the dehazing process can help the recognition task of semantic segmentation. On the other hand, semantic features obtained during the semantic segmentation process can provide cues for color priors for objects, which can help dehazing process. Experimental results demonstrate the effectiveness of the proposed multi-task approach, showing improved performance compared to the separate networks.

Automatic assessment of post-earthquake buildings based on multi-task deep learning with auxiliary tasks

  • Zhihang Li;Huamei Zhu;Mengqi Huang;Pengxuan Ji;Hongyu Huang;Qianbing Zhang
    • Smart Structures and Systems
    • /
    • v.31 no.4
    • /
    • pp.383-392
    • /
    • 2023
  • Post-earthquake building condition assessment is crucial for subsequent rescue and remediation and can be automated by emerging computer vision and deep learning technologies. This study is based on an endeavour for the 2nd International Competition of Structural Health Monitoring (IC-SHM 2021). The task package includes five image segmentation objectives - defects (crack/spall/rebar exposure), structural component, and damage state. The structural component and damage state tasks are identified as the priority that can form actionable decisions. A multi-task Convolutional Neural Network (CNN) is proposed to conduct the two major tasks simultaneously. The rest 3 sub-tasks (spall/crack/rebar exposure) were incorporated as auxiliary tasks. By synchronously learning defect information (spall/crack/rebar exposure), the multi-task CNN model outperforms the counterpart single-task models in recognizing structural components and estimating damage states. Particularly, the pixel-level damage state estimation witnesses a mIoU (mean intersection over union) improvement from 0.5855 to 0.6374. For the defect detection tasks, rebar exposure is omitted due to the extremely biased sample distribution. The segmentations of crack and spall are automated by single-task U-Net but with extra efforts to resample the provided data. The segmentation of small objects (spall and crack) benefits from the resampling method, with a substantial IoU increment of nearly 10%.

Employing SNMP to Manage Ubiquitous Environments

  • Murtaza Syed Shariyar;Amin Syed Obaid;Hong Choong Seon;Choi Sang Hyun
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.11a
    • /
    • pp.196-198
    • /
    • 2005
  • The vision of ubiquitous computing is becoming a reality now .Service discovery in ubiquitous environment, as well as adding semantics to the data is quite established. But, not many researchers have thought about the management of these devices. We envisage that by integrating SNMP with service discovery protocols, we could leverage the use of management factors (like performance, fault and security management etc) in the business, organizations, and other multi-user environments.

  • PDF

Development of Monitering system for Layers Rearing in Multi-tier Layers Battery by machine Vision (I) - Development of Image Processing Algorithm for Finding The Sick or The Dead Layers - (기계시각을 이용한 고단 직립식 산란계 케이지의 감시시스템 개발 (I) - 병${\cdot}$폐사계 판정알고리즘 개발 -)

  • Im, Song-Su;Jang, Dong-Il;Jeong, Ssang-Yang;Lee, Seung-Ju
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • v.10 no.2
    • /
    • pp.273-279
    • /
    • 2005
  • PDF

Development of a multi-robot control system with sensor integrating capability (센서 통합 능력을 갖는 다중 로보트 제어 시스템의 개발)

  • 서일홍;현웅근;김태원;여희주;김재욱;윤승중
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1992.10a
    • /
    • pp.1008-1013
    • /
    • 1992
  • 본 논문에서는 다중 로보느의 협조제어(Coordinated Control)를 위한 로보트 콘트롤러의 설계에 대해서 연구한다. 첫 부분에서는 다중 로보느의 연구배경 및 연구동기에 대해서 논의하고 이어서 Coordinated Task를 묘사하기 위한 Programming Primiitive Set을 정의하며 구현에 대해서도 논의한다. 특히 Motopn Primitive는 synchronous(Coordinated Motion), Asynchronous Motion, Conditional Motion, 특수 Motion으로 분류하고, 각각의 궤적계획 및 구현에 대해서도 간단히 논의한다. 특히 본 논문에서는 외부의 변화하는 환경에 효과적으로 적응할 수 있게 하기 위하여 Vision센서, Encoder신호와 Limit센서, Force센서 등의 다양한 외부 센서를 융합 처리할수 있는 다중 로보트 제어 시스템을 개발하였다.

  • PDF

Development of multi-robot control system (다중 로보트 제어시스템의 개발)

  • 서일홍;현웅근;김태원;여희주;허우정;이경호;양승원;임준홍;오상록
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1990.10a
    • /
    • pp.497-501
    • /
    • 1990
  • 본 논문에서는 다중로보트의 제어를 위하여 비젼 센서 등 여러 가지 센서신호를 처리할 수 있고 로봇의 충돌회피 및 협조제어를 할 수 있는 제어시스템을 개발하였다. 본 제어장치는 시스템 전체를 관리하며 언어 및 로보트 동작의 교시 그리고 자기진단 등의 기능을 하는 Supervisory Processor, vision에 대한 정보를 담당하는 로보트 제어 processor 등 여러개의 프로세서로 나누어 분산처리 구조를 갖도록 하여 확장성 및 유연성이 높은 시스템이 되도록 하였다. 실험적으로 본 시스템을 이용하여 로보트로 하여금 puzzle을 맞추는 작업을 수행시킴으로써 본 시스템의 우수성을 입증하였다.

  • PDF

Objective Image Quality Metric for Block-Based DCT Image Coder Using Structural Distortion Measurement (구조적 왜곡특성 측정을 이용한 블록기반 DCT 영상 부호화기의 객관적 화질평가)

  • Chung Tae-Yun
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.52 no.7
    • /
    • pp.434-441
    • /
    • 2003
  • This paper proposes a new quantitative and objective image quality metric which is essential to verify the performance of block-based DCT image coding. The proposed metric considers not only global distortion of coded image such as spatial frequency sensitivity and channel masking using HVS based multi-channel model, but also structural distortions caused block-based coding. The experimental results show a strong correlation between proposed metric and subjective metric.

Objective Image Quality Metric for Block-Based DCT Image Coder-using Structural Distortion Measurement (구조적 왜곡특성 측정을 이용한 블록기반 DCT 영상 부호화기의 객관적 화질평가)

  • Jeong, Tae Yun
    • The Transactions of the Korean Institute of Electrical Engineers C
    • /
    • v.52 no.7
    • /
    • pp.434-434
    • /
    • 2003
  • This paper proposes a new quantitative and objective image quality metric which is essential to verify the performance of block-based DCT image coding The proposed metric considers not only global distortion of coded image such as spatial frequency sensitivity and channel masking using HVS based multi-channel model, but also structural distortions caused block-based coding. The experimental results show a strong correlation between propose(B metric and subjective metric.