Search | Korea Science

Object Detection and 3D Position Estimation based on Stereo Vision (스테레오 영상 기반의 객체 탐지 및 객체의 3차원 위치 추정)

Son, Haengseon;Lee, Seonyoung;Min, Kyoungwon;Seo, Seongjin
- The Journal of Korea Institute of Information, Electronics, and Communication Technology
- /
- v.10 no.4
- /
- pp.318-324
- /
- 2017
We introduced a stereo camera on the aircraft to detect flight objects and to estimate the 3D position of them. The Saliency map algorithm based on PCT was proposed to detect a small object between clouds, and then we processed a stereo matching algorithm to find out the disparity between the left and right camera. In order to extract accurate disparity, cost aggregation region was used as a variable region to adapt to detection object. In this paper, we use the detection result as the cost aggregation region. In order to extract more precise disparity, sub-pixel interpolation is used to extract float type-disparity at sub-pixel level. We also proposed a method to estimate the spatial position of an object by using camera parameters. It is expected that it can be applied to image - based object detection and collision avoidance system of autonomous aircraft in the future.
https://doi.org/10.17661/jkiiect.2017.10.4.318 인용 PDF KSCI

Preprocessing Technique for Improving Action Recognition Performance in ERP Video with Multiple Objects (다중 객체가 존재하는 ERP 영상에서 행동 인식 모델 성능 향상을 위한 전처리 기법)

Park, Eun-Soo;Kim, Seunghwan;Ryu, Eun-Seok
- Journal of Broadcast Engineering
- /
- v.25 no.3
- /
- pp.374-385
- /
- 2020
In this paper, we propose a preprocessing technique to solve the problems of action recognition with Equirectangular Projection (ERP) video. The preprocessing technique proposed in this paper assumes the person object as the subject of action, that is, the Object of Interest (OOI), and the surrounding area of the OOI as the ROI. The preprocessing technique consists of three modules. I) Recognize person object in the image with object recognition model. II) Create a saliency map from the input image. III) Select subject of action using recognized person object and saliency map. The subject boundary box of the selected action is input to the action recognition model in order to improve the action recognition performance. When comparing the performance of the proposed preprocessing method to the action recognition model and the performance of the original ERP image input method, the performance is improved up to 99.6%, and the action is obtained when only the OOI is detected. It can also see the effects of related video summaries.
https://doi.org/10.5909/JBE.2020.25.3.374 인용 PDF KSCI KPUBS

Saliency Detection using Mutual Information of Wavelet Subbands (웨이블릿 부밴드의 상호 정보량을 이용한 세일리언시 검출)

Moon, Sang Whan;Lee, Ho Sang;Moon, Yong Ho;Eom, Il Kyu
- Journal of the Institute of Electronics and Information Engineers
- /
- v.54 no.6
- /
- pp.72-79
- /
- 2017
In this paper, we present a new saliency detection algorithm using the mutual information of wavelet subbands. Our method constructs an intermediate saliency map using the power operation and Gaussian blurring for high-frequency wavelet coefficients. After combining three intermediate saliency maps according to the direction of wavelet subband, we find the main directional components using entropy measure. The amount of mutual information of each subband is obtained centering on the subband having the minimum entropy The final saliency map is detected using Minkowski sum based on weights calculated by the mutual information. As a result of the experiment on CAT2000 and ECSSD databases, our method showed good detection results in terms of ROC and AUC with few computation times compared with the conventional methods.
https://doi.org/10.5573/ieie.2017.54.6.72 인용 PDF KSCI

Implementing a Depth Map Generation Algorithm by Convolutional Neural Network (깊이맵 생성 알고리즘의 합성곱 신경망 구현)

Lee, Seungsoo;Kim, Hong Jin;Kim, Manbae
- Journal of Broadcast Engineering
- /
- v.23 no.1
- /
- pp.3-10
- /
- 2018
Depth map has been utilized in a varity of fields. Recently research on generating depth map by artificial neural network (ANN) has gained much interest. This paper validates the feasibility of implementing the ready-made depth map generation by convolutional neural network (CNN). First, for a given image, a depth map is generated by the weighted average of a saliency map as well as a motion history image. Then CNN network is trained by test images and depth maps. The objective and subjective experiments are performed on the CNN and showed that the CNN can replace the ready-made depth generation method.
https://doi.org/10.5909/JBE.2018.23.1.3 인용 PDF KSCI KPUBS

Bottle Label Segmentation Based on Multiple Gradient Information

Chen, Yanjuan;Park, Sang-Cheol;Na, In-Seop;Kim, Soo-Hyung;Lee, Myung-Eun
- International Journal of Contents
- /
- v.7 no.4
- /
- pp.24-29
- /
- 2011
In this paper, we propose a method to segment the bottle label in images taken by mobile phones using multi-gradient approaches. In order to segment the label region of interest-object, the saliency map method and Hough Transformation method are first applied to the original images to obtain the candidate region. The saliency map is used to detect the most salient area based on three kinds of features (color, orientation and illumination features). The Hough Transformation is a technique to isolated features of a particular shape within an image. Therefore, we utilize it to find the left and right border of the bottle. Next, we segment the label based on the gradient information obtained from the structure tensor method and edge method. The experimental results have shown that the proposed method is able to accurately segment the labels as the first step of product label recognition system.
https://doi.org/10.5392/IJoC.2011.7.4.024 인용 PDF KSCI

Multi-view Image Generation using Grid-mesh based Image Domain Warping and Occlusion Region Information (차폐영역 정보와 그리드 메쉬 기반의 영상 워핑을 이용한 다시점 영상 생성)

Lim, Jong-Myeong;Um, Gi-Mun;Shin, Hong-Chang;Lee, Gwangsoon;Hur, Namho;Yoo, Jisang
- Journal of Broadcast Engineering
- /
- v.18 no.6
- /
- pp.859-871
- /
- 2013
In this paper, we propose an algorithm that generates multi-view images by grid-mesh based image domain warping using occlusion mask and various image features obtained from the stereoscopic images. In the proposed algorithm, we first extract image saliency map, line segments and disparity saliency map from stereo images and then get them through a process that improves the quality of extracted features. This process is accomplished in two steps. In the first step, reliability of disparity saliency map on object boundary regions is enhanced by using occlusion information. And in the second step, we enhance the quality of image features in terms of temporal consistency by using temporal consistency information for stereo images. With these enhanced features, multi-view images are generated by grid-mesh based image domain warping technique. Experimental results show that the proposed algorithm performs better than existing algorithms in terms of visual quality.
https://doi.org/10.5909/JBE.2013.18.6.859 인용 PDF KSCI KPUBS HTML

A Saliency-Based Focusing Region Selection Method for Robust Auto-Focusing

Jeon, Jaehwan;Cho, Changhun;Paik, Joonki
- IEIE Transactions on Smart Processing and Computing
- /
- v.1 no.3
- /
- pp.133-142
- /
- 2012
This paper presents a salient region detection algorithm for auto-focusing based on the characteristics of a human's visual attention. To describe the saliency at the local, regional, and global levels, this paper proposes a set of novel features including multi-scale local contrast, variance, center-surround entropy, and closeness to the center. Those features are then prioritized to produce a saliency map. The major advantage of the proposed approach is twofold; i) robustness to changes in focus and ii) low computational complexity. The experimental results showed that the proposed method outperforms the existing low-level feature-based methods in the sense of both robustness and accuracy for auto-focusing.
PDF

Development of Active Stereo Surveillance System with the Human-like Visual Selective Attention (인체의 상향식 선택적 주의 집중 시각 기능을 모방한 능동 스테레오 감시 시스템의 개발)

Jung, Bum-Soo;Lee, Min-Ho
- Journal of Sensor Science and Technology
- /
- v.13 no.2
- /
- pp.144-151
- /
- 2004
In this paper, we propose an active stereo surveillance system with human-like convergence function. The proposed system uses a bottom-up saliency map model with the human-like selective attention visual function to select an interesting region in each camera. and this system compares the landmarks whether the selective region in each camera finds a same region. If the left and right cameras successfully find a same landmarks, the implemented vision system focuses on the landmark. Using the motor encoder information, we can automatically obtain the depth information and resultantly construct a depth map using the depth information. Computer simulation and experimental results show that the proposed convergence method is very effective to implement the active stereo surveillance system.
https://doi.org/10.5369/JSST.2004.13.2.144 인용 PDF KSCI

Building Change Detection Using Deep Learning for Remote Sensing Images

Wang, Chang;Han, Shijing;Zhang, Wen;Miao, Shufeng
- Journal of Information Processing Systems
- /
- v.18 no.4
- /
- pp.587-598
- /
- 2022
To increase building change recognition accuracy, we present a deep learning-based building change detection using remote sensing images. In the proposed approach, by merging pixel-level and object-level information of multitemporal remote sensing images, we create the difference image (DI), and the frequency-domain significance technique is used to generate the DI saliency map. The fuzzy C-means clustering technique pre-classifies the coarse change detection map by defining the DI saliency map threshold. We then extract the neighborhood features of the unchanged pixels and the changed (buildings) from pixel-level and object-level feature images, which are then used as valid deep neural network (DNN) training samples. The trained DNNs are then utilized to identify changes in DI. The suggested strategy was evaluated and compared to current detection methods using two datasets. The results suggest that our proposed technique can detect more building change information and improve change detection accuracy.
https://doi.org/10.3745/JIPS.02.0180 인용 PDF KSCI

Query-based Visual Attention Algorithm for Object Recognition of A Mobile Robot (이동로봇의 물체인식을 위한 질의 기반 시각 집중 알고리즘)

Ryu, Gwang-Geun;Lee, Sang-Hoon;Suh, Il-Hong
- Journal of the Institute of Electronics Engineers of Korea SC
- /
- v.44 no.1
- /
- pp.50-58
- /
- 2007
In this paper, we propose a query-based visual attention algorithm for effective object finding of a vision-based mobile robot. This algorithm is developed by extending conventional bottom-up visual attention algorithms. In our proposed algorithm various conspicuity maps are merged to make a saliency map, where weighting values are determined by query-dependent object properties. The saliency map is then used to find possible attentive location of queried object. To show the validities of our proposed algorithm, several objects are employed to compare performances of our proposed algorithm with those of conventional bottom-up approaches. Here, as one of exemplar query-dependent object property, color property is used.
PDF KSCI

Search Result 100, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)