• Title/Summary/Keyword: visual estimation method

Search Result 255, Processing Time 0.029 seconds

An efficient frame rate up-conversion method with adaptive motion estimation and compensation for mobile projection displays

  • Lee, Jong-Ok;Jang, Seul-Ki;Chen, Qiao Song;Kim, Choon-Woo
    • 한국정보디스플레이학회:학술대회논문집
    • /
    • 2007.08a
    • /
    • pp.810-813
    • /
    • 2007
  • Recently, mobile video communication is getting more and more popular. Visual quality and computational complexity are primary factors affecting performance of video communication. Frame rate up-conversion (FRC) is necessary for achieving high visual quality in mobile projection displays. In this paper, a FRC method using motion compensation based on block matching algorithm (BMA) with adaptive block size is proposed. In order to improve the accuracy of the estimated motion vectors, the motion vector refinement technique is proposed. Experiment results indicate that the proposed technique exhibits better performance with lower hardware complexity compared to the conventional methods.

  • PDF

SD and EEG Evaluation of the Visual Cognition to the Natural and Urban Landscape (SD 및 EEG 기법을 통한 자연 및 도시경관의 시지각적 인지분석)

  • Hwang, Jee-Wook;Hong, Chul-Un;Chong, Woo-Suk
    • Journal of Environmental Science International
    • /
    • v.15 no.4
    • /
    • pp.305-310
    • /
    • 2006
  • The color and structure of urban constructions is a factor of urban landscape and shows their characteristics. Hence the modern buildings deal with their materials and external appearance as an important factor, making up the urban image. But it was nearby impossible to evaluate the value of visual landscape with objective measuring method. Most of all, it depends on the subjective estimation of a few talented or high educated experts with a sense of beauty. Such kinds of estimation can in some cases include arbitrary interpretations. In relation to this kind of problems, it is tried here in this study to analyse the human response of brain wave pattern (EEG) with use of SD method, while the tested persons watch the urban landscape scenery constructed in a visual reality. The tested persons were 20 adult male and female with no color blindness and intact cognitive function. Light source with color filter was used for color environment in a dark soundproof chamber. The signal of EEG is analysed digitally and grouped into the ${\alpha}$ and ${\beta}$ waves. The result showed that relative power of ${\alpha}$ wave ratio increased in the natural landscape scenery with blue and green color. From these results it was possible to evaluate the human response, which is affected by urban and natural color and structure stimulation and it might be useful as an indicator of visual cognition amenity toward the design of urban construction environment.

A study on the estimation of relative shift from aerial image sequences (연속항공영상에서의 상대적 편이 추정에 관한 연구)

  • Hwang, Y.S.;Lee, K.H.
    • Proceedings of the KIEE Conference
    • /
    • 1991.07a
    • /
    • pp.825-828
    • /
    • 1991
  • This paper addresses estimation of the relative shift vector from aerial image sequences. We perform similarity function tests and decide the most appropriate similarity function for the visual navigation system using aerial images. Finally, we propose the maximum variance reference line selection method for reducing the estimation error of the shift vector.

  • PDF

A Novel Covariance Matrix Estimation Method for MVDR Beamforming In Audio-Visual Communication Systems (오디오-비디오 통신 시스템에서 MVDR 빔 형성 기법을 위한 새로운 공분산 행렬 예측 방법)

  • You, Gyeong-Kuk;Yang, Jae-Mo;Lee, Jinkyu;Kang, Hong-Goo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.33 no.5
    • /
    • pp.326-334
    • /
    • 2014
  • This paper proposes a novel covariance matrix estimation scheme for minimum variance distortionless response (MVDR) beamforming. By accurately tracking direction-of-sound source arrival (DoA) information using audio-visual sensors, the covariance matrix is efficiently estimated by adopting a variable forgetting factor. The variable forgetting factor is determined by considering signal-to-interference ratio (SIR). Experimental results verify that the performance of the proposed method is superior to that of the conventional one in terms of interference/noise reduction and speech distortion.

Visual Sensing of the Light Spot of a Laser Pointer for Robotic Applications

  • Park, Sung-Ho;Kim, Dong Uk;Do, Yongtae
    • Journal of Sensor Science and Technology
    • /
    • v.27 no.4
    • /
    • pp.216-220
    • /
    • 2018
  • In this paper, we present visual sensing techniques that can be used to teach a robot using a laser pointer. The light spot of an off-the-shelf laser pointer is detected and its movement is tracked on consecutive images of a camera. The three-dimensional position of the spot is calculated using stereo cameras. The light spot on the image is detected based on its color, brightness, and shape. The detection results in a binary image, and morphological processing steps are performed on the image to refine the detection. The movement of the laser spot is measured using two methods. The first is a simple method of specifying the region of interest (ROI) centered at the current location of the light spot and finding the spot within the ROI on the next image. It is assumed that the movement of the spot is not large on two consecutive images. The second method is using a Kalman filter, which has been widely employed in trajectory estimation problems. In our simulation study of various cases, Kalman filtering shows better results mostly. However, there is a problem of fitting the system model of the filter to the pattern of the spot movement.

KNN-based Image Annotation by Collectively Mining Visual and Semantic Similarities

  • Ji, Qian;Zhang, Liyan;Li, Zechao
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.9
    • /
    • pp.4476-4490
    • /
    • 2017
  • The aim of image annotation is to determine labels that can accurately describe the semantic information of images. Many approaches have been proposed to automate the image annotation task while achieving good performance. However, in most cases, the semantic similarities of images are ignored. Towards this end, we propose a novel Visual-Semantic Nearest Neighbor (VS-KNN) method by collectively exploring visual and semantic similarities for image annotation. First, for each label, visual nearest neighbors of a given test image are constructed from training images associated with this label. Second, each neighboring subset is determined by mining the semantic similarity and the visual similarity. Finally, the relevance between the images and labels is determined based on maximum a posteriori estimation. Extensive experiments were conducted using three widely used image datasets. The experimental results show the effectiveness of the proposed method in comparison with state-of-the-arts methods.

Motion Estimation-based Human Fall Detection for Visual Surveillance

  • Kim, Heegwang;Park, Jinho;Park, Hasil;Paik, Joonki
    • IEIE Transactions on Smart Processing and Computing
    • /
    • v.5 no.5
    • /
    • pp.327-330
    • /
    • 2016
  • Currently, the world's elderly population continues to grow at a dramatic rate. As the number of senior citizens increases, detection of someone falling has attracted increasing attention for visual surveillance systems. This paper presents a novel fall-detection algorithm using motion estimation and an integrated spatiotemporal energy map of the object region. The proposed method first extracts a human region using a background subtraction method. Next, we applied an optical flow algorithm to estimate motion vectors, and an energy map is generated by accumulating the detected human region for a certain period of time. We can then detect a fall using k-nearest neighbor (kNN) classification with the previously estimated motion information and energy map. The experimental results show that the proposed algorithm can effectively detect someone falling in any direction, including at an angle parallel to the camera's optical axis.

Structural Damage Localization for Visual Inspection Using Unmanned Aerial Vehicle with Building Information Modeling Information (UAV와 BIM 정보를 활용한 시설물 외관 손상의 위치 측정 방법)

  • Lee, Yong-Ju;Park, Man-Woo
    • Journal of KIBIM
    • /
    • v.13 no.4
    • /
    • pp.64-73
    • /
    • 2023
  • This study introduces a method of estimating the 3D coordinates of structural damage from the detection results of visual inspection provided in 2D image coordinates using sensing data of UAV and 3D shape information of BIM. This estimation process takes place in a virtual space and utilizes the BIM model, so it is possible to immediately identify which member of the structure the estimated location corresponds to. Difference from conventional structural damage localization methods that require 3D scanning or additional sensor attachment, it is a method that can be applied locally and rapidly. Measurement accuracy was calculated through the distance difference between the measured position measured by TLS (Terrestrial Laser Scanner) and the estimated position calculated by the method proposed in this study, which can determine the applicability of this study and the direction of future research.

Improving Detection Range for Short Baseline Stereo Cameras Using Convolutional Neural Networks and Keypoint Matching (컨볼루션 뉴럴 네트워크와 키포인트 매칭을 이용한 짧은 베이스라인 스테레오 카메라의 거리 센싱 능력 향상)

  • Byungjae Park
    • Journal of Sensor Science and Technology
    • /
    • v.33 no.2
    • /
    • pp.98-104
    • /
    • 2024
  • This study proposes a method to overcome the limited detection range of short-baseline stereo cameras (SBSCs). The proposed method includes two steps: (1) predicting an unscaled initial depth using monocular depth estimation (MDE) and (2) adjusting the unscaled initial depth by a scale factor. The scale factor is computed by triangulating the sparse visual keypoints extracted from the left and right images of the SBSC. The proposed method allows the use of any pre-trained MDE model without the need for additional training or data collection, making it efficient even when considering the computational constraints of small platforms. Using an open dataset, the performance of the proposed method was demonstrated by comparing it with other conventional stereo-based depth estimation methods.

Fast Content-preserving Seam Estimation for Real-time High-resolution Video Stitching (실시간 고해상도 동영상 스티칭을 위한 고속 콘텐츠 보존 시접선 추정 방법)

  • Kim, Taeha;Yang, Seongyeop;Kang, Byeongkeun;Lee, Hee Kyung;Seo, Jeongil;Lee, Yeejin
    • Journal of Broadcast Engineering
    • /
    • v.25 no.6
    • /
    • pp.1004-1012
    • /
    • 2020
  • We present a novel content-preserving seam estimation algorithm for real-time high-resolution video stitching. Seam estimation is one of the fundamental steps in image/video stitching. It is to minimize visual artifacts in the transition areas between images. Typical seam estimation algorithms are based on optimization methods that demand intensive computations and large memory. The algorithms, however, often fail to avoid objects and results in cropped or duplicated objects. They also lack temporal consistency and induce flickering between frames. Hence, we propose an efficient and temporarily-consistent seam estimation algorithm that utilizes a straight line. The proposed method also uses convolutional neural network-based instance segmentation to locate seam at out-of-objects. Experimental results demonstrate that the proposed method produces visually plausible stitched videos with minimal visual artifacts in real-time.