• Title/Summary/Keyword: image pyramid structure

Search Result 53, Processing Time 0.026 seconds

A Hierarchical Stereo Matching Algorithm Using Wavelet Representation (웨이브릿 변환을 이용한 계층적 스테레오 정합)

  • 김영석;이준재;하영호
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.31B no.8
    • /
    • pp.74-86
    • /
    • 1994
  • In this paper a hierarchical stereo matching algorithm to obtain the disparity in wavelet transformed domain by using locally adaptive window and weights is proposed. The pyramidal structure obtained by wavelet transform is used to solve the loss of information which the conventional Gaussian or Laplacian pyramid have. The wavelet transformed images are decomposed into the blurred image the horizontal edges the vertical edges and the diagonal edges. The similarity between each wavelet channel of left and right image determines the relative importance of each primitive and make the algorithm perform the area-based and feature-based matching adaptively. The wavelet transform can extract the features that have the dense resolution as well as can avoid the duplication or loss of information. Meanwhile the variable window that needs to obtain precise and stable estimation of correspondense is decided adaptively from the disparities estimated in coarse resolution and LL(low-low) channel of wavelet transformed stereo image. Also a new relaxation algorithm that can reduce the false match without the blurring of the disparity edge is proposed. The experimental results for various images show that the proposed algorithm has good perfpormance even if the images used in experiments have the unfavorable conditions.

  • PDF

Object-Based Video Segmentation Using Spatio-temporal Entropic Thresholding and Camera Panning Compensation (시공간 엔트로피 임계법과 카메라 패닝 보상을 이용한 객체 기반 동영상 분할)

  • 백경환;곽노윤
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.4 no.3
    • /
    • pp.126-133
    • /
    • 2003
  • This paper is related to a morphological segmentation method for extracting the moving object in video sequence using global motion compensation and two-dimensional spatio-temporal entropic thresholding. First, global motion compensation is performed with camera panning vector estimated in the hierarchical pyramid structure constructed by wavelet transform. Secondly, the regions with high possibility to include the moving object between two consecutive frames are extracted block by block from the global motion compensated image using two-dimensional spatio-temporal entropic thresholding. Afterwards, the LUT classifying each block into one among changed block, uncertain block, stationary block according to the results classified by two-dimensional spatio-temporal entropic thresholding is made out. Next, by adaptively selecting the initial search layer and the search range referring to the LUT, the proposed HBMA can effectively carry out fast motion estimation and extract object-included region in the hierarchical pyramid structure. Finally, after we define the thresholded gradient image in the object-included region, and apply the morphological segmentation method to the object-included region pixel by pixel and extract the moving object included in video sequence. As shown in the results of computer simulation, the proposed method provides relatively good segmentation results for moving object and specially comes up with reasonable segmentation results in the edge areas with lower contrast.

  • PDF

The Design and Implementation of a Reusable Viewer Component

  • Kim, Hong-Gab;Lim, Young-Jae;Kim, Kyung-Ok
    • Proceedings of the KSRS Conference
    • /
    • 2002.10a
    • /
    • pp.66-69
    • /
    • 2002
  • This article outlines the capabilities of a viewer component called GridViewer, and proves its reusability. GridViewer was designed for the construction of the image display part of GIS or remote sensing application software, and consequently it is particularly straightforward to closely couple GridViewer with access to very large images. Displaying is performed through pyramid structure, which enables to treat very large dataset up to several gigabytes in size under the limited capability of PC. GridViewer is free from responsibility to handle various formats of raster data files by taking grid coverage, which is designed by OGC to promote interoperability between implementations done by data vendors and software vendors providing analysis and grid processing implementations. GridViewer differs from other such viewer by allowing for clients to extend its function and capability by using small set of methods originally implemented in it. We show its reusability and expandability by applying it in developing application programs performing various functions not supported originally by the GridViewer COM component.

  • PDF

A Multiresolution Stereo Matching Based on Genetic Algorithm using Edge Information (에지 정보를 이용한 유전 알고리즘 기반의 다해상도 스테레오 정합)

  • Hong, Seok-Keun;Cho, Seok-Je
    • The KIPS Transactions:PartB
    • /
    • v.17B no.1
    • /
    • pp.63-68
    • /
    • 2010
  • In this paper, we propose a multiresolution stereo matching method based on genetic algorithm using edge information. The proposed approach considers the matching environment as an optimization problem and finds the solution by using a genetic algorithm. A cost function composes of certain constraints which are commonly used in stereo matching. We defines the structure of chromosomes using edge pixel information of reference image of stereo pair. To increase the efficiency of process, we apply image pyramid method to stereo matching and calculate the initial disparity map at the coarsest resolution. Then initial disparity map is propagated to the next finer resolution, interpolated and performed disparity refinement. We valid our approach not only reduce the search time for correspondence but alse ensure the validity of matching.

Automatic Face Extraction with Unification of Brightness Distribution in Candidate Region and Triangle Structure among Facial Features (후보영역의 밝기 분산과 얼굴특징의 삼각형 배치구조를 결합한 얼굴의 자동 검출)

  • 이칠우;최정주
    • Journal of Korea Multimedia Society
    • /
    • v.3 no.1
    • /
    • pp.23-33
    • /
    • 2000
  • In this paper, we describe an algorithm which can extract human faces with natural pose from complex backgrounds. This method basically adopts the concept that facial region has the nearly same gray level for all pixels within appropriately scaled blocks. Based on the idea, we develop a hierarchial process that first, a block image data with pyramid structure of input image is generated, and some candidate regions for facial regions in the block image are Quickly determined, then finally the detailed facial features; organs are decided. To find the features easily, we introduce a local gray level transform which emphasizes dark and small regions, and estimate the geometrical triangle constraints among the facial features. The merit of our method is that we can be freed from the parameter assignment problem since the algorithm utilize a simple brightness computation, consequently robust systems not being depended on specific parameter values can be easily constructed.

  • PDF

Decision of Road Direction by Polygonal Approximation. (다각근사법을 이용한 도로방향 결정)

  • Lim, Young-Cheol;Park, Jong-Gun;Kim, Eui-Sun;Park, Jin-Su;Park, Chang-Seok
    • Proceedings of the KIEE Conference
    • /
    • 1996.07b
    • /
    • pp.1398-1400
    • /
    • 1996
  • In this paper, a method of the decision of the road direction for ALV(Autonomous Land Vehicle) road following by region-based segmentation is presented. The decision of the road direction requires extracting road regions from images in real-time to guide the navigation of ALV on the roadway. Two thresholds to discriminate between road and non-road region in the image are easily decided, using knowledge of problem region and polygonal approximation that searches multiple peaks and valleys in histogram of a road image. The most likely road region of the binary image is selected from original image by these steps. The location of a vanishing point to indicate the direction of the road can be obtained applying it to X-Y profile of the binary road region again. It can successfully steer a ALV along a road reliably, even in the presence of fluctuation of illumination condition, bad road surface condition such as hidden boundaries, shadows, road patches, dirt and water stains, and unusual road condition. Pyramid structure also saves time in processing road images and a real-time image processing for achieving navigation of ALV is implemented. The efficacy of this approach is demonstrated using several real-world road images.

  • PDF

Visual Model of Pattern Design Based on Deep Convolutional Neural Network

  • Jingjing Ye;Jun Wang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.18 no.2
    • /
    • pp.311-326
    • /
    • 2024
  • The rapid development of neural network technology promotes the neural network model driven by big data to overcome the texture effect of complex objects. Due to the limitations in complex scenes, it is necessary to establish custom template matching and apply it to the research of many fields of computational vision technology. The dependence on high-quality small label sample database data is not very strong, and the machine learning system of deep feature connection to complete the task of texture effect inference and speculation is relatively poor. The style transfer algorithm based on neural network collects and preserves the data of patterns, extracts and modernizes their features. Through the algorithm model, it is easier to present the texture color of patterns and display them digitally. In this paper, according to the texture effect reasoning of custom template matching, the 3D visualization of the target is transformed into a 3D model. The high similarity between the scene to be inferred and the user-defined template is calculated by the user-defined template of the multi-dimensional external feature label. The convolutional neural network is adopted to optimize the external area of the object to improve the sampling quality and computational performance of the sample pyramid structure. The results indicate that the proposed algorithm can accurately capture the significant target, achieve more ablation noise, and improve the visualization results. The proposed deep convolutional neural network optimization algorithm has good rapidity, data accuracy and robustness. The proposed algorithm can adapt to the calculation of more task scenes, display the redundant vision-related information of image conversion, enhance the powerful computing power, and further improve the computational efficiency and accuracy of convolutional networks, which has a high research significance for the study of image information conversion.

$L_2$-Norm Pyramid--Based Search Algorithm for Fast VQ Encoding (고속 벡터 양자 부호화를 위한 $L_2$-평균 피라미드 기반 탐색 기법)

  • Song, Byeong-Cheol;Ra, Jong-Beom
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.1
    • /
    • pp.32-39
    • /
    • 2002
  • Vector quantization for image compression needs expensive encoding time to find the closest codeword to the input vector. This paper proposes a search algorithm for fast vector quantization encoding. Firstly, we derive a robust condition based on the efficient topological structure of the codebook to dramatically eliminate unnecessary matching operations from the search procedure. Then, we Propose a fast search algorithm using the elimination condition. Simulation results show that with little preprocessing and memory cost, the encoding time of the proposed algorithm is reduced significantly while the encoding quality remains the same with respect to the full search algorithm. It is also found that the Proposed algorithm outperforms the existing search algorithms.

Vehicle Plate Extraction Algorithm for an Exculsive Bus Lane (버스 전용차선에서의 차량 번호판 추출 알고리즘)

  • 설성욱;이상찬;주재흠;강현인;남기곤
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.2 no.4
    • /
    • pp.31-37
    • /
    • 2001
  • License plate recognition system for an exclusive bus-lane is made of 5 core parts which are vehicle detection, image acquisition individual character extraction, character recognition and data transmission. Among them, the accuracy of license plate extraction can bring effect significantly to the accuracy of a whole system recognition rate also the more exact extraction of license plate is required in various weather and environment conditions. Therefore in this paper we propose a plat extraction algorithm that makes pyramid structure to reduced the extraction processing time binarizes plate's template region using adaptive thresholding extracts candidate region containing plate, and verifies a final region using plate character distribution characteristics among the candidates. Experimenal results were exactly extracted the license plate region by using proposed method to the image obtained in an exclusive bus-lane with various weather and environment conditions.

  • PDF

Three Dimensional Volume Reconstruction of an Object from X-ray Iamges using Uniform and Simultaneous ART (USART 방법에 의한 X선 영상으로부터의 삼차원 물체의 형상 복원)

  • Roh, Young-Jun;Cho, Hyung-Suck;Kim, Hyeong-Cheol;Kim, Jong-Hyung
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.8 no.1
    • /
    • pp.21-27
    • /
    • 2002
  • Inspection and shape measurement of three-dimensional objects are widely needed in industries for quality monitoring and control. A number of visual or optical technologies have been successfully applied to measure three-dimensional surfaces. However, those conventional visual or optical methods have inherent shortcomings such as occlusion and variant surface reflection. X-ray vision system can be a good solution to these conventional problems, since we can extract the volume information including both the surface geometry and the inner structure of any objects. In the x-ray system, the surface condition of an object, whether it is lambertian or specular, does not affect the inherent characteristics of its x-ray images. In this paper, we propose a three-dimensional x-ray imaging method to reconstruct a three dimensional structure of an object out of two dimensional x-ray image sets. To achieve this by the proposed method, two or more x-ray images projected from different views are needed. Once these images are acquired, the simultaneous algebraic reconstruction technique(SART) is usually utilized. Since the existing SART algorithms have several shortcomings such as low performance in convergence and different convergence within the reconstruction volume of interest, an advanced SART algorithm named as USART(uniform SART) is proposed to avoid such shortcomings and improve the reconstruction performance. Because, each voxel within the volume is equally weighted to update instantaneous value of its internal density, it can achieve uniform convergence property of the reconstructed volume. The algorithm is simulated on various shapes of objects such as a pyramid, a hemisphere and a BGA model. Based on simulation results the performance of the proposed method is compared with that of the conventional SART method.