• Title/Summary/Keyword: affine transformation

Search Result 127, Processing Time 0.022 seconds

Integration of Motion Compensation Algorithm for Predictive Video Coding (예측 비디오 코딩을 위한 통합 움직임 보상 알고리즘)

  • Eum, Ho-Min;Park, Geun-Soo;Song, Moon-Ho
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.36S no.12
    • /
    • pp.85-96
    • /
    • 1999
  • In a number of predictive video compression standards, the motion is compensated by the block-based motion compensation (BMC). The effective motion field used for the prediction by the BMC is obviously discontinuous since one motion vector is used for the entire macro-block. The usage of discontinuous motion field for the prediction causes the blocky artifacts and one obvious approach for eliminating such artifacts is to use a smoothed motion field. The optimal procedure will depend on the type of motion within the video. In this paper, several procedures for the motion vectors are considered. For any interpolation or approaches, however, the motion vectors as provided by the block matching algorithm(BMA) are no longer optimal. The optimum motion vectors(still one per macro-block) must minimize the of the displaced frame difference (DFD). We propose a unified algorithm that computes the optimum motion vectors to minimize the of the DFD using a conjugate gradient search. The proposed algorithm has been implemented and tested for the affine transformation based motion compensation (ATMC), the bilinear transformation based motion compensation (BTMC) and our own filtered motion compensation(FMC). The performance of these different approaches will be compared against the BMC.

  • PDF

Comparison Among Sensor Modeling Methods in High-Resolution Satellite Imagery (고해상도 위성영상의 센서모형과 방법 비교)

  • Kim, Eui Myoung;Lee, Suk Kun
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.26 no.6D
    • /
    • pp.1025-1032
    • /
    • 2006
  • Sensor modeling of high-resolution satellites is a prerequisite procedure for mapping and GIS applications. Sensor models, describing the geometric relationship between scene and object, are divided into two main categories, which are rigorous and approximate sensor models. A rigorous model is based on the actual geometry of the image formation process, involving internal and external characteristics of the implemented sensor. However, approximate models require neither a comprehensive understanding of imaging geometry nor the internal and external characteristics of the imaging sensor, which has gathered a great interest within photogrammetric communities. This paper described a comparison between rigorous and various approximate sensor models that have been used to determine three-dimensional positions, and proposed the appropriate sensor model in terms of the satellite imagery usage. Through the case study of using IKONOS satellite scenes, rigorous and approximate sensor models have been compared and evaluated for the positional accuracy in terms of acquirable number of ground controls. Bias compensated RFM(Rational Function Model) turned out to be the best among compared approximate sensor models, both modified parallel projection and parallel-perspective model were able to be modelled with a small number of controls. Also affine transformation, one of the approximate sensor models, can be used to determine the planimetric position of high-resolution satellites and perform image registration between scenes.

Geometric Correction of Lips Using Lip Information (입술정보를 이용한 입술모양의 기하학적 보정)

  • 황동국;박희정;전병민
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.6C
    • /
    • pp.834-841
    • /
    • 2004
  • There can be lips transformed geometrically in the lip images according to the location or the pose of camera and speaker. This transformation of the lip images changes geometric information of original lip phases. Therefore, for enhancing global lip information by using partial information of lips to correct lip phases transformed geometrically, in this paper we propose a method that can geometrically correct lips. The method is composed of two steps - the feature-deciding step and the correcting step. In the former, it is for us to extract key points and features of source image according to the its lip model and to create that of target image according to the its lip model. In the latter, we decide mapping relation after partition a source and target image based on information extracted in the previous step into each 4 regions. and then, after mapping, we unite corrected sub-images to a result image. As experiment image, we use fames that contain pronunciation on short vowels of the Korean language and use lip symmetry for evaluating the proposed algorithm. In experiment result, the correcting rate of the lower lip than the upper lip and that of lips moving largely than little was highly enhanced.

GEOMETRY OF SATELLITE IMAGES - CALIBRATION AND MATHEMATICAL MODELS

  • JACOBSEN KARSTEN
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.182-185
    • /
    • 2005
  • Satellite cameras are calibrated before launch in detail and in general, but it cannot be guaranteed that the geometry is not changing during launch and caused by thermal influence of the sun in the orbit. Modem satellite imaging systems are based on CCD-line sensors. Because of the required high sampling rate the length of used CCD-lines is limited. For reaching a sufficient swath width, some CCD-lines are combined to a longer virtual CCD-line. The images generated by the individual CCD-lines do overlap slightly and so they can be shifted in x- and y-direction in relation to a chosen reference image just based on tie points. For the alignment and difference in scale, control points are required. The resulting virtual image has only negligible errors in areas with very large difference in height caused by the difference in the location of the projection centers. Color images can be related to the joint panchromatic scenes just based on tie points. Pan-sharpened images may show only small color shifts in very mountainous areas and for moving objects. The direct sensor orientation has to be calibrated based on control points. Discrepancies in horizontal shift can only be separated from attitude discrepancies with a good three-dimensional control point distribution. For such a calibration a program based on geometric reconstruction of the sensor orientation is required. The approximations by 3D-affine transformation or direct linear transformation (DL n cannot be used. These methods do have also disadvantages for standard sensor orientation. The image orientation by geometric reconstruction can be improved by self calibration with additional parameters for the analysis and compensation of remaining systematic effects for example caused by a not linear CCD-line. The determined sensor geometry can be used for the generation? of rational polynomial coefficients, describing the sensor geometry by relations of polynomials of the ground coordinates X, Y and Z.

  • PDF

Driving Assist System using Semantic Segmentation based on Deep Learning (딥러닝 기반의 의미론적 영상 분할을 이용한 주행 보조 시스템)

  • Kim, Jung-Hwan;Lee, Tae-Min;Lim, Joonhong
    • Journal of IKEEE
    • /
    • v.24 no.1
    • /
    • pp.147-153
    • /
    • 2020
  • Conventional lane detection algorithms have problems in that the detection rate is lowered in road environments having a large change in curvature and illumination. The probabilistic Hough transform method has low lane detection rate since it exploits edges and restrictive angles. On the other hand, the method using a sliding window can detect a curved lane as the lane is detected by dividing the image into windows. However, the detection rate of this method is affected by road slopes because it uses affine transformation. In order to detect lanes robustly and avoid obstacles, we propose driving assist system using semantic segmentation based on deep learning. The architecture for segmentation is SegNet based on VGG-16. The semantic image segmentation feature can be used to calculate safety space and predict collisions so that we control a vehicle using adaptive-MPC to avoid objects and keep lanes. Simulation results with CARLA show that the proposed algorithm detects lanes robustly and avoids unknown obstacles in front of vehicle.

Multiresolution 3D Facial Model Compression (다해상도 3D 얼굴 모델의 압축)

  • 박동희;이종석;이영식;배철수
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2002.05a
    • /
    • pp.602-607
    • /
    • 2002
  • In this paper, we proposed an approach to efficiently compress and transmit multiresoltion 3D lariat models for multimedia and very low bit rate applications. A personal facial model is obtained by a 3D laser digitizer, and further re-quantized at several resolutions according to different scope of applications, such as animation, video game, and video conference. By deforming 2D templates to match and re-quantize a 3D digitized facial model, we obtain its compressed model. In the present study, we create hierarchical 2D lariat wireframe templates are adapted according to facial feature points and the proposed piecewise chainlet affined transformation(PACT) method. The 3D digitized model after requantization are reduced significantly without perceptual loss. Moreover the proposed multiresoulation lariat models possessed of hierarchial data structure are apt to be progressively transmitted and displayed across internet.

  • PDF

Effective Reduction of Horizontal Error in Laser Scanning Information by Strip-Wise Least Squares Adjustments

  • Lee, Byoung-Kil;Yu, Ki-Yun;Pyeon, Moo-Wook
    • ETRI Journal
    • /
    • v.25 no.2
    • /
    • pp.109-120
    • /
    • 2003
  • Though the airborne laser scanning (ALS) technique is becoming more popular in many applications, horizontal accuracy of points scanned by the ALS is not yet satisfactory when compared with the accuracy achieved for vertical positions. One of the major reasons is the drift that occurs in the inertial measurement unit (IMU) during the scanning. This paper presents an algorithm that adjusts for the error that is introduced mainly by the drift of the IMU that renders systematic differences between strips on the same area. For this, we set up an observation equation for strip-wise adjustments and completed it with tie point and control point coordinates derived from the scanned strips and information from aerial photos. To effectively capture the tie points, we developed a set of procedures that constructs a digital surface model (DSM) with breaklines and then performed feature-based matching on strips resulting in a set of reliable tie points. Solving the observation equations by the least squares method produced a set of affine transformation equations with 6 parameters that we used to transform the strips for adjusting the horizontal error. Experimental results after evaluation of the accuracy showed a root mean squared error (RMSE) of the adjusted strip points of 0.27 m, which is significant considering the RMSE before adjustment was 0.77 m.

  • PDF

Motion estimation method using multiple linear regression model (다중선형회귀모델을 이용한 움직임 추정방법)

  • 김학수;임원택;이재철;이규원;박규택
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.34S no.10
    • /
    • pp.98-103
    • /
    • 1997
  • Given the small bit allocation for motion information in very low bit-rate coding, motion estimation using the block matching algorithm(BMA) fails to maintain an acceptable level of prediction errors. The reson is that the motion model, or spatial transformation, assumed in block matching cannot approximate the motion in the real world precisely with a small number of parameters. In order to overcome the drawback of the conventional block matching algorithm, several triangle-based methods which utilize triangular patches insead of blocks have been proposed. To estimate the motions of image sequences, these methods usually have been based on the combination of optical flow equation, affine transform, and iteration. But the compuataional cost of these methods is expensive. This paper presents a fast motion estimation algorithm using a multiple linear regression model to solve the defects of the BMA and the triange-based methods. After describing the basic 2-D triangle-based method, the details of the proposed multiple linear regression model are presented along with the motion estimation results from one standard video sequence, representative of MPEG-4 class A data. The simulationresuls show that in the proposed method, the average PSNR is improved about 1.24 dB in comparison with the BMA method, and the computational cost is reduced about 25% in comparison with the 2-D triangle-based method.

  • PDF

Filtering Feature Mismatches using Multiple Descriptors (다중 기술자를 이용한 잘못된 특징점 정합 제거)

  • Kim, Jae-Young;Jun, Heesung
    • Journal of the Korea Society of Computer and Information
    • /
    • v.19 no.1
    • /
    • pp.23-30
    • /
    • 2014
  • Feature matching using image descriptors is robust method used recently. However, mismatches occur in 3D transformed images, illumination-changed images and repetitive-pattern images. In this paper, we observe that there are a lot of mismatches in the images which have repetitive patterns. We analyze it and propose a method to eliminate these mismatches. MDMF(Multiple Descriptors-based Mismatch Filtering) eliminates mismatches by using descriptors of nearest several features of one specific feature point. In experiments, for geometrical transformation like scale, rotation, affine, we compare the match ratio among SIFT, ASIFT and MDMF, and we show that MDMF can eliminate mismatches successfully.

Profile Design and Implementation of Aerial Photogrammetry WPS for Standard GIS Web Service (With Emphasis on Affine Transformation and Resection) (표준 GIS 웹 서비스를 위한 항공사진측량 WPS의 프로파일 설계 및 구현 (부등각사상변환, 후방교회법 중심으로))

  • Kim, Byung-Jo;Yom, Jae-Hong;Kyung, Min-Ju
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.28 no.3
    • /
    • pp.337-345
    • /
    • 2010
  • In general, Digital Photogrammetry is based on independent workstation system, which is costly and has complex process. In this research, a new approach method regarding Digital Photogrammetry procedure is suggested using Web Processing Service, which is a GIS standard proposed by Open Geospatial Consortium. For the experiment, many Generic Processes were defined through WPS profiling procedure which defines standard unit for Photogrammetry, and with the defined process each server and client S/W module was implemented based on WPS standards. In this paper, many users can be expected to share and reuse unit process in WPS server through the web.