• Title/Summary/Keyword: Frames of reference

Search Result 261, Processing Time 0.022 seconds

Estimation of Drone Velocity with Sum of Absolute Difference between Multiple Frames (다중 프레임의 SAD를 이용한 드론 속도 측정)

  • Nam, Donho;Yeom, Seokwon
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.20 no.3
    • /
    • pp.171-176
    • /
    • 2019
  • Drones are highly utilized because they can efficiently acquire long-distance videos. In drone operation, the speed, which is the magnitude of the velocity, can be set, but the moving direction cannot be set, so accurate information about the drone's movement should be estimated. In this paper, we estimate the velocity of the drone moving at a constant speed and direction. In order to estimate the drone's velocity, the displacement of the target frame to minimize the sum of absolute difference (SAD) of the reference frame and the target frame is obtained. The ground truth of the drone's velocity is calculated using the position of a certain matching point over all frames. In the experiments, a video was obtained from the drone moving at a constant speed at a height of 150 meters. The root mean squared error (RMSE) of the estimated velocities in x and y directions and the RMSE of the speed were obtained showing the reliability of the proposed method.

A Modified HE Technique to Enhance Image Contrast for Scaled Image on Small-sized Mobile Display (휴대단말기용 소형 디스플레이의 영상 컨트라스트 향상을 위한 변형된 HE 기법 연구)

  • Chung, Jin-Young;Hossen, Monir;Jeong, Kyung-Hoon;Kang, Dong-Wook;Kim, Ki-Doo
    • Proceedings of the KIEE Conference
    • /
    • 2008.10b
    • /
    • pp.137-138
    • /
    • 2008
  • This paper proposes the modified image contrast enhancement technique for small-sized display of mobile handset. Sample images are user interface images, in which scaled up wVGA($800{\times}480$) from qVGA($320{\times}240$) that we can see easily in mobile handset. The display size of mobile handset is relatively small, so the goal of this paper is to simplify image contrast enhancement algorithm based on conventional HE (Histogram Equalization) algorithm and improve computational effectiveness to minimize power consumption in real hardware IC. In this paper, we adopt HE technique, which is classical and widely used for image contrast enhancement. At first, the input frame image is partitioned to temporal sub-frames and then analyzes gray level histogram of each sub-frame. In case that the analyzed histogram of some sub-frames deviates so much from reference level (it means that the sub-frame image components consist of too bright ones or dark ones), apply DHE(Dynamic Histogram Equalization) algorithm. In the other case, apply classical Histogram Linearization (or Global HE) algorithm. Also we compare the HE technique with gamma LUT (Look-Up Table) method, which is known as the simplest technique to enhance image contrast.

  • PDF

A New Hardware Architecture of High-Speed Motion Estimator for H.264 Video CODEC (H.264 비디오 코덱을 위한 고속 움직임 예측기의 하드웨어 구조)

  • Lim, Jeong-Hun;Seo, Young-Ho;Choi, Hyun-Jun;Kim, Dong-Wook
    • Journal of Broadcast Engineering
    • /
    • v.16 no.2
    • /
    • pp.293-304
    • /
    • 2011
  • In this paper, we proposed a new hardware architecture for motion estimation (ME) which is the most time-consuming unit among H.264 algorithms and designed to the type of intellectual property (IP). The proposed ME hardware consists of buffer, processing unit (PU) array, SAD (sum of absolute difference) selector, and motion vector (MVgenerator). PU array is composed of 16 PUs and each PU consists of 16 processing elements (PUs). The main characteristics of the proposed hardware are that current and reference frames are re-used to reduce the number of access to the external memory and that there is no clock loss during SAD operation. The implemented ME hardware occupies 3% hardware resources of StatixIII EP3SE80F1152C2 which is a FPGA of Altera Inc. and can operate at up to 446.43MHz. Therefore it can process up to 50 frames of 1080p in a second.

Efficient QP-per-frame Assignment Method for Low-delay HEVC Encoder (저지연 HEVC 부호화기를 위한 효율적인 프레임별 양자화 파라미터 할당 방법)

  • Park, Sang-hyo;Jang, Euee S.
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.349-356
    • /
    • 2016
  • In this paper, we propose an efficient assignment method that assigns quantization parameter (QP) in accordance with group of picture (GOP) structure given in HEVC encoder. Each video frames can have difference QP values based on given GOP configuration for HEVC encoding. Particularly, for important frames we can assign low QP values, and vice versa. However, there has not been thorough investigation on efficient QP assignment method by far. Even in HEVC reference software encoder, only monotonic QP assignment method is employed. Thus, the proposed method assign adaptive QP values to each GOP so that temporal dynamic activity between GOPs can be exploited. Through the experiment, the proposed method showed a 7.3% gain of compression performance in terms of BD-rate compared to HEVC test model (HM) in low-delay configuration, and outperformed the existing QP assignment study on average.

Adaptive Frame Rate Up-Conversion Algorithm using the Neighbouring Pixel Information and Bilateral Motion Estimation (이웃하는 블록 정보와 양방향 움직임 예측을 이용한 적응적 프레임 보간 기법)

  • Oh, Hyeong-Chul;Lee, Joo-Hyun;Min, Chang-Ki;Jeong, Je-Chang
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.35 no.9C
    • /
    • pp.761-770
    • /
    • 2010
  • In this paper, we propose a new Frame Rate Up-Conversion (FRUC) scheme to increase the frame rate from a lower number into a higher one and enhance the decoded video quality at the decoder. The proposed algorithm utilizes the preliminary frames of forward and backward direction using bilateral prediction. In the process of the preliminary frames, an additional interpolation is performed for the occlusion area because if the calculated value of the block with reference frame if larger than the predetermine thresholdn the block is selected as the occlusion area. In order to interpolate the occlusion area, we perform re-search to obtain the osiomal block considerhe osiomnumber of available ne block consblock. The experimental results show that performance of the proposed algorithm has better PSNR and visual quality than the conventional methods.

VLSI architecture design of CAVLC entropy encoder/decoder for H.264/AVC (H.264/AVC를 위한 CAVLC 엔트로피 부/복호화기의 VLSI 설계)

  • Lee Dae-joon;Jeong Yong-jin
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.5C
    • /
    • pp.371-381
    • /
    • 2005
  • In this paper, we propose an advanced hardware architecture for the CAVLC entropy encoder/decoder engine for real time video compression. The CAVLC (Context-based Adaptive Variable Length Coding) is a lossless compression method in H.264/AVC and it has high compression efficiency but has computational complexity. The reference memory size is optimized using partitioned storing method and memory reuse method which are based on partiality of memory referencing. We choose the hardware architecture which has the most suitable one in several encoder/decoder architectures for the mobile devices and improve its performance using parallel processing. The proposed architecture has been verified by ARM-interfaced emulation board using Altera Excalibur and also synthesized on Samsung 0.18 um CMOS technology. The synthesis result shows that the encoder can process about 300 CIF frames/s at 150MHz and the decoder can process about 250 CIF frames/s at 140Mhz. The hardware architectures are being used as core modules when implementing a complete H.264/AVC video encoder/decoder chip for real-time multimedia application.

Area Changes in the Administrative Boundary Map of Korea by National Geodetic Reference Frames (세계측지계 전환에 따른 우리나라 행정구역도상 면적 변화)

  • Bae, Tae-Suk;Kim, Jeong-Hee;Yoon, Jong-Seong;Jeong, Jae-Joon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.30 no.3
    • /
    • pp.241-247
    • /
    • 2012
  • The national geodetic reference frame of Korea switched to the International Terrestrial Reference Frame (ITRF) in 2003. In order to study the land area changes, we calculated the entire land area of Korea using the administrative boundaries of census data provided by Statistics Korea. The standard transformation procedure by the National Geographic Information Institute (NGII) was followed. The Transverse Mercator (TM) projected coordinates were transformed into the GRS80-based world geodetic reference frame, and the ellipsoidal and the projected areas were calculated. The provinces that range over two projection origins were divided into two polygons and projected using appropriate origins. After the transformation, all boundaries were shifted in the northwestern direction, resulting in a decreased area of $1.36km^2$ (about 0.0013%) on the projected plane. Moving the boundaries into a high latitude area cancels out the effect of the enlarged ellipsoid. In addition, the rate of change shows that a higher-latitude province is more sensitive to the shift of the boundaries. The data by Statistics Korea is significantly different from those of the Ministry of Land, Transport and Maritime Affairs (MLTM), thus it is urgently recommended that the data are integrated and unified.

A Synchronized Playback Method of 3D Model and Video by Extracting Golf Swing Information from Golf Video (골프 동영상으로부터 추출된 스윙 정보를 활용한 3D 모델과 골프 동영상의 동기화 재생)

  • Oh, Hwang-Seok
    • Journal of the Korean Society for Computer Game
    • /
    • v.31 no.4
    • /
    • pp.61-70
    • /
    • 2018
  • In this paper, we propose a synchronized playback method of 3D reference model and video by extracting golf swing information from learner's golf video to precisely compare and analyze each motion in each position and time in the golf swing, and present the implementation result. In order to synchronize the 3D model with the learner's swing video, the learner's golf swing movie is first photographed and relative time information is extracted from the photographed video according to the position of the golf club from the address posture to the finishing posture. Through applying time information from learners' swing video to a 3D reference model that rigs the motion information of a pro-golfer's captured swing motion at 120 frames per second through high-quality motion capture equipment into a 3D model and by synchronizing the 3D reference model with the learner's swing video, the learner can correct or learn his / her posture by precisely comparing his or her posture with the reference model at each position of the golf swing. Synchronized playback can be used to improve the functionality of manually adjusting system for comparing and analyzing the reference model and learner's golf swing. Except for the part where the image processing technology that detects each position of the golf posture is applied, It is expected that the method of automatically extracting the time information of each location from the video and of synchronized playback can be extended to general life sports field.

TLS (Total Least-Squares) within Gauss-Helmert Model: 3D Planar Fitting and Helmert Transformation of Geodetic Reference Frames (가우스-헬머트 모델 전최소제곱: 평면방정식과 측지좌표계 변환)

  • Bae, Tae-Suk;Hong, Chang-Ki;Lim, Soo-Hyeon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.40 no.4
    • /
    • pp.315-324
    • /
    • 2022
  • The conventional LESS (LEast-Squares Solution) is calculated under the assumption that there is no errors in independent variables. However, the coordinates of a point, either from traditional ground surveying such as slant distances, horizontal and/or vertical angles, or GNSS (Global Navigation Satellite System) positioning, cannot be determined independently (and the components are correlated each other). Therefore, the TLS (Total Least Squares) adjustment should be applied for all applications related to the coordinates. Many approaches were suggested in order to solve this problem, resulting in equivalent solutions except some restrictions. In this study, we calculated the normal vector of the 3D plane determined by the trace of the VLBI targets based on TLS within GHM (Gauss-Helmert Model). Another numerical test was conducted for the estimation of the Helmert transformation parameters. Since the errors in the horizontal components are very small compared to the radius of the circle, the final estimates are almost identical. However, the estimated variance components are significantly reduced as well as show a different characteristic depending on the target location. The Helmert transformation parameters are estimated more precisely compared to the conventional LESS case. Furthermore, the residuals can be predicted on both reference frames with much smaller magnitude (in absolute sense).

Spatio-temporal Mode Selection Methods of Fast H.264 Using Multiple Reference Frames (다중 참조 영상을 이용한 고속 H.264의 움직임 예측 모드 선택 기법)

  • Kwon, Jae-Hyun;Kang, Min-Jung;Ryu, Chul
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.33 no.3C
    • /
    • pp.247-254
    • /
    • 2008
  • H.264 provides a good coding efficiency compared with existing video coding standards, H.263, MPEG-4, based on the use of multiple reference frame for variable block size motion estimation, quarter-pixel motion estimation and compensation, $4{\times}4$ integer DCT, rate-distortion optimization, and etc. However, many modules used to increase its performance also require H.264 to have increased complexity so that fast algorithms are to be implemented as practical approach. In this paper, among many approaches, fast mode decision algorithm by skipping variable block size motion estimation and spatial-predictive coding, which occupies most encoder complexity, is proposed. This approach takes advantages of temporal and spatial properties of fast mode selection techniques. Experimental results demonstrate that the proposed approach can save encoding time up to 65% compared with the H.264 standard while maintaining the visual perspectives.