• Title/Summary/Keyword: 복잡한 영상

Search Result 1,870, Processing Time 0.034 seconds

A Temporal Error Concealment Technique Using The Adaptive Boundary Matching Algorithm (적응적 경계 정합을 이용한 시간적 에러 은닉 기법)

  • 김원기;이두수;정제창
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.5C
    • /
    • pp.683-691
    • /
    • 2004
  • To transmit MPEG-2 video on an errorneous channel, a number of error control techniques are needed. Especially, error concealment techniques which can be implemented on receivers independent of transmitters are essential to obtain good video quality. In this paper, prediction of motion vector and an adaptive boundary matching algorithm are presented for temporal error concealment. Before the complex BMA, we perform error concealment by a motion vector prediction using neighboring motion vectors. If the candidate of error concealment is not satisfied, search range and reliable boundary pixels are selected by the temporal activity or motion vectors and a damaged macroblock is concealed by applying an adaptive BMA. This error concealment technique reduces the complexity and maintains a PSNR gain of 0.3∼0.7㏈ compared to conventional BMA.

Study on Fast HEVC Encoding with Hierarchical Motion Vector Clustering (움직임 벡터의 계층적 군집화를 통한 HEVC 고속 부호화 연구)

  • Lim, Jeongyun;Ahn, Yong-Jo;Sim, Donggyu
    • Journal of Broadcast Engineering
    • /
    • v.21 no.4
    • /
    • pp.578-591
    • /
    • 2016
  • In this paper, the fast encoding algorithm in High Efficiency Video Coding (HEVC) encoder was studied. For the encoding efficiency, the current HEVC reference software is divided the input image into Coding Tree Unit (CTU). then, it should be re-divided into CU up to maximum depth in form of quad-tree for RDO (Rate-Distortion Optimization) in encoding precess. But, it is one of the reason why complexity is high in the encoding precess. In this paper, to reduce the high complexity in the encoding process, it proposed the method by determining the maximum depth of the CU using a hierarchical clustering at the pre-processing. The hierarchical clustering results represented an average combination of motion vectors (MV) on neighboring blocks. Experimental results showed that the proposed method could achieve an average of 16% time saving with minimal BD-rate loss at 1080p video resolution. When combined the previous fast algorithm, the proposed method could achieve an average 45.13% time saving with 1.84% BD-rate loss.

Lower Tail Light Learning-based Forward Vehicle Detection System Irrelevant to the Vehicle Types (후미등 하단 학습기반의 차종에 무관한 전방 차량 검출 시스템)

  • Ki, Minsong;Kwak, Sooyeong;Byun, Hyeran
    • Journal of Broadcast Engineering
    • /
    • v.21 no.4
    • /
    • pp.609-620
    • /
    • 2016
  • Recently, there are active studies on a forward collision warning system to prevent the accidents and improve convenience of drivers. For collision evasion, the vehicle detection system is required. In general, existing learning-based vehicle detection methods use the entire appearance of the vehicles from rear-view images, so that each vehicle types should be learned separately since they have distinct rear-view appearance regarding the types. To overcome such shortcoming, we learn Haar-like features from the lower part of the vehicles which contain tail lights to detect vehicles leveraging the fact that the lower part is consistent regardless of vehicle types. As a verification procedure, we detect tail lights to distinguish actual vehicles and non-vehicles. If candidates are too small to detect the tail lights, we use HOG(Histogram Of Gradient) feature and SVM(Support Vector Machine) classifier to reduce false alarms. The proposed forward vehicle detection method shows accuracy of 95% even in the complicated images with many buildings by the road, regardless of vehicle types.

Design and Implementation of 3D Urban Landscape Simulation System Based on Web GIS (Web GIS 기반의 3차원 도시경관 시뮬레이션시스템 설계 및 구현)

  • Jang Mun-Hyun
    • Spatial Information Research
    • /
    • v.13 no.1 s.32
    • /
    • pp.103-117
    • /
    • 2005
  • Information can be visualized as information technology and computers have recently made dramatic improvements. Thanks to the brisk effects of providing information in 3D on the web with the technological developments of Web 3D and virtual reality implementation in the Internet, there has been the trend to present geographic information in images and 3D. In this study, the Web 3D GIS technique was adopted in analyzing the current state of urban landscape and examining and predicting the urban landscape affected by new urban development businesses. As a result, it was expected that the study would be able to help the government agencies to reduce their errors in making urban landscape plans and policies and implementing them to provide objective and visual data, and to come up with a simulation system that could be used to reflect the residents opinions in plans and policies.

  • PDF

Fast mode decision by skipping variable block-based motion estimation and spatial predictive coding in H.264 (H.264의 가변 블록 크기 움직임 추정 및 공간 예측 부호화 생략에 의한 고속 모드 결정법)

  • 한기훈;이영렬
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.40 no.5
    • /
    • pp.417-425
    • /
    • 2003
  • H.264, which is the latest video coding standard of both ITU-T(International Telecommunication Union-Telecommunication standardization sector) and MPEG(Moving Picture Experts Group), adopts new video coding tools such as variable block size motion estimation, multiple reference frames, quarter-pel motion estimation/compensation(ME/MC), 4${\times}$4 Integer DCT(Discrete Cosine Transform), and Rate-Distortion Optimization, etc. These new video coding tools provide good coding of efficiency compared with existing video coding standards as H.263, MPEG-4, etc. However, these new coding tools require the increase of encoder complexity. Therefore, in order to apply H.264 to many real applications, fast algorithms are required for H.264 coding tools. In this paper, when encoder MacroBlock(MB) mode is decided by rate-distortion optimization tool, fast mode decision algorithm by skipping variable block size ME/MC and spatial-predictive coding, which occupies most encoder complexity, is proposed. In terms of computational complexity, the proposed method runs about 4 times as far as JM(Joint Model) 42 encoder of H.264, while the PSNR(peak signal-to-noise ratio)s of the decoded images are maintained.

Automatic Extraction of the Facial Feature Points Using Moving Color (색상 움직임을 이용한 얼굴 특징점 자동 추출)

  • Kim, Nam-Ho;Kim, Hyoung-Gon;Ko, Sung-Jea
    • Journal of the Korean Institute of Telematics and Electronics S
    • /
    • v.35S no.8
    • /
    • pp.55-67
    • /
    • 1998
  • This paper presents an automatic facial feature point extraction algorithm in sequential color images. To extract facial region in the video sequence, a moving color detection technique is proposed that emphasize moving skin color region by applying motion detection algorithm on the skin-color transformed images. The threshold value for the pixel difference detection is also decided according to the transformed pixel value that represents the probability of the desired color information. Eye candidate regions are selected using both of the black/white color information inside the skin-color region and the valley information of the moving skin region detected using morphological operators. Eye region is finally decided by the geometrical relationship of the eyes and color histogram. To decide the exact feature points, the PCA(Principal Component Analysis) is used on each eye and mouth regions. Experimental results show that the feature points of eye and mouth can be obtained correctly irrespective of background, direction and size of face.

  • PDF

Seismic Reflection Tomography by Cell Parameterization (셀 매개변수에 의한 탄성파 반사주시 토모그래피)

  • Seo, Young-Tak;Shin, Chang-Soo;Ko, Seung-Won
    • Geophysics and Geophysical Exploration
    • /
    • v.6 no.2
    • /
    • pp.95-100
    • /
    • 2003
  • In this study, we developed reflection tomography inversion algorithm using Straight Ray Technique (SRT) which can calculate travel time easily and fast for complex geological structure. The inversion process begins by setting the initial velocity model as a constant velocity model that hat only impedance boundaries. The inversion process searches a layer-interface structure model that is able to explain the given data satisfactorily by inverting to minimize data misfit. For getting optimal solution, we used Gauss-Newton method that needed constructing the approximate Hessian matrix. We also applied the Marquart-Levenberg regularization method to this inversion process to prevent solution diverging. The ability of the method to resolve typical target structures was tested in a synthetic salt dome inversion. Using the inverted velocity model, we obtained the migration image close to that of the true velocity model.

A Diamond Web-grid Search Algorithm Combined with Efficient Stationary Block Skip Method for H.264/AVC Motion Estimation (H.264/AVC 움직임 추정을 위한 효율적인 정적 블록 스킵 방법과 결합된 다이아몬드 웹 격자 탐색 알고리즘)

  • Jeong, Chang-Uk;Choi, Jin-Ku;Ikenaga, Takeshi;Goto, Satoshi
    • Journal of Internet Computing and Services
    • /
    • v.11 no.2
    • /
    • pp.49-60
    • /
    • 2010
  • H.264/AVC offers a better encoding efficiency than conventional video standards by adopting many new encoding techniques. However, the advanced coding techniques also add to the overall complexity for H.264/AVC encoder. Accordingly, it is necessary to perform optimization to alleviate the level of complexity for the video encoder. The amount of computation for motion estimation is of particular importance. In this paper, we propose a diamond web-grid search algorithm combined with efficient stationary block skip method which employs full diamond and dodecagon search patterns, and the variable thresholds are used for performing an effective skip of stationary blocks. The experimental results indicate that the proposed technique reduces the computations of the unsymmetrical-cross multi-hexagon-grid search algorithm by up to 12% while maintaining a similar PSNR performance.

Context Aware Feature Selection Model for Salient Feature Detection from Mobile Video Devices (모바일 비디오기기 위에서의 중요한 객체탐색을 위한 문맥인식 특성벡터 선택 모델)

  • Lee, Jaeho;Shin, Hyunkyung
    • Journal of Internet Computing and Services
    • /
    • v.15 no.6
    • /
    • pp.117-124
    • /
    • 2014
  • Cluttered background is a major obstacle in developing salient object detection and tracking system for mobile device captured natural scene video frames. In this paper we propose a context aware feature vector selection model to provide an efficient noise filtering by machine learning based classifiers. Since the context awareness for feature selection is achieved by searching nearest neighborhoods, known as NP hard problem, we apply a fast approximation method with complexity analysis in details. Separability enhancement in feature vector space by adding the context aware feature subsets is studied rigorously using principal component analysis (PCA). Overall performance enhancement is quantified by the statistical measures in terms of the various machine learning models including MLP, SVM, Naïve Bayesian, CART. Summary of computational costs and performance enhancement is also presented.

High Compression Image Coding with BTC Parameters (BTC 파라메타를 이용한 고압축 영상부호화)

  • Shim, Young-Serk;Lee, Hark-Jun
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.26 no.2
    • /
    • pp.140-146
    • /
    • 1989
  • An efficient quantization and encoding of BTC (Block Truncation Coding) parameters {($Y_{\alpha},\;Y_{\beta}),\;P_{{\beta}/{\beta}}$} are investigated, In our algorithm 4${\times}$4 blocks are classified into flat or edge block. While edge block is represented by two approximation level $Y_{\alpha},\;Y_{\beta}$ with label plane $P_{{\beta}/{\beta}}$, flat block is represented by single approximation level Y. The approximation levels Y, $Y_{\alpha}$ and $Y_{\beta}$ are encoded by predictive quatization specially designed, and the label plane $P_{{\beta}/{\beta}}$ is tried to be encoded using stored 32 reference plantes. The performance of the proposed scheme has appeared comparable to much more complex transform coding in terms of SNR, although it requires more study on the representation of small slope in background.

  • PDF