• Title/Summary/Keyword: 부호화 단위

Search Result 269, Processing Time 0.027 seconds

The Efficient Coding Tools based 3-Dimensional Transform in H.264/AVC (H.264/AVC에서 3차원 변환에 기반을 둔 효율적인 동영상 압축 방법)

  • Jo, Jae-Kyu;Cho, Hye-Jeong;Lee, Jin-Ho;Jeong, Se-Yoon;Ahn, Chang-Beom;Oh, Seoung-Jun
    • Journal of Broadcast Engineering
    • /
    • v.15 no.3
    • /
    • pp.434-453
    • /
    • 2010
  • In this paper, we propose 3DTE(3-Dimensional Transform Environment) that is based on 3DT(Dimensional Transform) that performs 2-dimensional integer DCT(Discrete Cosine Transform) based on $4{\times}4$ block and 1-dimensional integer DCT based on $4{\times}1$ block after collecting same frequency coefficients in neighboring $4{\times}4$ block and supports it's additional coding tools for high performance. The transform of 3DT can keep prediction error by using $4{\times}4$ block and reduce spatial redundancy additionally. The proposed 3DTE can provide coding tools to improve the coding efficiency with using 3DT. The performance of 3DTE compared to JM11.0 is average 3.58% and 5.40% bit savings for all test sequences and HD sequences, respectively, with keeping subjective video quality in High profile.

A Composition of Mosaic Images based on MPEG Compressed Information (MPEG 압축 정보를 이용한 모자이크 구성)

  • 설정규;이승희;이준환
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.1C
    • /
    • pp.47-55
    • /
    • 2003
  • This paper proposes a composition method of mosaic image from the compressed MPEG-2 video stream, in which the displacement between successive frames according to the camera operation is estimated directly from the information involved in the stream. In the proposed method. the approximated optical flow is constructed from motion vectors of macro blocks, and it is used to determine the parameters of the displacements according to the camera operation associated with pan and tilt. The extracted parameters are used to determine the geometric transform of successive video frames in order to construct a mosaic image. The construction of mosaic uses several blending techniques including the one proposed by Nichols in which an analytic weight is used to determine pixel values. Through the experiment, the blending technique based on analytic weights was superior to the others such as averaging and median-based techniques. It provided more smooth changes in background and made use of instantaneous frame information to construct a mosaic. The mosaic in the paper puts the emphasis on the reduction of computation because it is constructed from the motion vectors included in the compressed video without decoding and recalculating exact optical flows. The constructed mosaic can be used in the retrieval of the compressed video as the representative frame of a shot.

A Still Image Compression System with a High Quality Text Compression Capability (고 품질 텍스트 압축 기능을 지원하는 정지영상 압축 시스템)

  • Lee, Je-Myung;Lee, Ho-Suk
    • Journal of KIISE:Software and Applications
    • /
    • v.34 no.3
    • /
    • pp.275-302
    • /
    • 2007
  • We propose a novel still image compression system which supports a high quality text compression function. The system segments the text from the image and compresses the text with a high quality. The system shows 48:1 high compression ratio using context-based adaptive binary arithmetic coding. The arithmetic coding performs the high compression by the codeblocks in the bitplane. The input of the system consists of a segmentation mode and a ROI(Region Of Interest) mode. In segmentation mode, the input image is segmented into a foreground consisting of text and a background consisting of the remaining region. In ROI mode, the input image is represented by the region of interest window. The high quality text compression function with a high compression ratio shows that the proposed system can be comparable with the JPEG2000 products. This system also uses gray coding to improve the compression ratio.

Implementation of UEP using Turbo Codes and EREC Algorithm for Video Transmission (동영상 전송을 위하여 터보코드와 EREC알고리즘을 이용한 UEP설계)

  • 심우성;허도근
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.25 no.7A
    • /
    • pp.994-1004
    • /
    • 2000
  • In this paper, bitstreams are composed of using H.263 for a moving picture coding in the band-limited and error-prone environment such as wireless environment. EREC sub-frames are implemented by applying the proposed EREC algorithm in order to be UEP for the real data parts of implemented bitstreams. Because those are able to do resynchronization with a block unit, propagation of the error can be minimized, and the position of the important bits such as INTRADC and MVD can be known. Class is separated using the position of these important bits, and variable puncturing tables are designed by the class informations and the code rates of turbo codes are differently designed in according to the class. Channel coding used the turbo codes, and an interleaver to be designed in the turbo codes does not eliminate redundancy bits of the important bits in applying variable code rates of EREC sub-frames unit and is always the same at the transmitter and the receiver although being variable frame size. As a result of simulation, UEP with the code rate similar to EEP is obtained a improved result in the side of bit error probability. And the result of applying it to image knows that the subjective and objective quality have been improved by the protection of important bits.

  • PDF

Fundamental Frequency Estimation of Voiced Speech Signals Based on the Inflection Point Detection (변곡점 검출에 기반한 음성의 기본 주파수 추정)

  • Byeonggwan Iem
    • Journal of IKEEE
    • /
    • v.27 no.4
    • /
    • pp.472-476
    • /
    • 2023
  • Fundamental frequency/pitch period are major characteristics of speech signals. They are used in many speech applications like speech coding, speech recognition, speaker identification, and so on. In this paper, some of inflection points are used to estimate the pitch which is the inverse of the fundamental frequency. The inflection points are defined as points where local maxima, local minima or the slope changes occur. The speech signal is preprocessed to remove unnecessary inflection points due to the high frequency components using a low pass filter. Only the inflection points from local maxima are used to get the pitch period. While the existing pitch estimation methods process speech signals in blockwise, the proposed method detects the inflection points in sample and produces the pitch period/fundamental frequency estimates along the time. Computer simulation shows the usefulness of the proposed method as a fundamental frequency estimator.

A Study on Real Time Pitch Alteration of Speech Signal (음성신호의 실시간 피치변경에 관한 연구)

  • 김종국;박형빈;배명진
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.1
    • /
    • pp.82-89
    • /
    • 2004
  • This paper describes how to reduce the effect of an occupation threshold by that the transform of mixture components of HMM parameters is controlled in hierarchical tree structure to prevent from over-adaptation. To reduce correlations between data elements and to remove elements with less variance, we employ PCA (principal component analysis) and ICA (independent component analysis) that would give as good a representation as possible, and decline the effect of over-adaptation. When we set lower occupation threshold and increase the number of transformation function, ordinary WLLR adaptation algorithm represents lower recognition rate than SI models, whereas the proposed MLLR adaptation algorithm represents the improvement of over 2% for the word recognition rate as compared to performance of SI models.

Multi-modal Detection of Anchor Shot in News Video (다중모드 특징을 사용한 뉴스 동영상의 앵커 장면 검출 기법)

  • Yoo, Sung-Yul;Kang, Dong-Wook;Kim, Ki-Doo;Jung, Kyeong-Hoon
    • Journal of Broadcast Engineering
    • /
    • v.12 no.4
    • /
    • pp.311-320
    • /
    • 2007
  • In this paper, an efficient detection algorithm of an anchor shot in news video is presented. We observed the audio visual characteristics of news video and proposed several low level features which are appropriate for detecting an anchor shot in news video. The overall structure of the proposed algorithm is composed of 3 stages: the pause detection, the audio cluster classification, and the matching with motion activity stage. We used the audio features as well as the motion feature in order to improve the indexing accuracy and the simulation results show that the performance of the proposed algorithm is quite satisfactory.

H.264/AVC Fast Motion Estimation using Spatial and Temporal Correlation of Motion Vector (움직임 벡터의 시공간적 관계를 이용한 H.264/AVC 고속 움직임 예측 방법)

  • Moon, Ji-Hee;Ho, Yo-Sung
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2010.07a
    • /
    • pp.335-336
    • /
    • 2010
  • H.264/AVC 비디오 압축 표준은 압축 효율을 높이기 위해 다양한 크기의 블록을 사용하여 화면 사이의 움직임 예측을 수행한다. 세밀한 움직임 예측으로 인해 기존의 동영상 표준보다 압축 효율을 높일 수 있었지만, 복잡도도 증가하는 단점이 있다. 따라서, H.264/AVC의 고속 움직임 추정 기법은 필수적이다. H.264/AVC에서 사용하는 움직임 예측 방법은 고정된 탐색 영역 안에서 모든 정수 화소 단위로 최적의 움직임 벡터를 계산한다. 불필요한 정수 화소까지 움직임을 예측하므로 계산양이 증가한다. 본 논문에서는 움직임 벡터의 시간적 상관도와 공간적 상관도를 이용하여 가변적으로 탐색 영역의 크기를 조절하는 방법과 적응적인 초기 시작점 결정 방법을 제안했다. 현재 매크로블록과 참조 화면 사이의 거리를 고려하여 시간적 상관도와 공간적 상관도의 탐색 영역 비중을 가변적으로 조절했다. 또한 참조 화면과 현재 매크로블록 사이의 거리가 멀어질수록 초기 시작점의 정확도를 높이기 위해 초기 시작점을 예측 움직임 벡터와 이전 참조 화면에서 결정된 최적의 움직임 벡터의 평균으로 결정했다. 제안하는 방법은 기존의 전 영역 탐색 방법과 유사한 부호화 성능을 보이면서 움직임 예측 시간이 평균 53.98% 감소하는 것을 확인할 수 있다.

  • PDF

Reliability Evaluation Method Based on Spatio-Temporal Statistical Characteristics for Motion Compensated Interpolated Frame (움직임 보상 보간 프레임에 대한 시공간적 통계특성에 기초한 블록기반의 신뢰도 평가 방법)

  • Kim, Jin-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.5
    • /
    • pp.28-36
    • /
    • 2013
  • Motion-compensated frame interpolation (MCFI) techniques in video signal processing have many application areas. Frame rate up-conversion (FRUC) or distributed video coding (DVC) technique needs an effective MCFI algorithm. For these applications, it is necessary to develop an effective post-processing technique to improve visual qualities or to reduce virtual channel noises, resulting in the reduced channel bit rate. This paper proposes a reliability evaluation method based on spatio-temporal characteristics for motion-compensated interpolated blocks. The proposed algorithm investigates the temporal matching characteristics for current frame and then is designed in such a way that it can measure temporal characteristics as well as the spatial ones. Through computer simulations, it is shown that the proposed method outperforms the conventional temporal matching method.

Efficient Design of Structured LDPC Codes (구조적 LDPC 부호의 효율적인 설계)

  • Chung Bi-Woong;Kim Joon-Sung;Song Hong-Yeop
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.1C
    • /
    • pp.14-19
    • /
    • 2006
  • The high encoding complexity of LDPC codes can be solved by designing structured parity-check matrix. If the parity-check matrix of LDPC codes is composed of same type of blocks, decoder implementation can be simple, this structure allow structured decoding and required memory for storing the parity-check matrix can be reduced largely. In this parer, we propose a construction algorithm for short block length structured LDPC codes based on girth condition, PEG algorithm and variable node connectivity. The code designed by this algorithm shows similar performance to other codes without structured constraint in low SNR and better performance in high SNR than those by simulation