• Title/Summary/Keyword: DCT Coefficients

Search Result 295, Processing Time 0.024 seconds

Selective Quantization Based on Band Property for Wideband Signal Codec (광대역 신호 압축기를 위한 주파수 대역 특성에 선택적인 양자화 방법)

  • 송재종;박호종;김무영;김도석;김정수
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.7
    • /
    • pp.76-82
    • /
    • 2001
  • In this paper, a novel quantization method for wideband signal codec with 7 kHz bandwidth is proposed. In the transform-based wideband signal codecs, the signal is transformed to frequency domain and the spectral coefficients in each frequency band are quantized based on human perceptual model, followed by Huffman coding. However, the property of each band varies with frequency, and the codec has poor performance when all bands are quantized with the same method. Therefore, a selective quantization method is proposed, which analyzes the band property and selects the quantization domain between frequency domain and time domain based on the quantization efficiency. It is confirmed that the proposed method has better performance than the quantizer of G722.1 codec.

  • PDF

Multiple Description Coding using Whitening Ttansform

  • Park, Kwang-Pyo;Lee, Keun-Young
    • Proceedings of the IEEK Conference
    • /
    • 2002.07b
    • /
    • pp.1003-1006
    • /
    • 2002
  • In the communications systems with diversity, we are commonly faced on needing of new source coding technique, error resilient coding. The error resilient coding addresses the coding algorithm that has the robustness to unreliability of communications channel. In recent years, many error resilient coding techniques were proposed such as data partitioning, resynchronization, error detection, concealment, reference picture selection and multiple description coding (MDC). Especially, the MDC using correlating transform explicitly adds correlation between two descriptions to enable the estimation of one set from the other. However, in the conventional correlating transform method, there is a critical problem that decoder must know statistics of original image. In this paper, we propose an enhanced method, the MDC using whitening transform that is not necessary additional statistical information to decode image because the DCT coefficients to apply whitening transform to an image have uni-variance statistics. Our experimental results show that the proposed method achieves a good trade-off between the coding efficiency and the reconstruction quality. In the proposed method, the PSNR of images reconstructed from two descriptions is about 0.7dB higher than conventional method at the 1.0 BPP and from only one description is about 1,8dB higher at the same rate.

  • PDF

Quadtree Based Infrared Image Compression in Wavelet Transform Domain (웨이브렛 변환 영역에서 쿼드트리 기반 적외선 영상 압축)

  • 조창호;이상효
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.3C
    • /
    • pp.387-397
    • /
    • 2004
  • The wavelet transform providing both of the frequency and spatial information of an image is proved to be very much effective for the compression of images, and recently lot of studies on coding algorithms for images decomposed by the wavelet transform together with the multi-resolution theory are going on. This paper proposes a quadtree decomposition method of image compression applied to the images decomposed by wavelet transform by using the correlations between pixels and '0'data grouping. Since the coefficients obtained by the wavelet transform have high correlations between scales and high concentrations, the quadtree method can reduce the data quantity effectively. the experimental infrared image with 256${\times}$256 size and 8〔bit〕, was used to compare the performances of the existing and the proposed compression methods.

MASS ESTIMATION OF IMPACTING OBJECTS AGAINST A STRUCTURE USING AN ARTIFICIAL NEURAL NETWORK WITHOUT CONSIDERATION OF BACKGROUND NOISE

  • Shin, Sung-Hwan;Park, Jin-Ho;Yoon, Doo-Byung;Choi, Young-Chul
    • Nuclear Engineering and Technology
    • /
    • v.43 no.4
    • /
    • pp.343-354
    • /
    • 2011
  • It is critically important to identify unexpected loose parts in a nuclear reactor pressure vessel, since they may collide with and cause damage to internal structures. Mass estimation can provide key information regarding the kind as well as the location of loose parts. This study proposes a mass estimation method based on an artificial neural network (ANN), which can overcome several unresolved issues involved in other conventional methods. In the ANN model, input parameters are the discrete cosine transform (DCT) coefficients of the auto-power spectrum density (APSD) of the measured impact acceleration signal. The performance of the proposed method is then evaluated through application to a large-sized plate and a 1/8-scaled mockup of a reactor pressure vessel. The results are compared with those obtained using a conventional method, the frequency ratio (FR) method. It is shown that the proposed method is capable of estimating the impact mass with 30% lower relative error than the FR method, thus improving the estimation performance.

A Suboptimum Block Quantization in Image Transform Coding (영상 변환부호화에서의 준최적 블록양자화)

  • 심영석
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.22 no.6
    • /
    • pp.41-45
    • /
    • 1985
  • A suboptimum block quantization method is investigated for efficent transform coding. In our study the following method has appeared as suboptimum. At first, optimum bit allocation is done assuming the varances of the transform coefficients are known. Secndly, a varance estimation algorithm which results from the approximate equations governing the optimum vit alllocation is applied. The better performance of the proposed block quantization method has been confirmed by simulations based on varous pdf assumptions. the results indicate that the proposed method yields overall improvements of about 25% in NMSE for both the symmetric nonuniform and uniform quantizer at the coding rate of 1 bit/pel.

  • PDF

Tracking and Interpretation of Moving Object in MPEG-2 Compressed Domain (MPEG-2 압축 영역에서 움직이는 객체의 추적 및 해석)

  • Mun, Su-Jeong;Ryu, Woon-Young;Kim, Joon-Cheol;Lee, Joon-Hoan
    • The KIPS Transactions:PartB
    • /
    • v.11B no.1
    • /
    • pp.27-34
    • /
    • 2004
  • This paper proposes a method to trace and interpret a moving object based on the information which can be directly obtained from MPEG-2 compressed video stream without decoding process. In the proposed method, the motion flow is constructed from the motion vectors included in compressed video. We calculate the amount of pan, tilt, and zoom associated with camera operations using generalized Hough transform. The local object motion can be extracted from the motion flow after the compensation with the parameters related to the global camera motion. Initially, a moving object to be traced is designated by user via bounding box. After then automatic tracking Is performed based on the accumulated motion flows according to the area contributions. Also, in order to reduce the cumulative tracking error, the object area is reshaped in the first I-frame of a GOP by matching the DCT coefficients. The proposed method can improve the computation speed because the information can be directly obtained from the MPEG-2 compressed video, but the object boundary is limited by macro-blocks rather than pixels. Also, the proposed method is proper for approximate object tracking rather than accurate tracing of an object because of limited information available in the compressed video data.

Post-filtering in Low Bit Rate Moving Picture Coding, and Subjective and Objective Evaluation of Post-filtering (저 전송률 동화상 압축에서 후처리 방법 및 후처리 방법의 주관적 객관적 평가)

  • 이영렬;김윤수;박현욱
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.8B
    • /
    • pp.1518-1531
    • /
    • 1999
  • The reconstructed images from highly compressed MPEG or H.263 data have noticeable image degradations, such as blocking artifacts near the block boundaries, corner outliers at cross points of blocks, and ringing noise near image edges, because the MPEG or H.263 quantizes the transformed coefficients of 8$\times$8 pixel blocks. A post-processing algorithm has been proposed by authors to reduce quantization effects, such as blocking artifacts, corner outliers, and ringing noise, in MPEG-decompressed images. Our signal-adaptive post-processing algorithm reduces the quantization effects adaptively by using both spatial frequency and temporal information extracted from the compressed data. The blocking artifacts are reduced by one-dimensional (1-D) horizontal and vertical low pass filtering (LPF), and the ringing noise is reduced by two-dimensional (2-D) signal-adaptive filtering (SAF). A comparison study of the subjective quality evaluation using modified single stimulus method (MSSM), the objective quality evaluation (PSNR) and the computation complexity analysis between the signal-adaptive post-processing algorithm and the MPEG-4 VM (Verification Model) post-processing algorithm is performed by computer simulation with several MPEG-4 image sequences. According to the comparison study, the subjective image qualities of both algorithms are similar, whereas the PSNR and the comparison complexity analysis of the signal-adaptive post-processing algorithm shows better performance than the VM post-processing algorithm.

  • PDF

Bit-Rate Control Using Histogram Based Rate-Distortion Characteristics (히스토그램 기반의 비트율-왜곡 특성을 이용한 비트율 제어)

  • 홍성훈;유상조;박수열;김성대
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.9B
    • /
    • pp.1742-1754
    • /
    • 1999
  • In this paper, we propose a rate control scheme, using histogram based rate-distortion (R-D) estimation, which produces a consistent picture quality between consecutive frames. The histogram based R-D estimation used in our rate control scheme offers a closed-form mathematical model that enable us to predict the bits and the distortion generated from an encoded frame at a given quantization parameter (QP) and vice versa. The most attractive feature of the R-D estimation is low complexity of computing the R-D data because its major operation is just to obtain a histogram or weighted histogram of DCT coefficients from an input picture. Furthermore, it is accurate enough to be applied to the practical video coding. Therefore, the proposed rate control scheme using this R-D estimation model is appropriate for the applications requiring low delay and low complexity, and controls the output bit-rate ad quality accurately. Our rate control scheme ensures that the video buffer do not underflow and overflow by satisfying the buffer constraint and, additionally, prevents quality difference between consecutive frames from exceeding certain level by adopting the distortion constraint. In addition, a consistent considering the maximum tolerance BER of the voice service. Also in Rician fading channel of K=6 and K=10, considering CLP=$10^{-3}$ as a criterion, it is observed that the performance improment of about 3.5 dB and 1.5 dB is obtained, respectively, in terms of $E_b$/$N_o$ by employing the concatenated FEC code with pilot symbols.

  • PDF

Evaluating Correlation between Geometrical Relationship and Dose Difference Caused by Respiratory Motion Using Statistical Analysis

  • Shin, Dong-Seok;Kang, Seong-Hee;Kim, Dong-Su;Kim, Tae-Ho;Kim, Kyeong-Hyeon;Cho, Min-Seok;Noh, Yu-Yoon;Yoon, Do-Kun;Suh, Tae Suk
    • Progress in Medical Physics
    • /
    • v.27 no.4
    • /
    • pp.203-212
    • /
    • 2016
  • Dose differences between three-dimensional (3D) and four-dimensional (4D) doses could be varied according to the geometrical relationship between a planning target volume (PTV) and an organ at risk (OAR). The purpose of this study is to evaluate the correlation between the overlap volume histogram (OVH), which quantitatively shows the geometrical relationship between the PTV and OAR, and the dose differences. 4D computed tomography (4DCT) images were acquired for 10 liver cancer patients. Internal target volume-based treatment planning was performed. A 3D dose was calculated on a reference phase (end-exhalation). A 4D dose was accumulated using deformation vector fields between the reference and other phase images of 4DCT from deformable image registration, and dose differences between the 3D and 4D doses were calculated. An OVH between the PTV and selected OAR (duodenum) was calculated and quantified on the basis of specific overlap volumes that corresponded to 10%, 20%, 30%, 40%, and 50% of the OAR volume overlapped with the expanded PTV. Statistical analysis was performed to verify the correlation with the OVH and dose difference for the OAR. The minimum mean dose difference was 0.50 Gy from case 3, and the maximum mean dose difference was 4.96 Gy from case 2. The calculated range of the correlation coefficients between the OVH and dose difference was from -0.720 to -0.712, and the R-square range for regression analysis was from 0.506 to 0.518 (p-value <0.05). However, when the 10% overlap volume was applied in the six cases that had OVH value ${\leq}2$, the average percent mean dose differences were $34.80{\pm}12.42%$. Cases with quantified OVH values of 2 or more had mean dose differences of $29.16{\pm}11.36%$. In conclusion, no significant statistical correlation was found between the OVH and dose differences. However, it was confirmed that a higher difference between the 3D and 4D doses could occur in cases that have smaller OVH value.

A Fast Inter Mode Decision Algorithm Considering Quantization Parameter in H.264 (H.264 표준에서 양자화 계수를 고려한 고속 인터모드 결정 방법)

  • Kim, Geun-Yong;Ho, Yo-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.43 no.6 s.312
    • /
    • pp.11-19
    • /
    • 2006
  • The recent video coding standard H.264 employs the rate-distortion optimization (RDO) method for choosing the best coding mode; however, it causes a large amount of encoding time. Thus, in order to reduce the encoding time, we need a fast mode decision algorithm. In this paper, we propose a fast inter mode decision algorithm considering quantization parameter (QP). The occurrence of best modes depends on QP. In order to reflect these characteristics, we consider the coded block pattern (CBP) which has 0 value when all quantized discrete cosine transform (DCT) coefficients are zero. We also use the early SKIP mode decision and early $16{\times}16$ mode decision methods. By computer simulations, we have verified that the proposed algorithm requires less encoding time than the fast inter mode decision method of the H.264 reference software for the Baseline and Main profiles by 19.6% and 18.8%, respectively.