• Title/Summary/Keyword: dctA

Search Result 846, Processing Time 0.027 seconds

Post-filtering in Low Bit Rate Moving Picture Coding, and Subjective and Objective Evaluation of Post-filtering (저 전송률 동화상 압축에서 후처리 방법 및 후처리 방법의 주관적 객관적 평가)

  • 이영렬;김윤수;박현욱
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.8B
    • /
    • pp.1518-1531
    • /
    • 1999
  • The reconstructed images from highly compressed MPEG or H.263 data have noticeable image degradations, such as blocking artifacts near the block boundaries, corner outliers at cross points of blocks, and ringing noise near image edges, because the MPEG or H.263 quantizes the transformed coefficients of 8$\times$8 pixel blocks. A post-processing algorithm has been proposed by authors to reduce quantization effects, such as blocking artifacts, corner outliers, and ringing noise, in MPEG-decompressed images. Our signal-adaptive post-processing algorithm reduces the quantization effects adaptively by using both spatial frequency and temporal information extracted from the compressed data. The blocking artifacts are reduced by one-dimensional (1-D) horizontal and vertical low pass filtering (LPF), and the ringing noise is reduced by two-dimensional (2-D) signal-adaptive filtering (SAF). A comparison study of the subjective quality evaluation using modified single stimulus method (MSSM), the objective quality evaluation (PSNR) and the computation complexity analysis between the signal-adaptive post-processing algorithm and the MPEG-4 VM (Verification Model) post-processing algorithm is performed by computer simulation with several MPEG-4 image sequences. According to the comparison study, the subjective image qualities of both algorithms are similar, whereas the PSNR and the comparison complexity analysis of the signal-adaptive post-processing algorithm shows better performance than the VM post-processing algorithm.

  • PDF

Robust Feature Extraction Based on Image-based Approach for Visual Speech Recognition (시각 음성인식을 위한 영상 기반 접근방법에 기반한 강인한 시각 특징 파라미터의 추출 방법)

  • Gyu, Song-Min;Pham, Thanh Trung;Min, So-Hee;Kim, Jing-Young;Na, Seung-You;Hwang, Sung-Taek
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.20 no.3
    • /
    • pp.348-355
    • /
    • 2010
  • In spite of development in speech recognition technology, speech recognition under noisy environment is still a difficult task. To solve this problem, Researchers has been proposed different methods where they have been used visual information except audio information for visual speech recognition. However, visual information also has visual noises as well as the noises of audio information, and this visual noises cause degradation in visual speech recognition. Therefore, it is one the field of interest how to extract visual features parameter for enhancing visual speech recognition performance. In this paper, we propose a method for visual feature parameter extraction based on image-base approach for enhancing recognition performance of the HMM based visual speech recognizer. For experiments, we have constructed Audio-visual database which is consisted with 105 speackers and each speaker has uttered 62 words. We have applied histogram matching, lip folding, RASTA filtering, Liner Mask, DCT and PCA. The experimental results show that the recognition performance of our proposed method enhanced at about 21% than the baseline method.

MASS ESTIMATION OF IMPACTING OBJECTS AGAINST A STRUCTURE USING AN ARTIFICIAL NEURAL NETWORK WITHOUT CONSIDERATION OF BACKGROUND NOISE

  • Shin, Sung-Hwan;Park, Jin-Ho;Yoon, Doo-Byung;Choi, Young-Chul
    • Nuclear Engineering and Technology
    • /
    • v.43 no.4
    • /
    • pp.343-354
    • /
    • 2011
  • It is critically important to identify unexpected loose parts in a nuclear reactor pressure vessel, since they may collide with and cause damage to internal structures. Mass estimation can provide key information regarding the kind as well as the location of loose parts. This study proposes a mass estimation method based on an artificial neural network (ANN), which can overcome several unresolved issues involved in other conventional methods. In the ANN model, input parameters are the discrete cosine transform (DCT) coefficients of the auto-power spectrum density (APSD) of the measured impact acceleration signal. The performance of the proposed method is then evaluated through application to a large-sized plate and a 1/8-scaled mockup of a reactor pressure vessel. The results are compared with those obtained using a conventional method, the frequency ratio (FR) method. It is shown that the proposed method is capable of estimating the impact mass with 30% lower relative error than the FR method, thus improving the estimation performance.

CARA: Character Appearance Retrieval and Analysis for TV Programs

  • Jung Byunghee;Park Sungchoon;Kim Kyeongsoo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2004.11a
    • /
    • pp.237-240
    • /
    • 2004
  • This paper describes a character retrieval system for TV programs and a set of novel algorithms for detecting and recognizing faces for the system. Our character retrieval system consists of two main components: Face Register and Face Recognizer. The Face Register detects faces in video frames and then guides users to register the detected faces of interest into the database. The Face Recognizer displays the appearance interval of each character on the timeline interface and the list of scenes with the names of characters that appear on each scene. These two components also provide a function to modify incorrect results. which is helpful to provide accurate character retrieval services. In the proposed face detection and recognition algorithms. we reduce the computation time without sacrificing the recognition accuracy by using the DCT/LDA method for face feature extraction. We also develop the character retrieval system in the form of plug-in. By plugging in our system to a cataloguing system. the metadata about the characters in a video can be automatically generated. Through this system, we can easily realize sophisticated on-demand video services which provide the search of scenes of a specific TV star.

  • PDF

An Adaptive Rate Control Using Piecewise Linear Approximation Model (부분 선형 근사 모델을 이용한 적응적 비트율 제어)

  • 조창형;정제창;최병욱
    • Journal of Broadcast Engineering
    • /
    • v.2 no.2
    • /
    • pp.194-205
    • /
    • 1997
  • In video compression standards such as MPEG and H.263. rate control is one of the key components for good coding performance. This paper presents a simple adaptive rate control scheme using a piecewise linear approximation model. While conventional buffer control approach is performed by adjusting the quantization parameter linearly according to the buffer fullness. the proposed approach uses a piecewise linear approximation model derived from logarithmic relation between the quantization parameter and bitrate in data compression. In addition. a forward analyzer performed in the spatial domain is used to improve image quality. Simulation results demonstrate that the proposed method provides better performance than the conventional one and reduces the fluctuation of the PSNR per frame while maintaining the quality of the reconstructed frames at a relatively stable level.

  • PDF

Color Image Watermarking Using Human Visual System (인간시각시스템을 고려한 칼라 영상 워터마킹)

  • Lee, Joo-Shin
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.6 no.2
    • /
    • pp.65-70
    • /
    • 2013
  • In this paper, we proposed color image watermarking using human visual system. A watermark is embedded by transforming a color image of RGB coordinate into a color image of HSI coordinate with considering that chromatic components are less sensitive than achromatic components. Watermark is embedded in the frequency domain of the chromatic channels by using discrete cosine transform. Watermark is extracted from watermarked image by using inverse discrete cosine transform. To verify the proposed method, a standard image and a fingerprint image are used for the original image and the watermark image, respectively. Simulation results are satisfied with invisibility and robustness from attacks as image compression.

ABSOLUTE ESTIMATION METHOD OF MOSQUITO NOISE FOR A POST FILTERLING

  • Kashimura, Youhei;Sagara, Naoya;Sugiyama, Kenji
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.612-617
    • /
    • 2009
  • In a DCT coding, degradations called block artifact and mosquito noise are appeared in reconstructed pictures. They should be reduced in post processing after decoding without superabundant processing. However, an estimation of mosquito noise is rare because of its difficulty. To realize an estimation of mosquito noise level, we extract a block that mosquito noise will be easy to occur. Mosquito noise level is calculated at a selected side of the block. In this processing, only the sides of high probability block are used. Then, a block value is taken by averaging. Finally, the picture value is calculated by averaging of this. Estimation method is evaluated by using the MPEG-4 decoded pictures. Quantization scale of coding and the estimated mosquito noise level are compared. As the results, we recognize the proposed method gives almost reasonable mosquito block and absolute level. Father, adaptive filter is controlled by the estimated mosquito noise level. It is recognized that the high quality of decoded picture is kept and the mosquito noise is reduced effectively at the picture with degradation.

  • PDF

An Adaptive Control for the Propagation Errors Incurred by DCT Coefficient-Dropping Transcoder

  • Kim, Jin-Soo;Kim, Jae-Gon;Seo, Kwang-Deok;Yun, Mong-Han
    • ETRI Journal
    • /
    • v.29 no.5
    • /
    • pp.559-568
    • /
    • 2007
  • This paper presents a new distortion control scheme with a simple estimation model for the propagation errors incurred by dropping some parts of the bitstream in a frame dropping-coefficient dropping (FD-CD) transcoder. The primary goal of this paper is to facilitate bit-rate conversions and rate-distortion controls in the compressed domain without introducing a full decoding and reencoding system in the pixel domain. First, the error propagation behavior over several frame sequences due to coefficient dropping is investigated on the basis of statistical and empirical properties. Then, such properties are used to develop a simple estimation model for the CD distortion accounting for the characteristics of the underlying coded-frame. Finally, the proposed estimation model allows us to determine the amount of coefficient dropping and to effectively allocate rate-distortions into coded-frames. Experimental results show that the proposed estimation model accurately describes the characteristics of propagation errors adaptively in the compressed domain and can be easily applied to distortion control over different kinds of video sequences.

  • PDF

A Study on a Compensation of Decoded Video Quality and an Enhancement of Encoding Speed

  • Sir, Jaechul;Yoon, Sungkyu;Lim, Younghwan
    • Journal of the Korea Computer Graphics Society
    • /
    • v.6 no.3
    • /
    • pp.35-40
    • /
    • 2000
  • There are two problems in H.26X compression technique. One is compressing time in encoding process and the other is degradation of the decoded video quality due to high compression rate. For transferring moving pictures in real-time, it is required to adopt massively high compression. In this case, there are a lot of losses of an original video data and that results in degradation of quality. Especially degradation called by blocking artifact may be produced. The blocking artifact effect is produced by DCT-based coding techniques because they operate without considering correlation between pixels in block boundaries. So it represents discontinuity between adjacent blocks. This paper describes methods of quality compensation for H.26x decoded data and enhancing encoding speed for real-time operation. Our goal of the quality compensation is not to make the decoded video identical to a original video but to make it perceived better through human eyes. We suggest an algorithm that reduces block artifact and clears decoded video in decoder. To enhance encoding speed, we adopt new four-step search algorithm. As shown in the experimental result, the quality compensation provides better video quality because of reducing blocking artifact. And then new four-step search algorithm with $MMX^{TM}$ implementation improves encoding speed from 2.5 fps to 17 fps.

  • PDF

Robust video watermarking algorithm for H.264/AVC based on JND model

  • Zhang, Weiwei;Li, Xin;Zhang, Yuzhao;Zhang, Ru;Zheng, Lixin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.5
    • /
    • pp.2741-2761
    • /
    • 2017
  • With the purpose of copyright protection for digital video, a novel H.264/AVC watermarking algorithm based on JND model is proposed. Firstly, according to the characteristics of human visual system, a new and more accurate JND model is proposed to determine watermark embedding strength by considering the luminance masking, contrast masking and spatial frequency sensitivity function. Secondly, a new embedding strategy for H.264/AVC watermarking is proposed based on an analysis on the drift error of energy distribution. We argue that more robustness can be achieved if watermarks are embedded in middle and high components of $4{\times}4$ integer DCT since these components are more stable than dc and low components when drift error occurs. Finally, according to different characteristics of middle and high components, the watermarks are embedded using different algorithms, respectively. Experimental results demonstrate that the proposed watermarking algorithm not only meets the imperceptibility and robustness requirements, but also has a high embedding capacity.