• Title/Summary/Keyword: model quantization

Search Result 227, Processing Time 0.026 seconds

Forward rate control of MPEG-2 video based on distortion-rate estimation (왜곡-비트율 추정에 근거한 MPEG-2 비디오의 순방향 비트율 제어)

  • 홍성훈;김성대;최재각;홍성용
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.23 no.8
    • /
    • pp.2010-2024
    • /
    • 1998
  • In video coding, it is important to improve the average picture quality as well as to maintain cosistent picture quality between consecutive pictures. In this paper, we propose a distortion-rate estimation method for MPEG-2 video and a forward rate control method, using the proposed estimation result, to be able to obtain the improved and consistent picture quality of CBR (Constant Bit Rate) encoded MPEG-2 video. The proposed distortion-rate estimation enable us to predict the distortion and the bits generated from an encoded picture at a given quantization step size and vice versa. The most attactive features of proposed distortion-rate estimation are its accuracy and low computational complexity enough to be applied to the practical video coding. In addition, the proposed rate control first determined a quantization parameter per frame by following procedure: distortion-rate estimation, target bit allocation, distortion constraint and VBV(Video Buffer Verification) constraint. And then this quantization parameter is applied to the encoding so that improved and consisten picture quality can be obtained. Furthermore the proposed rate control method can solve the error propagation problem caused by scene change or anchor picture degradation by using the B-picture skipping and the guarantee of the minimum bit allocation for the anchor picture. Experimental results, comparing the proposed forward rate control method with TM5 method, show that the proposed method makes more improed and consistent picture quality than TM5.

  • PDF

Millimeter Wave Energy Transfer based on Beam Steering (밀리미터파를 이용한 빔 조향 기반의 에너지 전송 기술)

  • Han, Yonggue;Jung, Sangwon;Lee, Chungyong
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.4
    • /
    • pp.10-15
    • /
    • 2017
  • Feedback burden of a full-digital energy beamforming, which is known as the optimal precoding scheme for radio frequency (RF) energy transfer, is huge because it uses a vector quantization for a channel feedback. To reduce the feedback burden, we consider a beam steering based wireless energy transfer, which uses a scalar quantization. Researches related to the beam steering based wireless energy transfer have been studied in special channel model with an assumption of full channel state information at the transmitter. In this paper, we analyze the beam steering scheme compared with the full-digital energy beamforming for practical channel models with channel estimation errors. According to characteristics of the millimeter wave channel, the number of antennas of the base station and the user, the distance between them, and channel estimation errors, we simulate the performance of the beam steering scheme and analyze reasons why.

Exaggerated Cartooning using a Reference Image (참조 이미지를 이용한 과장된 카투닝)

  • Han, Myoung-Hun;Seo, Sang-Hyun;Ryoo, Seung-Taek;Yoon, Kyung-Hyun
    • Journal of the Korea Computer Graphics Society
    • /
    • v.17 no.1
    • /
    • pp.33-38
    • /
    • 2011
  • This paper proposes the method of image cartooning, that makes cartoon-like images of a target, using reference images. We deform a target image using pre-defined reference images. For this deformation, we extract feature points from the target image by Active Appearance Model(AAM) and apply the warping method to the target using feature points of target and feature points of reference image as a basis of warping function. We create simplified cartoon-like images by abstraction of the deformed target image and drawing of edges and quantization of luminance of the abstracted image. Two main concept of cartoon(exaggeration and simplification) is inhered in this method when we use a exaggerated cartoon image as a reference image. It is possible for this method to create various results by control of warping and change of reference image.

Efficient QP-per-frame Assignment Method for Low-delay HEVC Encoder (저지연 HEVC 부호화기를 위한 효율적인 프레임별 양자화 파라미터 할당 방법)

  • Park, Sang-hyo;Jang, Euee S.
    • Journal of Broadcast Engineering
    • /
    • v.21 no.3
    • /
    • pp.349-356
    • /
    • 2016
  • In this paper, we propose an efficient assignment method that assigns quantization parameter (QP) in accordance with group of picture (GOP) structure given in HEVC encoder. Each video frames can have difference QP values based on given GOP configuration for HEVC encoding. Particularly, for important frames we can assign low QP values, and vice versa. However, there has not been thorough investigation on efficient QP assignment method by far. Even in HEVC reference software encoder, only monotonic QP assignment method is employed. Thus, the proposed method assign adaptive QP values to each GOP so that temporal dynamic activity between GOPs can be exploited. Through the experiment, the proposed method showed a 7.3% gain of compression performance in terms of BD-rate compared to HEVC test model (HM) in low-delay configuration, and outperformed the existing QP assignment study on average.

A Video Encoding Scheme using Adaptive Spatial Resolution Control for Mobile Video Applications (모바일 비디오 응용을 위한 적응적 공간 해상도 제어 인코딩 기법)

  • Lee, Hee-Jung;Lee, Yong-Hee;Lee, Jong-Hun;Shin, Heon-Shik
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.34 no.7C
    • /
    • pp.654-662
    • /
    • 2009
  • Video streams for mobile video streaming can be encoded to fit the available network bandwidth by controlling three factors: temporal resolution, spatial resolution, and picture quality. The controlling of picture quality by modifying the quantization parameter (QP) is most widely used. In this paper, we demonstrate that reducing the spatial resolution adaptively can be more efficient in terms of picture quality and energy consumption in low bit-rate environment, and present a model to find the optimal spatial resolution for the available bandwidth. Adaptive spatial resolution control scheme is especially effective when the bandwidth between the video server and the mobile device varies considerably with time, and when the mobile device is sensitive to energy consumption. Our scheme can improve the picture quality by approximately O.5dB and reduce energy consumption by more than 50% compared to the conventional video coding in low bit-rate environment.

Enhanced Pre echo Control Algorithm for MPEG Audio Coders (MPEG 오디오 부호화기를 위한 향상된 프리 에코 컨트롤 알고리듬)

  • Lee Chang-Joon;Lee Jae-Seong;Park Young-Cheol
    • Journal of Broadcast Engineering
    • /
    • v.11 no.2 s.31
    • /
    • pp.191-199
    • /
    • 2006
  • This paper presents an efficient pre echo control scheme for MPEG Audio coders based on the psychoacoustic model II (PAM-II). Pre echo control is the final step for the calculation of masking threshold in the PAM II. It is to minimize the spread of quantization error over the processing frame. In the conventional encoders, pre echo is reduced by restricting the estimated masking threshold not to exceed the one obtained in the previous frame. The conventional method performs pre echo control not only for short blocks but also for long blocks, which lowers the masking threshold in long blocks and, in turn, increases the quantization noise level of corresponding blocks. This paper proposes an efficient pre echo control process. The test result shows a mean enhancement of more than 0.4 especially for complex signals on the ITU R 5 point audio impairment scale.

Speech Recognition Based on VQ/NN using Fuzzy (Fuzzy를 이용한 VQ/NN에 기초를 둔 음성 인식)

  • Ann, Tae-Ock
    • The Journal of the Acoustical Society of Korea
    • /
    • v.15 no.6
    • /
    • pp.5-11
    • /
    • 1996
  • This paper is the study for recognizing single vowels of speaker-independent, and we suppose a method of speech recognition using VQ(Vector Quantization)/NN(Neural Network). This method makes a VQ codebook, which is used for obtaining the observation sequence, and then claculates the probability value by comparing each codeword with the data, finally uses these probability values for the input value of the neural network. Korean signle vowels are selected for our recognition experiment, and ten male speakers pronounced eight single vowels ten times. We compare the performance of our method with those of fuzzy VQ/HMM and conventional VQ/NN According to the experiment result, the recognition rate by VQ/NN is 92.3%, by VQ/HMM using fuzzy is 93.8% and by VQ/NN using fuzzy is 95.7%. Therefore, it is shown that recognition rate of speech recognition by fuzzy VQ/NN is better than those of fuzzy VQ/HMM and conventional VQ/HMM because of its excellent learning ability.

  • PDF

A Study on Game Contents Classification Service Method using Image Region Segmentation (칼라 영상 객체 분할을 이용한 게임 콘텐츠 분류 서비스 방안에 관한 연구)

  • Park, Chang Min
    • Journal of Service Research and Studies
    • /
    • v.5 no.2
    • /
    • pp.103-110
    • /
    • 2015
  • Recently, Classification of characters in a 3D FPS game has emerged as a very significant issue. In this study, We propose the game character Classification method using Image Region Segmentation of the extracting meaningful object in a simple operation. In this method, first used a non-linear RGB color model and octree color quantization scheme. The input image represented a less than 20 quantized color and uses a small number of meaningful color histogram. And then, the image divided into small blocks, calculate the degree of similarity between the color histogram intersection and adjacent block in block units. Because, except for the block boundary according to the texture and to extract only the boundaries of the object block. Set a region by these boundary blocks as a game object and can be used for FPS game play. Through experiment, we obtain accuracy of more than 80% for Classification method using each feature. Thus, using this property, characters could be classified effectively and it draws the game more speed and strategic actions as a result.

A Study on Discrete Hidden Markov Model for Vibration Monitoring and Diagnosis of Turbo Machinery (터보회전기기의 진동모니터링 및 진단을 위한 이산 은닉 마르코프 모델에 관한 연구)

  • Lee, Jong-Min;Hwang, Yo-ha;Song, Chang-Seop
    • The KSFM Journal of Fluid Machinery
    • /
    • v.7 no.2 s.23
    • /
    • pp.41-49
    • /
    • 2004
  • Condition monitoring is very important in turbo machinery because single failure could cause critical damages to its plant. So, automatic fault recognition has been one of the main research topics in condition monitoring area. We have used a relatively new fault recognition method, Hidden Markov Model(HMM), for mechanical system. It has been widely used in speech recognition, however, its application to fault recognition of mechanical signal has been very limited despite its good potential. In this paper, discrete HMM(DHMM) was used to recognize the faults of rotor system to study its fault recognition ability. We set up a rotor kit under unbalance and oil whirl conditions and sampled vibration signals of two failure conditions. DHMMS of each failure condition were trained using sampled signals. Next, we changed the setup and the rotating speed of the rotor kit. We sampled vibration signals and each DHMM was applied to these sampled data. It was found that DHMMs trained by data of one rotating speed have shown good fault recognition ability in spite of lack of training data, but DHMMs trained by data of four different rotating speeds have shown better robustness.

Coronary Artery Stenosis Quantification for Computed Tomography Angiography Based on Modified Student's t-Mixture Model

  • Sun, Qiaoyu;Yang, Guanyu;Shu, Huazhong;Shi, Daming
    • ETRI Journal
    • /
    • v.39 no.5
    • /
    • pp.662-671
    • /
    • 2017
  • Coronary artery disease (CAD) is a major cause of death in the world. As a non-invasive imaging modality, computed tomography angiography (CTA) is now usually used in clinical practice for CAD diagnosis. Precise quantification of coronary stenosis is of great interest for diagnosis and treatment planning. In this paper, a novel cluster method based on a Modified Student's t-Mixture Model is applied to separate the region of vessel lumen from other tissues. Then, the area of the vessel lumen in each slice is computed and the estimated value of it is fitted with a curve. Finally, the location and the level of the most stenoses are captured by comparing the calculated and fitted areas of the vessel. The proposed method has been applied to 17 clinical CTA datasets and the results have been compared with reference standard degrees of stenosis defined by an expert. The results of the experiment indicate that the proposed method can accurately quantify the stenosis of the coronary artery in CTA.