• Title/Summary/Keyword: minimum mean square error

Search Result 290, Processing Time 0.025 seconds

Joint Overlapped Block Motion Compensation Using Eight-Neighbor Block Motion Vectors for Frame Rate Up-Conversion

  • Li, Ran;Wu, Minghu;Gan, Zongliang;Cui, Ziguan;Zhu, Xiuchang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.10
    • /
    • pp.2448-2463
    • /
    • 2013
  • The traditional block-based motion compensation methods in frame rate up-conversion (FRUC) only use a single uniquely motion vector field. However, there will always be some mistakes in the motion vector field whether the advanced motion estimation (ME) and motion vector analysis (MA) algorithms are performed or not. Once the motion vector field has many mistakes, the quality of the interpolated frame is severely affected. In order to solve the problem, this paper proposes a novel joint overlapped block motion compensation method (8J-OBMC) which adopts motion vectors of the interpolated block and its 8-neighbor blocks to jointly interpolate the target block. Since the smoothness of motion filed makes the motion vectors of 8-neighbor blocks around the interpolated block quite close to the true motion vector of the interpolated block, the proposed compensation algorithm has the better fault-tolerant capability than traditional ones. Besides, the annoying blocking artifacts can also be effectively suppressed by using overlapped blocks. Experimental results show that the proposed method is not only robust to motion vectors estimated wrongly, but also can to reduce blocking artifacts in comparison with existing popular compensation methods.

Time-Synchronization Method for Dubbing Signal Using SOLA (SOLA를 이용한 더빙 신호의 시간축 동기화)

  • 이기승;지철근;차일환;윤대희
    • Journal of Broadcast Engineering
    • /
    • v.1 no.2
    • /
    • pp.85-95
    • /
    • 1996
  • The purpose of this paper Is to propose a dubbed signal time-synchroniztion technique based on the SOLA(Synchronized Over-Lap and Add) method which has been widely used to modify the time scale of speech signal. In broadcasting audio recording environments, the high degree of background noise requires dubbing process. Since the time difference between the original and the dubbed signal ranges about 200mili seconds, process is required to make the dubbed signal synchronize to the corresponding image. The proposed method finds he starting point of the dubbing signal using the short-time energy of the two signals. Thereafter, LPC cepstrum analysis and DTW(Dynamic Time Warping) process are applied to synchronize phoneme positions of the two signals. After determining the matched point by the minimum mean square error between orignal and dubbed LPC cepstrums, the SOLA method is applied to the dubbed signal, to maintain the consistency of the corresponding phase. Effectiveness of proposed method is verified by comparing the waveforms and the spectrograms of the original and the time synchronized dubbing signal.

  • PDF

Blind frequency offset estimation method in OFDM systems (OFDM에서 블라인드 주파수 옵셋 추정 방법)

  • Jeon, Hyoung-Goo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.4
    • /
    • pp.823-832
    • /
    • 2011
  • In this paper, an efficient blind carrier frequency offset (CFO) estimation method in orthogonal frequency division multiplexing (OFDM) systems is proposed. In the proposed method, we obtain two time different received OFDM symbols by using both the cyclic prefix and oversampling technique, and a cost function is defined by using the two OFDM symbols. We show that the cost function can be approximately expressed as a cosine function. Using a property of the cosine function, a formular for estimating the CFO is derived. The estimator of the CFO requires three independent cost function values calculated at three different points of frequency offset. The proposed method is very efficient in computational complexity since no searching operation for the minimum cost value is required. The proposed method reduces 97% of the amount of FFT computation, compared with the ML method. Unlike the conventional methods such as the ML method and the MUSIC] method, the accuracy of the proposed method is independent of the searching resolution since the closed form solution exists. The computer simulation shows that the performance of the proposed method is superior to those of the MUSIC and the ML method.

Forecast and verification of perceived temperature using a mesoscale model over the Korean Peninsula during 2007 summer (중규모 수치 모델 자료를 이용한 2007년 여름철 한반도 인지온도 예보와 검증)

  • Byon, Jae-Young;Kim, Jiyoung;Choi, Byoung-Cheol;Choi, Young-Jean
    • Atmosphere
    • /
    • v.18 no.3
    • /
    • pp.237-248
    • /
    • 2008
  • A thermal index which considers metabolic heat generation of human body is proposed for operational forecasting. The new thermal index, Perceived Temperature (PT), is forecasted using Weather Research and Forecasting (WRF) mesoscale model and validated. Forecasted PT shows the characteristics of diurnal variation and topographic and latitudinal effect. Statistical skill scores such as correlation, bias, and RMSE are employed for objective verification of PT and input meteorological variables which are used for calculating PT. Verification result indicates that the accuracy of air temperature and wind forecast is higher in the initial forecast time, while relative humidity is improved as the forecast time increases. The forecasted PT during 2007 summer is lower than PT calculated by observation data. The predicted PT has a minimum Root-Mean-Square-Error (RMSE) of $7-8^{\circ}C$ at 9-18 hour forecast. Spatial distribution of PT shows that it is overestimated in western region, while PT in middle-eastern region is underestimated due to strong wind and low temperature forecast. Underestimation of wind speed and overestimation of relative humidity have caused higher PT than observation in southern region. The predicted PT from the mesoscale model gives appropriate information as a thermal index forecast. This study suggests that forecasted PT is applicable to the prediction of health warning based on the relationship between PT and mortality.

Dynamic Electrical Impedance Tomography with Internal Electrodes (내부 전극을 이용한 동적 전기 임피던스 단층촬영법)

  • Kang, Suk-In;Kim, Kyung-Youn
    • Journal of IKEEE
    • /
    • v.5 no.2 s.9
    • /
    • pp.153-163
    • /
    • 2001
  • Electrical impedance tomography(EIT) is a relatively new imaging modality in which the internal impedivity distribution is reconstructed based on the known sets of injected currents and measured voltages on the surface of the object. We describe a dynamic EIT imaging technique for the case where the resistivity distribution inside the object changes rapidly within the time taken to acquire a full set of independent measurement data. In doing so, the inverse problem is treated as the state estimation problem and the unknown state (resistivity) is estimated with the aid of extended Kalman filter in a minimum mean square error sense. In particular, additional electrodes are attached to the known internal structure of the object to enhance the reconstruction performance and modified Tikhonov regularization technique is employed to mitigate the ill-posedness of the inverse problem. Computer simulations are provided to illustrate the reconstruction performance of the proposed algorithm.

  • PDF

Iterative Reduction of Blocking Artifact in Block Transform-Coded Images Using Wavelet Transform (웨이브렛 변환을 이용한 블록기반 변환 부호화 영상에서의 반복적 블록화 현상 제거)

  • 장익훈;김남철
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.24 no.12B
    • /
    • pp.2369-2381
    • /
    • 1999
  • In this paper, we propose an iterative algorithm for reducing the blocking artifact in block transform-coded images by using a wavelet transform. In the proposed method, an image is considered as a set of one-dimensional horizontal and vertical signals and one-dimensional wavelet transform is utilized in which the mother wavelet is the first order derivative of a Gaussian like function. The blocking artifact is reduced by removing the blocking component, that causes the variance at the block boundary position in the first scale wavelet domain to be abnormally higher than those at the other positions, using a minimum mean square error (MMSE) filter in the wavelet domain. This filter minimizes the MSE between the ideal blocking component-free signal and the restored signal in the neighborhood of block boundaries in the wavelet domain. It also uses local variance in the wavelet domain for pixel adaptive processing. The filtering and the projection onto a convex set of quantization constraint are iteratively performed in alternating fashion. Experimental results show that the proposed method yields not only a PSNR improvement of about 0.56-1.07 dB, but also subjective quality nearly free of the blocking artifact and edge blur.

  • PDF

Interference Aware Receiver Filtering for Wireless Ad Hoc Networks (무선 애드혹 네트워크에서의 간섭 제어 수신 기법)

  • Shin, Sungpil;Lee, Byungju;Park, Sunho;Shim, Byonghyo
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.50 no.3
    • /
    • pp.9-15
    • /
    • 2013
  • Recent works on ad hoc network study have shown that achievable throughput can be made to scale linearly with the number of receive antennas even if the transmitter has only a single antenna. In this paper, we propose a non-parametric linear minimum mean square error (MMSE) receiver for achieving further gain in performance when the channel state information at receiver (CSIR) of interferers is imperfect. The key feature to make our approach effective is to exploit the autocorrelation of the received signal. In fact, by incorporating the desired channel information on top of the observations including interference and noise only, the proposed method achieves large fraction of the optimal MMSE transmission capacity without transmission rate loss. From the SINR analysis as well as transmission capacity simulations in realistic ad hoc network system, we show that the proposed non-parametric linear MMSE receiver brings substantial performance gain over existing multiple receive antenna algorithms.

Voice Personality Transformation Using an Optimum Classification and Transformation (최적 분류 변환을 이용한 음성 개성 변환)

  • 이기승
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.5
    • /
    • pp.400-409
    • /
    • 2004
  • In this paper. a voice personality transformation method is proposed. which makes one person's voice sound like another person's voice. To transform the voice personality. vocal tract transfer function is used as a transformation parameter. Comparing with previous methods. the proposed method makes transformed speech closer to target speaker's voice in both subjective and objective points of view. Conversion between vocal tract transfer functions is implemented by classification of entire vector space followed by linear transformation for each cluster. LPC cepstrum is used as a feature parameter. A joint classification and transformation method is proposed, where optimum clusters and transformation matrices are simultaneously estimated in the sense of a minimum mean square error criterion. To evaluate the performance of the proposed method. transformation rules are generated from 150 sentences uttered by three male and on female speakers. These rules are then applied to another 150 sentences uttered by the same speakers. and objective evaluation and subjective listening tests are performed.

Voice personality transformation using an orthogonal vector space conversion (직교 벡터 공간 변환을 이용한 음성 개성 변환)

  • Lee, Ki-Seung;Park, Kun-Jong;Youn, Dae-Hee
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.33B no.1
    • /
    • pp.96-107
    • /
    • 1996
  • A voice personality transformation algorithm using orthogonal vector space conversion is proposed in this paper. Voice personality transformation is the process of changing one person's acoustic features (source) to those of another person (target). In this paper, personality transformation is achieved by changing the LPC cepstrum coefficients, excitation spectrum and pitch contour. An orthogonal vector space conversion technique is proposed to transform the LPC cepstrum coefficients. The LPC cepstrum transformation is implemented by principle component decomposition by applying the Karhunen-Loeve transformation and minimum mean-square error coordinate transformation(MSECT). Additionally, we propose a pitch contour modification method to transform the prosodic characteristics of any speaker. To do this, reference pitch patterns for source and target speaker are firstly built up, and speaker's one. The experimental results show the effectiveness of the proposed algorithm in both subjective and objective evaluations.

  • PDF

Voice Personality Transformation Using a Probabilistic Method (확률적 방법을 이용한 음성 개성 변환)

  • Lee Ki-Seung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.3
    • /
    • pp.150-159
    • /
    • 2005
  • This paper addresses a voice personality transformation algorithm which makes one person's voices sound as if another person's voices. In the proposed method, one person's voices are represented by LPC cepstrum, pitch period and speaking rate, the appropriate transformation rules for each Parameter are constructed. The Gaussian Mixture Model (GMM) is used to model one speaker's LPC cepstrums and conditional probability is used to model the relationship between two speaker's LPC cepstrums. To obtain the parameters representing each probabilistic model. a Maximum Likelihood (ML) estimation method is employed. The transformed LPC cepstrums are obtained by using a Minimum Mean Square Error (MMSE) criterion. Pitch period and speaking rate are used as the parameters for prosody transformation, which is implemented by using the ratio of the average values. The proposed method reveals the superior performance to the previous VQ-based method in subjective measures including average cepstrum distance reduction ratio and likelihood increasing ratio. In subjective test. we obtained almost the same correct identification ratio as the previous method and we also confirmed that high qualify transformed speech is obtained, which is due to the smoothly evolving spectral contours over time.