• Title/Summary/Keyword: Error variance estimate

Search Result 113, Processing Time 0.025 seconds

VARIANCE ESTIMATION OF ERROR IN THE REGRESSION MODEL AT A POINT

  • Oh, Jong-Chul
    • Journal of applied mathematics & informatics
    • /
    • v.13 no.1_2
    • /
    • pp.501-508
    • /
    • 2003
  • Although the estimate of regression function is important, some have focused the variance estimation of error term in regression model. Different variance estimators perform well under different conditions. In many practical situations, it is rather hard to assess which conditions are approximately satisfied so as to identify the best variance estimator for the given data. In this article, we suggest SHM estimator compared to LS estimator, which is common estimator using in parametric multiple regression analysis. Moreover, a combined estimator of variance, VEM, is suggested. In the simulation study it is shown that VEM performs well in practice.

Selection of Data-adaptive Polynomial Order in Local Polynomial Nonparametric Regression

  • Jo, Jae-Keun
    • Communications for Statistical Applications and Methods
    • /
    • v.4 no.1
    • /
    • pp.177-183
    • /
    • 1997
  • A data-adaptive order selection procedure is proposed for local polynomial nonparametric regression. For each given polynomial order, bias and variance are estimated and the adaptive polynomial order that has the smallest estimated mean squared error is selected locally at each location point. To estimate mean squared error, empirical bias estimate of Ruppert (1995) and local polynomial variance estimate of Ruppert, Wand, Wand, Holst and Hossjer (1995) are used. Since the proposed method does not require fitting polynomial model of order higher than the model order, it is simpler than the order selection method proposed by Fan and Gijbels (1995b).

  • PDF

Bootstrap of LAD Estimate in Infinite Variance AR(1) Processes

  • Kang, Hee-Jeong
    • Journal of the Korean Statistical Society
    • /
    • v.26 no.3
    • /
    • pp.383-395
    • /
    • 1997
  • This paper proves that the standard bootstrap approximation for the least absolute deviation (LAD) estimate of .beta. in AR(1) processes with infinite variance error terms is asymptotically valid in probability when the bootstrap resample size is much smaller than the original sample size. The theoretical validity results are supported by simulation studies.

  • PDF

Estimation of error variance in nonparametric regression under a finite sample using ridge regression

  • Park, Chun-Gun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.6
    • /
    • pp.1223-1232
    • /
    • 2011
  • Tong and Wang's estimator (2005) is a new approach to estimate the error variance using least squares method such that a simple linear regression is asymptotically derived from Rice's lag- estimator (1984). Their estimator highly depends on the setting of a regressor and weights in small sample sizes. In this article, we propose a new approach via a local quadratic approximation to set regressors in a small sample case. We estimate the error variance as the intercept using a ridge regression because the regressors have the problem of multicollinearity. From the small simulation study, the performance of our approach with some existing methods is better in small sample cases and comparable in large cases. More research is required on unequally spaced points.

A New Approach for Autofocusing in Microscopy

  • Tsomko, Elena;Kim, Hyoung-Joong;Han, Hyoung-Seok
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 2008.08a
    • /
    • pp.186-189
    • /
    • 2008
  • In order to estimate cell images, high-performance electron microscopes are used nowadays. In this paper, we propose a new simple, fast and efficient method for real-time automatic focusing in electron microscopes. The proposed algorithm is based on the prediction-error variance, and demonstrates its feasibility by using extensive experiments. This method is fast, easy to implement, accurate, and not demanding on computation time.

  • PDF

Variance components for two-way nested design data

  • Choi, Jaesung
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.3
    • /
    • pp.275-282
    • /
    • 2018
  • This paper discusses the use of projections for the sums of squares in the analyses of variance for two-way nested design data. The model for this data is assumed to only have random effects. Two different sizes of experimental units are required for a given experimental situation, since nesting is assumed to occur both in the treatment structure and in the design structure. So, variance components are coming from the sources of random effects of treatment factors and error terms in different sizes of experimental units. The model for this type of experimental situation is a random effects model with more than one error terms and therefore estimation of variance components are concerned. A projection method is used for the calculation of sums of squares due to random components. Squared distances of projections instead of using the usual reductions in sums of squares that show how to use projections to estimate the variance components associated with the random components in the assumed model. Expectations of quadratic forms are obtained by the Hartley's synthesis as a means of calculation.

A Study on Stochastic Estimation of Monthly Runoff by Multiple Regression Analysis (다중회귀분석에 의한 하천 월 유출량의 추계학적 추정에 관한 연구)

  • 김태철;정하우
    • Magazine of the Korean Society of Agricultural Engineers
    • /
    • v.22 no.3
    • /
    • pp.75-87
    • /
    • 1980
  • Most hydro]ogic phenomena are the complex and organic products of multiple causations like climatic and hydro-geological factors. A certain significant correlation on the run-off in river basin would be expected and foreseen in advance, and the effect of each these causual and associated factors (independant variables; present-month rainfall, previous-month run-off, evapotranspiration and relative humidity etc.) upon present-month run-off(dependent variable) may be determined by multiple regression analysis. Functions between independant and dependant variables should be treated repeatedly until satisfactory and optimal combination of independant variables can be obtained. Reliability of the estimated function should be tested according to the result of statistical criterion such as analysis of variance, coefficient of determination and significance-test of regression coefficients before first estimated multiple regression model in historical sequence is determined. But some error between observed and estimated run-off is still there. The error arises because the model used is an inadequate description of the system and because the data constituting the record represent only a sample from a population of monthly discharge observation, so that estimates of model parameter will be subject to sampling errors. Since this error which is a deviation from multiple regression plane cannot be explained by first estimated multiple regression equation, it can be considered as a random error governed by law of chance in nature. This unexplained variance by multiple regression equation can be solved by stochastic approach, that is, random error can be stochastically simulated by multiplying random normal variate to standard error of estimate. Finally hybrid model on estimation of monthly run-off in nonhistorical sequence can be determined by combining the determistic component of multiple regression equation and the stochastic component of random errors. Monthly run-off in Naju station in Yong-San river basin is estimated by multiple regression model and hybrid model. And some comparisons between observed and estimated run-off and between multiple regression model and already-existing estimation methods such as Gajiyama formula, tank model and Thomas-Fiering model are done. The results are as follows. (1) The optimal function to estimate monthly run-off in historical sequence is multiple linear regression equation in overall-month unit, that is; Qn=0.788Pn+0.130Qn-1-0.273En-0.1 About 85% of total variance of monthly runoff can be explained by multiple linear regression equation and its coefficient of determination (R2) is 0.843. This means we can estimate monthly runoff in historical sequence highly significantly with short data of observation by above mentioned equation. (2) The optimal function to estimate monthly runoff in nonhistorical sequence is hybrid model combined with multiple linear regression equation in overall-month unit and stochastic component, that is; Qn=0. 788Pn+0. l30Qn-1-0. 273En-0. 10+Sy.t The rest 15% of unexplained variance of monthly runoff can be explained by addition of stochastic process and a bit more reliable results of statistical characteristics of monthly runoff in non-historical sequence are derived. This estimated monthly runoff in non-historical sequence shows up the extraordinary value (maximum, minimum value) which is not appeared in the observed runoff as a random component. (3) "Frequency best fit coefficient" (R2f) of multiple linear regression equation is 0.847 which is the same value as Gaijyama's one. This implies that multiple linear regression equation and Gajiyama formula are theoretically rather reasonable functions.

  • PDF

Approximate Variance of Least Square Estimators for Regression Coefficient under Inclusion Probability Proportional to Size Sampling (포함확률비례추출에서 회귀계수 최소제곱추정량의 근사분산)

  • Kim, Kyu-Seong
    • Communications for Statistical Applications and Methods
    • /
    • v.19 no.1
    • /
    • pp.23-32
    • /
    • 2012
  • This paper deals with the bias and variance of regression coefficient estimators in a finite population. We derive approximate formulas for the bias, variance and mean square error of two estimators when we select a fixed-size inclusion probability proportional to the size sample and then estimate regression coefficients by the ordinary least square estimator as well as the weighted least square estimator based on the selected sample data. Necessary and sufficient conditions for the comparison of the two estimators in terms of variance and mean square error are suggested. In addition, a simple example is introduced to numerically compare the variance and mean square error of the two estimators.

The restricted maximum likelihood estimation of a censored regression model

  • Lee, Seung-Chun
    • Communications for Statistical Applications and Methods
    • /
    • v.24 no.3
    • /
    • pp.291-301
    • /
    • 2017
  • It is well known in a small sample that the maximum likelihood (ML) approach for variance components in the general linear model yields estimates that are biased downward. The ML estimate of residual variance tends to be downwardly biased. The underestimation of residual variance, which has implications for the estimation of marginal effects and asymptotic standard error of estimates, seems to be more serious in some limited dependent variable models, as shown by some researchers. An alternative frequentist's approach may be restricted or residual maximum likelihood (REML), which accounts for the loss in degrees of freedom and gives an unbiased estimate of residual variance. In this situation, the REML estimator is derived in a censored regression model. A small sample the REML is shown to provide proper inference on regression coefficients.

Estimation of the Mean and Variance for Normal Distributions whose Both Sides are Truncated

  • Hong, Chong-Sun;Choi, Yun-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.9 no.1
    • /
    • pp.249-259
    • /
    • 2002
  • In order to estimate the mean and variance for a Normal distribution which is truncated at both right and left sides, maximum likelihood estimators based on the entire sample from the original distribution are compared with the sample mean and variance of the censored sample which is the data remaining after truncation using simulation. We found that, surprisingly, the mean squared error of the mean based on the censored data Is smaller than that of the full sample estimators.