• 제목/요약/키워드: Error variance estimate

검색결과 113건 처리시간 0.029초

VARIANCE ESTIMATION OF ERROR IN THE REGRESSION MODEL AT A POINT

  • Oh, Jong-Chul
    • Journal of applied mathematics & informatics
    • /
    • 제13권1_2호
    • /
    • pp.501-508
    • /
    • 2003
  • Although the estimate of regression function is important, some have focused the variance estimation of error term in regression model. Different variance estimators perform well under different conditions. In many practical situations, it is rather hard to assess which conditions are approximately satisfied so as to identify the best variance estimator for the given data. In this article, we suggest SHM estimator compared to LS estimator, which is common estimator using in parametric multiple regression analysis. Moreover, a combined estimator of variance, VEM, is suggested. In the simulation study it is shown that VEM performs well in practice.

Selection of Data-adaptive Polynomial Order in Local Polynomial Nonparametric Regression

  • Jo, Jae-Keun
    • Communications for Statistical Applications and Methods
    • /
    • 제4권1호
    • /
    • pp.177-183
    • /
    • 1997
  • A data-adaptive order selection procedure is proposed for local polynomial nonparametric regression. For each given polynomial order, bias and variance are estimated and the adaptive polynomial order that has the smallest estimated mean squared error is selected locally at each location point. To estimate mean squared error, empirical bias estimate of Ruppert (1995) and local polynomial variance estimate of Ruppert, Wand, Wand, Holst and Hossjer (1995) are used. Since the proposed method does not require fitting polynomial model of order higher than the model order, it is simpler than the order selection method proposed by Fan and Gijbels (1995b).

  • PDF

Bootstrap of LAD Estimate in Infinite Variance AR(1) Processes

  • Kang, Hee-Jeong
    • Journal of the Korean Statistical Society
    • /
    • 제26권3호
    • /
    • pp.383-395
    • /
    • 1997
  • This paper proves that the standard bootstrap approximation for the least absolute deviation (LAD) estimate of .beta. in AR(1) processes with infinite variance error terms is asymptotically valid in probability when the bootstrap resample size is much smaller than the original sample size. The theoretical validity results are supported by simulation studies.

  • PDF

Estimation of error variance in nonparametric regression under a finite sample using ridge regression

  • Park, Chun-Gun
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권6호
    • /
    • pp.1223-1232
    • /
    • 2011
  • Tong and Wang's estimator (2005) is a new approach to estimate the error variance using least squares method such that a simple linear regression is asymptotically derived from Rice's lag- estimator (1984). Their estimator highly depends on the setting of a regressor and weights in small sample sizes. In this article, we propose a new approach via a local quadratic approximation to set regressors in a small sample case. We estimate the error variance as the intercept using a ridge regression because the regressors have the problem of multicollinearity. From the small simulation study, the performance of our approach with some existing methods is better in small sample cases and comparable in large cases. More research is required on unequally spaced points.

A New Approach for Autofocusing in Microscopy

  • ;김형중;한형석
    • 한국정보통신설비학회:학술대회논문집
    • /
    • 한국정보통신설비학회 2008년도 정보통신설비 학술대회
    • /
    • pp.186-189
    • /
    • 2008
  • In order to estimate cell images, high-performance electron microscopes are used nowadays. In this paper, we propose a new simple, fast and efficient method for real-time automatic focusing in electron microscopes. The proposed algorithm is based on the prediction-error variance, and demonstrates its feasibility by using extensive experiments. This method is fast, easy to implement, accurate, and not demanding on computation time.

  • PDF

Variance components for two-way nested design data

  • Choi, Jaesung
    • Communications for Statistical Applications and Methods
    • /
    • 제25권3호
    • /
    • pp.275-282
    • /
    • 2018
  • This paper discusses the use of projections for the sums of squares in the analyses of variance for two-way nested design data. The model for this data is assumed to only have random effects. Two different sizes of experimental units are required for a given experimental situation, since nesting is assumed to occur both in the treatment structure and in the design structure. So, variance components are coming from the sources of random effects of treatment factors and error terms in different sizes of experimental units. The model for this type of experimental situation is a random effects model with more than one error terms and therefore estimation of variance components are concerned. A projection method is used for the calculation of sums of squares due to random components. Squared distances of projections instead of using the usual reductions in sums of squares that show how to use projections to estimate the variance components associated with the random components in the assumed model. Expectations of quadratic forms are obtained by the Hartley's synthesis as a means of calculation.

다중회귀분석에 의한 하천 월 유출량의 추계학적 추정에 관한 연구 (A Study on Stochastic Estimation of Monthly Runoff by Multiple Regression Analysis)

  • 김태철;정하우
    • 한국농공학회지
    • /
    • 제22권3호
    • /
    • pp.75-87
    • /
    • 1980
  • Most hydro]ogic phenomena are the complex and organic products of multiple causations like climatic and hydro-geological factors. A certain significant correlation on the run-off in river basin would be expected and foreseen in advance, and the effect of each these causual and associated factors (independant variables; present-month rainfall, previous-month run-off, evapotranspiration and relative humidity etc.) upon present-month run-off(dependent variable) may be determined by multiple regression analysis. Functions between independant and dependant variables should be treated repeatedly until satisfactory and optimal combination of independant variables can be obtained. Reliability of the estimated function should be tested according to the result of statistical criterion such as analysis of variance, coefficient of determination and significance-test of regression coefficients before first estimated multiple regression model in historical sequence is determined. But some error between observed and estimated run-off is still there. The error arises because the model used is an inadequate description of the system and because the data constituting the record represent only a sample from a population of monthly discharge observation, so that estimates of model parameter will be subject to sampling errors. Since this error which is a deviation from multiple regression plane cannot be explained by first estimated multiple regression equation, it can be considered as a random error governed by law of chance in nature. This unexplained variance by multiple regression equation can be solved by stochastic approach, that is, random error can be stochastically simulated by multiplying random normal variate to standard error of estimate. Finally hybrid model on estimation of monthly run-off in nonhistorical sequence can be determined by combining the determistic component of multiple regression equation and the stochastic component of random errors. Monthly run-off in Naju station in Yong-San river basin is estimated by multiple regression model and hybrid model. And some comparisons between observed and estimated run-off and between multiple regression model and already-existing estimation methods such as Gajiyama formula, tank model and Thomas-Fiering model are done. The results are as follows. (1) The optimal function to estimate monthly run-off in historical sequence is multiple linear regression equation in overall-month unit, that is; Qn=0.788Pn+0.130Qn-1-0.273En-0.1 About 85% of total variance of monthly runoff can be explained by multiple linear regression equation and its coefficient of determination (R2) is 0.843. This means we can estimate monthly runoff in historical sequence highly significantly with short data of observation by above mentioned equation. (2) The optimal function to estimate monthly runoff in nonhistorical sequence is hybrid model combined with multiple linear regression equation in overall-month unit and stochastic component, that is; Qn=0. 788Pn+0. l30Qn-1-0. 273En-0. 10+Sy.t The rest 15% of unexplained variance of monthly runoff can be explained by addition of stochastic process and a bit more reliable results of statistical characteristics of monthly runoff in non-historical sequence are derived. This estimated monthly runoff in non-historical sequence shows up the extraordinary value (maximum, minimum value) which is not appeared in the observed runoff as a random component. (3) "Frequency best fit coefficient" (R2f) of multiple linear regression equation is 0.847 which is the same value as Gaijyama's one. This implies that multiple linear regression equation and Gajiyama formula are theoretically rather reasonable functions.

  • PDF

포함확률비례추출에서 회귀계수 최소제곱추정량의 근사분산 (Approximate Variance of Least Square Estimators for Regression Coefficient under Inclusion Probability Proportional to Size Sampling)

  • 김규성
    • Communications for Statistical Applications and Methods
    • /
    • 제19권1호
    • /
    • pp.23-32
    • /
    • 2012
  • 본 논문은 유한모집단에서 회귀계수추정량의 근사편향과 근사분산을 다루고 있다. 유한모집단에서 고정크기 포함확률비례표본을 추출하고 이 표본에서 조사된 데이터에 기초하여 회귀계수를 일반최소제곱추정량과 가중최소제곱추정량으로 추정할 때 두 추정량의 편향, 분산 그리고 평균제곱오차의 근사식을 유도하였다. 그리고 두 추정량의 효율을 비교하기 위하여 두 추정량의 분산을 비교하는 필요충분조건을 제시하였다. 또한 수치적인 비교를 위하여 간단한 예제를 소개하였다.

The restricted maximum likelihood estimation of a censored regression model

  • Lee, Seung-Chun
    • Communications for Statistical Applications and Methods
    • /
    • 제24권3호
    • /
    • pp.291-301
    • /
    • 2017
  • It is well known in a small sample that the maximum likelihood (ML) approach for variance components in the general linear model yields estimates that are biased downward. The ML estimate of residual variance tends to be downwardly biased. The underestimation of residual variance, which has implications for the estimation of marginal effects and asymptotic standard error of estimates, seems to be more serious in some limited dependent variable models, as shown by some researchers. An alternative frequentist's approach may be restricted or residual maximum likelihood (REML), which accounts for the loss in degrees of freedom and gives an unbiased estimate of residual variance. In this situation, the REML estimator is derived in a censored regression model. A small sample the REML is shown to provide proper inference on regression coefficients.

Estimation of the Mean and Variance for Normal Distributions whose Both Sides are Truncated

  • Hong, Chong-Sun;Choi, Yun-Young
    • Communications for Statistical Applications and Methods
    • /
    • 제9권1호
    • /
    • pp.249-259
    • /
    • 2002
  • In order to estimate the mean and variance for a Normal distribution which is truncated at both right and left sides, maximum likelihood estimators based on the entire sample from the original distribution are compared with the sample mean and variance of the censored sample which is the data remaining after truncation using simulation. We found that, surprisingly, the mean squared error of the mean based on the censored data Is smaller than that of the full sample estimators.