• Title/Summary/Keyword: Regression estimators

Search Result 226, Processing Time 0.02 seconds

Censored varying coefficient regression model using Buckley-James method

  • Shim, Jooyong;Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1167-1177
    • /
    • 2017
  • The censored regression using the pseudo-response variable proposed by Buckley and James has been one of the most well-known models. Recently, the varying coefficient regression model has received a great deal of attention as an important tool for modeling. In this paper we propose a censored varying coefficient regression model using Buckley-James method to consider situations where the regression coefficients of the model are not constant but change as the smoothing variables change. By using the formulation of least squares support vector machine (LS-SVM), the coefficient estimators of the proposed model can be easily obtained from simple linear equations. Furthermore, a generalized cross validation function can be easily derived. In this paper, we evaluated the proposed method and demonstrated the adequacy through simulate data sets and real data sets.

Prediction of extreme PM2.5 concentrations via extreme quantile regression

  • Lee, SangHyuk;Park, Seoncheol;Lim, Yaeji
    • Communications for Statistical Applications and Methods
    • /
    • v.29 no.3
    • /
    • pp.319-331
    • /
    • 2022
  • In this paper, we develop a new statistical model to forecast the PM2.5 level in Seoul, South Korea. The proposed model is based on the extreme quantile regression model with lasso penalty. Various meteorological variables and air pollution variables are considered as predictors in the regression model, and the lasso quantile regression performs variable selection and solves the multicollinearity problem. The final prediction model is obtained by combining various extreme lasso quantile regression estimators and we construct a binary classifier based on the model. Prediction performance is evaluated through the statistical measures of the performance of a binary classification test. We observe that the proposed method works better compared to the other classification methods, and predicts 'very bad' cases of the PM2.5 level well.

EVALUATION OF PARAMETER ESTIMATION METHODS FOR NONLINEAR TIME SERIES REGRESSION MODELS

  • Kim, Tae-Soo;Ahn, Jung-Ho
    • Journal of applied mathematics & informatics
    • /
    • v.27 no.1_2
    • /
    • pp.315-326
    • /
    • 2009
  • The unknown parameters in regression models are usually estimated by using various existing methods. There are several existing methods, such as the least squares method, which is the most common one, the least absolute deviation method, the regression quantile method, and the asymmetric least squares method. For the nonlinear time series regression models, which do not satisfy the general conditions, we will compare them in two ways: 1) a theoretical comparison in the asymptotic sense and 2) an empirical comparison using Monte Carlo simulation for a small sample size.

  • PDF

Comparison of Jump-Preserving Smoothing and Smoothing Based on Jump Detector

  • Park, Dong-Ryeon
    • Communications for Statistical Applications and Methods
    • /
    • v.16 no.3
    • /
    • pp.519-528
    • /
    • 2009
  • This paper deals with nonparametric estimation of discontinuous regression curve. Quite number of researches about this topic have been done. These researches are classified into two categories, the indirect approach and direct approach. The major goal of the indirect approach is to obtain good estimates of jump locations, whereas the major goal of the direct approach is to obtain overall good estimate of the regression curve. Thus it seems that two approaches are quite different in nature, so people say that the comparison of two approaches does not make much sense. Therefore, a thorough comparison of them is lacking. However, even though the main issue of the indirect approach is the estimation of jump locations, it is too obvious that we have an estimate of regression curve as the subsidiary result. The point is whether the subsidiary result of the indirect approach is as good as the main result of the direct approach. The performance of two approaches is compared through a simulation study and it turns out that the indirect approach is a very competitive tool for estimating discontinuous regression curve itself.

A Quantitative Model for the Projection of Health Expenditure (의료비 결정요인 분석을 위한 계량적 모형 고안)

  • Kim, Han-Joong;Lee, Young-Doo;Nam, Chung-Mo
    • Journal of Preventive Medicine and Public Health
    • /
    • v.24 no.1 s.33
    • /
    • pp.29-36
    • /
    • 1991
  • A multiple regression analysis using ordinary least square (OLS) is frequently used for the projection of health expenditure as well as for the identification of factors affecting health care costs. Data for the analysis often have mixed characteristics of time series and cross section. Parameters as a result of OLS estimation, in this case, are no longer the best linear unbiased estimators (BLUE) because the data do not satisfy basic assumptions of regression analysis. The study theoretically examined statistical problems induced when OLS estimation was applied with the time series cross section data. Then both the OLS regression and time series cross section regression (TSCS regression) were applied to the same empirical da. Finally, the difference in parameters between the two estimations were explained through residual analysis.

  • PDF

On Convex Combination of Local Constant Regression

  • Mun, Jung-Won;Kim, Choong-Rak
    • Communications for Statistical Applications and Methods
    • /
    • v.13 no.2
    • /
    • pp.379-387
    • /
    • 2006
  • Local polynomial regression is widely used because of good properties such as such as the adaptation to various types of designs, the absence of boundary effects and minimax efficiency Choi and Hall (1998) proposed an estimator of regression function using a convex combination idea. They showed that a convex combination of three local linear estimators produces an estimator which has the same order of bias as a local cubic smoother. In this paper we suggest another estimator of regression function based on a convex combination of five local constant estimates. It turned out that this estimator has the same order of bias as a local cubic smoother.

Wavelet Estimation of Regression Functions with Errors in Variables

  • Kim, Woo-Chul;Koo, Ja-Yong
    • Communications for Statistical Applications and Methods
    • /
    • v.6 no.3
    • /
    • pp.849-860
    • /
    • 1999
  • This paper addresses the issue of estimating regression function with errors in variables using wavelets. We adopt a nonparametric approach in assuming that the regression function has no specific parametric form, To account for errors in covariates deconvolution is involved in the construction of a new class of linear wavelet estimators. using the wavelet characterization of Besov spaces the question of regression estimation with Besov constraint can be reduced to a problem in a space of sequences. Rates of convergence are studied over Besov function classes $B_{spq}$ using $L_2$ error measure. It is shown that the rates of convergence depend on the smoothness s of the regression function and the decay rate of characteristic function of the contaminating error.

  • PDF

Regression discontinuity for survival data

  • Youngjoo Cho
    • Communications for Statistical Applications and Methods
    • /
    • v.31 no.1
    • /
    • pp.155-178
    • /
    • 2024
  • Regression discontinuity (RD) design is one of the most widely used methods in causal inference for estimation of treatment effect when the treatment is created by a cutpoint from the covariate of interest. There has been little attention to RD design, although it provides a very useful tool for analysis of treatment effect for censored data. In this paper, we define the causal effect for survival function in RD design when the treatment is assigned deterministically by the covariate of interest. We propose estimators of this causal effect for survival data by using transformation, which leads unbiased estimator of the survival function with local linear regression. Simulation studies show the validity of our approach. We also illustrate our proposed method using the prostate, lung, colorectal and ovarian (PLCO) dataset.

A FRAMEWORK TO UNDERSTAND THE ASYMPTOTIC PROPERTIES OF KRIGING AND SPLINES

  • Furrer Eva M.;Nychka Douglas W.
    • Journal of the Korean Statistical Society
    • /
    • v.36 no.1
    • /
    • pp.57-76
    • /
    • 2007
  • Kriging is a nonparametric regression method used in geostatistics for estimating curves and surfaces for spatial data. It may come as a surprise that the Kriging estimator, normally derived as the best linear unbiased estimator, is also the solution of a particular variational problem. Thus, Kriging estimators can also be interpreted as generalized smoothing splines where the roughness penalty is determined by the covariance function of a spatial process. We build off the early work by Silverman (1982, 1984) and the analysis by Cox (1983, 1984), Messer (1991), Messer and Goldstein (1993) and others and develop an equivalent kernel interpretation of geostatistical estimators. Given this connection we show how a given covariance function influences the bias and variance of the Kriging estimate as well as the mean squared prediction error. Some specific asymptotic results are given in one dimension for Matern covariances that have as their limit cubic smoothing splines.

A Dual Problem of Calibration of Design Weights Based on Multi-Auxiliary Variables

  • Al-Jararha, J.
    • Communications for Statistical Applications and Methods
    • /
    • v.22 no.2
    • /
    • pp.137-146
    • /
    • 2015
  • Singh (2013) considered the dual problem to the calibration of design weights to obtain a new generalized linear regression estimator (GREG) for the finite population total. In this work, we have made an attempt to suggest a way to use the dual calibration of the design weights in case of multi-auxiliary variables; in other words, we have made an attempt to give an answer to the concern in Remark 2 of Singh (2013) work. The same idea is also used to generalize the GREG estimator proposed by Deville and S$\ddot{a}$rndal (1992). It is not an easy task to find the optimum values of the parameters appear in our approach; therefore, few suggestions are mentioned to select values for such parameters based on a random sample. Based on real data set and under simple random sampling without replacement design, our approach is compared with other approaches mentioned in this paper and for different sample sizes. Simulation results show that all estimators have negligible relative bias, and the multivariate case of Singh (2013) estimator is more efficient than other estimators.