• Title/Summary/Keyword: regression estimation

Search Result 2,211, Processing Time 0.022 seconds

A study on semi-supervised kernel ridge regression estimation (준지도 커널능형회귀모형에 관한 연구)

  • Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.2
    • /
    • pp.341-353
    • /
    • 2013
  • In many practical machine learning and data mining applications, unlabeled data are inexpensive and easy to obtain. Semi-supervised learning try to use such data to improve prediction performance. In this paper, a semi-supervised regression method, semi-supervised kernel ridge regression estimation, is proposed on the basis of kernel ridge regression model. The proposed method does not require a pilot estimation of the label of the unlabeled data. This means that the proposed method has good advantages including less number of parameters, easy computing and good generalization ability. Experiments show that the proposed method can effectively utilize unlabeled data to improve regression estimation.

Population Distribution Estimation Using Regression-Kriging Model (Regression-Kriging 모형을 이용한 인구분포 추정에 관한 연구)

  • Kim, Byeong-Sun;Ku, Cha-Yong;Choi, Jin-Mu
    • Journal of the Korean Geographical Society
    • /
    • v.45 no.6
    • /
    • pp.806-819
    • /
    • 2010
  • Population data has been essential and fundamental in spatial analysis and commonly aggregated into political boundaries. A conventional method for population distribution estimation was a regression model with land use data, but the estimation process has limitation because of spatial autocorrelation of the population data. This study aimed to improve the accuracy of population distribution estimation by adopting a Regression-Kriging method, namely RK Model, which combines a regression model with Kriging for the residuals. RK Model was applied to a part of Seoul metropolitan area to estimate population distribution based on the residential zones. Comparative results of regression model and RK model using RMSE, MAE, and G statistics revealed that RK model could substantially improve the accuracy of population distribution. It is expected that RK model could be adopted actively for further population distribution estimation.

통계학의 비모수 추정에 관한 역사적 고찰

  • 이승우
    • Journal for History of Mathematics
    • /
    • v.16 no.3
    • /
    • pp.95-100
    • /
    • 2003
  • The recent surge of interest in the more technical aspects of nonparametric density estimation and nonparametric regression estimation has brought the subject into public view. In this paper, we investigate the general concept of the nonparametric density estimation, the nonparametric regression estimation and its performance criteria.

  • PDF

Estimation of Jump Points in Nonparametric Regression

  • Park, Dong-Ryeon
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.6
    • /
    • pp.899-908
    • /
    • 2008
  • If the regression function has jump points, nonparametric estimation method based on local smoothing is not statistically consistent. Therefore, when we estimate regression function, it is quite important to know whether it is reasonable to assume that regression function is continuous. If the regression function appears to have jump points, then we should estimate first the location of jump points. In this paper, we propose a procedure which can do both the testing hypothesis of discontinuity of regression function and the estimation of the number and the location of jump points simultaneously. The performance of the proposed method is evaluated through a simulation study. We also apply the procedure to real data sets as examples.

Trajectory Estimation of Center of Plantar Foot Pressure Using Gaussian Process Regression (가우시안 프로세스 회귀를 이용한 족저압 중심 궤적 추정)

  • Choi, Yuna;Lee, Daehun;Choi, Youngjin
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.3
    • /
    • pp.296-302
    • /
    • 2022
  • This paper proposes a center of plantar foot pressure (CoP) trajectory estimation method based on Gaussian process regression, with the aim to show robust results regardless of the regions and numbers of FSRs of the insole sensor. This method can bring an interpolation between the measurement points inside the wearable insole sensor, and two experiments are conducted for performance evaluation. For this purpose, the input data used in the experiment are generated in three types (13 FSRs, 8 FSRs, 5 FSRs) according to the regions and numbers of FSRs. First, the estimation results of the CoP trajectory are compared using Gaussian process regression and weighted mean. As a result of each method, the estimation results of the two methods were similar in the case of 13 FSRs data. On the other hand, in the case of the 8 and 5 FSRs data, the weighted mean varies depending on the regions and numbers of FSRs, but the estimation results of Gaussian process regression showed similar results in spite of reducing the regions and numbers. Second, the estimation results of the CoP trajectory based on Gaussian process regression during several gait cycles are analyzed. In five gait cycles, the previous cycle and the current estimation results are compared, and it was confirmed that similar trajectories appeared in all. In this way, the method of estimating the CoP trajectory based on Gaussian process regression showed robust results, and stability was confirmed by yielding similar results in several gait cycles.

A study on robust regression estimators in heteroscedastic error models

  • Son, Nayeong;Kim, Mijeong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.5
    • /
    • pp.1191-1204
    • /
    • 2017
  • Weighted least squares (WLS) estimation is often easily used for the data with heteroscedastic errors because it is intuitive and computationally inexpensive. However, WLS estimator is less robust to a few outliers and sometimes it may be inefficient. In order to overcome robustness problems, Box-Cox transformation, Huber's M estimation, bisquare estimation, and Yohai's MM estimation have been proposed. Also, more efficient estimations than WLS have been suggested such as Bayesian methods (Cepeda and Achcar, 2009) and semiparametric methods (Kim and Ma, 2012) in heteroscedastic error models. Recently, Çelik (2015) proposed the weight methods applicable to the heteroscedasticity patterns including butterfly-distributed residuals and megaphone-shaped residuals. In this paper, we review heteroscedastic regression estimators related to robust or efficient estimation and describe their properties. Also, we analyze cost data of U.S. Electricity Producers in 1955 using the methods discussed in the paper.

M-quantile kernel regression for small area estimation (소지역 추정을 위한 M-분위수 커널회귀)

  • Shim, Joo-Yong;Hwang, Chang-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.749-756
    • /
    • 2012
  • An approach widely used for small area estimation is based on linear mixed models. However, when the functional form of the relationship between the response and the input variables is not linear, it may lead to biased estimators of the small area parameters. In this paper we propose M-quantile kernel regression for small area mean estimation allowing nonlinearities in the relationship between the response and the input variables. Numerical studies are presented that show the sample properties of the proposed estimation method.

Generalized nonlinear percentile regression using asymmetric maximum likelihood estimation

  • Lee, Juhee;Kim, Young Min
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.6
    • /
    • pp.627-641
    • /
    • 2021
  • An asymmetric least squares estimation method has been employed to estimate linear models for percentile regression. An asymmetric maximum likelihood estimation (AMLE) has been developed for the estimation of Poisson percentile linear models. In this study, we propose generalized nonlinear percentile regression using the AMLE, and the use of the parametric bootstrap method to obtain confidence intervals for the estimates of parameters of interest and smoothing functions of estimates. We consider three conditional distributions of response variables given covariates such as normal, exponential, and Poisson for three mean functions with one linear and two nonlinear models in the simulation studies. The proposed method provides reasonable estimates and confidence interval estimates of parameters, and comparable Monte Carlo asymptotic performance along with the sample size and quantiles. We illustrate applications of the proposed method using real-life data from chemical and radiation epidemiological studies.

Development of Statistical Model and Neural Network Model for Tensile Strength Estimation in Laser Material Processing of Aluminum Alloy (알루미늄 합금의 레이저 가공에서 인장 강도 예측을 위한 회귀 모델 및 신경망 모델의 개발)

  • Park, Young-Whan;Rhee, Se-Hun
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.24 no.4 s.193
    • /
    • pp.93-101
    • /
    • 2007
  • Aluminum alloy which is one of the light materials has been tried to apply to light weight vehicle body. In order to do that, welding technology is very important. In case of the aluminum laser welding, the strength of welded part is reduced due to porosity, underfill, and magnesium loss. To overcome these problems, laser welding of aluminum with filler wire was suggested. In this study, experiment about laser welding of AA5182 aluminum alloy with AA5356 filler wire was performed according to process parameters such as laser power, welding speed and wire feed rate. The tensile strength was measured to find the weldability of laser welding with filler wire. The models to estimate tensile strength were suggested using three regression models and one neural network model. For regression models, one was the multiple linear regression model, another was the second order polynomial regression model, and the other was the multiple nonlinear regression model. Neural network model with 2 hidden layers which had 5 and 3 nodes respectively was investigated to find the most suitable model for the system. Estimation performance was evaluated for each model using the average error rate. Among the three regression models, the second order polynomial regression model had the best estimation performance. For all models, neural network model has the best estimation performance.

Fuzzy Local Linear Regression Analysis

  • Hong, Dug-Hun;Kim, Jong-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.515-524
    • /
    • 2007
  • This paper deals with local linear estimation of fuzzy regression models based on Diamond(1998) as a new class of non-linear fuzzy regression. The purpose of this paper is to introduce a use of smoothing in testing for lack of fit of parametric fuzzy regression models.

  • PDF