• 제목/요약/키워드: generalized cross validation function

검색결과 42건 처리시간 0.022초

Claims Reserving via Kernel Machine

  • Kim, Mal-Suk;Park, He-Jung;Hwang, Chang-Ha;Shim, Joo-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제19권4호
    • /
    • pp.1419-1427
    • /
    • 2008
  • This paper shows the kernel Poisson regression which can be applied in the claims reserving, where the row effect is assumed to be a nonlinear function of the row index. The paper concentrates on the chain-ladder technique, within the framework of the chain-ladder linear model. It is shown that the proposed method can provide better reserve estimates than the Poisson model. The cross validation function is introduced to choose optimal hyper-parameters in the procedure. Experimental results are then presented which indicate the performance of the proposed model.

  • PDF

가변계수 측정오차 회귀모형 (Varying coefficient model with errors in variables)

  • 손인석;심주용
    • Journal of the Korean Data and Information Science Society
    • /
    • 제28권5호
    • /
    • pp.971-980
    • /
    • 2017
  • 가변계수 회귀모형은 회귀계수의 동적변화를 모형화함으로써 종속변수와 입력변수의 관계에 대한 쉬운 해석이 가능하고 회귀계수의 변동성도 추정할 수 있는 장점을 지니고 있으므로, 여러 과학 분야에서 많은 주목을 받고 있다. 본 논문에서 입력변수와 출력변수의 오차를 효과적으로 고려한 가변계수 오차모형을 제안한다. 가변계수가 평활변수의 알려지지 않은 형태의 비선형함수이므로 이를 추정하기 위하여 커널 방법을 사용한다. 제안된 모형의 성능에 영향을 미치는 초모수의 최적값을 구하기 위하여 일반화 교차타당성 방법 또한 제안한다. 제안된 방법은 모의자료와 실제자료를 이용한 수치적 연구를 통하여 평가된다.

Variable selection for multiclassi cation by LS-SVM

  • Hwang, Hyung-Tae
    • Journal of the Korean Data and Information Science Society
    • /
    • 제21권5호
    • /
    • pp.959-965
    • /
    • 2010
  • For multiclassification, it is often the case that some variables are not important while some variables are more important than others. We propose a novel algorithm for selecting such relevant variables for multiclassification. This algorithm is base on multiclass least squares support vector machine (LS-SVM), which uses results of multiclass LS-SVM using one-vs-all method. Experimental results are then presented which indicate the performance of the proposed method.

IRF-k kriging of electrical resistivity data for estimating the extent of saltwater intrusion in a coastal aquifer system

  • Shim B. O.;Chung S. Y.;Kim H. J.;Sung I. H.
    • 한국지구물리탐사학회:학술대회논문집
    • /
    • 한국지구물리탐사학회 2003년도 Proceedings of the international symposium on the fusion technology
    • /
    • pp.352-361
    • /
    • 2003
  • We have evaluated the extent of saltwater intrusion from electrical resistivity distribution in a coastal aquifer system in the southeastern part of Busan, Korea. This aquifer system is divided into four layers according to the hydrogeologic characteristics and the horizontal extent of intruded saltwater is determined at each layer through the geostatistical interpretation of electrical resistivity data. In order to define the statistical structure of electrical resistivity data, variogram analysis is carried out to obtain best generalized covariance models. IRF-k (intrinsic random function of order k) kriging is performed with covariance models to produce the plane of spatial mean resistivities. The kriged estimates are evaluated by cross validation to show a good agreement with the true values and the statistics of cross validation represented low errors for the estimates. In the resistivity contour maps more than 5 m below the surface, we can see a dominant direction of saltwater intrusion beginning from the east side. The area of saltwater intrusion increases with depth. The northeast side has low resistivities less than 5 ohm-m due to the presence of saline water in the depth range of 20 m through 70 m. These results show that the application of geostatistical technique to electrical resistivity data is useful for assessing saltwater intrusion in a coastal aquifer system.

  • PDF

Semiparametric kernel logistic regression with longitudinal data

  • Shim, Joo-Yong;Seok, Kyung-Ha
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권2호
    • /
    • pp.385-392
    • /
    • 2012
  • Logistic regression is a well known binary classification method in the field of statistical learning. Mixed-effect regression models are widely used for the analysis of correlated data such as those found in longitudinal studies. We consider kernel extensions with semiparametric fixed effects and parametric random effects for the logistic regression. The estimation is performed through the penalized likelihood method based on kernel trick, and our focus is on the efficient computation and the effective hyperparameter selection. For the selection of optimal hyperparameters, cross-validation techniques are employed. Numerical results are then presented to indicate the performance of the proposed procedure.

Variable selection in the kernel Cox regression

  • Shim, Joo-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제22권4호
    • /
    • pp.795-801
    • /
    • 2011
  • In machine learning and statistics it is often the case that some variables are not important, while some variables are more important than others. We propose a novel algorithm for selecting such relevant variables in the kernel Cox regression. We employ the weighted version of ANOVA decomposition kernels to choose optimal subset of relevant variables in the kernel Cox regression. Experimental results are then presented which indicate the performance of the proposed method.

MULTI-PARAMETER TIKHONOV REGULARIZATION PROBLEM WITH MULTIPLE RIGHT HAND SIDES

  • Oh, SeYoung;Kwon, SunJoo
    • 충청수학회지
    • /
    • 제33권4호
    • /
    • pp.505-516
    • /
    • 2020
  • This study shows that image deblurring problems can be transformed into the multi-parameter Tikhonov type with multiple right hand sides. Also, this paper proposes the extension of the global generalized cross validation to obtain an appropriate choice of the regularization parameters for this problem. The experimental results of using the preconditioned Gl-CGLS algorithm were analyzed.

생존자료분석을 위한 혼합효과 최소제곱 서포트벡터기계 (Mixed effects least squares support vector machine for survival data analysis)

  • 황창하;심주용
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권4호
    • /
    • pp.739-748
    • /
    • 2012
  • 최소제곱 서포트벡터기계 (least squares support vector machine)는 분류 및 비선형 회귀분석에서 유용하게 사용되고 있는 통계적 기법이다. 본 논문에서는 각 집단별로 생존자료가 관측된 경우 적용할 수 있는 LS-SVM을 제안한다. 제안된 모형은 임의우측 중도절단자료를 비선형 회귀모형에 적용할 수 있게 Kaplan- Meier의 중도절단분포의 추정값을 이용하여 구해진 가중값을 사용하고, 집단 간의 변동을 나타내기 위하여 임의효과항을 포함한다. 벌칙상수와 커널모수의 최적값을 구하기 위하여 일반화 교차타당성함수가 사용되고 모의실험에서는 임의효과항을 포함하지 않은 LS-SVM과 성능을 비교함으로써 제안된 방법의 우수성을 보이기로 한다.

Variable selection in censored kernel regression

  • Choi, Kook-Lyeol;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • 제24권1호
    • /
    • pp.201-209
    • /
    • 2013
  • For censored regression, it is often the case that some input variables are not important, while some input variables are more important than others. We propose a novel algorithm for selecting such important input variables for censored kernel regression, which is based on the penalized regression with the weighted quadratic loss function for the censored data, where the weight is computed from the empirical survival function of the censoring variable. We employ the weighted version of ANOVA decomposition kernels to choose optimal subset of important input variables. Experimental results are then presented which indicate the performance of the proposed variable selection method.

Processing parallel-disk viscometry data in the presence of wall slip

  • Leong, Yee-Kwong;Campbell, Graeme R.;Yeow, Y. Leong;Withers, John W.
    • Korea-Australia Rheology Journal
    • /
    • 제20권2호
    • /
    • pp.51-58
    • /
    • 2008
  • This paper describes a two-step Tikhonov regularization procedure for converting the steady shear data generated by parallel-disk viscometers, in the presence of wall slip, into a shear stress-shear rate function and a wall shear stress-slip velocity functions. If the material under test has a yield stress or a critical wall shear stress below which no slip is observed the method will also provide an estimate of these stresses. Amplification of measurement noise is kept under control by the introduction of two separate regularization parameters and Generalized Cross Validation is used to guide the selection of these parameters. The performance of this procedure is demonstrated by applying it to the parallel disk data of an oil-in-water emulsion, of a foam and of a mayonnaise.