• Title/Summary/Keyword: 최소제곱교차타당성

Search Result 6, Processing Time 0.022 seconds

Bandwidth selections based on cross-validation for estimation of a discontinuity point in density (교차타당성을 이용한 확률밀도함수의 불연속점 추정의 띠폭 선택)

  • Huh, Jib
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.765-775
    • /
    • 2012
  • The cross-validation is a popular method to select bandwidth in all types of kernel estimation. The maximum likelihood cross-validation, the least squares cross-validation and biased cross-validation have been proposed for bandwidth selection in kernel density estimation. In the case that the probability density function has a discontinuity point, Huh (2012) proposed a method of bandwidth selection using the maximum likelihood cross-validation. In this paper, two forms of cross-validation with the one-sided kernel function are proposed for bandwidth selection to estimate the location and jump size of the discontinuity point of density. These methods are motivated by the least squares cross-validation and the biased cross-validation. By simulated examples, the finite sample performances of two proposed methods with the one of Huh (2012) are compared.

Analysis of market share attraction data using LS-SVM (최소제곱 서포트벡터기계를 이용한 시장점유율 자료 분석)

  • Park, Hye-Jung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.20 no.5
    • /
    • pp.879-886
    • /
    • 2009
  • The purpose of this article is to present the application of Least Squares Support Vector Machine in analyzing the existing structure of brand. We estimate the parameters of the Market Share Attraction Model using a non-parametric technique for function estimation called Least Squares Support Vector Machine, which allows us to perform even nonlinear regression by constructing a linear regression function in a high dimensional feature space. Estimation by Least Squares Support Vector Machine technique makes it a good candidate for solving the Market Share Attraction Model. To illustrate the performance of the proposed method, we use the car sales data in South Korea's car market.

  • PDF

Estimation of nonlinear GARCH-M model (비선형 평균 일반화 이분산 자기회귀모형의 추정)

  • Shim, Joo-Yong;Lee, Jang-Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.5
    • /
    • pp.831-839
    • /
    • 2010
  • Least squares support vector machine (LS-SVM) is a kernel trick gaining a lot of popularities in the regression and classification problems. We use LS-SVM to propose a iterative algorithm for a nonlinear generalized autoregressive conditional heteroscedasticity model in the mean (GARCH-M) model to estimate the mean and the conditional volatility of stock market returns. The proposed method combines a weighted LS-SVM for the mean and unweighted LS-SVM for the conditional volatility. In this paper, we show that nonlinear GARCH-M models have a higher performance than the linear GARCH model and the linear GARCH-M model via real data estimations.

Mixed effects least squares support vector machine for survival data analysis (생존자료분석을 위한 혼합효과 최소제곱 서포트벡터기계)

  • Hwang, Chang-Ha;Shim, Joo-Yong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.4
    • /
    • pp.739-748
    • /
    • 2012
  • In this paper we propose a mixed effects least squares support vector machine (LS-SVM) for the censored data which are observed from different groups. We use weights by which the randomly right censoring is taken into account in the nonlinear regression. The weights are formed with Kaplan-Meier estimates of censoring distribution. In the proposed model a random effects term representing inter-group variation is included. Furthermore generalized cross validation function is proposed for the selection of the optimal values of hyper-parameters. Experimental results are then presented which indicate the performance of the proposed LS-SVM by comparing with a standard LS-SVM for the censored data.

A study on semi-supervised kernel ridge regression estimation (준지도 커널능형회귀모형에 관한 연구)

  • Seok, Kyungha
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.2
    • /
    • pp.341-353
    • /
    • 2013
  • In many practical machine learning and data mining applications, unlabeled data are inexpensive and easy to obtain. Semi-supervised learning try to use such data to improve prediction performance. In this paper, a semi-supervised regression method, semi-supervised kernel ridge regression estimation, is proposed on the basis of kernel ridge regression model. The proposed method does not require a pilot estimation of the label of the unlabeled data. This means that the proposed method has good advantages including less number of parameters, easy computing and good generalization ability. Experiments show that the proposed method can effectively utilize unlabeled data to improve regression estimation.

MCP, Kernel Density Estimation and LoCoH Analysis for the Core Area Zoning of the Red-crowned Crane's Feeding Habitat in Cheorwon, Korea (철원지역 두루미 취식지의 핵심지역 설정을 위한 MCP, 커널밀도측정법(KDE)과 국지근린지점외곽연결(LoCoH) 분석)

  • Yoo, Seung-Hwa;Lee, Ki-Sup;Park, Chong-Hwa
    • Korean Journal of Environment and Ecology
    • /
    • v.27 no.1
    • /
    • pp.11-21
    • /
    • 2013
  • We tried to find out the core feeding site of the Red-crowned Crane(Grus japonensis) in Cheorwon, Korea by using analysis techniques which are MCP(minimum convex polygon), KDE(kernel density estimation), LoCoH(local nearest-neighbor convex-hull). And, We discussed the difference and meaning of result among analysis methods. We choose the data of utilization distribution from distribution map of Red-crowned Crane in Cheorwon, Korea at $17^{th}$ February 2012. Extent of the distribution area was $140km^2$ by MCP analysis. Extents of core feeding area of the Red-crowned Crane were $33.3km^2$($KDE_{1000m}$), $25.7km^2$($KDE_{CVh}$), $19.7km^2$($KDE_{LSCVh}$), according to the 1000m, CVh, LSCVh in value of bandwidth. Extent, number and shape complexity of the core area has decreased, and size of each core area have decreased as small as the bandwidth size(default:1000m, CVh: 554.6m, LSCVh: 329.9). We would suggest the CVh value in KDE analysis as a proper bandwidth value for the Red-crowned crane's core area zoning. Extent of the distribution range and core area have increased and merged into the large core area as a increasing of k value in LoCoH analysis. Proper value for the selecting core area of Red-crowned Crane's distribution was k=24, and extent of the core area was $18.2km^2$, 16.5% area of total distribution area. Finally, the result of LoCoH analysis, we selected two core area, and number of selected core area was smaller than selected area of KDE analysis. Exact value of bandwidth have not been used in studies using KDE analysis in most articles and presentations of the Korea. As a result, it is needed to clarify the exact using bandwidth value in KDE studies.