• Title/Summary/Keyword: Selection model

Search Result 4,043, Processing Time 0.027 seconds

On the Model Selection Criteria in Normal Distributions

  • Chung, Han-Yeong;Lee, Kee-Won
    • Journal of the Korean Statistical Society
    • /
    • v.21 no.2
    • /
    • pp.93-110
    • /
    • 1992
  • A model selection approach is used to find out whether the mean and the variance of a unique sample are different from the pre-specified values. Normal distribution is selected as an approximating model. Kullback-Leibler discrepancy comes out as a natural measure of discrepancy between the operating model and the approximating model. Several estimates of selection criterion are computed including AIC, TIC, and a coupleof bootstrap estimator of the selection criterion are considered according to the way of resampling. It is shown that a closed form expression is available for the parametric bootstrap estimated cirterion. A Monte Carlo study is provided to give a formal comparison when the operating family itself is normally distributed.

  • PDF

Bayesian Inference and Model Selection for Software Growth Reliability Models using Gibbs Sampler (몬테칼로 깁스방법을 적용한 소프트웨어 신뢰도 성장모형에 대한 베이지안 추론과 모형선택에 관한 연구)

  • 김희철;이승주
    • Journal of Korean Society for Quality Management
    • /
    • v.27 no.3
    • /
    • pp.125-141
    • /
    • 1999
  • Bayesian inference and model selection method for software reliability growth models are studied. Software reliability growth models are used in testing stages of software development to model the error content and time intervals between software failures. In this paper, we could avoid the multiple integration by the use of Gibbs sampling, which is a kind of Markov Chain Monte Carlo method to compute the posterior distribution. Bayesian inference and model selection method for Jelinski-Moranda and Goel-Okumoto and Schick-Wolverton models in software reliability with Poisson prior information are studied. For model selection, we explored the relative error.

  • PDF

Investment Performance of Markowitz's Portfolio Selection Model in the Korean Stock Market (한국 주식시장에서 비선형계획법을 이용한 마코위츠의 포트폴리오 선정 모형의 투자 성과에 관한 연구)

  • Kim, Seong-Moon;Kim, Hong-Seon
    • Korean Management Science Review
    • /
    • v.26 no.2
    • /
    • pp.19-35
    • /
    • 2009
  • This paper investigated performance of the Markowitz's portfolio selection model with applications to Korean stock market. We chose Samsung-Group-Funds and KOSPI index for performance comparison with the Markowitz's portfolio selection model. For the most recent one and a half year period between March 2007 and September 2008, KOSPI index almost remained the same with only 0.1% change, Samsung-Group-Funds showed 20.54% return, and Markowitz's model, which is composed of the same 17 Samsung group stocks, achieved 52% return. We performed sensitivity analysis on the duration of financial data and the frequency of portfolio change in order to maximize the return of portfolio. In conclusion, according to our empirical research results with Samsung-Group-Funds, investment by Markowitz's model, which periodically changes portfolio by using nonlinear programming with only financial data, outperformed investment by the fund managers who possess rich experiences on stock trading and actively change portfolio by the minute-by-minute market news and business information.

Feature selection in the semivarying coefficient LS-SVR

  • Hwang, Changha;Shim, Jooyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.28 no.2
    • /
    • pp.461-471
    • /
    • 2017
  • In this paper we propose a feature selection method identifying important features in the semivarying coefficient model. One important issue in semivarying coefficient model is how to estimate the parametric and nonparametric components. Another issue is how to identify important features in the varying and the constant effects. We propose a feature selection method able to address this issue using generalized cross validation functions of the varying coefficient least squares support vector regression (LS-SVR) and the linear LS-SVR. Numerical studies indicate that the proposed method is quite effective in identifying important features in the varying and the constant effects in the semivarying coefficient model.

Least absolute deviation estimator based consistent model selection in regression

  • Shende, K.S.;Kashid, D.N.
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.3
    • /
    • pp.273-293
    • /
    • 2019
  • We consider the problem of model selection in multiple linear regression with outliers and non-normal error distributions. In this article, the robust model selection criterion is proposed based on the robust estimation method with the least absolute deviation (LAD). The proposed criterion is shown to be consistent. We suggest proposed criterion based algorithms that are suitable for a large number of predictors in the model. These algorithms select only relevant predictor variables with probability one for large sample sizes. An exhaustive simulation study shows that the criterion performs well. However, the proposed criterion is applied to a real data set to examine its applicability. The simulation results show the proficiency of algorithms in the presence of outliers, non-normal distribution, and multicollinearity.

Penalized variable selection for accelerated failure time models

  • Park, Eunyoung;Ha, Il Do
    • Communications for Statistical Applications and Methods
    • /
    • v.25 no.6
    • /
    • pp.591-604
    • /
    • 2018
  • The accelerated failure time (AFT) model is a linear model under the log-transformation of survival time that has been introduced as a useful alternative to the proportional hazards (PH) model. In this paper we propose variable-selection procedures of fixed effects in a parametric AFT model using penalized likelihood approaches. We use three popular penalty functions, least absolute shrinkage and selection operator (LASSO), adaptive LASSO and smoothly clipped absolute deviation (SCAD). With these procedures we can select important variables and estimate the fixed effects at the same time. The performance of the proposed method is evaluated using simulation studies, including the investigation of impact of misspecifying the assumed distribution. The proposed method is illustrated with a primary biliary cirrhosis (PBC) data set.

Model Selection in Artificial Neural Network

  • Kim, Byung Joo
    • International journal of advanced smart convergence
    • /
    • v.7 no.4
    • /
    • pp.57-65
    • /
    • 2018
  • Artificial neural network is inspired by the biological neural network. For simplicity, in computer science, it is represented as a set of layers. Many research has been made in evaluating the number of neurons in the hidden layer but still, none was accurate. Several methods are used until now which do not provide the exact formula for calculating the number of thehidden layer as well as the number of neurons in each hidden layer. In this paper model selection approach was presented. Proposed model is based on geographical analysis of decision boundary. Proposed model selection method is useful when we know the distribution of the training data set. To evaluate the performance of the proposed method we compare it to the traditional architecture on IRIS classification problem. According to the experimental result on Iris data proposed method is turned out to be a powerful one.

Efficient estimation and variable selection for partially linear single-index-coefficient regression models

  • Kim, Young-Ju
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.1
    • /
    • pp.69-78
    • /
    • 2019
  • A structured model with both single-index and varying coefficients is a powerful tool in modeling high dimensional data. It has been widely used because the single-index can overcome the curse of dimensionality and varying coefficients can allow nonlinear interaction effects in the model. For high dimensional index vectors, variable selection becomes an important question in the model building process. In this paper, we propose an efficient estimation and a variable selection method based on a smoothing spline approach in a partially linear single-index-coefficient regression model. We also propose an efficient algorithm for simultaneously estimating the coefficient functions in a data-adaptive lower-dimensional approximation space and selecting significant variables in the index with the adaptive LASSO penalty. The empirical performance of the proposed method is illustrated with simulated and real data examples.

Comparing Bayesian model selection with a frequentist approach using iterative method of smoothing residuals

  • Koo, Hanwool;Shafieloo, Arman;Keeley, Ryan E.;L'Huillier, Benjamin
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.46 no.1
    • /
    • pp.48.2-48.2
    • /
    • 2021
  • We have developed a frequentist approach for model selection which determines consistency of a cosmological model and the data using the distribution of likelihoods from the iterative smoothing method. Using this approach, we have shown how confidently we can distinguish different models without comparison with one another. In this current work, we compare our approach with conventional Bayesian approach based on estimation of Bayesian evidence using nested sampling for the purpose of model selection. We use simulated future Roman (formerly WFIRST)-like type Ia supernovae data in our analysis. We discuss limits of the Bayesian approach for model selection and display how our proposed frequentist approach, if implemented appropriately, can perform better in falsification of individual models.

  • PDF

A new sample selection model for overdispersed count data (과대산포 가산자료의 새로운 표본선택모형)

  • Jo, Sung Eun;Zhao, Jun;Kim, Hyoung-Moon
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.6
    • /
    • pp.733-749
    • /
    • 2018
  • Sample selection arises as a result of the partial observability of the outcome of interest in a study. Heckman introduced a sample selection model to analyze such data and proposed a full maximum likelihood estimation method under the assumption of normality. Recently sample selection models for binomial and Poisson response variables have been proposed. Based on the theory of symmetry-modulated distribution, we extend these to a model for overdispersed count data. This type of data with no sample selection is often modeled using negative binomial distribution. Hence we propose a sample selection model for overdispersed count data using the negative binomial distribution. A real data application is employed. Simulation studies reveal that our estimation method based on profile log-likelihood is stable.