• Title/Summary/Keyword: 일반화 모형

Search Result 528, Processing Time 0.028 seconds

A credit classification method based on generalized additive models using factor scores of mixtures of common factor analyzers (공통요인분석자혼합모형의 요인점수를 이용한 일반화가법모형 기반 신용평가)

  • Lim, Su-Yeol;Baek, Jang-Sun
    • Journal of the Korean Data and Information Science Society
    • /
    • v.23 no.2
    • /
    • pp.235-245
    • /
    • 2012
  • Logistic discrimination is an useful statistical technique for quantitative analysis of financial service industry. Especially it is not only easy to be implemented, but also has good classification rate. Generalized additive model is useful for credit scoring since it has the same advantages of logistic discrimination as well as accounting ability for the nonlinear effects of the explanatory variables. It may, however, need too many additive terms in the model when the number of explanatory variables is very large and there may exist dependencies among the variables. Mixtures of factor analyzers can be used for dimension reduction of high-dimensional feature. This study proposes to use the low-dimensional factor scores of mixtures of factor analyzers as the new features in the generalized additive model. Its application is demonstrated in the classification of some real credit scoring data. The comparison of correct classification rates of competing techniques shows the superiority of the generalized additive model using factor scores.

Generalized Weighted Linear Models Based on Distribution Functions - A Frequentist Perspective (분포함수를 기초로 일반화가중선형모형)

  • 여인권
    • The Korean Journal of Applied Statistics
    • /
    • v.17 no.3
    • /
    • pp.489-498
    • /
    • 2004
  • In this paper, a new form of linear models referred to as generalized weighted linear models is proposed. The proposed models assume that the relationship between the response variable and explanatory variables can be modelled by a distribution function of the response mean and a weighted linear combination of distribution functions of covariates. This form addresses a structural problem of the link function in the generalized linear models in which the parameter space may not be consistent with the space derived from linear predictors. The maximum likelihood estimation with Lagrange's undetermined multipliers is used to estimate the parameters and resampling method is applied to compute confidence intervals and to test hypotheses.

Maximum likelihood estimation of Logistic random effects model (로지스틱 임의선형 혼합모형의 최대우도 추정법)

  • Kim, Minah;Kyung, Minjung
    • The Korean Journal of Applied Statistics
    • /
    • v.30 no.6
    • /
    • pp.957-981
    • /
    • 2017
  • A generalized linear mixed model is an extension of a generalized linear model that allows random effect as well as provides flexibility in developing a suitable model when observations are correlated or when there are other underlying phenomena that contribute to resulting variability. We describe maximum likelihood estimation methods for logistic regression models that include random effects - the Laplace approximation, Gauss-Hermite quadrature, adaptive Gauss-Hermite quadrature, and pseudo-likelihood. Applications are provided with social science problems by analyzing the effect of mental health and life satisfaction on volunteer activities from Korean welfare panel data; in addition, we observe that the inclusion of random effects in the model leads to improved analyses with more reasonable inferences.

Forecasting attendance in the Korean professional baseball league using GARCH models (일반화 자기회귀 조건부 이분산 모형을 이용한 한국프로야구 관중수의 예측)

  • Lee, Jang-Taek;Bang, So-Young
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.6
    • /
    • pp.1041-1049
    • /
    • 2010
  • In Korean professional baseball, attendance is the largest source of revenue for development of professional baseball and the highest concern of professional baseball teams. So, if there is demand forecasting model, it will be helpful for pennant chasers to work out the strategies for drawing attendance. For this reason, this research intends to suggest the model which estimates Korean professional baseball's attendance and uses all usable variables which have an effect on attendance in limited circumstances. We supposed that dependent variable is attendance as well as several independent variables and error term are homoscedastic variance. And then, we compared the models which assume conditional heteroscedastic variance like GARCH and EGARCH with GARCH-t models which use the assumption that error term's distribution follows student-t distribution. In result of that, we could confirm that the models which were made by using GARCH(1,1)-t made estimates the most accurately among the several models considered.

수정된 FS방법을 이용한 일반화된 지수생존모형의 추정

  • 하일도;조건호
    • Proceedings of the Korea Society for Industrial Systems Conference
    • /
    • 1999.05a
    • /
    • pp.205-209
    • /
    • 1999
  • 일반화된 지수생존모형(generalized exponential survival model)을 고려하여 이 모형의 모수를 추정하는 수정된 FS(modified Fisher scoring)방법을 제안한다. 이를 위해 우도방정식(likelihood equation)을 유도하고 초기추정치 (initial estimate)를 포함한 추정알고리즘(estimating algorithm)을 개발한다.

  • PDF

A Bayesian Prediction of the Generalized Pareto Model (일반화 파레토 모형에서의 베이지안 예측)

  • Huh, Pan;Sohn, Joong Kweon
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.1069-1076
    • /
    • 2014
  • Rainfall weather patterns have changed due to global warming and sudden heavy rainfalls have become more frequent. Economic loss due to heavy rainfall has increased. We study the generalized Pareto distribution for modelling rainfall in Seoul based on data from 1973 to 2008. We use several priors including Jeffrey's noninformative prior and Gibbs sampling method to derive Bayesian posterior predictive distributions. The probability of heavy rainfall has increased over the last ten years based on estimated posterior predictive distribution.

Estimating GARCH models using kernel machine learning (커널기계 기법을 이용한 일반화 이분산자기회귀모형 추정)

  • Hwang, Chang-Ha;Shin, Sa-Im
    • Journal of the Korean Data and Information Science Society
    • /
    • v.21 no.3
    • /
    • pp.419-425
    • /
    • 2010
  • Kernel machine learning is gaining a lot of popularities in analyzing large or high dimensional nonlinear data. We use this technique to estimate a GARCH model for predicting the conditional volatility of stock market returns. GARCH models are usually estimated using maximum likelihood (ML) procedures, assuming that the data are normally distributed. In this paper, we show that GARCH models can be estimated using kernel machine learning and that kernel machine has a higher predicting ability than ML methods and support vector machine, when estimating volatility of financial time series data with fat tail.

Automatic order selection procedure for count time series models (계수형 시계열 모형을 위한 자동화 차수 선택 알고리즘)

  • Ji, Yunmi;Seong, Byeongchan
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.2
    • /
    • pp.147-160
    • /
    • 2020
  • In this paper, we study an algorithm that automatically determines the orders of past observations and conditional mean values that play an important role in count time series models. Based on the orders of the ARIMA model, the algorithm constitutes the order candidates group for time series generalized linear models and selects the final model based on information criterion among the combinations of the order candidates group. To evaluate the proposed algorithm, we perform small simulations and empirical analysis according to underlying models and time series as well as compare forecasting performances with the ARIMA model. The results of the comparison confirm that the time series generalized linear model offers better performance than the ARIMA model for the count time series analysis. In addition, the empirical analysis shows better performance in mid and long term forecasting than the ARIMA model.

The estimation of winning rate in Korean professional baseball league (한국 프로야구의 승률 추정)

  • Kim, Soon-Kwi;Lee, Young-Hoon
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.3
    • /
    • pp.653-661
    • /
    • 2016
  • In this paper, we provide a suitable optimal exponent in the generalized Pythagorean theorem and propose to use the logistic model & the probit model to estimate the winning rate in Korean professional baseball league. Under a criterion of root-mean-square-error (RMSE), the efficiencies of the proposed models have been compared with those of the Pythagorean theorem. We use the team historic win-loss records of Korean professional baseball league from 1982 to the first half of 2015, and the proposed methods show slight outperformances over the generalized Pythagorean method under the criterion of RMSE.

Hurdle Model for Longitudinal Zero-Inflated Count Data Analysis (영과잉 경시적 가산자료 분석을 위한 허들모형)

  • Jin, Iktae;Lee, Keunbaik
    • The Korean Journal of Applied Statistics
    • /
    • v.27 no.6
    • /
    • pp.923-932
    • /
    • 2014
  • The Hurdle model can to analyze zero-inflated count data. This model is a mixed model of the logit model for a binary component and a truncated Poisson model of a truncated count component. We propose a new hurdle model with a general heterogeneous random effects covariance matrix to analyze longitudinal zero-inflated count data using modified Cholesky decomposition. This decomposition factors the random effects covariance matrix into generalized autoregressive parameters and innovation variance. The parameters are modeled using (generalized) linear models and estimated with a Bayesian method. We use these methods to carefully analyze a real dataset.