• Title/Summary/Keyword: 회귀 모형

Search Result 3,339, Processing Time 0.032 seconds

A Comparative Study on the Genetic Algorithm and Regression Analysis in Urban Population Surface Modeling (도시인구분포모형 개발을 위한 GA모형과 회귀모형의 적합성 비교연구)

  • Choei, Nae-Young
    • Spatial Information Research
    • /
    • v.18 no.5
    • /
    • pp.107-117
    • /
    • 2010
  • Taking the East-Hwasung area as the case, this study first builds gridded population data based on the municipal population survey raw data, and then measures, by way of GIS tools, the major urban spatial variables that are thought to influence the composition of the regional population. For the purpose of comparison, the urban models based on the Genetic Algorithm technique and the regression technique are constructed using the same input variables. The findings indicate that the GA output performed better in differentiating the effective variables among the pilot model variables, and predicted as much consistent and meaningful coefficient estimates for the explanatory variables as the regression models. The study results indicate that GA technique could be a very useful and supplementary research tool in understanding the urban phenomena.

Penalized quantile regression tree (벌점화 분위수 회귀나무모형에 대한 연구)

  • Kim, Jaeoh;Cho, HyungJun;Bang, Sungwan
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.7
    • /
    • pp.1361-1371
    • /
    • 2016
  • Quantile regression provides a variety of useful statistical information to examine how covariates influence the conditional quantile functions of a response variable. However, traditional quantile regression (which assume a linear model) is not appropriate when the relationship between the response and the covariates is a nonlinear. It is also necessary to conduct variable selection for high dimensional data or strongly correlated covariates. In this paper, we propose a penalized quantile regression tree model. The split rule of the proposed method is based on residual analysis, which has a negligible bias to select a split variable and reasonable computational cost. A simulation study and real data analysis are presented to demonstrate the satisfactory performance and usefulness of the proposed method.

Adaptive lasso in sparse vector autoregressive models (Adaptive lasso를 이용한 희박벡터자기회귀모형에서의 변수 선택)

  • Lee, Sl Gi;Baek, Changryong
    • The Korean Journal of Applied Statistics
    • /
    • v.29 no.1
    • /
    • pp.27-39
    • /
    • 2016
  • This paper considers variable selection in the sparse vector autoregressive (sVAR) model where sparsity comes from setting small coefficients to exact zeros. In the estimation perspective, Davis et al. (2015) showed that the lasso type of regularization method is successful because it provides a simultaneous variable selection and parameter estimation even for time series data. However, their simulations study reports that the regular lasso overestimates the number of non-zero coefficients, hence its finite sample performance needs improvements. In this article, we show that the adaptive lasso significantly improves the performance where the adaptive lasso finds the sparsity patterns superior to the regular lasso. Some tuning parameter selections in the adaptive lasso are also discussed from the simulations study.

The study on the determinants of the number of job changes (중소기업 청년인턴 이직횟수 결정요인 분석)

  • Park, Sungik;Ryu, Jangsoo;Kim, Jonghan;Cho, Jangsik
    • Journal of the Korean Data and Information Science Society
    • /
    • v.26 no.2
    • /
    • pp.387-397
    • /
    • 2015
  • In this paper, the determinants of the number of job changes in the SMEs (small and medium enterprises) youth-intern project is analysed, utilizing SMEs youth-intern DB and employment insurance DB. Since the number of job changes are count data which take integer values other than negative values, general linear regression analysis becomes inappropriate. Therefore, four models such as Poisson regression model, zero inflated Poisson regression model, negative binomial regression model and zero inflated negative binomial regression model are tried to fit count data. A zero inflated negative binomial regression model is selected to be the best model. Major results are the followings. First, the number of job changes is shown to be significantly smaller in the treatment group than in the control group. Second, the number of job changes turns out to be significantly smaller in the young-age group than in the old-age group. Third, it is also shown that the number of job changes of man is significantly greater than that of woman. Lastly, the number of job changes in the bigger firm is shown to be significantly less than that of the smaller firm.

Applying regional regression analysis of the hydrologic model parameters for assessing climate change impacts in the ungaged watershed (미계측 유역의 기후변화 영향평가를 위한 수문모형 매개변수의 지역회귀분석 적용)

  • Kim, Youngil;Seo, Seung Beom;Kim, Sung Jin;Kim, Young-Oh
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.219-219
    • /
    • 2017
  • 상대적으로 유역의 관측 자료가 충분하지 못하거나 검증되지 않았을 경우 미계측 유역으로 정의되며 수문모형의 매개변수 검정을 할 수 없으므로 다른 방법을 고안해야 한다. 이를 위해 기존 연구에서는 지역적 특성을 고려한 지역회기분석을 통해 미계측 유역의 유량을 산정하였는데, 대부분 유역의 특성과 연 평균 유출량 자료의 관계를 이용한 회귀식으로 실시간 유량의 변화를 고려하기 어려웠다. 본 연구에서는 개념적 강우-유출모형으로 많이 사용되고 있는 개념적 수문모형인 GR4J의 매개변수에 대해 미계측 유역의 특성을 고려한 변수들을 이용하여 회귀식을 구하고 그 적용성을 평가하였다. 이를 통해 미계측 유역의 유량 시계열 자료를 생성할 수 있었다. 또한 IPCC에서 발간한 AR5의 RCP 4.5 시나리오를 적용하여 미래 유출량을 산정하였다. 우선 지역회귀분석을 적용하기 위해 수문모형을 이용한 계측 유역의 유출량을 구하였으며 22개의 전국 댐 상류 지점을 기준으로 SCE 알고리즘을 이용하여 GR4J의 최적 매개변수를 구하고 각 유역별로 물리적, 지형적, 기상학적 특성을 고려하여 11개의 변수를 선택하였다. 각 변수간 다중공선성(Multicollinearity)를 고려하기 위해 VIF(Variation Inflation Factor) test를 적용하여 최종 7개의 변수를 선정하고 단계별 회귀방법(Stepwise regression)을 이용하여 GR4J의 매개변수별 회귀식을 생성하였다.

  • PDF

Generally non-linear regression model containing standardized lift for association number estimation (연관성 규칙 수의 추정을 위한 일반적인 비선형 회귀모형에서의 표준화 향상도 활용 방안)

  • Park, Hee Chang
    • Journal of the Korean Data and Information Science Society
    • /
    • v.27 no.3
    • /
    • pp.629-638
    • /
    • 2016
  • Among data mining techniques, the association rule is one of the most used in the real fields because it clearly displays the relationship between two or more items in large databases by quantifying the relationship between the items. There are three primary quality measures for association rule; support, confidence, and lift. We evaluate association rules using these measures. The approach taken in the previous literatures as to estimation of association rule number has been one of a determination function method or a regression modeling approach. In this paper, we proposed a few of non-linear regression equations useful in estimating the number of rules and also evaluated the estimated association rules using the quality measures. Furthermore we assessed their usefulness as compared to conventional regression models using the values of regression coefficients, F statistics, adjusted coefficients of determination and variation inflation factor.

An Empirical Study on the Estimation of Housing Sales Price using Spatiotemporal Autoregressive Model (시공간자기회귀(STAR)모형을 이용한 부동산 가격 추정에 관한 연구)

  • Chun, Hae Jung;Park, Heon Soo
    • Korea Real Estate Review
    • /
    • v.24 no.1
    • /
    • pp.7-14
    • /
    • 2014
  • This study, as the temporal and spatial data for the real price apartment in Seoul from January 2006 to June 2013, empirically compared and analyzed the estimation result of apartment price using OLS by hedonic price model for the problem of space-time correlation, temporal autoregressive model (TAR) considering temporal effect, spatial autoregressive model (SAR) spatial effect and spatiotemporal autoregressive model (STAR) spatiotemporal effect. As a result, the adjusted R-square of STAR model was increased by 10% compared that of OLS model while the root mean squares error (RMSE) was decreased by 18%. Considering temporal and spatial effect, it is observed that the estimation of apartment price is more correct than the existing model. As the result of analyzing STAR model, the apartment price is affected as follows; area for apartment(-), years of apartment(-), dummy of low-rise(-), individual heating (-), city gas(-), dummy of reconstruction(+), stairs(+), size of complex(+). The results of other analysis method were the same. When estimating the price of real estate using STAR model, the government officials can improve policy efficiency and make reasonable investment based on the objective information by grasping trend of real estate market accurately.

Simple principal component analysis using Lasso (라소를 이용한 간편한 주성분분석)

  • Park, Cheolyong
    • Journal of the Korean Data and Information Science Society
    • /
    • v.24 no.3
    • /
    • pp.533-541
    • /
    • 2013
  • In this study, a simple principal component analysis using Lasso is proposed. This method consists of two steps. The first step is to compute principal components by the principal component analysis. The second step is to regress each principal component on the original data matrix by Lasso regression method. Each of new principal components is computed as the linear combination of original data matrix using the scaled estimated Lasso regression coefficient as the coefficients of the combination. This method leads to easily interpretable principal components with more 0 coefficients by the properties of Lasso regression models. This is because the estimator of the regression of each principal component on the original data matrix is the corresponding eigenvector. This method is applied to real and simulated data sets with the help of an R package for Lasso regression and its usefulness is demonstrated.

비선형회귀분석에서 편잔차그림에 대한 연구

  • 강명욱;김정혜
    • Communications for Statistical Applications and Methods
    • /
    • v.5 no.3
    • /
    • pp.571-580
    • /
    • 1998
  • 선형회귀분석에서 새로운 변수가 모형에 추가될 때 변수변환의 필요성과 적절한 변환의 형태를 진단하는 기능이 있다고 알려져 있는 편잔차그림과 덧편잔차그림을 비선형회귀모형에 적용하고 이 그림들이 기능을 제대로 수행할 수 있는 조건을 알아보았다.

  • PDF

선형모형에서 오차의 대칭성에 대한 검정과 회귀계수의 추정에 관한 연구

  • 김순옥
    • Communications for Statistical Applications and Methods
    • /
    • v.2 no.1
    • /
    • pp.13-21
    • /
    • 1995
  • 선형모형에서 오차가 대칭인 분포를 따르는지 또는 한쪽으로 치우친(skewed distribution)분포를 따르는지 검정하는 문제를 다루었다. 또 이러한 검정과정을 분석의 예비단계로 하는 회귀계수의 추정방법에 대해서 연구하고, 모의실험을 통해서 회귀계수 추정법들의 효율을 비교하였다.

  • PDF