• Title/Summary/Keyword: 다변량 모형

Search Result 267, Processing Time 0.03 seconds

Generalized Linear Mixed Model for Multivariate Multilevel Binomial Data (다변량 다수준 이항자료에 대한 일반화선형혼합모형)

  • Lim, Hwa-Kyung;Song, Seuck-Heun;Song, Ju-Won;Cheon, Soo-Young
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.6
    • /
    • pp.923-932
    • /
    • 2008
  • We are likely to face complex multivariate data which can be characterized by having a non-trivial correlation structure. For instance, omitted covariates may simultaneously affect more than one count in clustered data; hence, the modeling of the correlation structure is important for the efficiency of the estimator and the computation of correct standard errors, i.e., valid inference. A standard way to insert dependence among counts is to assume that they share some common unobservable variables. For this assumption, we fitted correlated random effect models considering multilevel model. Estimation was carried out by adopting the semiparametric approach through a finite mixture EM algorithm without parametric assumptions upon the random coefficients distribution.

Selection of Input Nodes in Artificial Neural Network for Bankruptcy Prediction by Integrated Link Weight Analysis (통합 연결강도모형에 의한 부도예측용 인공신경망 모형 입력노드 선정에 관한 연구)

  • 이웅규
    • Proceedings of the Korea Inteligent Information System Society Conference
    • /
    • 2001.06a
    • /
    • pp.359-368
    • /
    • 2001
  • 본 연구에서는 부도예측용 인공신경망의 입력노드 선정을 위한 휴리스틱으로 연결강도분석 접근법을 제안한다. 연결강도분석은 학습이 끝난 인공신경망에서 입력노드와 은닉노드와 연결된 가중치의 절대값 즉, 연결강도를 분석하여 입력변수를 선정하는 접근법으로, 본 연구에서는 약체연결뉴론제거법, 강체연결뉴론선택법 그리고 이 두 기법을 통합한 통합 연결강도 모형을 제안하여 각각 의사결정 트리 및 다변량판별분석에 의해 선정된 입력변수를 이용한 인공신경망 모형과 예측율을 비교한다. 실험 결과 본 연구에서 제안하고 있는 방법론이 의사결정트리나 다다변량판별분석 기법 보다 높은 예측율을 보여 주었다. 특히 두 기법의 통합연결강도 모형의 경우에는 다른 단일 기법보다 높은 예측율을 보이고 있다.

  • PDF

A Study on Generation of Stochastic Rainfall Variation using Multivariate Monte Carlo method (다변량 Monte Carlo 기법을 이용한 추계학적 강우 변동 생성기법에 관한 연구)

  • Ahn, Ki-Hong;Han, Kun-Yeun
    • Journal of the Korean Society of Hazard Mitigation
    • /
    • v.9 no.3
    • /
    • pp.127-133
    • /
    • 2009
  • In this study, dimensionless-cumulative rainfall curves were generated by multivariate Monte Carlo method. For generation of rainfall curve rainfall storms were divided and made into dimensionless type since it was required to remove the spatial and temporal variances as well as differences in rainfall data. The dimensionless rainfall curves were divided into 4 types, and log-ratio method was introduced to overcome the limitations that elements of dimensionless-cumulative rainfall curve should always be more than zero and the sum total should be one. Orthogonal transformation by Johnson system and the constrained non-normal multivariate Monte Carlo simulation were introduced to analyse the rainfall characteristics. The generative technique in stochastic rainfall variation using multivariate Monte Carlo method will contribute to the design and evaluation of hydrosystems and can use the establishment of the flood disaster prevention system.

Evaluation of Agricultural Drought Prevention Ability Based on EOF Analysis and Multi-variate Time Series Model (EOF 해석 및 다변량시계열 모형을 이용한 농업가뭄 대비능력의 평가)

  • Yoo Chul-Sang;Kim Dae-Ha;Kim Sang-Dan
    • Journal of Korea Water Resources Association
    • /
    • v.39 no.7 s.168
    • /
    • pp.617-626
    • /
    • 2006
  • In this study 3-month SPI data from 59 stations over the Korean peninsula are analyzed by deriving and spatially characterizing the EOFs. Also, the coefficient time series of EOF are applied to the multi-variate time series model to generate the time series of 10,000 years, to average them to estimate the areal average, and to decide the maximum drought severity for given return periods. Finally, the drought prevention ability is evaluated by considering the effective storage of dam within the basin and the size of agricultural area. Especially for the return period of 30 years, only the Han river basin has the potential to overcome the drought. Other river basins like the Youngsan river basin, which has a large portion of agricultural area but less water storage, are found to be very vulnerable to the rainfall-sensitive agricultural drought.

Parameter Regionalization of Semi-Distributed Runoff Model Using Multivariate Statistical Analysis (다변량 통계분석을 이용한 준분포형 유출모형 매개변수 지역화)

  • Lee, Byong-Ju;Jung, Il-Won;Bae, Deg-Hyo
    • Journal of Korea Water Resources Association
    • /
    • v.42 no.2
    • /
    • pp.149-160
    • /
    • 2009
  • The objective of this study is to suggest parameter regionalization scheme which is integrated two multivariate statistical methods: principal components analysis(PCA) and hierarchical cluster analysis(HCA). This technique is to apply semi-distributed rainfall-runoff model on ungauged catchments. 7 catchment characteristics (area, mean altitude, mean slope, ratio of forest, water content at saturation, field capacity and wilting point) are estimated for 109 mid-sized sub-basins. The first two components from PCA results account for 82.11% of the total variance in the dataset. Component 1 is related to the location of the catchments relevant to the altitude and Component 2 is connected with the area of these. 103 ungauged catchments are clustered using HCA as the following 6 groups: Goesan 23, Andong 6, Imha 5, Hapcheon 21, Yongdam 4, Seomjin 44. SWAT model is used to simulate runoff and the parameters of the model on the 6 gauged basins are estimated. The model parameters were regionalized for Soyang, Chungju and Daecheong dam basins which are assumed as ungauged ones. The model efficiency coefficients of the simulated inflows for these three dams were at least 0.8. These results also mean that goodness of fit is high to the observed inflows. This research will contribute to estimate and analyze hydrologic components on the ungauged catchments.

Completion of the Missing Rainfall Data by a Multi-regression method (다중회귀분석을 이용한 강우량 결측치 보정)

  • Lee, Myoung-Woo;Lee, Bong-Hee;Kim, Hung-Soo;Shim, Myung-Pil
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2006.05a
    • /
    • pp.775-779
    • /
    • 2006
  • 강우자료의 구축은 수문해석에 있어 가장 기본적이며 중요한 단계라 할 수 있다. 하지만 수문 관측 자료의 경우 결측치가 존재하여 그에 대한 보정이 필요한 경우가 종종 발생하게 된다. 따라서 수문자료의 분석을 수행하기에 앞서 우선 자료에 대한 검정을 실시하고, 결측치가 존재할 경우는 이를 보정하여 분석을 수행하여야 한다. 본 연구에서는 다변량통계기법의 하나인 다중회귀분석을 이용하여 강우 결측치를 보정하였다. 본 연구에서는 다중공선성과 자기상관에 대하여 고려한 다중회귀모형을 구성하였다. 모형의 구성시 모든 결측지점에 적용이 가능하지 않아 일반성이 떨어짐을 확인 할 수 있었지만, 모형이 구성될 경우 통계적 적합도와 유의수준을 확인 할 수 있는 장점이 있었으며, 다중회귀모형이 구성되는 경우 좋은 보정 결과를 주는 것을 확인 할 수 있었다.

  • PDF

Penalized least distance estimator in the multivariate regression model (다변량 선형회귀모형의 벌점화 최소거리추정에 관한 연구)

  • Jungmin Shin;Jongkyeong Kang;Sungwan Bang
    • The Korean Journal of Applied Statistics
    • /
    • v.37 no.1
    • /
    • pp.1-12
    • /
    • 2024
  • In many real-world data, multiple response variables are often dependent on the same set of explanatory variables. In particular, if several response variables are correlated with each other, simultaneous estimation considering the correlation between response variables might be more effective way than individual analysis by each response variable. In this multivariate regression analysis, least distance estimator (LDE) can estimate the regression coefficients simultaneously to minimize the distance between each training data and the estimates in a multidimensional Euclidean space. It provides a robustness for the outliers as well. In this paper, we examine the least distance estimation method in multivariate linear regression analysis, and furthermore, we present the penalized least distance estimator (PLDE) for efficient variable selection. The LDE technique applied with the adaptive group LASSO penalty term (AGLDE) is proposed in this study which can reflect the correlation between response variables in the model and can efficiently select variables according to the importance of explanatory variables. The validity of the proposed method was confirmed through simulations and real data analysis.

An application to Multivariate Zero-Inflated Poisson Regression Model

  • Kim, Kyung-Moo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.2
    • /
    • pp.177-186
    • /
    • 2003
  • The Zero-Inflated Poisson regression is a model for count data with exess zeros. When the correlated response variables are intrested, we have to extend the univariate zero-inflated regression model to multivariate model. In this paper, we study and simulate the multivariate zero-inflated regression model. A real example was applied to this model. Regression parameters are estimated by using MLE's. We also compare the fitness of multivariate zero-inflated Poisson regression model with the decision tree model.

  • PDF

A Study on the Two-Phased Hybrid Neural Network Approach to an Effective Decision-Making (효과적인 의사결정을 위한 2단계 하이브리드 인공신경망 접근방법에 관한 연구)

  • Lee, Geon-Chang
    • Asia pacific journal of information systems
    • /
    • v.5 no.1
    • /
    • pp.36-51
    • /
    • 1995
  • 본 논문에서는 비구조적인 의사결정문제를 효과적으로 해결하기 위하여 감독학습 인공신경망 모형과 비감독학습 인공신경망 모형을 결합한 하이브리드 인공신경망 모형인 HYNEN(HYbrid NEural Network) 모형을 제안한다. HYNEN모형은 주어진 자료를 클러스터화 하는 CNN(Clustering Neural Network)과 최종적인 출력을 제공하는 ONN(Output Neural Network)의 2단계로 구성되어 있다. 먼저 CNN에서는 주어진 자료로부터 적정한 퍼지규칙을 찾기 위하여 클러스터를 구성한다. 그리고 이러한 클러스터를 지식베이스로하여 ONN에서 최종적인 의사결정을 한다. CNN에서는 SOFM(Self Organizing Feature Map)과 LVQ(Learning Vector Quantization)를 클러스터를 만든 후 역전파학습 인공신경망 모형으로 이를 학습한다. ONN에서는 역전파학습 인공신경망 모형을 이용하여 각 클러스터의 내용을 학습한다. 제안된 HYNEN 모형을 우리나라 기업의 도산자료에 적용하여 그 결과를 다변량 판별분석법(MDA:Multivariate Discriminant Analysis)과 ACLS(Analog Concept Learning System) 퍼지 ARTMAP 그리고 기존의 역전파학습 인공신경망에 의한 실험결과와 비교하였다.

  • PDF