• Title/Summary/Keyword: multivariate modeling

Search Result 115, Processing Time 0.031 seconds

A Study on the Stochastic Modeling for Stream Flow Generation (하천유량의 모의발생을 위한 추계학적 모형의 적용에 관한 연구)

  • Lee, Joo-Heon
    • Journal of the Korean Society of Hazard Mitigation
    • /
    • v.1 no.2 s.2
    • /
    • pp.115-121
    • /
    • 2001
  • The purpose of the synthetic generation of monthly river flows based on the short term observed data by means of stochastic models is to provide abundant input data to the water resources systems of which the system performance and operation policy are to be determined beforehand. In this study, a multivariate autoregressive model has been applied to generate monthly flows of the multi sites considering the correlations between each site. The model performance was examined using statistical comparisons between the historical and generated monthly series such as mean, variance, skewness and correlation coefficients. The results of this study showed that the modeled generated flows were statistically similar to the historical flows.

  • PDF

Development of MKDE-ebd for Estimation of Multivariate Probabilistic Distribution Functions (다변량 확률분포함수의 추정을 위한 MKDE-ebd 개발)

  • Kang, Young-Jin;Noh, Yoojeong;Lim, O-Kaung
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.32 no.1
    • /
    • pp.55-63
    • /
    • 2019
  • In engineering problems, many random variables have correlation, and the correlation of input random variables has a great influence on reliability analysis results of the mechanical systems. However, correlated variables are often treated as independent variables or modeled by specific parametric joint distributions due to difficulty in modeling joint distributions. Especially, when there are insufficient correlated data, it becomes more difficult to correctly model the joint distribution. In this study, multivariate kernel density estimation with bounded data is proposed to estimate various types of joint distributions with highly nonlinearity. Since it combines given data with bounded data, which are generated from confidence intervals of uniform distribution parameters for given data, it is less sensitive to data quality and number of data. Thus, it yields conservative statistical modeling and reliability analysis results, and its performance is verified through statistical simulation and engineering examples.

Absorbtion Spectroscopy, Molecular Dynamics Calculations, and Multivariate Curve Resolution on the Phthalocyanine Aggregation

  • Ajloo, Davood;Ghadamgahi, Maryam;Shaheri, Freshte;Zarei, Kobra
    • Bulletin of the Korean Chemical Society
    • /
    • v.35 no.5
    • /
    • pp.1440-1448
    • /
    • 2014
  • Co(II)-tetrasulfonated phthalocyanine (CoTSP) is known to be aggregated to dimer at high concentration levels in water. A study on the aggregation of CoTSP using multivariate curve resolution analysis of the visible absorbance spectra over a concentration range of 30, 40 and 50 ${\mu}M$ in the presence of dimethyl sulfoxide (DMSO), dimethyl formamide (DMF), acetonitrile (AN) and ethanol (EtOH) in the concentration range of 0 to 3.57 M is conducted. A hard modeling-based multivariate curve resolution method was applied to determine the dissociation constants of the CoTSP aggregates at various temperatures ranging from 25, 45 and $65^{\circ}C$ and in the presence of various co-solvents. Dissociation constant for aggregation was increased and then decrease by temperature and concentration of phthalocyanine, respectively. Utilizing the vant Hoff relation, the enthalpy and entropy of the dissociation equilibriums were calculated. For the dissociation of both aggregates, the enthalpy and entropy changes were positive and negative, respectively. Molecular dynamics simulation of cosolvent effect on CoTSP aggregation was done to confirm spectroscopy results. Results of radial distribution function (RDF), root mean square deviation (RMSD) and distance curves confirmed more effect of polar solvent to decrease monomer formation.

Volatility Analysis for Multivariate Time Series via Dimension Reduction (차원축소를 통한 다변량 시계열의 변동성 분석 및 응용)

  • Song, Eu-Gine;Choi, Moon-Sun;Hwang, S.Y.
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.6
    • /
    • pp.825-835
    • /
    • 2008
  • Multivariate GARCH(MGARCH) has been useful in financial studies and econometrics for modeling volatilities and correlations between components of multivariate time series. An obvious drawback lies in that the number of parameters increases rapidly with the number of variables involved. This thesis tries to resolve the problem by using dimension reduction technique. We briefly review both factor models for dimension reduction and the MGARCH models including EWMA (Exponentially weighted moving-average model), DVEC(Diagonal VEC model), BEKK and CCC(Constant conditional correlation model). We create meaningful portfolios obtained after reducing dimension through statistical factor models and fundamental factor models and in turn these portfolios are applied to MGARCH. In addition, we compare portfolios by assessing MSE, MAD(Mean absolute deviation) and VaR(Value at Risk). Various financial time series are analyzed for illustration.

Change points detection for nonstationary multivariate time series

  • Yeonjoo Park;Hyeongjun Im;Yaeji Lim
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.4
    • /
    • pp.369-388
    • /
    • 2023
  • In this paper, we develop the two-step procedure that detects and estimates the position of structural changes for multivariate nonstationary time series, either on mean parameters or second-order structures. We first investigate the presence of mean structural change by monitoring data through the aggregated cumulative sum (CUSUM) type statistic, a sequential procedure identifying the likely position of the change point on its trend. If no mean change point is detected, the proposed method proceeds to scan the second-order structural change by modeling the multivariate nonstationary time series with a multivariate locally stationary Wavelet process, allowing the time-localized auto-correlation and cross-dependence. Under this framework, the estimated dynamic spectral matrices derived from the local wavelet periodogram capture the time-evolving scale-specific auto- and cross-dependence features of data. We then monitor the change point from the lower-dimensional approximated space of the spectral matrices over time by applying the dynamic principal component analysis. Different from existing methods requiring prior information on the type of changes between mean and covariance structures as an input for the implementation, the proposed algorithm provides the output indicating the type of change and the estimated location of its occurrence. The performance of the proposed method is demonstrated in simulations and the analysis of two real finance datasets.

Scientific and Technical Visualization for Ocean Process Simulations (해양과정시뮬레이션의 과학기술적가시화)

  • Choi Byung Ho
    • 한국전산유체공학회:학술대회논문집
    • /
    • 1999.05a
    • /
    • pp.1-10
    • /
    • 1999
  • This paper briefly introduces the work done up to 1998 during the past twenty years for numerical modeling of ocean process focussing on the neighbouring seas of Korean Peninsula. Modeling of global ocean dynamics has also been performed as a pathway to understand the regional ocean dynamics. The ocean simulation produces a vast amount of multidimensional multivariate dataset therefore adoption of scientific and technical visualization techniques were essential to properly understand the physics involved.

  • PDF

Comparison study of modeling covariance matrix for multivariate longitudinal data (다변량 경시적 자료 분석을 위한 공분산 행렬의 모형화 비교 연구)

  • Kwak, Na Young;Lee, Keunbaik
    • The Korean Journal of Applied Statistics
    • /
    • v.33 no.3
    • /
    • pp.281-296
    • /
    • 2020
  • Repeated outcomes from the same subjects are referred to as longitudinal data. Analysis of the data requires different methods unlike cross-sectional data analysis. It is important to model the covariance matrix because the correlation between the repeated outcomes must be considered when estimating the effects of covariates on the mean response. However, the modeling of the covariance matrix is tricky because there are many parameters to be estimated, and the estimated covariance matrix should be positive definite. In this paper, we consider analysis of multivariate longitudinal data via two modeling methodologies for the covariance matrix for multivariate longitudinal data. Both methods describe serial correlations of multivariate longitudinal outcomes using a modified Cholesky decomposition. However, the two methods consider different decompositions to explain the correlation between simultaneous responses. The first method uses enhanced linear covariance models so that the covariance matrix satisfies a positive definiteness condition; in addition, and principal component analysis and maximization-minimization algorithm (MM algorithm) were used to estimate model parameters. The second method considers variance-correlation decomposition and hypersphere decomposition to model covariance matrix. Simulations are used to compare the performance of the two methodologies.

Application of machine learning models for estimating house price (단독주택가격 추정을 위한 기계학습 모형의 응용)

  • Lee, Chang Ro;Park, Key Ho
    • Journal of the Korean Geographical Society
    • /
    • v.51 no.2
    • /
    • pp.219-233
    • /
    • 2016
  • In social science fields, statistical models are used almost exclusively for causal explanation, and explanatory modeling has been a mainstream until now. In contrast, predictive modeling has been rare in the fields. Hence, we focus on constructing the predictive non-parametric model, instead of the explanatory model. Gangnam-gu, Seoul was chosen as a study area and we collected single-family house sales data sold between 2011 and 2014. We applied non-parametric models proposed in machine learning area including generalized additive model(GAM), random forest, multivariate adaptive regression splines(MARS) and support vector machines(SVM). Models developed recently such as MARS and SVM were found to be superior in predictive power for house price estimation. Finally, spatial autocorrelation was accounted for in the non-parametric models additionally, and the result showed that their predictive power was enhanced further. We hope that this study will prompt methodology for property price estimation to be extended from traditional parametric models into non-parametric ones.

  • PDF

Repetitive model refinement for structural health monitoring using efficient Akaike information criterion

  • Lin, Jeng-Wen
    • Smart Structures and Systems
    • /
    • v.15 no.5
    • /
    • pp.1329-1344
    • /
    • 2015
  • The stiffness of a structure is one of several structural signals that are useful indicators of the amount of damage that has been done to the structure. To accurately estimate the stiffness, an equation of motion containing a stiffness parameter must first be established by expansion as a linear series model, a Taylor series model, or a power series model. The model is then used in multivariate autoregressive modeling to estimate the structural stiffness and compare it to the theoretical value. Stiffness assessment for modeling purposes typically involves the use of one of three statistical model refinement approaches, one of which is the efficient Akaike information criterion (AIC) proposed in this paper. If a newly added component of the model results in a decrease in the AIC value, compared to the value obtained with the previously added component(s), it is statistically justifiable to retain this new component; otherwise, it should be removed. This model refinement process is repeated until all of the components of the model are shown to be statistically justifiable. In this study, this model refinement approach was compared with the two other commonly used refinement approaches: principal component analysis (PCA) and principal component regression (PCR) combined with the AIC. The results indicate that the proposed AIC approach produces more accurate structural stiffness estimates than the other two approaches.

Bivariate regional frequency analysis of extreme rainfalls in Korea (이변량 지역빈도해석을 이용한 우리나라 극한 강우 분석)

  • Shin, Ju-Young;Jeong, Changsam;Ahn, Hyunjun;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.51 no.9
    • /
    • pp.747-759
    • /
    • 2018
  • Multivariate regional frequency analysis has advantages of regional and multivariate framework as adopting a large number of regional dataset and modeling phenomena that cannot be considered in the univariate frequency analysis. To the best of our knowledge, the multivariate regional frequency analysis has not been employed for hydrological variables in South Korea. Applicability of the multivariate regional frequency analysis should be investigated for the hydrological variable in South Korea in order to improve our capacity to model the hydrological variables. The current study focused on estimating parameters of regional copula and regional marginal models, selecting the most appropriate distribution models, and estimating regional multivariate growth curve in the multivariate regional frequency analysis. Annual maximum rainfall and duration data observed at 71 stations were used for the analysis. The results of the current study indicate that Frank and Gumbel copula models were selected as the most appropriate regional copula models for the employed regions. Several distributions, e.g. Gumbel and log-normal, were the representative regional marginal models. Based on relative root mean square error of the quantile growth curves, the multivariate regional frequency analysis provided more stable and accurate quantiles than the multivariate at-site frequency analysis, especially for long return periods. Application of regional frequency analysis in bivariate rainfall-duration analysis can provide more stable quantile estimation for hydraulic infrastructure design criteria and accurate modelling of rainfall-duration relationship.