• Title/Summary/Keyword: multivariate modeling

Search Result 115, Processing Time 0.022 seconds

A Study on Statistical Modeling of Spatial Land-use Change Prediction (토지이용 공간변화 예측의 통계학적 모형에 관한 연구)

  • 김의홍
    • Spatial Information Research
    • /
    • v.5 no.2
    • /
    • pp.177-183
    • /
    • 1997
  • S1he concept of a class in the land-use classification system can be equally applied to a class in the land-use-change classification. The maximum likelihood method using linear discriminant function and Markov transition matrix method were integrated to a synthetic modeling effort in order to project spatial allocation of land-use-change and quantitative assignment of that prediction as a whole. The algorithm of both the multivariate discriminant function and the Markov chain matrix were discussed and the test of synthetic model on the study area was resulted in the projection of '90 year as well as '95 year land -use classification. The accuracy and the issue of modeling improvement were discussed eventually.

  • PDF

Assessing the Impact of Socio-economic Variables on Breast Cancer Treatment Outcome Disparity

  • Cheung, Min Rex
    • Asian Pacific Journal of Cancer Prevention
    • /
    • v.14 no.12
    • /
    • pp.7133-7136
    • /
    • 2013
  • Background: We studied Surveillance, Epidemiology and End Results (SEER) breast cancer data of Georgia USA to analyze the impact of socio-economic factors on the disparity of breast cancer treatment outcome. Materials and Methods: This study explored socio-economic, staging and treatment factors that were available in the SEER database for breast cancer from Georgia registry diagnosed in 2004-2009. An area under the receiver operating characteristic curve (ROC) was computed for each predictor to measure its discriminatory power. The best biological predictors were selected to be analyzed with socio-economic factors. Survival analysis, Kolmogorov-Smirnov 2-sample tests and Cox proportional hazard modeling were used for univariate and multivariate analyses of time to breast cancer specific survival data. Results: There were 34,671 patients included in this study, 99.3% being females with breast cancer. This study identified race and education attainment of county of residence as predictors of poor outcome. On multivariate analysis, these socio-economic factors remained independently prognostic. Overall, race and education status of the place of residence predicted up to 10% decrease in cause specific survival at 5 years. Conclusions: Socio-economic factors are important determinants of breast cancer outcome and ensuring access to breast cancer treatment may eliminate disparities.

A Multivariate Model Development For Stream Flow Generation (다변량 모형에 의한 하천유량의 모의 발생)

  • 정상만
    • Water for future
    • /
    • v.24 no.4
    • /
    • pp.67-72
    • /
    • 1991
  • Various modeling approaches to study along term behavior of streamflow or groundwater storagge have been conducted. In this study, a Multivariate AR (1) Model has been applied to generate monthly flows of the one key station which has historical flows using monthly flows of the three subordinate stations. The Model performance was examined using statistical comparisons between the historical and generated monthly series such as mean, various, skewness. Also, the correlation coefficients(lag-zero, and lag-one)between the two monthly flows were compared. The results showed that the modeled generated flows were statistically similar to the historical flows.

  • PDF

Value at Risk of portfolios using copulas

  • Byun, Kiwoong;Song, Seongjoo
    • Communications for Statistical Applications and Methods
    • /
    • v.28 no.1
    • /
    • pp.59-79
    • /
    • 2021
  • Value at Risk (VaR) is one of the most common risk management tools in finance. Since a portfolio of several assets, rather than one asset portfolio, is advantageous in the risk diversification for investment, VaR for a portfolio of two or more assets is often used. In such cases, multivariate distributions of asset returns are considered to calculate VaR of the corresponding portfolio. Copulas are one way of generating a multivariate distribution by identifying the dependence structure of asset returns while allowing many different marginal distributions. However, they are used mainly for bivariate distributions and are not widely used in modeling joint distributions for many variables in finance. In this study, we would like to examine the performance of various copulas for high dimensional data and several different dependence structures. This paper compares copulas such as elliptical, vine, and hierarchical copulas in computing the VaR of portfolios to find appropriate copula functions in various dependence structures among asset return distributions. In the simulation studies under various dependence structures and real data analysis, the hierarchical Clayton copula shows the best performance in the VaR calculation using four assets. For marginal distributions of single asset returns, normal inverse Gaussian distribution was used to model asset return distributions, which are generally high-peaked and heavy-tailed.

EPB-TBM performance prediction using statistical and neural intelligence methods

  • Ghodrat Barzegari;Esmaeil Sedghi;Ata Allah Nadiri
    • Geomechanics and Engineering
    • /
    • v.37 no.3
    • /
    • pp.197-211
    • /
    • 2024
  • This research studies the effect of geotechnical factors on EPB-TBM performance parameters. The modeling was performed using simple and multivariate linear regression methods, artificial neural networks (ANNs), and Sugeno fuzzy logic (SFL) algorithm. In ANN, 80% of the data were randomly allocated to training and 20% to network testing. Meanwhile, in the SFL algorithm, 75% of the data were used for training and 25% for testing. The coefficient of determination (R2) obtained between the observed and estimated values in this model for the thrust force and cutterhead torque was 0.19 and 0.52, respectively. The results showed that the SFL outperformed the other models in predicting the target parameters. In this method, the R2 obtained between observed and predicted values for thrust force and cutterhead torque is 0.73 and 0.63, respectively. The sensitivity analysis results show that the internal friction angle (φ) and standard penetration number (SPT) have the greatest impact on thrust force. Also, earth pressure and overburden thickness have the highest effect on cutterhead torque.

Forensic Classification of Latent Fingerprints Applying Laser-induced Plasma Spectroscopy Combined with Chemometric Methods (케모메트릭 방법과 결합된 레이저 유도 플라즈마 분광법을 적용한 유류 지문의 법의학적 분류 연구)

  • Yang, Jun-Ho;Yoh, Jai-Ick
    • Korean Journal of Optics and Photonics
    • /
    • v.31 no.3
    • /
    • pp.125-133
    • /
    • 2020
  • An innovative method for separating overlapping latent fingerprints, using laser-induced plasma spectroscopy (LIPS) combined with multivariate analysis, is reported in the current study. LIPS provides the capabilities of real-time analysis and high-speed scanning, as well as data regarding the chemical components of overlapping fingerprints. These spectra provide valuable chemical information for the forensic classification and reconstruction of overlapping latent fingerprints, by applying appropriate multivariate analysis. This study utilizes principal-component analysis (PCA) and partial-least-squares (PLS) techniques for the basis classification of four types of fingerprints from the LIPS spectra. The proposed method is successfully demonstrated through a classification example of four distinct latent fingerprints, using discrimination such as soft independent modeling of class analogy (SIMCA) and partial-least-squares discriminant analysis (PLS-DA). This demonstration develops an accuracy of more than 85% and is proven to be sufficiently robust. In addition, by laser-scanning analysis at a spatial interval of 125 ㎛, the overlapping fingerprints were separated as two-dimensional forms.

Missing Value Estimation and Sensor Fault Identification using Multivariate Statistical Analysis (다변량 통계 분석을 이용한 결측 데이터의 예측과 센서이상 확인)

  • Lee, Changkyu;Lee, In-Beum
    • Korean Chemical Engineering Research
    • /
    • v.45 no.1
    • /
    • pp.87-92
    • /
    • 2007
  • Recently, developments of process monitoring system in order to detect and diagnose process abnormalities has got the spotlight in process systems engineering. Normal data obtained from processes provide available information of process characteristics to be used for modeling, monitoring, and control. Since modern chemical and environmental processes have high dimensionality, strong correlation, severe dynamics and nonlinearity, it is not easy to analyze a process through model-based approach. To overcome limitations of model-based approach, lots of system engineers and academic researchers have focused on statistical approach combined with multivariable analysis such as principal component analysis (PCA), partial least squares (PLS), and so on. Several multivariate analysis methods have been modified to apply it to a chemical process with specific characteristics such as dynamics, nonlinearity, and so on.This paper discusses about missing value estimation and sensor fault identification based on process variable reconstruction using dynamic PCA and canonical variate analysis.

Multilevel and Multivariate Structural Equation Models for Activity Participation and Travel Behavior (다수준 다변량 구조방정식을 이용한 활동참여와 통행행태 분석에 관한 연구)

  • 최연숙;정진혁
    • Journal of Korean Society of Transportation
    • /
    • v.21 no.4
    • /
    • pp.145-154
    • /
    • 2003
  • Multilevel and Multivariate Structural Equation Mpdel is applied to handle the hierarchical nature of the data and explain complex relationship among socioeconomic factors of individuals and household, activity participation, and travel behavior using Puset Sound Transportation Panel data. From analysis, variations of individual activity participation and travel behavior can be divided into two categories : Within-household variation and Between-households variation. Empirical results show that the interdependency index(p) of variables for household members within a household is between 0.13 and 0.33 indicating high interdependency. These results suggest that Multilevel and Multivariate SEM approach is an appropriate modeling methodology and gives additional information for activity participation and travel behavior. Also most of personal and household characteristics influence on activity participation and travel behavior within a household as well as between households.

Prediction of Water Storage Rate for Agricultural Reservoirs Using Univariate and Multivariate LSTM Models (단변량 및 다변량 LSTM을 이용한 농업용 저수지의 저수율 예측)

  • Sunguk Joh;Yangwon Lee
    • Korean Journal of Remote Sensing
    • /
    • v.39 no.5_4
    • /
    • pp.1125-1134
    • /
    • 2023
  • Out of the total 17,000 reservoirs in Korea, 13,600 small agricultural reservoirs do not have hydrological measurement facilities, making it difficult to predict water storage volume and appropriate operation. This paper examined univariate and multivariate long short-term memory (LSTM) modeling to predict the storage rate of agricultural reservoirs using remote sensing and artificial intelligence. The univariate LSTM model used only water storage rate as an explanatory variable, and the multivariate LSTM model added n-day accumulative precipitation and date of year (DOY) as explanatory variables. They were trained using eight years data (2013 to 2020) for Idong Reservoir, and the predictions of the daily water storage in 2021 were validated for accuracy assessment. The univariate showed the root-mean square error (RMSE) of 1.04%, 2.52%, and 4.18% for the one, three, and five-day predictions. The multivariate model showed the RMSE 0.98%, 1.95%, and 2.76% for the one, three, and five-day predictions. In addition to the time-series storage rate, DOY and daily and 5-day cumulative precipitation variables were more significant than others for the daily model, which means that the temporal range of the impacts of precipitation on the everyday water storage rate was approximately five days.

Analysis of Korean GDP by unobserved components model (비관측요인모형을 이용한 한국의 국내총생산 분석)

  • Seong, Byeong-Chan;Lee, Seung-Kyung
    • Journal of the Korean Data and Information Science Society
    • /
    • v.22 no.5
    • /
    • pp.829-837
    • /
    • 2011
  • Since Harvey (1989), many approaches for applying unobserved components (UC) models to both univariate and multivariate time series analysis have been developed. However, practitioners still tend to use traditional methods such as exponential smoothing or ARIMA models for modeling and predicting time series data. It is well known that the UC model combines the flexibility of ARIMA models and the easy interpretability of exponential smoothing models by using unobserved components such as trend, cycle, season, and irregular components. This study reviews the UC model and compares its relative performances with those of the other models in modeling and predicting the real gross domestic products (GDP) in Korea. We conclude that the optimal model is the UC model on basis of root mean squared error.