• Title/Summary/Keyword: multivariate data analysis

Search Result 1,393, Processing Time 0.025 seconds

FAULT DETECTION, MONITORING AND DIAGNOSIS OF SEQUENCING BATCH REACTOR FOR INTEGRATED WASTEWATER TREATMENT MANAGEMENT SYSTEM

  • Yoo, Chang-Kyoo;Vanrolleghem, Peter A.;Lee, In-Beum
    • Environmental Engineering Research
    • /
    • v.11 no.2
    • /
    • pp.63-76
    • /
    • 2006
  • Multivariate analysis and batch monitoring on a pilot-scale sequencing batch reactor (SBR) are described for integrated wastewater treatment management system, where a batchwise multiway independent component analysis method (MICA) are used to extract meaningful hidden information from non-Gaussian wastewater treatment data. Three-way batch data of SBR are unfolded batch-wisely, and then a non-Gaussian multivariate monitoring method is used to capture the non-Gaussian characteristics of normal batches in biological wastewater treatment plant. It is successfully applied to an 80L SBR for biological wastewater treatment, which is characterized by a variety of error sources with non-Gaussian characteristics. The batchwise multivariate monitoring results of a pilot-scale SBR for integrated wastewater treatment management system showed more powerful monitoring performance on a WWTP application than the conventional method since it can extract non-Gaussian source signals which are independent and cross-correlation of variables.

A Comparison of Univariate and Multivariate AR Models for Monthly River Flow Series (월유량에 대한 일변량 및 다변량 AR모형의 비교)

  • 이원환;심재현
    • Water for future
    • /
    • v.23 no.1
    • /
    • pp.99-107
    • /
    • 1990
  • The statistical analysis based on the past hydrologic data required to set up the water resources development plan and design the hydraulic structres rationally. Because hydrologic events have random factors implied, the sotchastic analysis is necessary. In this paper, same order of stochastic models of monthly runoff data(multivariate AR(1) and AR(2) models, univariate AR(1) and AR(2) models) are applied to compare the statistical characteristics. The other purpose of this paper is to compare the monthly series, which is generated by univariate and multivariate models. By comparing and estimating of each simulated series, it is known that the multivariate models, including the time and spatial colinearity, are better in prediction than univariate models in the analysis of monthly flow at south Han river basin.

  • PDF

Regional Geological Mapping by Principal Component Analysis of the Landsat TM Data in a Heavily Vegetated Area (식생이 무성한 지역에서의 Principal Component Analysis 에 의한 Landsat TM 자료의 광역지질도 작성)

  • 朴鍾南;徐延熙
    • Korean Journal of Remote Sensing
    • /
    • v.4 no.1
    • /
    • pp.49-60
    • /
    • 1988
  • Principal Component Analysis (PCA) was applied for regional geological mapping to a multivariate data set of the Landsat TM data in the heavily vegetated and topographically rugged Chungju area. The multivariate data set selection was made by statistical analysis based on the magnitude of regression of squares in multiple regression, and it includes R1/2/R3/4, R2/3, R5/7/R4/3, R1/2, R3/4. R4/3. AND R4/5. As a result of application of PCA, some of later principal components (in this study PC 3 and PC 5) are geologically more significant than earlier major components, PC 1 and PC 2 herein. The earlier two major components which comprise 96% of the total information of the data set, mainly represent reflectance of vegetation and topographic effects, while though the rest represent 3% of the total information which statistically indicates the information unstable, geological significance of PC3 and PC5 in the study implies that application of the technique in more favorable areas should lead to much better results.

Assessment of Water Quality using Multivariate Statistical Techniques: A Case Study of the Nakdong River Basin, Korea

  • Park, Seongmook;Kazama, Futaba;Lee, Shunhwa
    • Environmental Engineering Research
    • /
    • v.19 no.3
    • /
    • pp.197-203
    • /
    • 2014
  • This study estimated spatial and seasonal variation of water quality to understand characteristics of Nakdong river basin, Korea. All together 11 parameters (discharge, water temperature, dissolved oxygen, 5-day biochemical oxygen demand, chemical oxygen demand, pH, suspended solids, electrical conductivity, total nitrogen, total phosphorus, and total organic carbon) at 22 different sites for the period of 2003-2011 were analyzed using multivariate statistical techniques (cluster analysis, principal component analysis and factor analysis). Hierarchical cluster analysis grouped whole river basin into three zones, i.e., relatively less polluted (LP), medium polluted (MP) and highly polluted (HP) based on similarity of water quality characteristics. The results of factor analysis/principal component analysis explained up to 83.0%, 81.7% and 82.7% of total variance in water quality data of LP, MP, and HP zones, respectively. The rotated components of PCA obtained from factor analysis indicate that the parameters responsible for water quality variations were mainly related to discharge and total pollution loads (non-point pollution source) in LP, MP and HP areas; organic and nutrient pollution in LP and HP zones; and temperature, DO and TN in LP zone. This study demonstrates the usefulness of multivariate statistical techniques for analysis and interpretation of multi-parameter, multi-location and multi-year data sets.

Clustering Technique for Multivariate Data Analysis

  • Lee, Jin-Ki
    • Journal of the military operations research society of Korea
    • /
    • v.6 no.2
    • /
    • pp.89-127
    • /
    • 1980
  • The multivariate analysis techniques of cluster analysis are examined in this article. The theory and applications of the techniques and computer software concerning these techniques are discussed and sample jobs are included. A hierarchical cluster analysis algorithm, available in the IMSL software package, is applied to a set of data extracted from a group of subjects for the purpose of partitioning a collection of 26 attributes of a weapon system into six clusters of superattributes. A nonhierarchical clustering procedure were applied to a collection of data of tanks considering of twenty-four observations of ten attributes of tanks. The cluster analysis shows that the tanks cluster somewhat naturally by nationality. The principal componant analysis and the discriminant analysis show that tank weight is the single most important discriminator among nationality although they are not shown in this article because of the space restriction. This is a part of thesis for master's degree in operations research.

  • PDF

A Resetting Scheme for Process Parameters using the Mahalanobis-Taguchi System

  • Park, Chang-Soon
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.4
    • /
    • pp.589-603
    • /
    • 2012
  • Mahalanobis-Taguchi system(MTS) is a statistical tool for classifying the normal group and abnormal group in multivariate data structures. In addition to the classification itself, the MTS uses a method for selecting variables useful for the classification. This method can be used efficiently especially when the abnormal group data are scattered without a specific directionality. When the feedback adjustment procedure through the measurements of the process output for controlling process input variables is not practically possible, the reset procedure can be an alternative one. This article proposes a reset procedure using the MTS. Moreover, a method for identifying input variables to reset is also proposed by the use of the contribution. The identification of the root-cause parameters using the existing dimension-reduced contribution tends to be difficult due to the variety of correlation relationships of multivariate data structures. However, it became possible to provide an improved decision when used together with the location-centered contribution and the individual-parameter contribution.

A Classification of Regional Pattern Analysis for the Planning in Chungbuk using Multivariate Analysis (다변량분석법을 이용한 충청북도 읍면단위 농촌계획 수립을 위한 지역유형구분 분석)

  • Yoon, Seong-Soo;Joo, Ho-Gil
    • Journal of Korean Society of Rural Planning
    • /
    • v.11 no.2 s.27
    • /
    • pp.35-41
    • /
    • 2005
  • It is necessary that the basic concept of rural planning update from economics based on the production and sale into experience of natural resources and traditional culture. For the purpose of set up development direction for rural district, it is requisite to the multivariate analysis. In this study, the methods of the classification of rural village with existing data are studied, the results looking for applying to the making of principal viewpoint of the development. The analysis methods of classification are used the PCA, CA and combination of these, and making the revised method for localization of the rural district. In this study, we implement classification of regional pattern analysis for the planning of rural district in Chungbuk province.

Artificial Neural Networks for Interest Rate Forecasting based on Structural Change : A Comparative Analysis of Data Mining Classifiers

  • Oh, Kyong-Joo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.14 no.3
    • /
    • pp.641-651
    • /
    • 2003
  • This study suggests the hybrid models for interest rate forecasting using structural changes (or change points). The basic concept of this proposed model is to obtain significant intervals caused by change points, to identify them as the change-point groups, and to reflect them in interest rate forecasting. The model is composed of three phases. The first phase is to detect successive structural changes in the U. S. Treasury bill rate dataset. The second phase is to forecast the change-point groups with data mining classifiers. The final phase is to forecast interest rates with backpropagation neural networks (BPN). Based on this structure, we propose three hybrid models in terms of data mining classifier: (1) multivariate discriminant analysis (MDA)-supported model, (2) case-based reasoning (CBR)-supported model, and (3) BPN-supported model. Subsequently, we compare these models with a neural network model alone and, in addition, determine which of three classifiers (MDA, CBR and BPN) can perform better. For interest rate forecasting, this study then examines the prediction ability of hybrid models to reflect the structural change.

  • PDF

Application of functional ANOVA and functional MANOVA (단변량 및 다변량 함수 데이터에 대한 분산분석의 활용)

  • Kim, Mijeong
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.5
    • /
    • pp.579-591
    • /
    • 2022
  • Functional data is collected in various fields. It is often necessary to test whether there are differences among groups of functional data. In this case, it is not appropriate to explain using the point-wise ANOVA method, and we should present not the point-wise result but the integrated result. Various studies on functional data analysis of variance have been proposed, and recently implemented those methods in the package fdANOVA of R. In this paper, I first explain ANOVA and multivariate ANOVA, then I will introduce various methods of analysis of variance for univariate and multivariate functional data recently proposed. I also describe how to use the R package fdANOVA. This package is used to test equality of weekly temperatures in Seoul and Busan through univariate functional data ANOVA, and to test equality of multivariate functional data corresponding to handwritten images using multivariate function data ANOVA.

Bivariate regional frequency analysis of extreme rainfalls in Korea (이변량 지역빈도해석을 이용한 우리나라 극한 강우 분석)

  • Shin, Ju-Young;Jeong, Changsam;Ahn, Hyunjun;Heo, Jun-Haeng
    • Journal of Korea Water Resources Association
    • /
    • v.51 no.9
    • /
    • pp.747-759
    • /
    • 2018
  • Multivariate regional frequency analysis has advantages of regional and multivariate framework as adopting a large number of regional dataset and modeling phenomena that cannot be considered in the univariate frequency analysis. To the best of our knowledge, the multivariate regional frequency analysis has not been employed for hydrological variables in South Korea. Applicability of the multivariate regional frequency analysis should be investigated for the hydrological variable in South Korea in order to improve our capacity to model the hydrological variables. The current study focused on estimating parameters of regional copula and regional marginal models, selecting the most appropriate distribution models, and estimating regional multivariate growth curve in the multivariate regional frequency analysis. Annual maximum rainfall and duration data observed at 71 stations were used for the analysis. The results of the current study indicate that Frank and Gumbel copula models were selected as the most appropriate regional copula models for the employed regions. Several distributions, e.g. Gumbel and log-normal, were the representative regional marginal models. Based on relative root mean square error of the quantile growth curves, the multivariate regional frequency analysis provided more stable and accurate quantiles than the multivariate at-site frequency analysis, especially for long return periods. Application of regional frequency analysis in bivariate rainfall-duration analysis can provide more stable quantile estimation for hydraulic infrastructure design criteria and accurate modelling of rainfall-duration relationship.