• Title/Summary/Keyword: Multivariate Statistical Analysis

Search Result 632, Processing Time 0.029 seconds

A Study on Forest Land Classification Using Multivariate Statistical Methods : A Case Study at Mt. Kwanak (다변수통계방법을 이용한 산지분류에 관한 연구)

  • 정순오
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.13 no.1
    • /
    • pp.43-66
    • /
    • 1985
  • Korea needs proper and rational public policies on conservation and use of forest land and other natural resources because of the accelerating expansion of national land developments in recent years. Unfortunately, there is no systematic planning system to support the needs. Generally, forest land use planning needs suitability analysis based on efficient land classification system. The goal of this study was to classify a forest land using multivariate satistical methods. A case study was carried out in winter of 1983 on a mountainous area higher than 100m above sea level located at Mt. Kwanak in Anyang -city, Kyung-gi-do (province). The study area was 19.80 km$^2$wide and was divided into 1, 383 Operational Taxonomic Units (OTU's) by a 120m$\times$120m grid. Fourteen descriptors were identified and quantified for each OTU from existing national land data : elevation, slope, aspect, terrain form, geologic material, surface soil permeability, topsoil type, depth of the solum, soil acidity, forest cover type, stand size class, stand age class, stand density class, and simple forest soil capability class. For this study, a FORTRAN IV program was written for input and output map data, and the computer statistics packages, SPSS and BMD, were used to perform the multivariate statistical analysis. Fourteen variables were analyzed to investigate the characteristics of their fire quench distribution and to estimate the correlation coefficients among them. Principal component analysis was executed to find the dimensions of forest land characteristics, and factor scores were used for proper samples of OTU throughout the study area. In order to develop the classes of forest land classification based on 102 surrogates, cluster and discriminant analyses of principal descriptor variable matrix were undertaken. Results obtained through a series of multivariate statistical analyses were as follows ; 1) Principal component analysis was proved to be a useful tool for data selection and identification of principal descriptor variables which represented the characteristics of forest land and facilitated the selection of samples.

  • PDF

Automatic Electrofacies Classification from Well Logs Using Multivariate Statistical Techniques (다변량 통계 기법을 이용한 물리검층 자료로부터의 암석물리학상 결정)

  • Lim Jong-Se;Kim Jungwhan;Kang Joo-Myung
    • Geophysics and Geophysical Exploration
    • /
    • v.1 no.3
    • /
    • pp.170-175
    • /
    • 1998
  • A systematic methodology is developed for the prediction of the lithology using electrofacies classification from wireline log data. Multivariate statistical techniques are adopted to segment well log measurements and group the segments into electrofacies types. To consider corresponding contribution of each log and reduce the computational dimension, multivariate logs are transformed into a single variable through principal components analysis. Resultant principal components logs are segmented using the statistical zonation method to enhance the quality and efficiency of the interpreted results. Hierarchical cluster analysis is then used to group the segments into electrofacies. Optimal number of groups is determined on the basis of the ratio of within-group variance to total variance and core data. This technique is applied to the wells in the Korea Continental Shelf. The results of field application demonstrate that the prediction of lithology based on the electrofacies classification works well with reliability to the core and cutting data. This methodology for electrofacies determination can be used to define reservoir characterization which is helpful to the reservoir management.

  • PDF

Diagnosis of Observations after Fit of Multivariate Skew t-Distribution: Identification of Outliers and Edge Observations from Asymmetric Data

  • Kim, Seung-Gu
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.6
    • /
    • pp.1019-1026
    • /
    • 2012
  • This paper presents a method for the identification of "edge observations" located on a boundary area constructed by a truncation variable as well as for the identification of outliers and the after fit of multivariate skew $t$-distribution(MST) to asymmetric data. The detection of edge observation is important in data analysis because it provides information on a certain critical area in observation space. The proposed method is applied to an Australian Institute of Sport(AIS) dataset that is well known for asymmetry in data space.

Development of Real-Time Water Quality Abnormality Warning System for Using Multivariate Statistical Method (다변량 통계기법을 활용한 실시간 수질이상 유무 판단 시스템 개발)

  • Heo, Tae-Young;Jeon, Hang-Bae;Park, Sang-Min;Lee, Young-Joo
    • Journal of Korean Society of Environmental Engineers
    • /
    • v.37 no.3
    • /
    • pp.137-144
    • /
    • 2015
  • The purpose of this study is to develop an warning system to detect real-time water quality abnormality using a multivariate statistical approach. In this study, we applied principal component analysis among multivariate data analyses which was used for the correlation between water quality parameters considering the real-time algorithm to determine abnormality in water quality. We applied our approach to real field data and showed the utilization of algorithm for the real-time monitoring to find water quality abnormality. In addition, our approach with Korea Meterological Adminstration database identified heavy rain data due to climate change is one of the most important factors to explain water quality abnormality.

Development of MKDE-ebd for Estimation of Multivariate Probabilistic Distribution Functions (다변량 확률분포함수의 추정을 위한 MKDE-ebd 개발)

  • Kang, Young-Jin;Noh, Yoojeong;Lim, O-Kaung
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.32 no.1
    • /
    • pp.55-63
    • /
    • 2019
  • In engineering problems, many random variables have correlation, and the correlation of input random variables has a great influence on reliability analysis results of the mechanical systems. However, correlated variables are often treated as independent variables or modeled by specific parametric joint distributions due to difficulty in modeling joint distributions. Especially, when there are insufficient correlated data, it becomes more difficult to correctly model the joint distribution. In this study, multivariate kernel density estimation with bounded data is proposed to estimate various types of joint distributions with highly nonlinearity. Since it combines given data with bounded data, which are generated from confidence intervals of uniform distribution parameters for given data, it is less sensitive to data quality and number of data. Thus, it yields conservative statistical modeling and reliability analysis results, and its performance is verified through statistical simulation and engineering examples.

Note on response dimension reduction for multivariate regression

  • Yoo, Jae Keun
    • Communications for Statistical Applications and Methods
    • /
    • v.26 no.5
    • /
    • pp.519-526
    • /
    • 2019
  • Response dimension reduction in a sufficient dimension reduction (SDR) context has been widely ignored until Yoo and Cook (Computational Statistics and Data Analysis, 53, 334-343, 2008) founded theories for it and developed an estimation approach. Recent research in SDR shows that a semi-parametric approach can outperform conventional non-parametric SDR methods. Yoo (Statistics: A Journal of Theoretical and Applied Statistics, 52, 409-425, 2018) developed a semi-parametric approach for response reduction in Yoo and Cook (2008) context, and Yoo (Journal of the Korean Statistical Society, 2019) completes the semi-parametric approach by proposing an unstructured method. This paper theoretically discusses and provides insightful remarks on three versions of semi-parametric approaches that can be useful for statistical practitioners. It is also possible to avoid numerical instability by presenting the results for an orthogonal transformation of the response variables.

Multivariate Statistical Analysis Approach to Predict the Reactor Properties and the Product Quality of a Direct Esterification Reactor for PET Synthesis (다변량 통계분석법을 이용한 PET 중합공정 중 직접 에스테르화 반응기의 거동 및 생산제품 예측)

  • Kim Sung Young;Chung Chang Bock;Choi Soo Hyoung;Lee Bomsock;Lee Bomsock
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.11 no.6
    • /
    • pp.550-557
    • /
    • 2005
  • The multivariate statistical analysis methods, using both multiple linear regression(MLR) and partial least square(PLS), have been applied to predict the reactor properties and the product quality of a direct esterification reactor for polyethylene terephthalate(PET) synthesis. On the basis of the set of data including the flow rate of water vapor, the flow rate of EG vapor, the concentration of acid end groups of a product and other operating conditions such as temperature, pressure, reaction times and feed monomer mole ratio, two multi-variable analysis methods have been applied. Their regression and prediction abilities also have been compared. The prediction results are critically compared with the actual plant data and the other mathematical model based results in reliability. This paper shows that PLS method approach can be used for the reasonably accurate prediction of a product quality of a direct esterification reactor in PET synthesis process.

A Study on the Estimation of Coefficients K and n Using Multivariate Data Analysis (다변량 통계기법을 이용한 K및 n의 산정에 관한 연구)

  • 백용진;최재성;배동명;김경진
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.13 no.8
    • /
    • pp.583-590
    • /
    • 2003
  • For the preestimate of the vibration level of the ground next to a dwelling, a multivariate statistical analysis on the experiment data acquired from a variety of construction sites was performed, and then a new estimate model for the value of K and n that can be applied in the diagnosis of the damage was offered. The results maybe summarized as follows : First, the $K_{95}$ and n showed high correlation at P$\leq$0.05. Specially the correlation coefficient about $W_{max}$, S were higher in $K_{95}$ than in n. indicating that $K_{95}$ is generally associated with source conditions. Second, the factor analysis permitted to identify two major sources in each fraction. These sources accounted for at least 73 % of valiance of $K_{95}$. Third, the multiple regression model for the estimate of $K_{95}$ was developed from Fac1 which depend upon the source conditions and Fac2 which depend upon the transmission conditions. The n value is able to determine from the correlation relationship associated with $K_{95}$./.

A Comparison Study of Multivariate Binary and Continuous Outcomes

  • Pak, Dae-Woo;Cho, Hyung-Jun
    • The Korean Journal of Applied Statistics
    • /
    • v.25 no.4
    • /
    • pp.605-612
    • /
    • 2012
  • Multivariate data are often generated with multiple outcomes in various fields. Multiple outcomes could be mixed as continuous and discrete. Because of their complexity, the data are often dealt with by separately applying regression analysis to each outcome even though they are associated the each other. This univariate approach results in the low efficiency of estimates for parameters. We study the efficiency gains of the multivariate approaches relative to the univariate approach with the mixed data that include continuous and binary outcomes. All approaches yield consistent estimates for parameters with complete data. By jointly estimating parameters using multivariate methods, it is generally possible to obtain more accurate estimates for parameters than by a univariate approach. The association between continuous and binary outcomes creates a gap in efficiency between multivariate and univariate approaches. We provide a guidance to analyze the mixed data.