• Title/Summary/Keyword: 다변량통계분석

Search Result 475, Processing Time 0.03 seconds

A Study of Influence Factors for Reservoir Evaporation Using Multivariate Statistical Analysis (다변량 통계분석을 이용한 저수지증발량 영향인자에 관한 연구)

  • Lee, Kyungsu;Kwak, Sunghyun;Seo, Yong Jae;Lyu, Siwan
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2017.05a
    • /
    • pp.237-240
    • /
    • 2017
  • 지구온난화로 인해 세계 곳곳에서 기온상승이 관측되고 있으며, 이는 전지구적 기후시스템의 변화를 보여주는 대표적인 예이다. 온도를 비롯한 강수량, 풍속, 증발량 등의 기상학적, 수문학적 인자들이 각각 서로에게 영향을 주고 받으며 복잡하게 변화할 것이고, 그 변화폭도 점점 커질 것이다. 증발에 영향을 미치는 인자들은 크게 세 가지로 나뉘는데, 태양복사에너지, 온도, 바람, 기압, 습도와 같은 기상학적인자, 증발표면의 특성인자 그리고 수질인자로 분류할 수 있다. 증발에 영향을 주는 인자들은 예전부터 알려져 있지만 이들 간의 복잡한 상호작용에 대해 정확히 이해하기는 쉽지 않다. 본 연구에서는 댐유역의 증발량에 영향을 미치는 기상인자 파악을 위해 2008부터 2016년까지 관측된 낙동강수계 내 안동댐과 남강댐의 기상자료(기온, 강수량, 풍속, 상대습도, 기압, 일사량, 일조시간, 전운량)를 이용한 변화를 분석하였으며, 다변량 통계기법인요인분석을 통해 증발량과 상관성이 높은 인자들을 분류하였다. 안동댐과 남강댐 공통적으로 증발량과 기온, 기압이 같은 요인으로 분류되고 높은 상관성을 보였으며, 강수량, 일조시간, 일사량, 전운량이 같은 요인으로 분류되었다. 국내의 증발량 측정지점에 대한 추가적인 분석과 영향인자를 이용한 다변량회귀식과 인공신경망 통해 증발량 미측정 지점의 증발량 산정이 가능할 것으로 판단된다.

  • PDF

유류오염 지역내 지층 특성이 지하수 수질에 미치는 영향 연구

  • Go Gyeong-Seok;O In-Suk;Kim Eul-Yeong;Lee Gwang-Sik;Yang Jae-Ha;Lee Gang-Geun
    • Proceedings of the Korean Society of Soil and Groundwater Environment Conference
    • /
    • 2006.04a
    • /
    • pp.419-423
    • /
    • 2006
  • 유류누출지역의 수리지질, 수리지화학 및 미생물 분석을 통하여 지층특성이 지하수 흐름, 수질 및 미생물 특성에 미치는 영향을 고찰한 결과 지표에서 심도 $1.8{\sim}3.5m$ 구간에 수십 cm에서 2m 정도의 두께를 갖는 투수성이 상대적으로 양호한 지층이 존재함을 확인할 수 있었다. 이러한 지층의 존재에 의하여 상하부의 지하수대의 수두차이가 현저함을 관측하였으며 유류 수송관에서 누출된 유류가 이 투수성 지층을 따라 이동하여 이 구간의 토양 및 지하수가 오염되었다고 판단할 수 있다. 지하수 수질은 이러한 지층의 특성에 따라 다른 이온성분 및 동위원소 특징을 보여주었으며 이는 다변량통계분석에 의해서도 확인할 수 있었다. 미생물 DGGE 분석결과 역시 유사한 특징을 보여주어 이러한 수리지화학, 수리지질 및 미생물 특징이 서로 밀접한 상관성을 보여줌을 확인할 수 있었다.

  • PDF

Application of Multivariate Statistical Analysis Technique in Landfill Investigation (매립물 특성 조사를 위한 다변량 통계분석 기법의 응용)

  • Kwon, Byung-Doo;Kim, Cha-Soup
    • Journal of the Korean earth science society
    • /
    • v.18 no.6
    • /
    • pp.515-521
    • /
    • 1997
  • To investigate the nature of the waste materials in the Nanjido Landfill, we have conducted multivariate statistical analysis of geophysical data set comprised of magnetic, gravity, LandSat TM thermal band and surface depression measurement data. Because these data sets show different responses to the depth, we have transformed the observed total field magnetic data and gravity data to the residual reduced-to-pole(RTP) magnetic anomalies and the three dimensional density anomalies, respectively, and utilized the informations about the upper shallow part of the landfills only in the following process. For the statistical analysis at the points of depression measurement, the magnetic, density and LandSat data values at these points are determined by interpolation process. Since the multivarite statistical analysis technique utilizes a clustering algorithm for classification of data set and we have measured the dissimilarity between objects by using Euclidean distance, standardization was applied prior to distance calculation in order to eliminate any scaling effects due to different measurement unit of each data set. The hierarchial grouping technique was used to construct the dendrogram. The optimum number of statistical groups(clusters), which are classified on the basis of geophysical and geotechnical characteristics, appeared to be six on the resulting dendrogram. The result of this study suggests that the dimension and nature of the multicomponent waste landfills can be identified by application of the multivarite statistical analysis technique to integrated geophysical data sets.

  • PDF

Identification of the out-of-control variable based on Hotelling's T2 statistic (호텔링 T2의 이상신호 원인 식별)

  • Lee, Sungim
    • The Korean Journal of Applied Statistics
    • /
    • v.31 no.6
    • /
    • pp.811-823
    • /
    • 2018
  • Multivariate control chart based on Hotelling's $T^2$ statistic is a powerful tool in statistical process control for identifying an out-of-control process. It is used to monitor multiple process characteristics simultaneously. Detection of the out-of-control signal with the $T^2$ chart indicates mean vector shifts. However, these multivariate signals make it difficult to interpret the cause of the out-of-control signal. In this paper, we review methods of signal interpretation based on the Mason, Young, and Tracy (MYT) decomposition of the $T^2$ statistic. We also provide an example on how to implement it using R software and demonstrate simulation studies for comparing the performance of these methods.

Use of Multivariate Statistical Approaches for Decoding Chemical Evolution of Groundwater near Underground Storage Caverns (다변량통계기법을 이용한 지하저장시설 주변의 지하수질 변동에 관한 연구)

  • Lee, Jeonghoon
    • Journal of the Korean earth science society
    • /
    • v.35 no.4
    • /
    • pp.225-236
    • /
    • 2014
  • Multivariate statistical analyses have been extensively applied to hydrochemical measurements to analyze and interpret the data. This study examines anthropogenic factors obtained from applications of correspondence analysis (CA) and principal component analysis (PCA) to a hydrogeochemical data set. The goal was to synthesize the hydrogeochemical information using these multivariate statistical techniques by incorporating hydrogeochemical speciation results calculated by the program, commonly used, WATEQ4F included in the NETPATH. The selected case study was LPG underground storage caverns, which is located in the southeastern Korea. The highly alkaline groundwaters at this study area are an analogue for the repository system. High pH, speciation of Al and possible precipitation of calcite characterize these groundwaters. Available groundwater quality monitoring data were used to confirm these statistical models. The present study focused on understanding the hydrogeochemical attributes and establishing the changes of phase when two anthropogenic effects (i.e., disinfection activity and cement pore water) in the study area have been introduced. Comparisons made between two statistical results presented and the findings of previous investigations highlight the descriptive capabilities of PCA using calculated saturation index and CA as exploratory tools in hydrogeochemical research.

Hotelling의 T$^{2}$ 통계량을 이용한 cDNA 마이크로어레이 분석

  • Kim, Byeong-Su;Lee, Seon-Ho;Kim, In-Yeong;Kim, Sang-Cheol;Ra, Seon-Yeong;Jeong, Hyeon-Cheol
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.05a
    • /
    • pp.295-297
    • /
    • 2003
  • 본 논의에서는 cDNA 마이크로어레이 분석에서 다변량 분석의 한 방법인 Hotelling의 T제곱 통계량을 이용하여 유의적 유전자군을 검색하고, 이 유전자군을 사용하여 검사자료를 두군으로 분류하는데 단변량 t통계량에 기초한 접근보다 얼마나 효율적인지를 평가하고자 한다.

  • PDF

KCYP data analysis using Bayesian multivariate linear model (베이지안 다변량 선형 모형을 이용한 청소년 패널 데이터 분석)

  • Insun, Lee;Keunbaik, Lee
    • The Korean Journal of Applied Statistics
    • /
    • v.35 no.6
    • /
    • pp.703-724
    • /
    • 2022
  • Although longitudinal studies mainly produce multivariate longitudinal data, most of existing statistical models analyze univariate longitudinal data and there is a limitation to explain complex correlations properly. Therefore, this paper describes various methods of modeling the covariance matrix to explain the complex correlations. Among them, modified Cholesky decomposition, modified Cholesky block decomposition, and hypersphere decomposition are reviewed. In this paper, we review these methods and analyze Korean children and youth panel (KCYP) data are analyzed using the Bayesian method. The KCYP data are multivariate longitudinal data that have response variables: School adaptation, academic achievement, and dependence on mobile phones. Assuming that the correlation structure and the innovation standard deviation structure are different, several models are compared. For the most suitable model, all explanatory variables are significant for school adaptation, and academic achievement and only household income appears as insignificant variables when cell phone dependence is a response variable.

Statistical Outliers in Florida Counties at the Presidential Election 2000 (2000년 미국대선 플로리다주의 투표결과 분석)

  • 김현철
    • The Korean Journal of Applied Statistics
    • /
    • v.15 no.1
    • /
    • pp.21-32
    • /
    • 2002
  • We searched out in the votes data of the State of Florida at presidential election 2000. We used a multivariate regression analysis. We got there were several outliers including Palm Beach County. It means that we should analyze the number of disqualified ballots which were double-punched as well as the votes, to insist the " Butterfly Ballot" made Palm Beach outlier.

A Comparative Study of Covariance Matrix Estimators in High-Dimensional Data (고차원 데이터에서 공분산행렬의 추정에 대한 비교연구)

  • Lee, DongHyuk;Lee, Jae Won
    • The Korean Journal of Applied Statistics
    • /
    • v.26 no.5
    • /
    • pp.747-758
    • /
    • 2013
  • The covariance matrix is important in multivariate statistical analysis and a sample covariance matrix is used as an estimator of the covariance matrix. High dimensional data has a larger dimension than the sample size; therefore, the sample covariance matrix may not be suitable since it is known to perform poorly and event not invertible. A number of covariance matrix estimators have been recently proposed with three different approaches of shrinkage, thresholding, and modified Cholesky decomposition. We compare the performance of these newly proposed estimators in various situations.

Application of Statistical Analysis to Analyze the Spatial Distribution of Earthquake-induced Strain Data (지진유발 변형률 데이터의 분포 특성 분석을 위한 응용통계기법의 적용)

  • Kim, Bo-Ram;Chae, Byung-Gon;Kim, Yongje;Seo, Yong-Seok
    • The Journal of Engineering Geology
    • /
    • v.23 no.4
    • /
    • pp.353-361
    • /
    • 2013
  • To analyze the distribution of earthquake-induced strain data in rock masses, statistical analysis was performed on four-directional strain data obtained from a ground movement monitoring system installed in Korea. Strain data related to the 2011 Tohoku-oki earthquake and two aftershocks of >M7.0 in 2011 were used in x-MR control chart analysis, a type of univariate statistical analysis that can detect an abnormal distribution. The analysis revealed different dispersion times for each measurement orientation. In a more comprehensive analysis, the strain data were re-evaluated using multivariate statistical analysis (MSA) considering correlations among the various data from the different measurement orientations. $T_2$ and Q-statistics, based on principal component analysis, were used to analyze the time-series strain data in real-time. The procedures were performed with 99.9%, 99.0%, and 95.0% control limits. It is possible to use the MSA data to successfully detect an abnormal distribution caused by earthquakes because the dispersion time using the 99.9% control limit is concurrent with or earlier than that from the x-MR analysis. In addition, the dispersion using the 99.0% and 95.0% control limits detected an abnormal distribution in advance. This finding indicates the potential use of MSA for recognizing abnormal distributions of strain data.