• Title/Summary/Keyword: Multivariate Statistical Analysis

검색결과 632건 처리시간 0.026초

A Case Study on the Compatibility Analysis of Measurement Systems in Automobile Body Assembly

  • Lee, Myung-Duk;Lim, Ik-Sung;Sung, Chun-Ja
    • International Journal of Reliability and Applications
    • /
    • 제9권1호
    • /
    • pp.7-15
    • /
    • 2008
  • The dimensional measurement equipment, such as Coordinate Measurement Machine (CMM), Optical Coordinate Measurement Machine (OCMM), and Checking Fixture (CF), take multiple dimensional measurements for each part in an automobile industry. Measurements are also recorded under different measurement systems to see if the responses differ significantly over these systems. Each measurement system (CMM, OCMM, and CF) will be considered as different treatments. This set-up provides massive amounts of process data which are multivariate in nature. Therefore, the multivariate statistical analysis is required to analyze data that are dependent on each other. This research provides step by step methodology for the evaluation procedure of the compatibility of measurement systems and clarify a systematic analyzation among the different measurement system's compatibility followed by number of case studies for each methodologies provided.

  • PDF

LOF를 이용한 ICA 기반 통계적 공정관리의 성능 개선 방법론 (The Use of Local Outlier Factor(LOF) for Improving Performance of Independent Component Analysis(ICA) based Statistical Process Control(SPC))

  • 이재신;강복영;강석호
    • 한국경영과학회지
    • /
    • 제36권1호
    • /
    • pp.39-55
    • /
    • 2011
  • Process monitoring has been emphasized for the monitoring of complex system such as chemical processing industries to achieve the efficiency enhancement, quality management, safety improvement. Recently, ICA (Independent Component Analysis) based MSPC (Multivariate Statistical Process Control) was widely used in process monitoring approaches. Moreover, DICA (Dynamic ICA) has been introduced to consider the system dynamics. However, the existing approaches show the limitation that their performances are strongly dependent on the statistical distributions of control variables. To improve the limitation, we propose a novel approach for process monitoring by integrating DICA and LOF (Local Outlier Factor). In this paper, we aim to improve the fault detection rate with the proposed method. LOF detects local outliers by using density of surrounding space so that its performance is regardless of data distribution. Therefore, the proposed method not only can consider the system dynamics but can also assure robust performance regardless of the statistical distributions of control variables. Comparison experiments were conducted on the widely used benchmark dataset, Tennessee Eastman process (TE process), and showed the improved performance than existing approaches.

Canonical Correlation: Permutation Tests and Regression

  • Yoo, Jae-Keun;Kim, Hee-Youn;Um, Hye-Yeon
    • Communications for Statistical Applications and Methods
    • /
    • 제19권3호
    • /
    • pp.471-478
    • /
    • 2012
  • In this paper, we present a permutation test to select the number of pairs of canonical variates in canonical correlation analysis. The existing chi-squared test is known to be limited to normality in use. We compare the existing test with the proposed permutation test and study their asymptotic behaviors through numerical studies. In addition, we connect canonical correlation analysis to regression and we we show that certain inferences in regression can be done through canonical correlation analysis. A regression analysis of real data through canonical correlation analysis is illustrated.

Statistical analysis of metagenomics data

  • Calle, M. Luz
    • Genomics & Informatics
    • /
    • 제17권1호
    • /
    • pp.6.1-6.9
    • /
    • 2019
  • Understanding the role of the microbiome in human health and how it can be modulated is becoming increasingly relevant for preventive medicine and for the medical management of chronic diseases. The development of high-throughput sequencing technologies has boosted microbiome research through the study of microbial genomes and allowing a more precise quantification of microbiome abundances and function. Microbiome data analysis is challenging because it involves high-dimensional structured multivariate sparse data and because of its compositional nature. In this review we outline some of the procedures that are most commonly used for microbiome analysis and that are implemented in R packages. We place particular emphasis on the compositional structure of microbiome data. We describe the principles of compositional data analysis and distinguish between standard methods and those that fit into compositional data analysis.

적외선 분광분석과 다변량 통계에 기반한 바이오디젤 품질분석 (Analysis of biodiesel quality based on infrared spectroscopy and multivariate statistics)

  • 김혜실;조현우;유준
    • 분석과학
    • /
    • 제25권4호
    • /
    • pp.214-222
    • /
    • 2012
  • ASTM (American Society for Testing and Materials) D6751-10은 바이오디젤의 품질 규격 뿐 아니라 분석방법 또한 제시하고 있다. 하지만 ASTM 표준에 따른 바이오디젤 및 포함된 여러 불순물의 품질 분석은 경제적, 시간적으로 부담이 크다. 본 연구는 적외선 분광분석법(infrared spectroscopy)과 다변량 통계분석법 중 하나인 PLS (partial least square method)를 이용하여 1회 측정만으로 바이오 디젤 및 불순물들의 농도를 분석하는 시스템을 개발하고자 하였다. 특히, 적외선을 이용한 분석에서 생기는 각 물질의 스펙트럼에 대한 산란 보정, 노이즈 감소 등을 위해 SNV, MSC, OSC, Savitzky-Golay 등의 4가지 전처리 방법의 성능을 비교하였다. 품질 분석에 필요한 바이오 디젤 검량 모델을 PLS로 모델링 결과, Savitzky-Golay 전처리를 하였을 때 정확도가 가장 우수함을 알았다.

함수형 ARCH 분석 및 다변량 변동성을 통한 일중 로그 수익률 시간 간격 선택 (Functional ARCH analysis for a choice of time interval in intraday return via multivariate volatility)

  • 김다희;윤재은;황선영
    • 응용통계연구
    • /
    • 제33권3호
    • /
    • pp.297-308
    • /
    • 2020
  • 본 논문에서는 고빈도 함수적 ARCH 모형을 소개하고 근사모형으로써 다변량 변동성 모형을 고려하였다. 이를 기반으로 함수형 변동성 분석에서 중요한 요소인 일중 로그 수익률의 적절한 시간 간격을 찾아보았다. 또한 함수적 ARCH 모형에서 l-시차 후 변동성 예측식을 제시하고 고빈도 KOSPI 자료에 적합하여 예시하였다.

다변량 통계 분석법의 연속 적용에 의한 서부 지리산 천연림의 산림 피복형 분류 (The Classification of Forest Cover Types by Consecutive Application of Multivariate Statistical Analysis in the Natural Forest of Western Mt. Jiri)

  • 정상훈;김지홍
    • 한국산림과학회지
    • /
    • 제102권3호
    • /
    • pp.407-414
    • /
    • 2013
  • 본 연구는 다변량 통계 분석법을 이용하여 지리산 서부 천연림을 대상으로 산림 피복형을 분류하기 위해 실시하였다. 점표본법에 의한 식생자료를 바탕으로, 수종-표본점 곡선, 계층적 군집분석, 지표종분석, 다중판별분석 등의 다변량 통계 분석법을 이용하여 식생자료를 분석하였다. 수종-표본점 곡선에서는 산림 피복형 분류에서 전혀 영향력이 없는 수종들을 예외값으로 제거하였다. 예외값을 제외한 산림식생정보를 바탕으로 계층적 군집분석을 이용하여 연구대상지를 2~10개의 클러스터로 분류하였으며, 지표종분석을 통해 연구대상지의 적정 클러스터 수는 7개인 것으로 파악되었다. 이를 통계적으로 검증하기 위해 다중판별분석을 실시하였고, 91.3%가 정확하게 분류되어, 연구대상지 산림 피복형의 개수는 7개가 적당한 것으로 나타났다. 각 클러스터 상층의 우점수종 비율에 따라 신갈나무순림, 중생혼합림, 신갈나무-졸참나무림, 구상나무-신갈나무림, 들메나무림, 졸참나무림, 서어나무림으로 산림 피복형을 명명하였다.

Issues Related to the Use of Time Series in Model Building and Analysis: Review Article

  • Wei, William W.S.
    • Communications for Statistical Applications and Methods
    • /
    • 제22권3호
    • /
    • pp.209-222
    • /
    • 2015
  • Time series are used in many studies for model building and analysis. We must be very careful to understand the kind of time series data used in the analysis. In this review article, we will begin with some issues related to the use of aggregate and systematic sampling time series. Since several time series are often used in a study of the relationship of variables, we will also consider vector time series modeling and analysis. Although the basic procedures of model building between univariate time series and vector time series are the same, there are some important phenomena which are unique to vector time series. Therefore, we will also discuss some issues related to vector time models. Understanding these issues is important when we use time series data in modeling and analysis, regardless of whether it is a univariate or multivariate time series.

Common Feature Analysis of Economic Time Series: An Overview and Recent Developments

  • Centoni, Marco;Cubadda, Gianluca
    • Communications for Statistical Applications and Methods
    • /
    • 제22권5호
    • /
    • pp.415-434
    • /
    • 2015
  • In this paper we overview the literature on common features analysis of economic time series. Starting from the seminal contributions by Engle and Kozicki (1993) and Vahid and Engle (1993), we present and discuss the various notions that have been proposed to detect and model common cyclical features in macroeconometrics. In particular, we analyze in details the link between common cyclical features and the reduced-rank regression model. We also illustrate similarities and differences between the common features methodology and other popular types of multivariate time series modelling. Finally, we discuss some recent developments in this area, such as the implications of common features for univariate time series models and the analysis of common autocorrelation in medium-large dimensional systems.

오미자(Schisandra chinensis)의 국내 산지별 화학적마커 선정을 위한 LC/MS 기반의 대사체학 접근법 (LC/MS-based metabolomics approach for selection of chemical markers by domestic production region of Schisandra chinensis)

  • 김인선;오선민;송하은;김두영;윤다혜;이대영;류형원
    • Journal of Applied Biological Chemistry
    • /
    • 제66권
    • /
    • pp.467-476
    • /
    • 2023
  • 오미자(Schisandra chinensis)는 오미자과에 속하는 낙엽활엽덩굴식물로 한국, 일본, 중국, 대만 등 동아시아에 널리 분포한다. 오미자에 함유된 주요 성분에는 리그난 화합물뿐만 아니라 트리테르페노이드 화합물도 포함되어 있는 것으로 보고되었다. 한국 산지별 오미자의 특성을 구별하기 위해 대사산물 프로파일링과 다변량 통계 분석 기법인 PCA을 수행하여 판별식을 설정하였고, 그 결과 triterpenoids 16종, lignan 9종, flavonoid, phenylpropanoid, fatty acid 각 1종을 동정하였다. 또한 다변량 통계분석을 통해 OPLS-DA의 s-plot 모델을 적용하여 단양, 문경, 거창, 평창의 4개 그룹을 구분하는 것을 확인하였고, lanostane, cycloartane, 그리고 schiartane triterpenoid, dibenzocyclooctadiene lignan 이 각각 화학적마커로 동정하였다.