• Title/Summary/Keyword: multivariate data analysis

Search Result 1,405, Processing Time 0.024 seconds

Design of Intelligent Material Quality Control System based on Pattern Analysis using Artificial Neural Network (인공 신경망의 패턴분석에 근거한 지능적 부품품질 관리시스템의 설계)

  • 이장희;유성진;박상찬
    • Journal of Korean Society for Quality Management
    • /
    • v.29 no.4
    • /
    • pp.38-53
    • /
    • 2001
  • In resolving industrial quality control problems, a vector of multiple quality characteristic variables is involved rather than a single variable. However, it is not guaranteed that a multivariate control chart based on statistical methods can monitor abnormal signal in case that small changes of relationship between each variables causes abnormal production process. Hence a quality control system for real-time monitoring of the multi-dimensional quality characteristic vector under a multivariate normal process is needed to enhance tile production system quality performance. A pattern analysis approach based on self-organizing map (SOM), an unsupervised learning technique of neural network, is applied to the design of such a quality control system. In this study we present a new material quality control system based on pattern analysis approach and illustrate the effectiveness of proposed system using actual electronic company material data.

  • PDF

Abnormality Detection to Non-linear Multivariate Process Using Supervised Learning Methods (지도학습기법을 이용한 비선형 다변량 공정의 비정상 상태 탐지)

  • Son, Young-Tae;Yun, Deok-Kyun
    • IE interfaces
    • /
    • v.24 no.1
    • /
    • pp.8-14
    • /
    • 2011
  • Principal Component Analysis (PCA) reduces the dimensionality of the process by creating a new set of variables, Principal components (PCs), which attempt to reflect the true underlying process dimension. However, for highly nonlinear processes, this form of monitoring may not be efficient since the process dimensionality can't be represented by a small number of PCs. Examples include the process of semiconductors, pharmaceuticals and chemicals. Nonlinear correlated process variables can be reduced to a set of nonlinear principal components, through the application of Kernel Principal Component Analysis (KPCA). Support Vector Data Description (SVDD) which has roots in a supervised learning theory is a training algorithm based on structural risk minimization. Its control limit does not depend on the distribution, but adapts to the real data. So, in this paper proposes a non-linear process monitoring technique based on supervised learning methods and KPCA. Through simulated examples, it has been shown that the proposed monitoring chart is more effective than $T^2$ chart for nonlinear processes.

The Methodological Aspects of Forecasting and the Analysis of Macroeconomic Indicators

  • VYBOROVA, Elena Nikolaevna
    • East Asian Journal of Business Economics (EAJBE)
    • /
    • v.10 no.2
    • /
    • pp.31-42
    • /
    • 2022
  • Purpose - The main research goals by macroeconomic analysis is to assess the effectiveness of state regulation, the sustainability of development, and the financial stability of the state. Research design, Data, and methodology - The research were analyzed using the methods of multivariate statistics and application of the software package Stat graphics. The volume of data from the 1995 to the 2021 was analyzed by Russian Federation. The scale of research on Belarus: to be analyzed the amount of data from the 2015 by 2021, on Kazakhstan - from the 19941, on Kyrgyzstan - from the 2002, on Tajikistan - from the 2008, on Armenia - from the 2021, on Japan - since the 1970, on China - since the 1950, on South Korea - since the 1953. Result - The methods of multivariate statistics was demonstrated exact of result in forecasting of macroeconomic indicators. The most of tendency with the accurate results of are described using the second-degree polynomials. In the most research of country there are the macroeconomic proportion are broken. Conclusion - In the countries studied, the monetary aggregates have a significant growth rate. The shares with a substantial monetary stock and the speed of its growth are divided in the two groups: having placements in the real sectors of the economy and not having received the same result of development from the growth of the monetary stock.

Constructing Simultaneous Confidence Intervals for the Difference of Proportions from Multivariate Binomial Distributions

  • Jeong, Hyeong-Chul;Kim, Dae-Hak
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.1
    • /
    • pp.129-140
    • /
    • 2009
  • In this paper, we consider simultaneous confidence intervals for the difference of proportions between two groups taken from multivariate binomial distributions in a nonparametric way. We briefly discuss the construction of simultaneous confidence intervals using the method of adjusting the p-values in multiple tests. The features of bootstrap simultaneous confidence intervals using non-pooled samples are presented. We also compute confidence intervals from the adjusted p-values of multiple tests in the Westfall (1985) style based on a pooled sample. The average coverage probabilities of the bootstrap simultaneous confidence intervals are compared with those of the Bonferroni simultaneous confidence intervals and the Sidak simultaneous confidence intervals. Finally, we give an example that shows how the proposed bootstrap simultaneous confidence intervals can be utilized through data analysis.

A study on the fuzzy based inference using multivariate human sensibility database (다변량해석기법에 의한 감성 데이터베이스를 활용한 감성공학적 퍼지추론에 관한 연구)

  • 한성배;양선모;정기원;김형범;박정호;이순요
    • Proceedings of the Korean Operations and Management Science Society Conference
    • /
    • 1996.04a
    • /
    • pp.407-410
    • /
    • 1996
  • This paper presents how to build a human sensibility database by multivariate method. And, we discribe a fuzzy based inference system which converts human sensibility data to design factors using the human sensibility database. We are able to obtain the values of multiple correlation coeffcient, partial correlation coefficient, and categories by the quantification theory which is multivariate analysis. So, the human sensibility database is constructed from those values. The inference system will be more useful, if the human sensibility database and graphic design factor database were integrated.

  • PDF

Comparison of Forecasting Performance in Multivariate Nonstationary Seasonal Time Series Models (다변량 비정상 계절형 시계열모형의 예측력 비교)

  • Seong, Byeong-Chan
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.1
    • /
    • pp.13-21
    • /
    • 2011
  • This paper studies the analysis of multivariate nonstationary time series with seasonality. Three types of multivariate time series models are considered: seasonal cointegration model, nonseasonal cointegration model with seasonal dummies, and vector autoregressive model in seasonal differences that are compared for forecasting performances using Korean macro-economic time series data. The cointegration models produce smaller forecast errors in short horizons; however, when longer forecasting periods are considered the vector autoregressive model appears preferable.

The Evaluation of Water Quality in Coastal Sea of Kunsan Using Statistic Analysis (통계분석기법을 이용한 군산연안해역의 수질평가)

  • Lee, Nam-Do;Kim, Jong-Gu
    • Journal of Environmental Science International
    • /
    • v.16 no.3
    • /
    • pp.369-376
    • /
    • 2007
  • This study was conducted to evaluate water quality in coastal sea of Kunsan using multivariate analysis. The analysis data in Coastal Sea of Kunsan use of surveyed data by the NFRDI from April 2000 to November 2002. Twelve water Quality parameter were determined on each sample. The results was summarized as follow ; Water quality in coastal sea of Kunsan could be explained up to 62.782% by four factors which were included in loading of nitrogen-nutrients by Keum river(24.688%), suspended solids variation (12.180%), seasonal climate variation (18.367%) and variation of DIP (10.546%). To analyze spatially and monthly variation by factor score, it was divided by inner area and outer area spatially, and spring and summer monthly. The result of time series analysis by factor score, inner area of Kunsan coastal sea(St.1 and St. 2) was the most affected by nitrogen-nutrient and suspended solids due to runoff by Keum river. It could be suggested from these results that it is important to reduce tile pollution loads from Kuem river for the control of the water quality in coastal sea of Kunsan.

The Comparison of Singular Value Decomposition and Spectral Decomposition

  • Shin, Yang-Gyu
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.4
    • /
    • pp.1135-1143
    • /
    • 2007
  • The singular value decomposition and the spectral decomposition are the useful methods in the area of matrix computation for multivariate techniques such as principal component analysis and multidimensional scaling. These techniques aim to find a simpler geometric structure for the data points. The singular value decomposition and the spectral decomposition are the methods being used in these techniques for this purpose. In this paper, the singular value decomposition and the spectral decomposition are compared.

  • PDF

Principles of Multivariate Data Visualization

  • Huh, Moon Yul;Cha, Woon Ock
    • Communications for Statistical Applications and Methods
    • /
    • v.11 no.3
    • /
    • pp.465-474
    • /
    • 2004
  • Data visualization is the automation process and the discovery process to data sets in an effort to discover underlying information from the data. It provides rich visual depictions of the data. It has distinct advantages over traditional data analysis techniques such as exploring the structure of large scale data set both in the sense of number of observations and the number of variables by allowing great interaction with the data and end-user. We discuss the principles of data visualization and evaluate the characteristics of various tools of visualization according to these principles.

Method for predicting the diagnosis of mastitis in cows using multivariate data and Recurrent Neural Network (다변량 데이터와 순환 신경망을 이용한 젖소의 유방염 진단예측 방법)

  • Park, Gicheol;Lee, Seonghun;Park, Jaehwa
    • Journal of Software Assessment and Valuation
    • /
    • v.17 no.1
    • /
    • pp.75-82
    • /
    • 2021
  • Mastitis in cows is a major factor that hinders dairy productivity of farms, and many attempts have been made to solve it. However, research on mastitis has been limited to diagnosis rather than prediction, and even this is mostly using a single sensor. In this study, a predictive model was developed using multivariate data including biometric data and environmental data. The data used for the analysis were collected from robot milking machines and sensors installed in farmhouses in Chungcheongnam-do, South Korea. The recurrent neural network model using three weeks of data predicts whether or not mastitis is diagnosed the next day. As a result, mastitis was predicted with an accuracy of 82.9%. The superiority of the model was confirmed by comparing the performance of various data collection periods and various models.