• Title/Summary/Keyword: Multivariate Process

Search Result 297, Processing Time 0.036 seconds

Application of multivariate statistics towards the geochemical evaluation of fluoride enrichment in groundwater at Shilabati river bank, West Bengal, India

  • Ghosh, Arghya;Mondal, Sandip
    • Environmental Engineering Research
    • /
    • v.24 no.2
    • /
    • pp.279-288
    • /
    • 2019
  • To obtain insightful knowledge of geochemical process controlling fluoride enrichment in groundwater of the villages near Shilabati river bank, West Bengal, India, multivariate statistical techniques were applied to a subgroup of the dataset generated from major ion analysis of groundwater samples. Water quality analysis of major ion chemistry revealed elevated levels of fluoride concentration in groundwater. Factor analysis (FA) of fifteen hydrochemical parameters demonstrated that fluoride occurrence was due to the weathering and dissolution of fluoride-bearing minerals in the aquifer. A strong positive loading (> 0.75) of fluoride with pH and bicarbonate for FA indicates an alkaline dominated environment responsible for leaching of fluoride from the source material. Mineralogical analysis of soli sediment exhibits the presence of fluoride-bearing minerals in underground geology. Hierarchical cluster analysis (HCA) was carried out to isolate the sampling sites according to groundwater quality. With HCA the sampling sites were isolated into three clusters. The occurrence of abundant fluoride in the higher elevated area of the observed three different clusters revealed that there was more contact opportunity of recharging water with the minerals present in the aquifer during infiltration through the vadose zone.

An Effective Multivariate Control Framework for Monitoring Cloud Systems Performance

  • Hababeh, Ismail;Thabain, Anton;Alouneh, Sahel
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.1
    • /
    • pp.86-109
    • /
    • 2019
  • Cloud computing systems' performance is still a central focus of research for determining optimal resource utilization. Running several existing benchmarks simultaneously serves to acquire performance information from specific cloud system resources. However, the complexity of monitoring the existing performance of computing systems is a challenge requiring an efficient and interactive user directing performance-monitoring system. In this paper, we propose an effective multivariate control framework for monitoring cloud systems performance. The proposed framework utilizes the hardware cloud systems performance metrics, collects and displays the performance measurements in terms of meaningful graphics, stores the graphical information in a database, and provides the data on-demand without requiring a third party software. We present performance metrics in terms of CPU usage, RAM availability, number of cloud active machines, and number of running processes on the selected machines that can be monitored at a high control level by either using a cloud service customer or a cloud service provider. The experimental results show that the proposed framework is reliable, scalable, precise, and thus outperforming its counterparts in the field of monitoring cloud performance.

A Study on the Node Split in Decision Tree with Multivariate Target Variables (다변량 목표변수를 갖는 의사결정나무의 노드분리에 관한 연구)

  • Kim, Seong-Jun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.4
    • /
    • pp.386-390
    • /
    • 2003
  • Data mining is a process of discovering useful patterns for decision making from an amount of data. It has recently received much attention in a wide range of business and engineering fields. Classifying a group into subgroups is one of the most important subjects in data mining. Tree-based methods, known as decision trees, provide an efficient way to finding the classification model. The primary concern in tree learning is to minimize a node impurity, which is evaluated using a target variable in the data set. However, there are situations where multiple target variable should be taken into account, for example, such as manufacturing process monitoring, marketing science, and clinical and health analysis. The purpose of this article is to present some methods for measuring the node impurity, which are applicable to data sets with multivariate target variables. For illustration, a numerical cxample is given with discussion.

Differential Metabolomics Analysis of Ginseng (Panax ginseng) by Processing Time (가공시간에 따른 인삼의 대사체학 분석)

  • Choi, Moon-Young;Kim, Kyung-Min;Choi, Min-Suk;Heo, Yun-Seok;Lee, Hae-Na;Lee, Choong-Woo;Kwon, Sung-Won
    • Journal of Pharmaceutical Investigation
    • /
    • v.38 no.1
    • /
    • pp.23-29
    • /
    • 2008
  • Red ginseng is made of white ginseng through the steaming and drying procedure. In this process, the amounts of toxic elements of ginseng are decreased and those of effective components, ginsenosides are increased. In order to identify the components alteration of white ginseng by processing time, we applied HPLC-based metabolomics approach combined with the principal component analysis (PCA) multivariate analysis. White ginsengs were steamed at 0, 1, 2, 4, 8 and 16 h, respectively and followed by drying process at moderate temperature. Then the steamed ginsengs and the commercial red ginsengs were analyzed by HPLC. On the basis of HPLC results, PCA multivariate analysis was applied for evaluating the quality of red ginseng, which showed the processed ginsengs are grouped by processed time because less polar ginsenosides were increased in proportion as the steaming time was increased. The purchased red ginsengs were distributed in the range of $0{\sim}1$ hour steaming time. This pilot experiment suggests that HPLC-based metabolomics approach is able to allow the quality of herbal medicines to be controlled with a simple and economic method.

Diagnostic Classification Based on Nonlinear Representation and Filtering of Process Measurement Data (공정측정데이터의 비선형표현과 전처리를 활용한 분류기반 진단)

  • Cho, Hyun-Woo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.16 no.5
    • /
    • pp.3000-3005
    • /
    • 2015
  • Reliable monitoring and diagnosis of industrial processes is quite important for in terms of quality and safety. The goal of fault diagnosis is to find process variables responsible for causing specific abnormalities of the process. This work presents a classification-based diagnostic scheme based on nonlinear representation of process data. The use of a nonlinear kernel technique is able to reduce the size of the data considered and provides efficient and reliable representation of the measurement data. As a filtering stage a preprocessing is performed to eliminate unwanted parts of the data with enhanced performance. The case study of an industrial batch process has shown that the performance of the scheme outperformed other methods. In addition, the use of a nonlinear representation technique and filtering improved the diagnosis performance in the case study.

Application of Multivariate Statistical Analysis Technique in Landfill Investigation (매립물 특성 조사를 위한 다변량 통계분석 기법의 응용)

  • Kwon, Byung-Doo;Kim, Cha-Soup
    • Journal of the Korean earth science society
    • /
    • v.18 no.6
    • /
    • pp.515-521
    • /
    • 1997
  • To investigate the nature of the waste materials in the Nanjido Landfill, we have conducted multivariate statistical analysis of geophysical data set comprised of magnetic, gravity, LandSat TM thermal band and surface depression measurement data. Because these data sets show different responses to the depth, we have transformed the observed total field magnetic data and gravity data to the residual reduced-to-pole(RTP) magnetic anomalies and the three dimensional density anomalies, respectively, and utilized the informations about the upper shallow part of the landfills only in the following process. For the statistical analysis at the points of depression measurement, the magnetic, density and LandSat data values at these points are determined by interpolation process. Since the multivarite statistical analysis technique utilizes a clustering algorithm for classification of data set and we have measured the dissimilarity between objects by using Euclidean distance, standardization was applied prior to distance calculation in order to eliminate any scaling effects due to different measurement unit of each data set. The hierarchial grouping technique was used to construct the dendrogram. The optimum number of statistical groups(clusters), which are classified on the basis of geophysical and geotechnical characteristics, appeared to be six on the resulting dendrogram. The result of this study suggests that the dimension and nature of the multicomponent waste landfills can be identified by application of the multivarite statistical analysis technique to integrated geophysical data sets.

  • PDF

Economic Evaluation of Measurement System by Principal Component Analysis (주성분 분석을 이용한 측정시스템의 경제적 평가)

  • Kang, Chung-Oh;Byun, Jai-Hyun
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.24 no.2
    • /
    • pp.211-221
    • /
    • 1998
  • It is very important to have a satisfactory measurement system, since it is useless to try to improve the manufacturing process without an adequate measurement system. Therefore, evaluation of the measurement system is the first step for the quality improvement of the manufacturing process. To estimate the measurement error we must conduct a controlled gage repeatability and reproducibility(gage R&R) study. Many manufacturers use a gage or instrument to measure multiple dimensions for the overall quality of the manufactured parts. In this case, it is necessary to estimate the gage R&R for multiple dimensions. When a gage measures a large number of dimensions of a part, it is very time-consuming and costly to measure all the dimensions. In this paper we propose the use of the principal component analysis method to identify a few principal components out of the original multivariate measurement capability to explain most of the measurement system variation pattern.

  • PDF

A NOTE ON THE GENERALIZED HEAT CONTENT FOR LÉVY PROCESSES

  • Cygan, Wojciech;Grzywny, Tomasz
    • Bulletin of the Korean Mathematical Society
    • /
    • v.55 no.5
    • /
    • pp.1463-1481
    • /
    • 2018
  • Let $X=\{X_t\}_{t{\geq}0}$ be a $L{\acute{e}}vy$ process in ${\mathbb{R}}^d$ and ${\Omega}$ be an open subset of ${\mathbb{R}}^d$ with finite Lebesgue measure. The quantity $H_{\Omega}(t)={\int_{\Omega}}{\mathbb{P}}^x(X_t{\in}{\Omega})$ dx is called the heat content. In this article we consider its generalized version $H^{\mu}_g(t)={\int_{\mathbb{R}^d}}{\mathbb{E}^xg(X_t){\mu}(dx)$, where g is a bounded function and ${\mu}$ a finite Borel measure. We study its asymptotic behaviour at zero for various classes of $L{\acute{e}}vy$ processes.

Empirical process optimization through response surface experiments and model building

  • PARK, SUNG H.
    • Journal of Korean Society for Quality Management
    • /
    • v.8 no.1
    • /
    • pp.3-7
    • /
    • 1980
  • In many industrial processes, there are more than two responses (i.e., yield, percent impurity, etc.) of interest, and it is desirable to determine the optimal levels of the factors (i.e., temperature, pressure, etc.) that influence the responses. Suppose the response relationships are assumed to be approximated by second-order polynomial regression models. The problems considered in this paper is, first, to propose how to select polynomial terms to fit the multivariate regression surfaces for a given set of data, and, second, to propose how to analyze the data to obtain an optimal operating condition for the factors. The proposed techniques were applied for empirical process optimization in a tire company in Korea. This case is presented as an illustration.

  • PDF

On Statistical Estimation of Multivariate (Vector-valued) Process Capability Indices with Bootstraps)

  • Cho, Joong-Jae;Park, Byoung-Sun;Lim, Soo-Duck
    • Communications for Statistical Applications and Methods
    • /
    • v.8 no.3
    • /
    • pp.697-709
    • /
    • 2001
  • In this paper we study two vector-valued process capability indices $C_{p}$=($C_{px}$, $C_{py}$ ) and C/aub pm/=( $C_{pmx}$, $C_{pmy}$) considering process capability indices $C_{p}$ and $C_{pm}$ . First, two asymptotic distributions of plug-in estimators $C_{p}$=($C_{px}$, $C_{py}$ ) and $C_{pm}$ =) $C_{pmx}$, $C_{pmy}$) are derived.. With the asymptotic distributions, we propose asymptotic confidence regions for our indices. Next, obtaining the asymptotic distributions of two bootstrap estimators $C_{p}$=($C_{px}$, $C_{py}$ )and $C_{pm}$ =( $C_{pmx}$, $C_{pmy}$) with our bootstrap algorithm, we will provide the consistency of our bootstrap for statistical inference. Also, with the consistency of our bootstrap, we propose bootstrap asymptotic confidence regions for our indices. (no abstract, see full-text)see full-text)e full-text)

  • PDF