• 제목/요약/키워드: Multivariate statistical technique

검색결과 77건 처리시간 0.024초

다축-다변량회귀분석 기법을 이용한 회분식 공정의 이상감지 및 통계적 제어 방법 (Fault Detection & SPC of Batch Process using Multi-way Regression Method)

  • 우경섭;이창준;한경훈;고재욱;윤인섭
    • Korean Chemical Engineering Research
    • /
    • 제45권1호
    • /
    • pp.32-38
    • /
    • 2007
  • 통계적인 공정 제어 기법을 회분식 공정에 적용하여, 일반적인 회분식 공정의 데이터를 통해 보다 빠르고, 손쉽게 공정의 상태를 진단할 수 있는 시스템을 구현해 보았다. 대표적인 회분식 공정의 하나인 반도체 식각공정과 반회분식 스타이렌-부타디엔 고무 생산 공정의 데이터를 이용하여 공정 변수와 공정의 상태간의 연관 관계를 규명할 수 있는 모델을 수립하였으며, 이 모델의 출력(output) 결과를 이용해 통계적 공정 제어 차트를 구성하고, 시간에 따른 공정의 추이를 분석해 이상을 판별해 보았다. 회분식 공정의 다축(multi-way) 데이터를 두개의 축으로 만드는 펼치기(unfolding) 과정을 거쳤으며, 모델링 방법으로는 Support Vector Regression 및 Partial Least Square 등의 다변량 회귀분석 방법을 이용하였다. 또한 에러차트 및 변수 기여도 차트(variable contribution chart)를 이용해 이상의 세기, 형태 및 이상 데이터에 대한 각 변수들의 기여도를 계산해 보았으며, 그 결과 이상의 발생 유무 및 발생시점 뿐만아니라 이상의 세기 및 원인 까지 진단해 볼 수 있는 우수한 성능을 보이는 것을 확인할 수 있었다.

예술작품의 수치화와 다변량분석에 의한 새로운 분류 제안 - 전문가를 중심으로 - (A Propose of New Classification Indication about Work of Art through Numeric and Multivariate Data Analysis - Focused on the Specialist -)

  • 서명애;이상복
    • 품질경영학회지
    • /
    • 제35권4호
    • /
    • pp.67-77
    • /
    • 2007
  • We tried new interpreting about the work of art in this paper. The work of art respects the intention of the artist to make it and interprets intention until now. After critics distinguish by a period, an area that they set to philosophical thought which is the time and interpreted. We set to each one subjectivity and interpreted between artist to make the work of art and appreciator. But in this paper, we tied various criteria which appreciates the work of art. We tried so that we presented the intimacy each other newly. Otherwise we tied with the subjectivity of the individual and are the try to be an objectification low through statistical technique. We looked into the culture and art in the introduction and explain the discussion about the work of art interpreting which the main subject. We set the category 6 area, and explain an each criteria explanation and assessment method. We tried to propose new interpreting as the intimacy to be multi-variate data analysis result of the assessment analysis.

Neural-based Blind Modeling of Mini-mill ASC Crown

  • Lee, Gang-Hwa;Lee, Dong-Il;Lee, Seung-Joon;Lee, Suk-Gyu;Kim, Shin-Il;Park, Hae-Doo;Park, Seung-Gap
    • 한국지능시스템학회논문지
    • /
    • 제12권6호
    • /
    • pp.577-582
    • /
    • 2002
  • Neural network can be trained to approximate an arbitrary nonlinear function of multivariate data like the mini-mill crown values in Automatic Shape Control. The trained weights of neural network can evaluate or generalize the process data outside the training vectors. Sometimes, the blind modeling of the process data is necessary to compare with the scattered analytical model of mini-mill process in isolated electro-mechanical forms. To come up with a viable model, we propose the blind neural-based range-division domain-clustering piecewise-linear modeling scheme. The basic ideas are: 1) dividing the range of target data, 2) clustering the corresponding input space vectors, 3)training the neural network with clustered prototypes to smooth out the convergence and 4) solving the resulting matrix equations with a pseudo-inverse to alleviate the ill-conditioning problem. The simulation results support the effectiveness of the proposed scheme and it opens a new way to the data analysis technique. By the comparison with the statistical regression, it is evident that the proposed scheme obtains better modeling error uniformity and reduces the magnitudes of errors considerably. Approximatly 10-fold better performance results.

다변량분석에 의한 예술작품 분류 시도 연구;전문가를 중심으로 (Study on New Classification Indication about Work of Art through Multi-variate Data Analysis;On Focused Specialist)

  • 서명애;이상복
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 2006년도 추계 학술대회
    • /
    • pp.251-259
    • /
    • 2006
  • Evaluation of the work of art with intention of the artist different is not a possibility of free oneself from the limit which estimates an evaluation at value of appreciator. We tried new interpreting about the work of art in this paper. The work of art respects the intention of the artist to make it and interprets intention until now. After critics distinguish by a period, an area that they set to philosophical thought which is the time and interpreted. We set to each one subjectivity and interpreted between artist to make the work of art and appreciator. But in this paper, we tied various criteria which appreciates the work of art. We tried so that we presented the intimacy each other newly. Otherwise we tied with the subjectivity of the individual and are the try to be an objectification low through statistical technique. We looked into the culture and art in the introduction and explain the discussion about the work of art interpreting which the main subject. We set the category 6 area, and explain an each criteria explanation and assessment method. We tried to propose new interpreting as the intimacy to be multivariate data analysis result of the assessment analysis. Stopping from the thing which sees the work of art knows, it will be able to give meaning thing from this research prerequisite.

  • PDF

분석변수들의 잠재공간 표현 (Representing variables in the latent space)

  • 허명회
    • 응용통계연구
    • /
    • 제30권4호
    • /
    • pp.555-566
    • /
    • 2017
  • 다변량 자료에서 변수 수 p가 큰 경우 주성분분석 등 통상적인 차원축소는 효과적이지 못할 수 있다. 효과적인 시각화가 되려면 축소공간의 차원이 2-3 정도이어야 하는데, 관측개체의 잠재적 차원이 이보다 훨씬 큰 경우가 있기 때문이다. 이 논문은 분석변수들을 다수의 잠재 차원에 분할하여 차원축소적 방법으로 탐색하고 부분들의 유기적 관계를 시각화하는 이단계 작업을 제안한다. 분석변수들을 잠재 차원에 분할하는 "잠재변인 변수군집화" 방법으로는 R팩키지 ClustOfVar를 쓰고 개별 변수군집의 시각화를 위해서 주성분분석 행렬도(biplot)를, 개별 변수군집과 외부 잠재변인 또는 외적 변수 간 관계의 시각화를 위해서는 추가변수 끼워넣기(embedding supplementary variables) 기법을 활용한다.

Liquid Chromatography-Mass Spectrometry-Based Chemotaxonomic Classification of Aspergillus spp. and Evaluation of the Biological Activity of Its Unique Metabolite, Neosartorin

  • Lee, Mee Youn;Park, Hye Min;Son, Gun Hee;Lee, Choong Hwan
    • Journal of Microbiology and Biotechnology
    • /
    • 제23권7호
    • /
    • pp.932-941
    • /
    • 2013
  • This work aimed to classify Aspergillus (8 species, 28 strains) by using a secondary metabolite profile-based chemotaxonomic classification technique. Secondary metabolites were analyzed by liquid chromatography ion-trap mass spectrometry (LC-IT-MS) and multivariate statistical analysis. Most strains were generally well separated from each section. A. lentulus was discriminated from the other seven species (A. fumigatus, A. fennelliae, A. niger, A. kawachii, A. flavus, A. oryzae, and A. sojae) with partial least-squares discriminate analysis (PLS-DA) with five discriminate metabolites, including 4,6-dihydroxymellein, fumigatin, 5,8-dihydroxy-9-octadecenoic acid, cyclopiazonic acid, and neosartorin. Among them, neosartorin was identified as an A. lentulus-specific compound that showed anticancer activity, as well as antibacterial effects on Staphylococcus epidermidis. This study showed that metabolite-based chemotaxonomic classification is an effective tool for the classification of Aspergillus spp. with species-specific activity.

Discovering Relationships between Skin Type and Life Style Using Data Mining Techniques: A Case Study of Korea

  • Kim, Taeheung;Ha, Jihyun;Lee, Jong-Seok;Oh, Younhak;Cho, Yong Ju
    • Industrial Engineering and Management Systems
    • /
    • 제15권1호
    • /
    • pp.110-121
    • /
    • 2016
  • With the growing interest in skincare and maintenance, there are increasing numbers of studies on the classification of skin type and the factors influencing each type. This study presents a novel methodology by using data mining, for the determination of the relationships between skin type, lifestyle, and patterns of cosmetic utilization. Eight skin-specific factors, which are moisture, sebum in U-zone (both cheeks), sebum in T-zone (forehead, nose, and chin), pore, melanin, wrinkle, acne, hemoglobin, were measured in 1,246 subjects living in South Korea, in conjunction with a questionnaire survey analyzing their lifestyles and pattern of cosmetic utilization. Using various multivariate statistical methods and data mining techniques, we classified the skin types based on the skin-specific values, determined the relationship between skin type and lifestyle, and accordingly sorted the subjects into clusters. Logistic regression analysis revealed gender-related differences in the skin; therefore, separate analyses were performed for males and females. Using the Gaussian Mixture Modeling (GMM) technique, we classified the subjects based on skin type (two male and four female). Using the ANOVA and decision tree techniques, we attempted to characterize the relationship between each skin type and the lifestyles of the subjects. Menstruation, eating habits, stress, and smoking were identified as the major factors affecting the skin.

다변량 통계분석을 이용한 준분포형 유출모형 매개변수 지역화 (Parameter Regionalization of Semi-Distributed Runoff Model Using Multivariate Statistical Analysis)

  • 이병주;정일원;배덕효
    • 한국수자원학회논문집
    • /
    • 제42권2호
    • /
    • pp.149-160
    • /
    • 2009
  • 본 연구에서는 미계측유역에 대한 준분포형 강우-유출모형을 적용하기 위한 방법으로 두 개의 다변량 통계기법인 주성분분석과 계층적 군집분석을 연계한 매개변수 지역화 기법을 제안하였다. 109개 중권역 유역에 대해 7개 유역특성인자(유역면적, 평균표고, 평균경사, 산림면적비, 포화토양수분량, 포장용수량, 영구위조점)를 추출하였으며 주성분분석을 수행한 결과 제1, 2 성분이 전체자료의 82.11%를 설명하는 것으로 나타났다. 제1성분은 유역위치, 제2성분은 유역규모와 관계가 있는 것으로 분석되었으며 이들 성분점수로부터 군집분석을 이용하여 103개 미계측유역을 6개 계측유역으로 분류한 결과 괴산댐 23개, 안동댐 6개, 임하댐 5개, 합천댐 21개, 용담댐 4개, 섬진강댐 44개의 미계측 유역을 포함하는 것으로 나타났다. 유출모형은 SWAT 모형을 선정하였으며 6개 계측유역에 대한 매개변수를 추정하였다. 매개변수 지역화 결과의 적용성을 평가하기 위해 미계측유역으로 가정한 소양, 충주, 대청댐 상류유역에 대해 지역화된 매개변수를 이용하여 유출해석을 수행한 결과 모형효율성계수가 0.8 이상으로 관측치와 적합도가 매우 높게 나타났다. 이상의 결과로부터 다변량 통계분석을 이용한 유출매개변수 지역화 방법은 미계측유역의 유출모의시활용 가능함을 확인하였다.

MEAT SPECIATION USING A HIERARCHICAL APPROACH AND LOGISTIC REGRESSION

  • Arnalds, Thosteinn;Fearn, Tom;Downey, Gerard
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1245-1245
    • /
    • 2001
  • Food adulteration is a serious consumer fraud and a matter of concern to food processors and regulatory agencies. A range of analytical methods have been investigated to facilitate the detection of adulterated or mis-labelled foods & food ingredients but most of these require sophisticated equipment, highly-qualified staff and are time-consuming. Regulatory authorities and the food industry require a screening technique which will facilitate fast and relatively inexpensive monitoring of food products with a high level of accuracy. Near infrared spectroscopy has been investigated for its potential in a number of authenticity issues including meat speciation (McElhinney, Downey & Fearn (1999) JNIRS, 7(3), 145-154; Downey, McElhinney & Fearn (2000). Appl. Spectrosc. 54(6), 894-899). This report describes further analysis of these spectral sets using a hierarchical approach and binary decisions solved using logistic regression. The sample set comprised 230 homogenized meat samples i. e. chicken (55), turkey (54), pork (55), beef (32) and lamb (34) purchased locally as whole cuts of meat over a 10-12 week period. NIR reflectance spectra were recorded over the wavelength range 400-2498nm at 2nm intervals on a NIR Systems 6500 scanning monochromator. The problem was defined as a series of binary decisions i. e. is the meat red or white\ulcorner is the red meat beef or lamb\ulcorner, is the white meat pork or poultry\ulcorner etc. Each of these decisions was made using an individual binary logistic model based on scores derived from principal component or partial least squares (PLS1 and PLS2) analysis. The results obtained were equal to or better than previous reports using factorial discriminant analysis, K-nearest neighbours and PLS2 regression. This new approach using a combination of exploratory and logistic analyses also appears to have advantages of transparency and the use of inherent structure in the spectral data. Additionally, it allows for the use of different data transforms and multivariate regression techniques at each decision step.

  • PDF

MEAT SPECIATION USING A HIERARCHICAL APPROACH AND LOGISTIC REGRESSION

  • Arnalds, Thosteinn;Fearn, Tom;Downey, Gerard
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1152-1152
    • /
    • 2001
  • Food adulteration is a serious consumer fraud and a matter of concern to food processors and regulatory agencies. A range of analytical methods have been investigated to facilitate the detection of adulterated or mis-labelled foods & food ingredients but most of these require sophisticated equipment, highly-qualified staff and are time-consuming. Regulatory authorities and the food industry require a screening technique which will facilitate fast and relatively inexpensive monitoring of food products with a high level of accuracy. Near infrared spectroscopy has been investigated for its potential in a number of authenticity issues including meat speciation (McElhinney, Downey & Fearn (1999) JNIRS, 7(3), 145 154; Downey, McElhinney & Fearn (2000). Appl. Spectrosc. 54(6), 894-899). This report describes further analysis of these spectral sets using a hierarchical approach and binary decisions solved using logistic regression. The sample set comprised 230 homogenized meat samples i. e. chicken (55), turkey (54), pork (55), beef (32) and lamb (34) purchased locally as whole cuts of meat over a 10-12 week period. NIR reflectance spectra were recorded over the wavelength range 400-2498nm at 2nm intervals on a NIR Systems 6500 scanning monochromator. The problem was defined as a series of binary decisions i. e. is the meat red or white\ulcorner is the red meat beef or lamb\ulcorner, is the white meat pork or poultry\ulcorner etc. Each of these decisions was made using an individual binary logistic model based on scores derived from principal component or partial least squares (PLS1 and PLS2) analysis. The results obtained were equal to or better than previous reports using factorial discriminant analysis, K-nearest neighbours and PLS2 regression. This new approach using a combination of exploratory and logistic analyses also appears to have advantages of transparency and the use of inherent structure in the spectral data. Additionally, it allows for the use of different data transforms and multivariate regression techniques at each decision step.

  • PDF