• 제목/요약/키워드: Multivariate statistical technique

검색결과 77건 처리시간 0.025초

의사결정나무를 이용한 다변량 공정관리 절차 (Multivariate process control procedure using a decision tree learning technique)

  • 정광영;이재헌
    • Journal of the Korean Data and Information Science Society
    • /
    • 제26권3호
    • /
    • pp.639-652
    • /
    • 2015
  • 현대의 제조공정은 컴퓨터의 발전과 통신 및 네트워크의 발달로 컴퓨터통합제조가 가능해졌다. 이로 인해 고품질 제품의 고속 생산공정이 확대되고, 공정에서 실시간으로 전송되는 다양한 품질변수들의 데이터 축적 또한 가능하게 되었다. 이를 관리하기 위해서는 다변량 통계적 공정관리 절차가 필요하다. 전통적으로 사용하는 다변량 관리도는 이상상태 발생시 이상신호를 주지만, 이상원인이 어떠한 변수에 어떠한 영향을 주는지에 대한 정보를 제공하지 않는다는 단점이 있다. 이를 보완하기 위해 데이터마이닝과 기계학습 기법을 이용할 수 있다. 이 논문에서는 의사결정나무 학습 기법을 이용한 다변량 공정관리 절차를 소개하고, 이변량인 경우 모의실험을 통하여 그 효율을 살펴보았다. 모의실험 결과를 살펴볼 때, 상관계수에 따라 이상상태 탐지 능력은 비슷한 것으로 나타났고, 이상상태에 대한 분류 정확도는 상관계수와 이상원인의 형태에 따라 차이가 있지만 기존의 다변량 관리도에서는 제공하지 않는 이상원인의 정보를 제공하는 장점이 있음을 알 수 있다.

EXPERIMENTAL ANALYSIS OF DRIVING PATTERNS AND FUEL ECONOMY FOR PASSENGER CARS IN SEOUL

  • Sa, J.-S.;Chung, N.-H.;Sunwoo, M.-H.
    • International Journal of Automotive Technology
    • /
    • 제4권2호
    • /
    • pp.101-108
    • /
    • 2003
  • There are a lot of factors that influence automotive fuel economy such as average trip time per kilometer, average trip speed, the number of times of vehicle stationary, and so forth. These factors depend on road conditions and traffic environment. In this study, various driving data were measured and recorded during road tests in Seoul. The accumulated road test mileage is around 1,300 kilometers. The objective of the study is to identify the driving patterns of the Seoul metropolitan area and to analyze the fuel economy based on these driving patterns. The driving data which was acquired through road tests was analysed statistically in order to obtain the driving characteristics via modal analysis, speed analysis, and speed-acceleration analysis. Moreover, the driving data was analyzed by multivariate statistical techniques including correlation analysis, principal component analysis, and multiple linear regression analysis in order to obtain the relationships between influencing factors on fuel economy. The analyzed results show that the average speed is around 29.2 km/h, and the average fuel economy is 10.23 km/L. The vehicle speed of the Seoul metropolitan area is slower, and the stop-and-go operation is more frequent than FTP-75 test mode which is used for emission and fuel economy tests. The average trip time per kilometer is one of the most important factors in fuel consumption, and the increase of the average speed is desirable for reducing emissions and fuel consumption.

Application of metabolic profiling for biomarker discovery

  • Hwang, Geum-Sook
    • 한국응용약물학회:학술대회논문집
    • /
    • 한국응용약물학회 2007년도 Proceedings of The Convention
    • /
    • pp.19-27
    • /
    • 2007
  • An important potential of metabolomics-based approach is the possibility to develop fingerprints of diseases or cellular responses to classes of compounds with known common biological effect. Such fingerprints have the potential to allow classification of disease states or compounds, to provide mechanistic information on cellular perturbations and pathways and to identify biomarkers specific for disease severity and drug efficacy. Metabolic profiles of biological fluids contain a vast array of endogenous metabolites. Changes in those profiles resulting from perturbations of the system can be observed using analytical techniques, such as NMR and MS. $^1H$ NMR was used to generate a molecular fingerprint of serum or urinary sample, and then pattern recognition technique was applied to identity molecular signatures associated with the specific diseases or drug efficiency. Several metabolites that differentiate disease samples from the control were thoroughly characterized by NMR spectroscopy. We investigated the metabolic changes in human normal and clinical samples using $^1H$ NMR. Spectral data were applied to targeted profiling and spectral binning method, and then multivariate statistical data analysis (MVDA) was used to examine in detail the modulation of small molecule candidate biomarkers. We show that targeted profiling produces robust models, generates accurate metabolite concentration data, and provides data that can be used to help understand metabolic differences between healthy and disease population. Such metabolic signatures could provide diagnostic markers for a disease state or biomarkers for drug response phenotypes.

  • PDF

PROCESS ANALYSIS OF AUTOMOTIVE PARTS USING GRAPHICAL MODELLING

  • IRIKURA Norio;KUZUYA Kazuyoshi;NISHINA Ken
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 1998년도 The 12th Asia Quality Management Symposium* Total Quality Management for Restoring Competitiveness
    • /
    • pp.295-300
    • /
    • 1998
  • Recently graphical modelling is being studied as a useful process analysis tool for exploratory causal analysis. Graphical modelling is a presentation method that uses graphs to describe statistical models of the structures of multivariate data. This paper describes an application of this graphical modeling with two cases from the automotive parts industry. One case is the unbalance problem of the pulley, an automotive generator part. There is multivariate data of the product from each of the processes which are connected in the series. By means of exploratory causal analysis between the variables using graphical modeling, the key processes which causes the variation of the final characteristics and their mechanism of the causal relationship have become clear. Another case is, also, the unbalanced problem of automotive starter parts which consists of many parts and is manufactured by complex machinery and assembling process. By means of the similar technique, the key processes are obtained easily and the results are reasonable from technical knowledge.

  • PDF

A LOOK FOR DESIGN FACTORS OF PACKAGES BY MULTIVARIATE ANALYSIS METHODS

  • Yamarai Yasushi;Ihara Masamori
    • 한국품질경영학회:학술대회논문집
    • /
    • 한국품질경영학회 1998년도 The 12th Asia Quality Management Symposium* Total Quality Management for Restoring Competitiveness
    • /
    • pp.316-321
    • /
    • 1998
  • In order to detect causal relationships between latent traits of sensual impressions for a color and physical characteristics constructing it, it is a common practice first to extract latent factors by a factor analysis method and secondly to clarify the causal relationships by a regression analysis method. This paper presents a multivariate statistical technique to detect the influence of the physical characteristics to the latent factors simultaneously which treats the physical characteristics as experimental factors in a $L_{27}$ factorial design and analysis the effects of the factors to the latent trait scores by an ANOVA.

  • PDF

인공 신경망의 패턴분석에 근거한 지능적 부품품질 관리시스템의 설계 (Design of Intelligent Material Quality Control System based on Pattern Analysis using Artificial Neural Network)

  • 이장희;유성진;박상찬
    • 품질경영학회지
    • /
    • 제29권4호
    • /
    • pp.38-53
    • /
    • 2001
  • In resolving industrial quality control problems, a vector of multiple quality characteristic variables is involved rather than a single variable. However, it is not guaranteed that a multivariate control chart based on statistical methods can monitor abnormal signal in case that small changes of relationship between each variables causes abnormal production process. Hence a quality control system for real-time monitoring of the multi-dimensional quality characteristic vector under a multivariate normal process is needed to enhance tile production system quality performance. A pattern analysis approach based on self-organizing map (SOM), an unsupervised learning technique of neural network, is applied to the design of such a quality control system. In this study we present a new material quality control system based on pattern analysis approach and illustrate the effectiveness of proposed system using actual electronic company material data.

  • PDF

투사지향방법에 의한 판별분석의 모의실험분석 (A simulation study on projection pursuit discriminant analysis)

  • 안윤기;이성석
    • 응용통계연구
    • /
    • 제5권1호
    • /
    • pp.103-111
    • /
    • 1992
  • 다변량 통계분석기법중 하나로 제기된 투사지향방법은 다변량자료를 관심있는 일차원 또는 이차원의 자료로의 선형투사를 찾아 나가는 방법이다. 이 방법은 다변량 자료가 갖는 차원의 문제를 해결해 줄 수 있는 유용한 기법으로 제시되었다. 본 연구에서는 투사지향방법을 이용하여 추정한 다변량 확률밀도함수를 사용한 새로운 비모수적인 판별분석방법을 제시하고, 이를 기존의 모수적 판별분석방법중 실제적으로 많이 사용되는 선형판별함수방법, 그리고 기존의 비모수적 판별분석방법중 계산상의 편리성이 많은 K-최인접방법과 컴퓨터 시뮬레이션을 통하여 비교분석하였다.

  • PDF

식생이 무성한 지역에서의 Principal Component Analysis 에 의한 Landsat TM 자료의 광역지질도 작성 (Regional Geological Mapping by Principal Component Analysis of the Landsat TM Data in a Heavily Vegetated Area)

  • 朴鍾南;徐延熙
    • 대한원격탐사학회지
    • /
    • 제4권1호
    • /
    • pp.49-60
    • /
    • 1988
  • Principal Component Analysis (PCA) was applied for regional geological mapping to a multivariate data set of the Landsat TM data in the heavily vegetated and topographically rugged Chungju area. The multivariate data set selection was made by statistical analysis based on the magnitude of regression of squares in multiple regression, and it includes R1/2/R3/4, R2/3, R5/7/R4/3, R1/2, R3/4. R4/3. AND R4/5. As a result of application of PCA, some of later principal components (in this study PC 3 and PC 5) are geologically more significant than earlier major components, PC 1 and PC 2 herein. The earlier two major components which comprise 96% of the total information of the data set, mainly represent reflectance of vegetation and topographic effects, while though the rest represent 3% of the total information which statistically indicates the information unstable, geological significance of PC3 and PC5 in the study implies that application of the technique in more favorable areas should lead to much better results.

한방 처방의 통계적 연구( II ) -인삼배합 한방처방의 통계적 연구- (Statistical Studies on the Formularies of Oriental Medicine(II) -Statistical Analyses of Ginseng Prescription-)

  • 홍문화
    • 생약학회지
    • /
    • 제3권4호
    • /
    • pp.187-197
    • /
    • 1972
  • In spite of the fact that the system of oriental medicine still remains in the realm of 'unproven-method of treatment', no one can deny that the oriental medicine is a rich source of idea and motivation for the discovery of new drug from natural sources. However, non-scientific, mystic hypothetical system of oriental medicine refuses to be revealed scientifically. For the purpose of drawing useful parameters for inductive reasoning of the system, a new approach which comprises statistical analyses of prescription was attempted in this study. One hundred and thirty two ginseng-compounds prescription in 'Bang-Yak-Hap-Pyon', one of the most popular formularies of oriental medicine in Korea, were analysed by multivariate analysis technique. The results revealed ginseng from many points of view, e.g., therapeutic indications, dose, and compatibility, etc. Among these, the most striking coincidence with scientific achievements of modern pharmacology, is the fact that the oriental medicine has characterized ginseng already from remote ancient times as neither a specific curative nor an aphrodisiac, but a non-specific adaptogenic drug for general infirmity.

  • PDF

Prediction of ultimate load capacity of concrete-filled steel tube columns using multivariate adaptive regression splines (MARS)

  • Avci-Karatas, Cigdem
    • Steel and Composite Structures
    • /
    • 제33권4호
    • /
    • pp.583-594
    • /
    • 2019
  • In the areas highly exposed to earthquakes, concrete-filled steel tube columns (CFSTCs) are known to provide superior structural aspects such as (i) high strength for good seismic performance (ii) high ductility (iii) enhanced energy absorption (iv) confining pressure to concrete, (v) high section modulus, etc. Numerous studies were reported on behavior of CFSTCs under axial compression loadings. This paper presents an analytical model to predict ultimate load capacity of CFSTCs with circular sections under axial load by using multivariate adaptive regression splines (MARS). MARS is a nonlinear and non-parametric regression methodology. After careful study of literature, 150 comprehensive experimental data presented in the previous studies were examined to prepare a data set and the dependent variables such as geometrical and mechanical properties of circular CFST system have been identified. Basically, MARS model establishes a relation between predictors and dependent variables. Separate regression lines can be formed through the concept of divide and conquers strategy. About 70% of the consolidated data has been used for development of model and the rest of the data has been used for validation of the model. Proper care has been taken such that the input data consists of all ranges of variables. From the studies, it is noted that the predicted ultimate axial load capacity of CFSTCs is found to match with the corresponding experimental observations of literature.