• Title/Summary/Keyword: 주성분 회귀 모델

Search Result 43, Processing Time 0.034 seconds

Performance Comparison of Data Mining Approaches for Prediction Models of Near Infrared Spectroscopy Data (근적외선 분광 데이터 예측 모형을 위한 데이터 마이닝 기법의 성능비교)

  • Baek, Seung Hyun
    • Journal of the Korea Safety Management & Science
    • /
    • v.15 no.4
    • /
    • pp.311-315
    • /
    • 2013
  • 본 논문에서는 주성분 회귀법과 부분최소자승 회귀법을 비교하여 보여준다. 이 비교의 목적은 선형형태를 보유한 근적외선 분광 데이터의 분석에 사용할 수 있는 적합한 예측 방법을 찾기 위해서이다. 두 가지 데이터 마이닝 방법론인 주성분 회귀법과 부분최소자승 회귀법이 비교되어 질 것이다. 본 논문에서는 부분최소자승 회귀법은 주성분 회귀법과 비교했을 때 약간 나은 예측능력을 가진 결과를 보여준다. 주성분 회귀법에서 50개의 주성분이 모델을 생성하기 위해서 사용지만 부분최소자승 회귀법에서는 12개의 잠재요소가 사용되었다. 평균제곱오차가 예측능력을 측정하는 도구로 사용되었다. 본 논문의 근적외선 분광데이터 분석에 따르면 부분최소자승회귀법이 선형경향을 가진 데이터의 예측에 가장 적합한 모델로 판명되었다.

Analysis on Correlation between AE Parameters and Stress Intensity Factor using Principal Component Regression and Artificial Neural Network (주성분 회귀분석 및 인공신경망을 이용한 AE변수와 응력확대계수와의 상관관계 해석)

  • Kim, Ki-Bok;Yoon, Dong-Jin;Jeong, Jung-Chae;Park, Phi-Iip;Lee, Seung-Seok
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.21 no.1
    • /
    • pp.80-90
    • /
    • 2001
  • The aim of this study is to develop the methodology which enables to identify the mechanical properties of element such as stress intensity factor by using the AE parameters. Considering the multivariate and nonlinear properties of AE parameters such as ringdown count, rise time, energy, event duration and peak amplitude from fatigue cracks of machine element the principal component regression(PCR) and artificial neural network(ANN) models for the estimation of stress intensity factor were developed and validated. The AE parameters were found to be very significant to estimate the stress intensity factor. Since the statistical values including correlation coefficients, standard mr of calibration, standard error of prediction and bias were stable, the PCR and ANN models for stress intensity factor were very robust. The performance of ANN model for unknown data of stress intensity factor was better than that of PCR model.

  • PDF

Design of Regression Model and Pattern Classifier by Using Principal Component Analysis (주성분 분석법을 이용한 회귀다항식 기반 모델 및 패턴 분류기 설계)

  • Roh, Seok-Beom;Lee, Dong-Yoon
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.10 no.6
    • /
    • pp.594-600
    • /
    • 2017
  • The new design methodology of prediction model and pattern classification, which is based on the dimension reduction algorithm called principal component analysis, is introduced in this paper. Principal component analysis is one of dimension reduction techniques which are used to reduce the dimension of the input space and extract some good features from the original input variables. The extracted input variables are applied to the prediction model and pattern classifier as the input variables. The introduced prediction model and pattern classifier are based on the very simple regression which is the key point of the paper. The structural simplicity of the prediction model and pattern classifier leads to reducing the over-fitting problem. In order to validate the proposed prediction model and pattern classifier, several machine learning data sets are used.

A Study on the Prediction of Fuel Consumption of a Ship Using the Principal Component Analysis (주성분 분석기법을 이용한 선박의 연료소비 예측에 관한 연구)

  • Kim, Young-Rong;Kim, Gujong;Park, Jun-Bum
    • Journal of Navigation and Port Research
    • /
    • v.43 no.6
    • /
    • pp.335-343
    • /
    • 2019
  • As the regulations of ship exhaust gas have been strengthened recently, many measures are under consideration to reduce fuel consumption. Among them, research has been performed actively to develop a machine-learning model that predicts fuel consumption by using data collected from ships. However, many studies have not considered the methodology of the main parameter selection for the model or the processing of the collected data sufficiently, and the reckless use of data may cause problems such as multicollinearity between variables. In this study, we propose a method to predict the fuel consumption of the ship by using the principal component analysis to solve these problems. The principal component analysis was performed on the operational data of the 13K TEU container ship and the fuel consumption prediction model was implemented by regression analysis with extracted components. As the R-squared value of the model for the test data was 82.99%, this model would be expected to support the decision-making of operators in the voyage planning and contribute to the monitoring of energy-efficient operation of ships during voyages.

Simultaneous Determination of Anionic and Nonionic Surfactants Using Multivariate Calibration Method (다변량 분석법에 의한 Anionic Surfactant와 Nonionic Surfactant의 동시정량)

  • Sang Hak Lee;Soon Nam Kwon;Bum Mok Son
    • Journal of the Korean Chemical Society
    • /
    • v.47 no.1
    • /
    • pp.19-25
    • /
    • 2003
  • A spectrophotometric method for the simultaneous determination of anionic and nonionic surfactant based on the application of multivariate calibration method such as principal component regression(PCR) and partial least squares(PLS) has been studied. The calibration models in PCR and PLS were obtained from the spectral data in the range of 400~700 nm for each standard of a calibration set of 26 standards, each containing different amounts of two surfactants. The relative standard error of prediction(RSEP$_{\alpha}$) was obtained to assess the model goodness in quantifying each analyte in a 5 validation samples which containing different amounts of two surfactants.

Traffic Volume Dependent Displacement Estimation Model for Gwangan Bridge Using Monitoring Big Data (교량 모니터링 빅데이터를 이용한 광안대교의 교통량 의존 변위 추정 모델)

  • Park, Ji Hyun;Shin, Sung Woo;Kim, Soo Yong
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.38 no.2
    • /
    • pp.183-191
    • /
    • 2018
  • In this study a traffic volume dependent displacement estimation model for Gwangan Bridge was developed using bridge monitoring big data. Traffic volume data for four different vehicle types and the vertical displacement data in the central position of the Gwangan Bridge were used to develop and validate the estimation model. Two statistical estimation models were developed using multiple regression analysis (MRA) and principal component analysis (PCA). Estimation performance of those two models were compared with actual values. The results show that both the MRA and the PCA based models are successfully estimating the vertical displacement of Gwangan Bridge. Based on the results, it is concluded that the developed model can effectively be used to predict the traffic volume dependent displacement behavior of Gwangan Bridge.

Improving Estimation Ability of Software Development Effort Using Principle Component Analysis (주성분분석을 이용한 소프트웨어 개발노력 추정능력 향상)

  • Lee, Sang-Un
    • The KIPS Transactions:PartD
    • /
    • v.9D no.1
    • /
    • pp.75-80
    • /
    • 2002
  • Putnam develops SLIM (Software LIfecycle Management) model based upon the assumption that the manpower utilization during software project development is followed by a Rayleigh distribution. To obtain the manpower distribution, we have to be estimate the total development effort and difficulty ratio parameter. We need a way to accurately estimate these parameters early in the requirements and specification phase before investment decisions have to be made. Statistical tests show that system attributes are highly correlation (redundant) so that Putnam discards one and get a parameter estimator from the other attributes. But, different statistical method has different system attributes and presents different performance. To select the principle system attributes, this paper uses the principle component analysis (PCA) instead of Putnam's method. The PCA's results improve a 9.85 percent performance more than the Putnam's result. Also, this model seems to be simple and easily realize.

Degradation-Based Remaining Useful Life Analysis for Predictive Maintenance in a Steel Galvanizing Kettle (철강 도금로의 예지보전을 위한 열화 기반 잔존수명 분석)

  • Shin, Joon Ho;Kim, Chang Ouk
    • Journal of the Korea Convergence Society
    • /
    • v.10 no.12
    • /
    • pp.271-280
    • /
    • 2019
  • Smart factory, a critical part of digital transformation, enables data-driven decision making using monitoring, analysis and prediction. Predictive maintenance is a key element of smart factory and the need is increasing. The purpose of this study is to analyze the degradation characteristics of a galvanizing kettle for the steel plating process and to predict the remaining useful life(RUL) for predictive maintenance. Correlation analysis, multiple regression, principal component regression were used for analyzing factors of the process. To identify the trend of degradation, a proposed rolling window was used. It was observed the degradation trend was dependent on environmental temperature as well as production factors. It is expected that the proposed method in this study will be an example to identify the trend of degradation of the facility and enable more consistent predictive maintenance.

Analysis and Classification of Acoustic Emission Signals During Wood Drying Using the Principal Component Analysis (주성분 분석을 이용한 목재 건조 중 발생하는 음향방출 신호의 해석 및 분류)

  • Kang, Ho-Yang;Kim, Ki-Bok
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.23 no.3
    • /
    • pp.254-262
    • /
    • 2003
  • In this study, acoustic emission (AE) signals due to surface cracking and moisture movement in the flat-sawn boards of oak (Quercus Variablilis) during drying under the ambient conditions were analyzed and classified using the principal component analysis. The AE signals corresponding to surface cracking showed higher in peak amplitude and peak frequency, and shorter in rise time than those corresponding to moisture movement. To reduce the multicollinearity among AE features and to extract the significant AE parameters, correlation analysis was performed. Over 99% of the variance of AE parameters could be accounted for by the first to the fourth principal components. The classification feasibility and success rate were investigated in terms of two statistical classifiers having six independent variables (AE parameters) and six principal components. As a result, the statistical classifier having AE parameters showed the success rate of 70.0%. The statistical classifier having principal components showed the success rate of 87.5% which was considerably than that of the statistical classifier having AE parameters.