• Title/Summary/Keyword: PLS 회귀

Search Result 65, Processing Time 0.024 seconds

Visualizing (X,Y) Data by Partial Least Squares Method (PLS 기법에 의한 (X,Y) 자료의 시각화)

  • Huh, Myung-Hoe;Lee, Yong-Goo;Yi, Seong-Keun
    • The Korean Journal of Applied Statistics
    • /
    • v.20 no.2
    • /
    • pp.345-355
    • /
    • 2007
  • PLS methods are suited for regressing q-variate Y variables on p-variate X variables even in the presence of multicollinearity problem among X variables. Consequently, they are useful for analyzing datasets with smaller number of observations compared to the number of variables, such as NIR(near-infrared) spectroscopy data in chemometrics. In this study, we propose two visualizing methods of p-variate X variables and q-variate Y variable that can be used in connection with PLS analysis.

Moisture Content Measurement Technique for Powdered Food using NIR Reflectance Spectroscopy (근적외선 반사스펙트럼을 이용한 분말식품의 저함수율 측정 기술)

  • 모창연;노상하
    • Proceedings of the Korean Society for Agricultural Machinery Conference
    • /
    • 2002.02a
    • /
    • pp.444-449
    • /
    • 2002
  • 1. 시료의 흡광도는 입도가 커짐에 따라 전체 파장 영역에 걸쳐 증가하는 경향을 보였다. 가시광 영역에서는 입도 보다도 색상의 영향이 큰 것으로 나타났다. 2. 시료의 함수율과 각 파장에서의 흡광도 사이의 상관관계는 550~950nm 영역에서는 상관계수가 0.53이하로 작았으며, 물의 흡수파장대역인 1430 nm 부근에서는 0.85~0.87로 높게 나타났다. 3. 각 시료들의 반사 스펙트럼을 이용하여 세 가지 파장 영역에 대해 PLS회귀모델과 MLR 모델에 의한 함수율 예측 모델을 개발하였다. 모든 시료에서 PLS회귀모델이 MLR 회귀 모델보다 예측성능이 우수하였다. 4. PLS회귀 모델에서 전처리 효과를 분석한 결과, 시료의 입도에 따른 흡광도의 차이를 보정하기 위해 평활화, 미분, MSC, SNV 등의 전처리가 필요한 것으로 판단되었다. 5. 전체시료에 대해 함수율 예측을 위한 PLS회귀모델을 개발한 결과 400~2500nm영역에서의 개발된 모델의 예측성능은 $R^2$=0.9986, SEP=0.2166, 900~1700nm영역에서의 모델은 $R^2$=0.9985, SEP=0.2233이었으며 550~950nm 영역에서의 모델은 $R^2$=0.9838, SEP=0.7405로 나타났다. 각 시료의 종류별로 회귀모델을 개발할 경우 상기 결과보다 SEP가 더욱 작게 나타났다. 6. 이 연구 결과에 의하면 현재 시판되고 있는 실시간 분광기를 이용할 경우 시료의 입도에 무관한 온라인 함수율 측정장치의 개발이 가능할 것으로 판단된다.

  • PDF

AI Technology Analysis using Partial Least Square Regression

  • Choi, JunHyeog;Jun, Sunghae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.3
    • /
    • pp.109-115
    • /
    • 2020
  • In this paper, we propose an artificial intelligence(AI) technology analysis using partial least square(PLS) regression model. AI technology is now affecting most areas of our society. So, it is necessary to understand this technology. To analyze the AI technology, we collect the patent documents related to AI from the patent databases in the world. We extract AI technology keywords from the patent documents by text mining techniques. In addition, we analyze the AI keyword data by PLS regression model. This regression model is based on the technique of partial least squares used in the advanced analyses such as bioinformatics, social science, and engineering. To show the performance of our proposed method, we make experiments using AI patent documents, and we illustrate how our research can be applied to real problems. This paper is applicable not only to AI technology but also to other technological fields. This also contributes to understanding other various technologies by PLS regression analysis.

Predicting Future Terrestrial Vegetation Productivity Using PLS Regression (PLS 회귀분석을 이용한 미래 육상 식생의 생산성 예측)

  • CHOI, Chul-Hyun;PARK, Kyung-Hun;JUNG, Sung-Gwan
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.1
    • /
    • pp.42-55
    • /
    • 2017
  • Since the phases and patterns of the climate adaptability of vegetation can greatly differ from region to region, an intensive pixel scale approach is required. In this study, Partial Least Squares (PLS) regression on satellite image-based vegetation index is conducted for to assess the effect of climate factors on vegetation productivity and to predict future productivity of forests vegetation in South Korea. The results indicate that the mean temperature of wettest quarter (Bio8), mean temperature of driest quarter (Bio9), and precipitation of driest month (Bio14) showed higher influence on vegetation productivity. The predicted 2050 EVI in future climate change scenario have declined on average, especially in high elevation zone. The results of this study can be used in productivity monitoring of climate-sensitive vegetation and estimation of changes in forest carbon storage under climate change.

Expressions for Shrinkage Factors of PLS Estimator

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1169-1180
    • /
    • 2006
  • Partial least squares regression (PLS) is a biased, non-least squares regression method and is an alternative to the ordinary least squares regression (OLS) when predictors are highly collinear or predictors outnumber observations. One way to understand the properties of biased regression methods is to know how the estimators shrink the OLS estimator. In this paper, we introduce an expression for the shrinkage factor of PLS and develop a new shrinkage expression, and then prove the equivalence of the two representations. We use two near-infrared (NIR) data sets to show general behavior of the shrinkage and in particular for what eigendirections PLS expands the OLS coefficients.

  • PDF

Simultaneous Determination of Anionic and Nonionic Surfactants Using Multivariate Calibration Method (다변량 분석법에 의한 Anionic Surfactant와 Nonionic Surfactant의 동시정량)

  • Sang Hak Lee;Soon Nam Kwon;Bum Mok Son
    • Journal of the Korean Chemical Society
    • /
    • v.47 no.1
    • /
    • pp.19-25
    • /
    • 2003
  • A spectrophotometric method for the simultaneous determination of anionic and nonionic surfactant based on the application of multivariate calibration method such as principal component regression(PCR) and partial least squares(PLS) has been studied. The calibration models in PCR and PLS were obtained from the spectral data in the range of 400~700 nm for each standard of a calibration set of 26 standards, each containing different amounts of two surfactants. The relative standard error of prediction(RSEP$_{\alpha}$) was obtained to assess the model goodness in quantifying each analyte in a 5 validation samples which containing different amounts of two surfactants.

Predicting Site Quality by Partial Least Squares Regression Using Site and Soil Attributes in Quercus mongolica Stands (신갈나무 임분의 입지 및 토양 속성을 이용한 부분최소제곱 회귀의 지위추정 모형)

  • Choonsig Kim;Gyeongwon Baek;Sang Hoon Chung;Jaehong Hwang;Sang Tae Lee
    • Journal of Korean Society of Forest Science
    • /
    • v.112 no.1
    • /
    • pp.23-31
    • /
    • 2023
  • Predicting forest productivity is essential to evaluate sustainable forest management or to enhance forest ecosystem services. Ordinary least squares (OLS) and partial least squares (PLS) regression models were used to develop predictive models for forest productivity (site index) from the site characteristics and soil profile, along with soil physical and chemical properties, of 112 Quercus mongolica stands. The adjusted coefficients of determination (adjusted R2) in the regression models were higher for the site characteristics and soil profile of B horizon (R2=0.32) and of A horizon (R2=0.29) than for the soil physical and chemical properties of B horizon (R2=0.21) and A horizon (R2=0.09). The PLS models (R2=0.20-0.32) were better predictors of site index than the OLS models (R2=0.09-0.31). These results suggest that the regression models for Q. mongolica can be applied to predict the forest productivity, but new variables may need to be developed to enhance the explanatory power of regression models.

Shrinkage Structure of Ridge Partial Least Squares Regression

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.2
    • /
    • pp.327-344
    • /
    • 2007
  • Ridge partial least squares regression (RPLS) is a regression method which can be obtained by combining ridge regression and partial least squares regression and is intended to provide better predictive ability and less sensitive to overfitting. In this paper, explicit expressions for the shrinkage factor of RPLS are developed. The structure of the shrinkage factor is explored and compared with those of other biased regression methods, such as ridge regression, principal component regression, ridge principal component regression, and partial least squares regression using a near infrared data set.

  • PDF

Simultaneous Determination of Tryptophan and Tyrosine by Spectrofluorimetry Using Multivariate Calibration Method (다변량 분석법을 이용한 Tryptophan과 Tyrosine의 형광분광법적 정량)

  • Lee, Sang-Hak;Park, Ju-Eun;Son, Beom-Mok
    • Journal of the Korean Chemical Society
    • /
    • v.46 no.4
    • /
    • pp.309-317
    • /
    • 2002
  • A spectrofluorimetric method for the simultaneous determination of amino acids (tryptophan and tyrosine) based on the application of multivariate calibration method such as principal component regression and partial least squares (PLS) to luminescence measurements has been studied. Emission spectra of synthetic mixtures of two amino acids were obtained at excitation wavelength of 257 ㎚. The calibration model in PCR and PLS was obtained from the spectral data in the range of 280-500 ㎚ for each standard of a calibration set of 32 standards, each containing different amounts of two amino acids. The relative standard error of prediction ($RSEP_a$) was obtained to assess the model goodness in quantifying each analyte in a validation set. The overall relative standard error of prediction ($RSEP_m$) for the mixture obtained from the results of a validation set, formed by 6 independent mixtures was also used to validate the present method.

Variable Selection in PLS Regression with Penalty Function (벌점함수를 이용한 부분최소제곱 회귀모형에서의 변수선택)

  • Park, Chong-Sun;Moon, Guy-Jong
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.4
    • /
    • pp.633-642
    • /
    • 2008
  • Variable selection algorithm for partial least square regression using penalty function is proposed. We use the fact that usual partial least square regression problem can be expressed as a maximization problem with appropriate constraints and we will add penalty function to this maximization problem. Then simulated annealing algorithm can be used in searching for optimal solutions of above maximization problem with penalty functions added. The HARD penalty function would be suggested as the best in several aspects. Illustrations with real and simulated examples are provided.