• Title/Summary/Keyword: PLS Regression

Search Result 175, Processing Time 0.032 seconds

Combining Ridge Regression and Latent Variable Regression

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.18 no.1
    • /
    • pp.51-61
    • /
    • 2007
  • Ridge regression (RR), principal component regression (PCR) and partial least squares regression (PLS) are among popular regression methods for collinear data. While RR adds a small quantity called ridge constant to the diagonal of X'X to stabilize the matrix inversion and regression coefficients, PCR and PLS use latent variables derived from original variables to circumvent the collinearity problem. One problem of PCR and PLS is that they are very sensitive to overfitting. A new regression method is presented by combining RR and PCR and PLS, respectively, in a unified manner. It is intended to provide better predictive ability and improved stability for regression models. A real-world data from NIR spectroscopy is used to investigate the performance of the newly developed regression method.

  • PDF

Expressions for Shrinkage Factors of PLS Estimator

  • Kim, Jong-Duk
    • Journal of the Korean Data and Information Science Society
    • /
    • v.17 no.4
    • /
    • pp.1169-1180
    • /
    • 2006
  • Partial least squares regression (PLS) is a biased, non-least squares regression method and is an alternative to the ordinary least squares regression (OLS) when predictors are highly collinear or predictors outnumber observations. One way to understand the properties of biased regression methods is to know how the estimators shrink the OLS estimator. In this paper, we introduce an expression for the shrinkage factor of PLS and develop a new shrinkage expression, and then prove the equivalence of the two representations. We use two near-infrared (NIR) data sets to show general behavior of the shrinkage and in particular for what eigendirections PLS expands the OLS coefficients.

  • PDF

SELECTION OF WAELENGTH REGION FOR PLS BRIX CALIBRATION OF MANGO BY MLR METHOD

  • Sarawong, Sirinnapa;Sornsrivichai, Jinda;Kawano, Sumio
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1625-1625
    • /
    • 2001
  • The calibration equations for Brix value determination of intact mango were developed using the NIR spectra in a short wavelength region from 700 to 1100 nm. Multiple linear regression (MLR) and partial least square regression (PLS) was used for the calibration. It was found that the best wavelength region for PLS calibration from 900 to 1000 nm was similar to the wavelength region selected by MLR from 906 nm to 996 nm. Both MLR and selected region PLS provided sufficiently accurate prediction equations for Brix determination of intact mango. For MLR, the prediction results were SEP = 0.45 Brix and Bias = -0.04 Brix while PLS prediction results were SEP : 0.46 Brix and Bias = -0.2 Brix. It was concluded that MLR and PLS would have similar abilities in making calibration equation for Brix determination of intact mango if the appropriate wavelengths or wavelength region were selected. The appropriate wavelength region for PLS regression could be assumed by using the wavelength region selected by MLR in place of random selection, The relationship between calibration results of MLR and PLS regression is discussed.

  • PDF

Non-linear PLS based on non-linear principal component analysis and neural network (비선형 주성분해석과 신경망에 기반한 비선형 PLS)

  • 손정현;정신호;송상옥;윤인섭
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2000.10a
    • /
    • pp.394-394
    • /
    • 2000
  • This Paper proposes a new nonlinear partial least square method that extends the linear PLS. Proposed nonlinear PLS uses self-organizing feature map as PLS outer relation and multilayer neural network as PLS inner regression method.

  • PDF

Analysis of Success Factors for Mobile Commerce using Text Mining and PLS Regression

  • Kim, Yong-Hwan;Kim, Ja-Hee;Park, Ji hoon;Lee, Seung-Jun
    • Journal of the Korea Society of Computer and Information
    • /
    • v.21 no.11
    • /
    • pp.127-134
    • /
    • 2016
  • In this paper, we propose factors that influence on the mobile commerce satisfaction conducted by data mining and a PLS regression analysis. We extracted the most frequent words from mobile application reviews in which there are a large number of user's requests. We employed the content analysis to condense the large number of texts. We took a survey with the categories by which data are condensed and specified as factors that influence on the mobile commerce satisfaction. To avoid multicollinearity, we employed a PLS regression analysis instead of using a multiple regression analysis. Discovered factors that are potential consequences of customer satisfaction from direct requests by customers, the result may be an appropriate indicator for the mobile commerce market to improve its services.

AI Technology Analysis using Partial Least Square Regression

  • Choi, JunHyeog;Jun, Sunghae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.3
    • /
    • pp.109-115
    • /
    • 2020
  • In this paper, we propose an artificial intelligence(AI) technology analysis using partial least square(PLS) regression model. AI technology is now affecting most areas of our society. So, it is necessary to understand this technology. To analyze the AI technology, we collect the patent documents related to AI from the patent databases in the world. We extract AI technology keywords from the patent documents by text mining techniques. In addition, we analyze the AI keyword data by PLS regression model. This regression model is based on the technique of partial least squares used in the advanced analyses such as bioinformatics, social science, and engineering. To show the performance of our proposed method, we make experiments using AI patent documents, and we illustrate how our research can be applied to real problems. This paper is applicable not only to AI technology but also to other technological fields. This also contributes to understanding other various technologies by PLS regression analysis.

Compositional Analysis of Naphtha by FT-Raman Spectroscopy

  • 구민식;정호일
    • Bulletin of the Korean Chemical Society
    • /
    • v.20 no.2
    • /
    • pp.159-162
    • /
    • 1999
  • Three different chemical compositions of total paraffin, total naphthene, total aromatic content in naphtha have been successfully analyzed using FT-Raman spectroscopy. Partial least squares (PLS) regression has been utilized to develop calibration models for each composition from Raman spectral bands. The PLS calibration results showed Blood correlation with those of gas chromatography (GC). Using PLS regression, the spectral information related to each composition has been successfully extracted from highly overlapped Raman spectra of naphtha.

A Method for Screening Product Design Variables for Building A Usability Model : Genetic Algorithm Approach (사용편의성 모델수립을 위한 제품 설계 변수의 선별방법 : 유전자 알고리즘 접근방법)

  • Yang, Hui-Cheol;Han, Seong-Ho
    • Journal of the Ergonomics Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.45-62
    • /
    • 2001
  • This study suggests a genetic algorithm-based partial least squares (GA-based PLS) method to select the design variables for building a usability model. The GA-based PLS uses a genetic algorithm to minimize the root-mean-squared error of a partial least square regression model. A multiple linear regression method is applied to build a usability model that contains the variables seleded by the GA-based PLS. The performance of the usability model turned out to be generally better than that of the previous usability models using other variable selection methods such as expert rating, principal component analysis, cluster analysis, and partial least squares. Furthermore, the model performance was drastically improved by supplementing the category type variables selected by the GA-based PLS in the usability model. It is recommended that the GA-based PLS be applied to the variable selection for developing a usability model.

  • PDF

Multiple Factors in the Second Trimester of Pregnancy on Preterm Labor Symptoms and Preterm Birth (임신 2삼분기 여성의 조기진통 증상과 조산에 영향을 미치는 다인성 요인)

  • Kim, Jeung-Im;Cho, Mi-Ock;Choi, Gyu-Yeon
    • Journal of Korean Academy of Nursing
    • /
    • v.47 no.3
    • /
    • pp.357-366
    • /
    • 2017
  • Purpose: The aim of this study was to determine the influence of various factors on preterm labor symptoms (PLS) and preterm birth (PB). Methods: This prospective cohort study included 193 women in the second stage of pregnancy. Multiple characteristics including body mass index (BMI), smoking, and pregnancy complications were collected through a self-report questionnaire. Pregnancy stress and PLS were each measured with a related scale. Cervical length and birth outcome were evaluated from medical charts. Multiple regression was used to predict PLS and logistic regression was used to predict PB. Results: Multiple regression showed smoking experience, pregnancy complications and pregnancy specific stress were predictors of PLS and accounted for 19.2% of the total variation. Logistic regression showed predictors of PB to be twins (OR=13.68, CI=3.72~50.33, p<.001), shorter cervix (<25mm) (OR=5.63, CI=1.29~24.54, p<.05), BMI >25 ($kg/m^2$) (OR=3.50, CI=1.35~9.04, p<.01) and a previous PB (OR=4.15, CI=1.07~16.03, p<.05). Conclusion: The results of this study show that the multiple factors affect stage II pregnant women can result in PLS or PB. And preterm labor may predict PB. These findings highlight differences in predicting variables for pretrm labor and for PB. Future research is needed to develop a screening tool to predict the risk of preterm birth in pregnant women.

Predicting Future Terrestrial Vegetation Productivity Using PLS Regression (PLS 회귀분석을 이용한 미래 육상 식생의 생산성 예측)

  • CHOI, Chul-Hyun;PARK, Kyung-Hun;JUNG, Sung-Gwan
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.20 no.1
    • /
    • pp.42-55
    • /
    • 2017
  • Since the phases and patterns of the climate adaptability of vegetation can greatly differ from region to region, an intensive pixel scale approach is required. In this study, Partial Least Squares (PLS) regression on satellite image-based vegetation index is conducted for to assess the effect of climate factors on vegetation productivity and to predict future productivity of forests vegetation in South Korea. The results indicate that the mean temperature of wettest quarter (Bio8), mean temperature of driest quarter (Bio9), and precipitation of driest month (Bio14) showed higher influence on vegetation productivity. The predicted 2050 EVI in future climate change scenario have declined on average, especially in high elevation zone. The results of this study can be used in productivity monitoring of climate-sensitive vegetation and estimation of changes in forest carbon storage under climate change.