• Title/Summary/Keyword: Partial least squares regression (PLSR)

Search Result 50, Processing Time 0.022 seconds

A modified partial least squares regression for the analysis of gene expression data with survival information

  • Lee, So-Yoon;Huh, Myung-Hoe;Park, Mira
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1151-1160
    • /
    • 2014
  • In DNA microarray studies, the number of genes far exceeds the number of samples and the gene expression measures are highly correlated. Partial least squares regression (PLSR) is one of the popular methods for dimensional reduction and known to be useful for the classifications of microarray data by several studies. In this study, we suggest a modified version of the partial least squares regression to analyze gene expression data with survival information. The method is designed as a new gene selection method using PLSR with an iterative procedure of imputing censored survival time. Mean square error of prediction criterion is used to determine the dimension of the model. To visualize the data, plot for variables superimposed with samples are used. The method is applied to two microarray data sets, both containing survival time. The results show that the proposed method works well for interpreting gene expression microarray data.

A Study on the Several Robust Regression Estimators

  • Kim, Jee-Yun;Roh, Kyung-Mi;Hwang, Jin-Soo
    • Journal of the Korean Data and Information Science Society
    • /
    • v.15 no.2
    • /
    • pp.307-316
    • /
    • 2004
  • Principal Component Regression(PCR) and Partial Least Squares Regression(PLSR) are the two most popular regression techniques in chemometrics. In the field of chemometrics usually the number of regressor variables greatly exceeds the number of observation. So we have to reduce the number of regressors to avoid the identifiability problem. In this paper we compare PCR and PLSR techniques combined with various robust regression methods including regression depth estimation. We compare the efficiency, goodness-of-fit and robustness of each estimators under several contamination schemes.

  • PDF

Milling tool wear forecast based on the partial least-squares regression analysis

  • Xu, Chuangwen;Chen, Hualing
    • Structural Engineering and Mechanics
    • /
    • v.31 no.1
    • /
    • pp.57-74
    • /
    • 2009
  • Power signals resulting from spindle and feed motor, present a rich content of physical information, the appropriate analysis of which can lead to the clear identification of the nature of the tool wear. The partial least-squares regression (PLSR) method has been established as the tool wear analysis method for this purpose. Firstly, the results of the application of widely used techniques are given and their limitations of prior methods are delineated. Secondly, the application of PLSR is proposed. The singular value theory is used to noise reduction. According to grey relational degree analysis, sample variable is filtered as part sample variable and all sample variables as independent variables for modelling, and the tool wear is taken as dependent variable, thus PLSR model is built up through adapting to several experimental data of tool wear in different milling process. Finally, the prediction value of tool wear is compare with actual value, in order to test whether the model of the tool wear can adopt to new measuring data on the independent variable. In the new different cutting process, milling tool wear was predicted by the methods of PLSR and MLR (Multivariate Linear Regression) as well as BPNN (BP Neural Network) at the same time. Experimental results show that the methods can meet the needs of the engineering and PLSR is more suitable for monitoring tool wear.

Development of Virtual Metrology Models in Semiconductor Manufacturing Using Genetic Algorithm and Kernel Partial Least Squares Regression (유전알고리즘과 커널 부분최소제곱회귀를 이용한 반도체 공정의 가상계측 모델 개발)

  • Kim, Bo-Keon;Yum, Bong-Jin
    • IE interfaces
    • /
    • v.23 no.3
    • /
    • pp.229-238
    • /
    • 2010
  • Virtual metrology (VM), a critical component of semiconductor manufacturing, is an efficient way of assessing the quality of wafers not actually measured. This is done based on a model between equipment sensor data (obtained for all wafers) and the quality characteristics of wafers actually measured. This paper considers principal component regression (PCR), partial least squares regression (PLSR), kernel PCR (KPCR), and kernel PLSR (KPLSR) as VM models. For each regression model, two cases are considered. One utilizes all explanatory variables in developing a model, and the other selects significant variables using the genetic algorithm (GA). The prediction performances of 8 regression models are compared for the short- and long-term etch process data. It is found among others that the GA-KPLSR model performs best for both types of data. Especially, its prediction ability is within the requirement for the short-term data implying that it can be used to implement VM for real etch processes.

Partial least squares regression theory and application in spectroscopic diagnosis of total hemoglobin in whole blood (부분최소제곱회귀(Partial Least Squares Regression) 이론과 분광학적 혈중 헤모글로빈 진단에의 응용)

  • 김선우;김연주;김종원;윤길원
    • The Korean Journal of Applied Statistics
    • /
    • v.10 no.2
    • /
    • pp.227-239
    • /
    • 1997
  • PLSR is a powerful multivariate statistical tool that has been successfully applied to the quantitative analyses of data in spectroscopy, chemistry, and industrial process control. Data in spectorscopy is represented by spectrum matrix measured in many wavelengths. Problems of many kinds of noise in data and itercorrelation between wavelengths are quite common in such data. PLSR utilizes whole data set measured in many wavelengths to the analysis, and handles such problems through data compression method. We investigated the PLSR theory, and applied this method to the data for spectroscopic diagnosis of Total Hemoglobin in whole blood.

  • PDF

Quantitative Analysis of Taurine Using Near Infrared Spectrometry (NIRS) (근적외선 분광분석법을 이용한 타우린의 정량 분석)

  • Cho, Chang-Hee;Kim, Hyo-Jin;Meang, Dae-Young;Seo, Sang-Hun;Cho, Jung-Hwan
    • YAKHAK HOEJI
    • /
    • v.42 no.6
    • /
    • pp.545-551
    • /
    • 1998
  • Near Infrared transmittance Spectroscopy (NIRS) was used to evaluate and quantify the pharmaceutical active compounds. In the paper, taurine (2-Aminoethanesulfonic acid) was quantitatively analyzed in commercial pharmaceutical preparations. For calibration a central composite factorial design was used to determine concentrations of ingredients in reference samples. For the quantitative analysis of taurine, the most suitable data analysis method includes the calculation of second derivatives and a partial least squares regression (PLSR) model. By NIR spectrometry, combined with PLSR, the taurine concentration was successfully predicted with a relative standard error of prediction (SEP) lower than 1.04%.

  • PDF

Development of Nondestructive Detection Method for Adulterated Powder Products Using Raman Spectroscopy and Partial Least Squares Regression (라만 분광법과 부분최소자승법을 이용한 불량 분말식품 비파괴검사 기술 개발)

  • Lee, Sangdae;Lohumi, Santosh;Cho, Byoung-Kwan;Kim, Moon S.;Lee, Soo-Hee
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.34 no.4
    • /
    • pp.283-289
    • /
    • 2014
  • This study was conducted to develop a non-destructive detection method for adulterated powder products using Raman spectroscopy and partial least squares regression(PLSR). Garlic and ginger powder, which are used as natural seasoning and in health supplement foods, were selected for this experiment. Samples were adulterated with corn starch in concentrations of 5-35%. PLSR models for adulterated garlic and ginger powders were developed and their performances evaluated using cross validation. The $R^2_c$ and SEC of an optimal PLSR model were 0.99 and 2.16 for the garlic powder samples, and 0.99 and 0.84 for the ginger samples, respectively. The variable importance in projection (VIP) score is a useful and simple tool for the evaluation of the importance of each variable in a PLSR model. After the VIP scores were taken pre-selection, the Raman spectrum data was reduced by one third. New PLSR models, based on a reduced number of wavelengths selected by the VIP scores technique, gave good predictions for the adulterated garlic and ginger powder samples.

Determination of Human Skin Moisture in the Near-Infrared Region from 1100 to 2200 nm by Portable NIR System (1100∼2200 nm 파장 영역의 휴대용 근적외선 분광분석기를 이용한 사람피부의 수분측정)

  • 안지원;서은정;우영아;김효진
    • YAKHAK HOEJI
    • /
    • v.47 no.3
    • /
    • pp.148-153
    • /
    • 2003
  • Skin moisture is an important factor in skin health. Measurement of moisture content can provide diagnostic information on the condition of skin. In this study, a portable near-infrared (NIR) system was newly integrated with a photo diode array detector that has no moving parts, and this system has been successfully applied for the evaluation of human skin moisture. Diffuse reflectance spectra were collected and transformed to absorbance using 1 nm step size over the wavelength range of 1100 nm to 2200 nm. Partial least squares regression (PLSR) was applied to develop a calibration model. For practical use for the evaluation of human skin moisture, the PLS model for human skin moisture was developed in vivo using the portable NIR system on the basis of the relative water content values of stratum corneum from the conventional capacitance method. The PLS model showed a good correlation. The calibration with the use of PLS model predicted human moisture with a standard error of prediction (SEP) of 3.5 at 1120∼1730 nm range. This study showed the possibility of skin moisture measurement using portable NIR system.

Quantification of Skin Moisture in Hairless Mouse by using a Portable NIR System and a FT NIR Spectrometer (Photo Diode Array형의 휴대용 근적외 분광기와 FT 근적외 분광기를 이용한 Hairless Mouse 피부 수분 정량)

  • Suh, Eun-Jung;Woo, Young-Ah;Kim, Hyo-Jin
    • YAKHAK HOEJI
    • /
    • v.49 no.2
    • /
    • pp.115-121
    • /
    • 2005
  • In this study, the performance of a portable NIR system and a FT NIR spectrometer were compared to determine water content of hairless mouse skin. The stratum corneum parts wer e separated from the epidermal tissues by trypsin solution. NIR diffuse reflectance spectra of hairless mouse skin were acquired using a fiber optic probe. In the near infrared, water molecules show two clear absorption bands at 1450 nm from first overtone of O-H stretching and 1940 nm from the combination involving O-H stretching and O-H deformation. It was found that the variations of O-H absorption band according to water content. Partial least squares regression (PLSR) was applied to develop a calibration model. The PLS model showed a good correlation between NIR predicted value and the absolute water content of separated hairless mouse skin, in vitro. For both the portable and the FT NIR spectrometer, These studies showed the possibility of a rapid and nondestructive skin moisture measurement using NIR spectroscopy. The portable NIR spectrometer with a photodiode arrays-microsensor could be more rapidly applied for the determination of water content with comparable accuracy with the performance of a FT spectrometer .

Non-destructive and Rapid Prediction of Moisture Content in Red Pepper (Capsicum annuum L.) Powder Using Near-infrared Spectroscopy and a Partial Least Squares Regression Model

  • Lim, Jongguk;Mo, Changyeun;Kim, Giyoung;Kang, Sukwon;Lee, Kangjin;Kim, Moon S.;Moon, Jihea
    • Journal of Biosystems Engineering
    • /
    • v.39 no.3
    • /
    • pp.184-193
    • /
    • 2014
  • Purpose: The aim of this study was to develop a technique for the non-destructive and rapid prediction of the moisture content in red pepper powder using near-infrared (NIR) spectroscopy and a partial least squares regression (PLSR) model. Methods: Three red pepper powder products were separated into three groups based on their particle sizes using a standard sieve. Each product was prepared, and the expected moisture content range was divided into six or seven levels from 3 to 21% wb with 3% wb intervals. The NIR reflectance spectra acquired in the wavelength range from 1,100 to 2,300 nm were used for the development of prediction models of the moisture content in red pepper powder. Results: The values of $R{_V}{^2}$, SEP, and RPD for the best PLSR model to predict the moisture content in red pepper powders of varying particle sizes below 1.4 mm were 0.990, ${\pm}0.487%$ wb, and 10.00, respectively. Conclusions: These results demonstrated that NIR spectroscopy and a PLSR model could be useful techniques for measuring rapidly and non-destructively the moisture content in red pepper powder.