• Title/Summary/Keyword: modified partial least squares

Search Result 43, Processing Time 0.028 seconds

A modified partial least squares regression for the analysis of gene expression data with survival information

  • Lee, So-Yoon;Huh, Myung-Hoe;Park, Mira
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1151-1160
    • /
    • 2014
  • In DNA microarray studies, the number of genes far exceeds the number of samples and the gene expression measures are highly correlated. Partial least squares regression (PLSR) is one of the popular methods for dimensional reduction and known to be useful for the classifications of microarray data by several studies. In this study, we suggest a modified version of the partial least squares regression to analyze gene expression data with survival information. The method is designed as a new gene selection method using PLSR with an iterative procedure of imputing censored survival time. Mean square error of prediction criterion is used to determine the dimension of the model. To visualize the data, plot for variables superimposed with samples are used. The method is applied to two microarray data sets, both containing survival time. The results show that the proposed method works well for interpreting gene expression microarray data.

Hybrid Linear Analysis Based on the Net Analyte Signal in Spectral Response with Orthogonal Signal Correction

  • Park, Kwang-Su;Jun, Chi-Hyuck
    • Near Infrared Analysis
    • /
    • v.1 no.2
    • /
    • pp.1-8
    • /
    • 2000
  • Using the net analyte signal, hybrid linear analysis was proposed to predict chemical concentration. In this paper, we select a sample from training set and apply orthogonal signal correction to obtain an improved pseudo unit spectrum for hybrid least analysis. using the mean spectrum of a calibration training set, we first show the calibration by hybrid least analysis is effective to the prediction of not only chemical concentrations but also physical property variables. Then, a pseudo unit spectrum from a training set is also tested with and without orthogonal signal correction. We use two data sets, one including five chemical concentrations and the other including ten physical property variables, to compare the performance of partial least squares and modified hybrid least analysis calibration methods. The results show that the hybrid least analysis with a selected training spectrum instead of well-measured pure spectrum still gives good performances, which is a little better than partial least squares.

Modified partial least squares method implementing mixed-effect model

  • Kyunga Kim;Shin-Jae Lee;Soo-Heang Eo;HyungJun Cho;Jae Won Lee
    • Communications for Statistical Applications and Methods
    • /
    • v.30 no.1
    • /
    • pp.65-73
    • /
    • 2023
  • Contemporary biomedical data often involve an ill-posed problem owing to small sample size and large number of multi-collinear variables. Partial least squares (PLS) method could be a plausible alternative to an ill-conditioned ordinary least squares. However, in the case of a PLS model that includes a random-effect, how to deal with a random-effect or mixed effects remains a widely open question worth further investigation. In the present study, we propose a modified multivariate PLS method implementing mixed-effect model (PLSM). The advantage of PLSM is its versatility in handling serial longitudinal data or its ability for taking a randomeffect into account. We conduct simulations to investigate statistical properties of PLSM, and showcase its real clinical application to predict treatment outcome of esthetic surgical procedures of human faces. The proposed PLSM seemed to be particularly beneficial 1) when random-effect is conspicuous; 2) the number of predictors is relatively large compared to the sample size; 3) the multicollinearity is weak or moderate; and/or 4) the random error is considerable.

Determination of Protein Content in Pea by Near Infrared Spectroscopy

  • Lee, Jin-Hwan;Choung, Myoung-Gun
    • Food Science and Biotechnology
    • /
    • v.18 no.1
    • /
    • pp.60-65
    • /
    • 2009
  • Near infrared reflectance spectroscopy (NIRS) was used as a rapid and non-destructive method to determine the protein content in intact and ground seeds of pea (Pisum sativum L.) germplasms grown in Korea. A total of 115 samples were scanned in the reflectance mode of a scanning monochromator at intact seed and flour condition, and the reference values for the protein content was measured by auto-Kjeldahl system. In the developed ground and intact NIRS equations for analysis of protein, the most accurate equation were obtained at 2, 8, 6, 1 math treatment conditions with standard normal variate and detrend scatter correction method and entire spectrum (400-2,500 nm) by using modified partial least squares regression (n=78). External validation (n=34) of these NIRS equations showed significant correlation between reference values and NIRS estimated values based on the standard error of prediction (SEP), $R^2$, and the ratio of standard deviation of reference data to SEP. Therefore, these ground and intact NIRS equations can be applicable and reliable for determination of protein content in pea seeds, and non-destructive NIRS method could be used as a mass analysis technique for selection of high protein pea in breeding program and for quality control in food industry.

Prediction of Chemical Compositions for On-line Quality Measurement of Red Pepper Powder Using Near Infrared Reflectance Spectroscopy (NIRS)

  • Lee, Sun-Mee;Kim, Su-Na;Park, Jae-Bok;Hwang, In-Kyeong
    • Food Science and Biotechnology
    • /
    • v.14 no.2
    • /
    • pp.280-285
    • /
    • 2005
  • Applicability of near infrared reflectance spectroscopy (NIRS) was examined for quality control of red pepper powder in milling factories. Prediction of chemical composition was performed using modified partial least square (MPLS) techniques. Analysis of total 51 and 21 red pepper powder samples by conventional methods for calibration and validation, respectively, revealed standard error of prediction (SEP) and correlation coefficient ($R^2$) of moisture content, ASTA color value, capsaicinoid content, and total sugar content were 0.55 and 0.90, 8.58 and 0.96, 31.60 and 0.65, and 1.82 and 0.86, respectively; SEP and $R^2$ were low and high, respectively, except for capsaicinoid content. The results indicate, with slight improvement, on-line quality measurement of red pepper powder with NIRS could be applied in red pepper milling factories.

Quantification of Tocopherol and Tocotrienol Content in Rice Bran by Near Infrated Reflectance Spectroscopy (근적외선분광분석기를 이용한 미강의 Tocopherol과 Tocotrienol 함량 분석)

  • 김용호;강창성;이영상
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.49 no.3
    • /
    • pp.211-215
    • /
    • 2004
  • Near infrared reflectance spectroscopy (NIRS) is a rapid and accurate analytical method for determining the composition of agricultural products and feeds. This study was conducted to determine tocopherol and tocotrienol contents in rice bran by using NIRS system. Total 80 rice bran samples previously analyzed by HPLC were scanned by NIRS and over 60 samples were selected for calibration and validation equation. A calibration equation calculated by MPLS(modified partial least squares) regression technique was developed and coefficient of determination for tocopheyol and tocotyienol content were 0.975 and 0.984, respectively, in calibration sets. Each calibration equation was fitted to validation set that was performed with the remaining samples not included is the calibration set, which showed high positive correlation both in tocopherol and tocotrienol content file. This results demonstrate that the developed NIRS equation can be practically used as a rapid screening method for quantification of tocopherol and tocotrienol contents in rice bran.

Quantification of Icariin Contents in Epimedium koreanum N. by Using a Near Infrared Reflectance Spectroscopy (NIRS를 이용한 삼지구엽초의 이카린 함량 분석)

  • Kim, Yong-Ho;Choi, Byoung-Ryourl;Baek, Hum-Young;Lee, Young-Sang
    • Korean Journal of Medicinal Crop Science
    • /
    • v.10 no.5
    • /
    • pp.340-343
    • /
    • 2002
  • Near infrared reflectance spectroscopy (NIRS) has become widely accepted for rapid quantitative analysis of components in some crops. Our object was to determine icariin contents in whole plant of Epimedium koreanum by using an NIRS system. Total 150 plant samples previously analyzed by HPLC were scanned by NIRS and 68 samples were selected for calibration and validation equation. A calibration equation calculated by MPLS(modified partial least squares) regression technique was developed and a coefficient of determination in calibration and validation sets were 0.95 and 0.82, respectively. A comparison between NIRS estimation and HPLC value was performed with the remaining samples not included in the calibration and validation sets. Most of samples also showed a positive correlation like a validation set. Our results demonstrate that this developed NIRS equation can be practically used as a mass screening method for rapid quantification of icarin contents in Epimedium koreanum N.

Discrimination of Korean Domestic and Foreign Soybeans using Near Infrared Reflectance Spectroscopy (근적외선분광광도계(NIRS)를 이용한 국내산 콩과 수입콩의 판별분석)

  • Ahn, Hyung-Gyun;Kim, Yong-Ho
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.57 no.3
    • /
    • pp.296-300
    • /
    • 2012
  • Discrimination of geographic origin of agricultural products is a important issue in Korea because the price difference between Korean domestic and imported cereals is a key among some reasons. NIRS (Near Infrared Reflectance Spectroscopy) has been applied to classify the geographical origin of soybeans. Total 135 samples (Korean domestic 92 and foreign 43) were used to obtain calibration equation through 400~2,500 nm wavelength. The math treatment with 1st derivative and 4 nm gap and the modified partial least squares(MPLS)regression was outstanding for calibration equation. The standard error of calibration and determination coefficient in calibration set(n=115) was 6.65 and 0.98, respectively. And it showed that the extra 20 samples for validation equation were identified their authentication correctly. This study describes that the application of NIRS would be possible for discrimination of geographical origin between Korean domestic and imported soybeans.

A Study on the Structural Relationship of Perceived Value, Price Sensitivity, and Satisfaction between Brand Image and Purchase Intention in Overseas Direct Purchase (해외직접구매 소비자의 브랜드이미지와 구매의도 간 지각된가치, 가격민감도, 만족도의 구조적 관계 연구)

  • Jeong, Boon-Do;Kim, Ji-Hoon
    • Korea Trade Review
    • /
    • v.44 no.6
    • /
    • pp.169-185
    • /
    • 2019
  • The purpose of this study is to analyze the structural relationships of perceived value, price sensitivity, and satisfaction between brand image and purchase intention of consumers who have experience of overseas direct purchase. This study collected questionnaires used to analyze these structural relationships. Using the R's plspm package, we analyzed the PLS (partial least squares) structural equation model. In order to examine the relationship between perceived value and price sensitivity, the research model was modified and analyzed. As a result, not only the adoption of the research hypothesis, but also the goodness of fit was higher than before the research model modifying, and the relationship between perceived value and price sensitivity was further verified. The modified research model has higher academic value, so it is necessary to select it as the final proposal model.

RAPID PREDICTION OF ENERGY CONTENT IN CEREAL FOOD PRODUCTS WITH NIRS.

  • Kays, Sandra E.;Barton, Franklin E.
    • Proceedings of the Korean Society of Near Infrared Spectroscopy Conference
    • /
    • 2001.06a
    • /
    • pp.1511-1511
    • /
    • 2001
  • Energy content, expressed as calories per gram, is an important part of the evaluation and marketing of foods in developed countries. Currently accepted methods of measurement of energy by U.S. food labeling legislation include measurement of gross calories by bomb calorimetry with an adjustment for undigested protein and by calculation using specific factors for the energy values of protein, carbohydrate less the amount of insoluble dietary fiber, and total fat. The ability of NIRS to predict the energy value of diverse, processed and unprocessed cereal food products was investigated. NIR spectra of cereal products were obtained with an NIR Systems monochromator and the wavelength range used for analysis was 1104-2494 nm. Gross energy of the foods was measured by oxygen bomb calorimetry (Parr Manual No. 120) and expressed as calories per gram (CPGI, range 4.05-5.49 cal/g). Energy value was adjusted for undigested protein (CPG2, range 3.99-5.38 cal/g) and undigested protein and insoluble dietary fiber (CPG3, range 2.42-5.35 cal/g). Using a multivariate analysis software package (ISI International, Inc.) partial least squares models were developed for the prediction of energy content. The standard error of cross validation and multiple coefficient of determination for CPGI using modified partial least squares regression (n=127) was 0.060 cal/g and 0.95, respectively, and the standard error of performance, coefficient of determination, bias and slope using an independent validation set (n=59) were 0.057 cal/g, 0.98, -0.027 cal/g and 1.05 respectively. The PLS loading for factor 1 (Pearson correlation coefficient 0.92) had significant absorption peaks correlated to C-H stretch groups in lipid at 1722/1764 nm and 2304/2346 nm and O-H groups in carbohydrate at 1434 and 2076 nm. Thus the model appeared to be predominantly influenced by lipid and carbohydrate. Models for CPG2 and CPG3 showed similar trends with standard errors of performance, using the independent validation set, of 0.058 and 0.088 cal/g, respectively, and coefficients of determination of 0.96. Thus NIRS provides a rapid and efficient method of predicting energy content of diverse cereal foods.

  • PDF