• Title/Summary/Keyword: partial least squares component

Search Result 74, Processing Time 0.024 seconds

Face recognition by PLS

  • Baek, Jang-Sun
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2003.10a
    • /
    • pp.69-72
    • /
    • 2003
  • The paper considers partial least squares (PLS) as a new dimension reduction technique for the feature vector to overcome the small sample size problem in face recognition. Principal component analysis (PCA), a conventional dimension reduction method, selects the components with maximum variability, irrespective of the class information. So PCA does not necessarily extract features that are important for the discrimination of classes. PLS, on the other hand, constructs the components so that the correlation between the class variable and themselves is maximized. Therefore PLS components are more predictive than PCA components in classification. The experimental results on Manchester and ORL databases show that PLS is to be preferred over PCA when classification is the goal and dimension reduction is needed.

  • PDF

Discrimination Model of Cultivation Area of Alismatis Rhizoma using a GC-MS-Based Metabolomics Approach (GC-MS 기반 대사체학 기법을 이용한 택사의 산지판별모델)

  • Leem, Jae-Yoon
    • YAKHAK HOEJI
    • /
    • v.60 no.1
    • /
    • pp.29-35
    • /
    • 2016
  • Traditional Korean medicines may be managed more scientifically, through the development of logical criterion to verify their cultivation region. It contributes to advance the industry of traditional herbal medicines. Volatile compounds were obtained from 14 samples of domestic Taeksa and 30 samples of Chinese Taeksa by steam distillation. The metabolites were identified by NIST mass spectral library in the obtained gas chromatography/mass spectrometer (GC/MS) data of 35 training samples. The multivariate statistical analysis, such as Principal Component Analysis (PCA), Partial Least Squares Discriminant Analysis (PLS-DA), and Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA), were performed based on the qualitative and quantitative data. Finally trans-(2,3-diphenylcyclopropyl)methyl phenyl sulfoxide (47.265 min), 1,2,3,4-tetrahydro-1-phenyl-naphthalene (47.781 min), spiro[4-oxatricyclo[5.3.0.0.(2,6)]decan-3-one-5,2'-cyclohexane] (54.62 min), 6-[7-nitrobenzofurazan-4-yl]amino-morphinan-4,5-epoxy (54.86 min), p-hydroxynorephedrine (55.14 min) were determined as marker metabolites to verify candidates for the origin of Taeksa. The statistical model was well established to determine the origin of Taeksa. The cultivation areas of test samples, each 3 domestic and 6 Chinese Taeksa were predicted by the established OPLS-DA model and it was confirmed that all 9 samples were precisely classified.

A Study on Face Recognition based on Partial Least Squares (부분 최소제곱법을 이용한 얼굴 인식에 관한 연구)

  • Lee Chang-Beom;Kim Do-Hyang;Baek Jang-Sun;Park Hyuk-Ro
    • The KIPS Transactions:PartB
    • /
    • v.13B no.4 s.107
    • /
    • pp.393-400
    • /
    • 2006
  • There are many feature extraction methods for face recognition. We need a new method to overcome the small sample problem that the number of feature variables is larger than the sample size for face image data. The paper considers partial least squares(PLS) as a new dimension reduction technique for feature vector. Principal Component Analysis(PCA), a conventional dimension reduction method, selects the components with maximum variability, irrespective of the class information. So, PCA does not necessarily extract features that are important for the discrimination of classes. PLS, on the other hand, constructs the components so that the correlation between the class variable and themselves is maximized. Therefore PLS components are more predictive than PCA components in classification. The experimental results on Manchester and ORL databases shows that PLS is to be preferred over PCA when classification is the goal and dimension reduction is needed.

Comparative Analysis of Cultivation Region of Angelica gigas Using a GC-MS-Based Metabolomics Approach (GC-MS 기반 대사체학 기술을 응용한 참당귀의 산지비교분석)

  • Jiang, Guibao;Leem, Jae Yoon
    • Korean Journal of Medicinal Crop Science
    • /
    • v.24 no.2
    • /
    • pp.93-100
    • /
    • 2016
  • Background: A set of logical criteria that can accurately identify and verify the cultivation region of raw materials is a critical tool for the scientific management of traditional herbal medicine. Methods and Results: Volatile compounds were obtained from 19 and 32 samples of Angelica gigas Nakai cultivated in Korea and China, respectively, by using steam distillation extraction. The metabolites were identified using GC/MS by querying against the NIST reference library. Data binning was performed to normalize the number of variables used in statistical analysis. Multivariate statistical analyses, such as Principal Component Analysis (PCA), Partial Least Squares-Discriminant Analysis (PLS-DA), and Orthogonal Partial Least Squares-Discriminant Analysis (OPLS-DA) were performed using the SIMCA-P software. Significant variables with a Variable Importance in the Projection (VIP) score higher than 1.0 as obtained through OPLS-DA and those that resulted in p-values less than 0.05 through one-way ANOVA were selected to verify the marker compounds. Among the 19 variables extracted, styrene, ${\alpha}$-pinene, and ${\beta}$-terpinene were selected as markers to indicate the origin of A. gigas. Conclusions: The statistical model developed was suitable for determination of the geographical origin of A. gigas. The cultivation regions of six Korean and eight Chinese A. gigas. samples were predicted using the established OPLS-DA model and it was confirmed that 13 of the 14 samples were accurately classified.

A Statistical Approach to Screening Product Design Variables for Modeling Product Usability (사용편의성에 영향을 미치는 제품 설계 변수의 통계적 선별 방법)

  • Kim, Jong-Seo;Han, Seong-Ho
    • Journal of the Ergonomics Society of Korea
    • /
    • v.19 no.3
    • /
    • pp.23-37
    • /
    • 2000
  • Usability is one of the most important factors that affect customers' decision to purchase a product. Several studies have been conducted to model the relationship between the product design variables and the product usability. Since there could be hundreds of design variables to be considered in the model, a variable screening method is required. Traditional variable screening methods are based on expert opinions (Expert screening) in most Kansei engineering studies. Suggested in this study are statistical methods for screening important design variables by using the principal component regression(PCR), cluster analysis, and partial least squares(PLS) method. Product variables with high effect (PCR screening and PLS screening) or representative variables (Cluster screening) can be used to model the usability. Proposed variable screening methods are used to model the usability for 36 audio/visual products. The three analysis methods (PCR, Cluster, and PLS) show better model performance than the Expert screening in terms of $R^2$, the number of variables in the model, and PRESS. It is expected that these methods can be used for screening the product design variables efficiently.

  • PDF

A Study on Factor Analytical Methods and Procedures for PLS-SEM (Partial Least Squares Structural Equation Modeling)

  • YIM, Myung-Seong
    • The Journal of Industrial Distribution & Business
    • /
    • v.10 no.5
    • /
    • pp.7-20
    • /
    • 2019
  • Purpose - This study provides appropriate procedures for EFA to help researchers conduct empirical studies by using PLS-SEM. Research design, data, and methodology - This study addresses the absolute and relative sample size criteria, sampling adequacy, factor extraction models, factor rotation methods, the criterion for the number of factors to retain, interpretation of results, and reporting information. Results - The factor analysis procedure for PLS-SEM consists of the following five stages. First, it is important to look at whether both the Bartlett test of sphericity and the KMO MSA meet the qualitative criteria. Second, PAF is a better choice of methodology. Third, an oblique technique is a suitable method for PLS-SEM. Fourth, a combined approach is strongly recommended to factor retention. PA should be used at the onset. Next, it is recommended using the K1 criterion. In addition, it is necessary to extract factors that increase the total variance explanatory power through the PVA-FS. Finally, it is appropriate to select an item with a factor loading into 0.5 or higher and a communality of 0.5. Conclusions - It is expected that the accurate factor analysis processed for PLS-SEM as previously presented will help us extract more precise factors of the structural model.

A quantitative determination of surfactant mixtures by FT-IR (FT-IR을 이용한 계면활성제 혼합물의 정량)

  • 최종근;노경원
    • Journal of the Society of Cosmetic Scientists of Korea
    • /
    • v.21 no.2
    • /
    • pp.129-139
    • /
    • 1995
  • To confirm the usefulness of partial least-squares(PLS) and multiple scattering correction(MSC) method for quantitation of surfactants in [quantitative methods using FT-lR, reconsitituted mixtures of LAS, MES and ELA-9 were tested. Each mixture was dissolved in 50% EtOH, dried, and applied to the KBr cell. From the IR spectra of these mixture, the variance spectrum was obtained. After repeated calibrations for the various regios of this spectrum, we found that 1245-1130cm-1 and 1070-1010cm-1 showed the strong correlation with each component of the sample mixture: all the correlation coefficients were 1.000 and quantitative errors did not exceed 0.32%. From this result, we concluded that PLS method and MSC method are very useful and can be successfully applied to Quality control.

  • PDF

Comparison of 12 Isoflavone Profiles of Soybean (Glycine max (L.) Merrill) Seed Sprouts from Three Different Countries

  • Park, Soo-Yun;Kim, Jae Kwang;Kim, Eun-Hye;Kim, Seung-Hyun;Prabakaran, Mayakrishnan;Chung, Ill-Min
    • KOREAN JOURNAL OF CROP SCIENCE
    • /
    • v.63 no.4
    • /
    • pp.360-377
    • /
    • 2018
  • The levels of 12 isoflavones were measured in soybean (Glycine max (L.) Merrill) sprouts of 68 genetic varieties from three countries (China, Japan, and Korea). The isoflavone profile differences were analyzed using data mining methods. A principal component analysis (PCA) revealed that the CSRV021 variety was separated from the others by the first two principal components. This variety appears to be most suited for functional food production due to its high isoflavone levels. Partial least squares discriminant analysis (PLS-DA) and orthogonal projections to latent structures discriminant analysis (OPLS-DA) showed that there are meaningful isoflavone compositional differences in samples that have different countries of origin. Hierarchical clustering analysis (HCA) of these phytochemicals resulted in clusters derived from closely related biochemical pathways. These results indicate the usefulness of metabolite profiling combined with chemometrics as a tool for assessing the quality of foods and identifying metabolic links in biological systems.

Forensic Classification of Latent Fingerprints Applying Laser-induced Plasma Spectroscopy Combined with Chemometric Methods (케모메트릭 방법과 결합된 레이저 유도 플라즈마 분광법을 적용한 유류 지문의 법의학적 분류 연구)

  • Yang, Jun-Ho;Yoh, Jai-Ick
    • Korean Journal of Optics and Photonics
    • /
    • v.31 no.3
    • /
    • pp.125-133
    • /
    • 2020
  • An innovative method for separating overlapping latent fingerprints, using laser-induced plasma spectroscopy (LIPS) combined with multivariate analysis, is reported in the current study. LIPS provides the capabilities of real-time analysis and high-speed scanning, as well as data regarding the chemical components of overlapping fingerprints. These spectra provide valuable chemical information for the forensic classification and reconstruction of overlapping latent fingerprints, by applying appropriate multivariate analysis. This study utilizes principal-component analysis (PCA) and partial-least-squares (PLS) techniques for the basis classification of four types of fingerprints from the LIPS spectra. The proposed method is successfully demonstrated through a classification example of four distinct latent fingerprints, using discrimination such as soft independent modeling of class analogy (SIMCA) and partial-least-squares discriminant analysis (PLS-DA). This demonstration develops an accuracy of more than 85% and is proven to be sufficiently robust. In addition, by laser-scanning analysis at a spatial interval of 125 ㎛, the overlapping fingerprints were separated as two-dimensional forms.

Simultaneous Determination of Tryptophan and Tyrosine by Spectrofluorimetry Using Multivariate Calibration Method (다변량 분석법을 이용한 Tryptophan과 Tyrosine의 형광분광법적 정량)

  • Lee, Sang-Hak;Park, Ju-Eun;Son, Beom-Mok
    • Journal of the Korean Chemical Society
    • /
    • v.46 no.4
    • /
    • pp.309-317
    • /
    • 2002
  • A spectrofluorimetric method for the simultaneous determination of amino acids (tryptophan and tyrosine) based on the application of multivariate calibration method such as principal component regression and partial least squares (PLS) to luminescence measurements has been studied. Emission spectra of synthetic mixtures of two amino acids were obtained at excitation wavelength of 257 ㎚. The calibration model in PCR and PLS was obtained from the spectral data in the range of 280-500 ㎚ for each standard of a calibration set of 32 standards, each containing different amounts of two amino acids. The relative standard error of prediction ($RSEP_a$) was obtained to assess the model goodness in quantifying each analyte in a validation set. The overall relative standard error of prediction ($RSEP_m$) for the mixture obtained from the results of a validation set, formed by 6 independent mixtures was also used to validate the present method.