• 제목/요약/키워드: Partial Least Square Analysis

Search Result 292, Processing Time 0.023 seconds

Analysis of Partial Least Square Regression on Textural Data from Back Extrusion Test for Commercial Instant Noodles (시중 즉석 조리 면의 Back Extrusion 텍스처 데이터에 대한 Partial Least Square Regression 분석)

  • Kim, Su kyoung;Lee, Seung Ju
    • Food Engineering Progress
    • /
    • v.14 no.1
    • /
    • pp.75-79
    • /
    • 2010
  • Partial least square regression (PLSR) was executed on curve data of force-deformation from back extrusion test and sensory data for commercial instant noodles. Sensory attributes considered were hardness (A), springiness (B), roughness (C), adhesiveness to teeth (D), and thickness (E). Eight and two kinds of fried and non-fried instant noodles respectively were used in the tests. Changes in weighted regression coefficients were characterized as three stages: compaction, yielding, and extrusion. Correlation coefficients appeared in the order of E>D>A>B>C, root mean square error of prediction D>C>E>B>A, and relative ability of prediction D>C>E>B>A. Overall, 'D' was the best in the correlation and prediction. 'A' with poor prediction ability but high correlation was considered good when determining the order of magnitude.

A modified partial least squares regression for the analysis of gene expression data with survival information

  • Lee, So-Yoon;Huh, Myung-Hoe;Park, Mira
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.5
    • /
    • pp.1151-1160
    • /
    • 2014
  • In DNA microarray studies, the number of genes far exceeds the number of samples and the gene expression measures are highly correlated. Partial least squares regression (PLSR) is one of the popular methods for dimensional reduction and known to be useful for the classifications of microarray data by several studies. In this study, we suggest a modified version of the partial least squares regression to analyze gene expression data with survival information. The method is designed as a new gene selection method using PLSR with an iterative procedure of imputing censored survival time. Mean square error of prediction criterion is used to determine the dimension of the model. To visualize the data, plot for variables superimposed with samples are used. The method is applied to two microarray data sets, both containing survival time. The results show that the proposed method works well for interpreting gene expression microarray data.

A Method for Screening Product Design Variables for Building A Usability Model : Genetic Algorithm Approach (사용편의성 모델수립을 위한 제품 설계 변수의 선별방법 : 유전자 알고리즘 접근방법)

  • Yang, Hui-Cheol;Han, Seong-Ho
    • Journal of the Ergonomics Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.45-62
    • /
    • 2001
  • This study suggests a genetic algorithm-based partial least squares (GA-based PLS) method to select the design variables for building a usability model. The GA-based PLS uses a genetic algorithm to minimize the root-mean-squared error of a partial least square regression model. A multiple linear regression method is applied to build a usability model that contains the variables seleded by the GA-based PLS. The performance of the usability model turned out to be generally better than that of the previous usability models using other variable selection methods such as expert rating, principal component analysis, cluster analysis, and partial least squares. Furthermore, the model performance was drastically improved by supplementing the category type variables selected by the GA-based PLS in the usability model. It is recommended that the GA-based PLS be applied to the variable selection for developing a usability model.

  • PDF

Discrimination of Alismatis Rhizoma According to Geographical Origins using Near Infrared Spectroscopy (근적외선분광법을 이용한 택사의 산지 판별법 연구)

  • Lee, Dong Young;Kim, Seung Hyun;Kim, Hyo Jin;Sung, Sang Hyun
    • Korean Journal of Pharmacognosy
    • /
    • v.44 no.4
    • /
    • pp.344-349
    • /
    • 2013
  • Near infrared spectroscopy (NIRS) combined with multivariate analysis was used to discriminate the geographical origin of Alisma orientale from Korea (n=94) and China (n=72). Two-thirds of samples were selected randomly for the training set, and one-third of samples for the test set. Second derivative was used for the pretreatment of NIR spectra. Partial least square discriminant analysis (PLS-DA) models correctly discriminated 100% of the Korean and Chinese A. orientale samples. These results demonstrate the potential use of NIR spectroscopy combined with multivariate analysis as a rapid and accurate method to discriminate A. orientale according to their geographical origin.

Quantitative Analysis of Indomethacin by the Portable Near-Infrared (NIR) System (근적외분광분석법을 이용한 인도메타신의 정량분석)

  • 김도형;우영아;김효진
    • YAKHAK HOEJI
    • /
    • v.47 no.5
    • /
    • pp.261-265
    • /
    • 2003
  • Near-infrared (NIR) system was used to determine rapidly and simply indomethacin in buffer solution for a dissolution test of tablets and capsules. Indomethacin standards were prepared ranging from 10 to 50 ppm using the mixture of phosphate buffer (pH 7.2) and water (1 : 4). The near-infrared (NIR) transmittance spectra of indomethacin standard solutions were collected by using a quartz cell in 1 mm and 2 mm pathlength. Partial least square regression (PLSR) was explored to develop calibration models over the spectral range 1100∼1700 nm. The model using 1 mm quartz cell was better than that using 2 mm quartz cell. The PLSR models developed gave standard error of prediction (SEP) of 0.858 ppm. In order to validate the developed calibration model, routine analysis was performed using another standard solutions. The NIR routine analysis showed good correlation with actual values. Standard error of prediction (SEP) is 1.414 ppm for 7 indomethacin samples in routine analysis and its error was permeable in the regulation of Korean Pharmacopoeia (VII). These results show the potential use of the real time monitoring for indomethacin during a dissolution test.

AI Technology Analysis using Partial Least Square Regression

  • Choi, JunHyeog;Jun, Sunghae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.25 no.3
    • /
    • pp.109-115
    • /
    • 2020
  • In this paper, we propose an artificial intelligence(AI) technology analysis using partial least square(PLS) regression model. AI technology is now affecting most areas of our society. So, it is necessary to understand this technology. To analyze the AI technology, we collect the patent documents related to AI from the patent databases in the world. We extract AI technology keywords from the patent documents by text mining techniques. In addition, we analyze the AI keyword data by PLS regression model. This regression model is based on the technique of partial least squares used in the advanced analyses such as bioinformatics, social science, and engineering. To show the performance of our proposed method, we make experiments using AI patent documents, and we illustrate how our research can be applied to real problems. This paper is applicable not only to AI technology but also to other technological fields. This also contributes to understanding other various technologies by PLS regression analysis.

Determination of Urban-Life Housing Price and Return Ratio by Location (도시형생활주택의 입지별 분양가격 및 수익률 결정요인)

  • Park, Jin-A;Woo, Chul-Min;Baik, Min-Seok;Shim, Gyo-Eon
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.11
    • /
    • pp.469-481
    • /
    • 2012
  • The demand for small-sized housing has been increasing due to the recession of real-estate price and the increase of small-sized households. Especially, the demand for affordable housing has been increasing since the style of housing and the location fits the lifestyle of small-sized household. In addition, many investors have been buying it because it has advertised as an investment property holding high-return ratio. However, an empirical analysis about the selling price and the return ratio has not been done yet. Therefore, the purpose of the research is having the empirical analysis based on the selling price and return ration by examining the affordable housing in Seoul. The urban-life housing more than 50 generations of the Seoul was irradiated for the analysis. And the linear regression analysis and PLS(Partial Least Square Regression) analysis was used for the empirical analysis. The result of analysis, based on the linear regression analysis, showed that factors including neighboring housing price and subway catchment area have a significant effect to the determinant factors of housing price. The analysis for return ratio showed neighboring housing price, subway catchment area and amenities affects the ratio. Especially, the fault of using small sample was covered by using the partial least square regression in this research.

Utilization of R Program for the Partial Least Square Model: Comparison of SmartPLS and R (부분최소제곱모형을 위한 R 프로그램의 활용: SmartPLS와 R의 비교)

  • Kim, Yong-Tae;Lee, Sang-Jun
    • Journal of Digital Convergence
    • /
    • v.13 no.12
    • /
    • pp.117-124
    • /
    • 2015
  • As the acceptance of statistical analysis has been increased because of Big Data, the needs for an advanced second generation of statistical analysis method like Structural Equation Model are also increasing. This study suggests how R-Program, as open software, can be utilized when Partial Least Square Model, one of the SEMs, is applied to statistical analysis. R is a free software as a part of GNU projects as well as a powerful and useful tool for statistical analysis including Big Data. The study utilized R and SmartPLS, a representative statistical package of PLS-SEM, and analyzed internal consistency reliability, convergent validity, and discriminant validity of the measurement model. The study also analyzed path coefficients and moderator effects of the structural model and compared the results, respectively. The results indicated that R showed the same results with SmartPLS on the measurement model and the structural model. Therefore, the study confirmed that R could be a powerful tool that is alternative to a commercial statistical package in the future.

Non-linear Data Classification Using Partial Least Square and Residual Compensator (부분 최소 자승법과 잔차 보상기를 이용한 비선형 데이터 분류)

  • 김경훈;김태영;최원호
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.10 no.2
    • /
    • pp.185-191
    • /
    • 2004
  • Partial least squares(PLS) is one of multiplicate statistical process methods and has been developed in various algorithms with the characteristics of principal component analysis, dimensionality reduction, and analysis of the relationship between input variables and output variables. But it has been limited somewhat by their dependency on linear mathematics. The algorithm is proposed to classify for the non-linear data using PLS and the residual compensator(RC) based on radial basis function network (RBFN). It compensates for the error of the non-linear data using the RC based on RBFN. The experimental result is given to verify its efficiency compared with those of previous works.

Development of On-line Sorting System for Detection of Infected Seed Potatoes Using Visible Near-Infrared Transmittance Spectral Technique (가시광 및 근적외선 투과분광법을 이용한 감염 씨감자 온라인 선별시스템 개발)

  • Kim, Dae Yong;Mo, Changyeun;Kang, Jun-Soon;Cho, Byoung-Kwan
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.35 no.1
    • /
    • pp.1-11
    • /
    • 2015
  • In this study, an online seed potato sorting system using a visible and near infrared (40 1100 nm) transmittance spectral technique and statistical model was evaluated for the nondestructive determination of infected and sound seed potatoes. Seed potatoes that had been artificially infected with Pectobacterium atrosepticum, which is known to cause a soil borne disease infection, were prepared for the experiments. After acquiring transmittance spectra from sound and infected seed potatoes, a determination algorithm for detecting infected seed potatoes was developed using the partial least square discriminant analysis method. The coefficient of determination($R^2_p$) of the prediction model was 0.943, and the classification accuracy was above 99% (n = 80) for discriminating diseased seed potatoes from sound ones. This online sorting system has good potential for developing a technique to detect agricultural products that are infected and contaminated by pathogens.