• Title/Summary/Keyword: Multivariate Discriminant Analysis

Search Result 172, Processing Time 0.034 seconds

Establishment of discrimination system using multivariate analysis of FT-IR spectroscopy data from different species of artichoke (Cynara cardunculus var. scolymus L.) (FT-IR 스펙트럼 데이터 기반 다변량통계분석기법을 이용한 아티초크의 대사체 수준 품종 분류)

  • Kim, Chun Hwan;Seong, Ki-Cheol;Jung, Young Bin;Lim, Chan Kyu;Moon, Doo Gyung;Song, Seung Yeob
    • Horticultural Science & Technology
    • /
    • v.34 no.2
    • /
    • pp.324-330
    • /
    • 2016
  • To determine whether FT-IR spectral analysis based on multivariate analysis for whole cell extracts can be used to discriminate between artichoke (Cynara cardunculus var. scolymus L.) plants at the metabolic level, leaves of ten artichoke plants were subjected to Fourier transform infrared(FT-IR) spectroscopy. FT-IR spectral data from leaves were analyzed by principal component analysis (PCA), partial least square discriminant analysis (PLS-DA) and hierarchical clustering analysis (HCA). FT-IR spectra confirmed typical spectral differences between the frequency regions of 1,700-1,500, 1,500-1,300 and $1,100-950cm^{-1}$, respectively. These spectral regions reflect the quantitative and qualitative variations of amide I, II from amino acids and proteins ($1,700-1,500cm^{-1}$), phosphodiester groups from nucleic acid and phospholipid ($1,500-1,300cm^{-1}$) and carbohydrate compounds ($1,100-950cm^{-1}$). PCA revealed separate clusters that corresponded to their species relationship. Thus, PCA could be used to distinguish between artichoke species with different metabolite contents. PLS-DA showed similar species classification of artichoke. Furthermore these metabolic discrimination systems could be used for the rapid selection and classification of useful artichoke cultivars.

Application of Mahalanobis Taguchi System for Analysis of Multivariate System (Mahalanobis Taguchi System을 이용한 다변량 시스템의 해석에 관한 연구)

  • Hong, Jeong-Eui;Kim, Yong-Beom
    • Proceedings of the Safety Management and Science Conference
    • /
    • 2005.11a
    • /
    • pp.300-310
    • /
    • 2005
  • Mahalanobis Taguchi System (MTS) is developed by Genishi Taguchi as a part of his quality engineering methodology. The basic idea of Taguchi's quality engineering is looking for the way of effectiveness of analyzing multivariate system. In the MTS, with the standardized variables of healthy normal data, Mahalanobis Distance(MD) calculated and that can be discriminate between normal and abnormal objects. If this discrimination process is successful, next step is optimization which is try to reduce number of attributes by neglecting less effective attributes to MD. Orthogonal Array (OA) and Signal to Noise ratio (S/N) are used to evaluate the amount contribution of each attribute to the MD. Wisconsin Breast Cancer study, from machining learning repository at University of California at Irvine, used for examining the discriminant ability of MTS.

  • PDF

A Study on Forest Land Classification Using Multivariate Statistical Methods : A Case Study at Mt. Kwanak (다변수통계방법을 이용한 산지분류에 관한 연구)

  • 정순오
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.13 no.1
    • /
    • pp.43-66
    • /
    • 1985
  • Korea needs proper and rational public policies on conservation and use of forest land and other natural resources because of the accelerating expansion of national land developments in recent years. Unfortunately, there is no systematic planning system to support the needs. Generally, forest land use planning needs suitability analysis based on efficient land classification system. The goal of this study was to classify a forest land using multivariate satistical methods. A case study was carried out in winter of 1983 on a mountainous area higher than 100m above sea level located at Mt. Kwanak in Anyang -city, Kyung-gi-do (province). The study area was 19.80 km$^2$wide and was divided into 1, 383 Operational Taxonomic Units (OTU's) by a 120m$\times$120m grid. Fourteen descriptors were identified and quantified for each OTU from existing national land data : elevation, slope, aspect, terrain form, geologic material, surface soil permeability, topsoil type, depth of the solum, soil acidity, forest cover type, stand size class, stand age class, stand density class, and simple forest soil capability class. For this study, a FORTRAN IV program was written for input and output map data, and the computer statistics packages, SPSS and BMD, were used to perform the multivariate statistical analysis. Fourteen variables were analyzed to investigate the characteristics of their fire quench distribution and to estimate the correlation coefficients among them. Principal component analysis was executed to find the dimensions of forest land characteristics, and factor scores were used for proper samples of OTU throughout the study area. In order to develop the classes of forest land classification based on 102 surrogates, cluster and discriminant analyses of principal descriptor variable matrix were undertaken. Results obtained through a series of multivariate statistical analyses were as follows ; 1) Principal component analysis was proved to be a useful tool for data selection and identification of principal descriptor variables which represented the characteristics of forest land and facilitated the selection of samples.

  • PDF

Unraveling dynamic metabolomes underlying different maturation stages of berries harvested from Panax ginseng

  • Lee, Mee Youn;Seo, Han Sol;Singh, Digar;Lee, Sang Jun;Lee, Choong Hwan
    • Journal of Ginseng Research
    • /
    • v.44 no.3
    • /
    • pp.413-423
    • /
    • 2020
  • Background: Ginseng berries (GBs) show temporal metabolic variations among different maturation stages, determining their organoleptic and functional properties. Methods: We analyzed metabolic variations concomitant to five different maturation stages of GBs including immature green (IG), mature green (MG), partially red (PR), fully red (FR), and overmature red (OR) using mass spectrometry (MS)-based metabolomic profiling and multivariate analyses. Results: The partial least squares discriminant analysis score plot based on gas chromatography-MS datasets highlighted metabolic disparity between preharvest (IG and MG) and harvest/postharvest (PR, FR, and OR) GB extracts along PLS1 (34.9%) with MG distinctly segregated across PLS2 (18.2%). Forty-three significantly discriminant primary metabolites were identified encompassing five developmental stages (variable importance in projection > 1.0, p < 0.05). Among them, most amino acids, organic acids, 5-C sugars, ethanolamines, purines, and palmitic acid were detected in preharvest GB extracts, whereas 6-C sugars, phenolic acid, and oleamide levels were distinctly higher during later maturation stages. Similarly, the partial least squares discriminant analysis based on liquid chromatography-MS datasets displayed preharvest and harvest/postharvest stages clustered across PLS1 (11.1 %); however, MG and PR were separated from IG, FR, and OR along PLS2 (5.6 %). Overall, 24 secondary metabolites were observed significantly discriminant (variable importance in projection > 1.0, p < 0.05), with most displaying higher relative abundance during preharvest stages excluding ginsenosides Rg1 and Re. Furthermore, we observed strong positive correlations between total flavonoid and phenolic metabolite contents in GB extracts and antioxidant activity. Conclusion: Comprehending the dynamic metabolic variations associated with GB maturation stages rationalize their optimal harvest time per se the related agroeconomic traits.

Operation Modes Classification of Chemical Processes for History Data-Based Fault Diagnosis Methods (데이터 기반 이상진단법을 위한 화학공정의 조업모드 판별)

  • Lee, Chang Jun;Ko, Jae Wook;Lee, Gibaek
    • Korean Chemical Engineering Research
    • /
    • v.46 no.2
    • /
    • pp.383-388
    • /
    • 2008
  • The safe and efficient operation of the chemical processes has become one of the primary concerns of chemical companies, and a variety of fault diagnosis methods have been developed to diagnose faults when abnormal situations arise. Recently, many research efforts have focused on fault diagnosis methods based on quantitative history data-based methods such as statistical models. However, when the history data-based models trained with the data obtained on an operation mode are applied to another operating condition, the models can make continuous wrong diagnosis, and have limits to be applied to real chemical processes with various operation modes. In order to classify operation modes of chemical processes, this study considers three multivariate models of Euclidean distance, FDA (Fisher's Discriminant Analysis), and PCA (principal component analysis), and integrates them with process dynamics to lead dynamic Euclidean distance, dynamic FDA, and dynamic PCA. A case study of the TE (Tennessee Eastman) process having six operation modes illustrates the conclusion that dynamic PCA model shows the best classification performance.

Classification of Forest Cover Types in the Baekdudaegan, South Korea

  • Chung, Sang Hoon;Lee, Sang Tae
    • Journal of Forest and Environmental Science
    • /
    • v.37 no.4
    • /
    • pp.269-279
    • /
    • 2021
  • This study was carried out to introduce the forest cover types of the Baekdudaegan inhabiting the number of native tree species. In order to understand the vegetation distribution characteristics of the Baekdudaegan, a vegetation survey was conducted on the major 20 mountains of the Baekdudaegan. The vegetation data were collected from 3,959 sample points by the point-centered quarter method. Each mountain was classified into 4-7 forests by using various multivariate statistical methods such as cluster analysis, indicator species analysis, multiple discriminant analysis, and species composition analysis. The forests were classified mainly according to the relative abundance of Quercus mongolica. There was a total of 111 classified forests and these forests were integrated into the following nine forest cover types using the percentage similarity index and by clustering according to vegetation type: 1) Mongolian oak, 2) Mongolian oak and other deciduous, 3) Oaks (Mixed Quercus spp.), 4) Korean red pine, 5) Korean red pine and oaks, 6) ash, 7) mixed mesophytic, 8) subalpine zone coniferous, and 9) miscellaneous forest. Forests grouped within the subalpine zone coniferous and miscellaneous classifications were characterized by similar environmental conditions and those forests that did not fit in any other category, respectively.

An approach for simultaneous determination for geographical origins of Korean Panax ginseng by UPLC-QTOF/MS coupled with OPLS-DA models

  • Song, Hyuk-Hwan;Kim, Doo-Young;Woo, Soyeun;Lee, Hyeong-Kyu;Oh, Sei-Ryang
    • Journal of Ginseng Research
    • /
    • v.37 no.3
    • /
    • pp.341-348
    • /
    • 2013
  • Identification of the origins of Panax ginseng has been issued in Korea scientifically and economically. We describe a metabolomics approach used for discrimination and prediction of ginseng roots from different origins in Korea. The fresh ginseng roots from six ginseng cooperative associations (Gangwon, Gaeseong, Punggi, Chungbuk, Jeonbuk, and Anseong) were analyzed by UPLC-MS-based approach combined with orthogonal projections to latent structure-discriminant analysis multivariate analysis. The ginsengs from Gangwon and Gaeseong were easily differentiated. We further analyzed the metabolomics results in subgroups. Punggi, Chungbuk, Jeonbuk, and Anseong ginseng could be easily differentiated by the first two orthogonal components. As a validation of the discrimination model, we performed blind prediction tests of sample origins using an external test set. Our model predicted their geographical origins as 99.7% probability. The robust discriminatory power and statistical validity of our method suggest its general applicability for determining the origins of P. ginseng samples.

ROENTGENOCEPHALOMETRIC STUDY ON CRANIOFACIAL MORPHOLOGY OF DEEPBITES (과개교합자의 악안면 형태에 관한 두부 X-선사진 계측학적 연구)

  • Kim, Hee-Jeong;Nahm, Dong-Seok
    • The korean journal of orthodontics
    • /
    • v.23 no.3 s.42
    • /
    • pp.341-358
    • /
    • 1993
  • This study was investigated to evaluate the morphologic characteristics of deepbite tendency as multiple factors. The subjects consisted of 60 control subjects(male 25, female 35) and 137 deephite patients(68 male, 69 female). The deepbite group was composed of 4 subgroups(Class I 44, Class II div. 1 40, Class II div. 2 13, Class III 40). The mean age was 21.57 year for the control group 21 year for deepbite group lateral cephalograph in centric occlusion were taken, traced, and digitized for each subject. The statistically computerized analysis was carried out with SAS program. The results were as follows ; 1. In deepbite group, saddle angle is lesser than that of normal group. 2. The vertical dysplasia is prominent on anterior lower face and is closely related with mandibular form and inclination. 3. Without consideration of sagittal relationship, the dental factors such as curve of Spee, interincisal angle, U1 to upper lip length were prominent in the deepbite group. 4. Although there were individual variances in the perioral soft tissue profile, the lip presented more protruded pattern. 5. There was no significant difference in hyoid bone position and inclination between normal and deepbite group. 6. The multivariate discriminant analysis between normal and Class I deepbite group showed that curve of Spee, AB-MP angle, interincisal angle, articular agnle were critical in the determination of deepbite as multiple factors.

  • PDF

Wing Morphometric Analysis of Psylla elaeagni Complex (Homoptera : Psyllidae) (보리나무이종군의 날개에 대한 수량형태학적 분석 (동시목: 나무이과))

  • Park, Hee-Cheon;Lee, Chang-Eon;Kim, Hoon-Soo
    • Animal Systematics, Evolution and Diversity
    • /
    • no.nspc2
    • /
    • pp.243-250
    • /
    • 1988
  • The wing morphometric characters of P.elaeagni complex feeding on the genus Elaeagnus plants was analysed by the multivariate methods using clustering of generalized distance and discriminant analysis. On the clustering of the species, the effect of sexual differences, seasonal variation and geographic population sensitively appeared . However, four species of this group was precicely divided by the discriminant analysis.

  • PDF

Discrimination model of cultivation area of Corni Fructus using a GC-MS-Based metabolomics approach (GC-MS 기반 대사체학 기법을 이용한 산수유의 산지판별모델)

  • Leem, Jae-Yoon
    • Analytical Science and Technology
    • /
    • v.29 no.1
    • /
    • pp.1-9
    • /
    • 2016
  • It is believed that traditional Korean medicines can be managed more scientifically through the development of logical criteria to verify their region of cultivation, and that this could contribute to the advancement of the traditional herbal medicine industry. This study attempted to determine such criteria for Sansuyu. The volatile compounds were obtained from 20 samples of domestic Corni fructus (Sansuyu) and 45 samples of Chinese Sansuyu by steam distillation. The metabolites were identified in the NIST Mass Spectral Library via the obtained gas chromatography/mass spectrometer (GC/MS) data of 53 training samples. Data binning at 0.2 min intervals was performed to normalize the number of variables used in the statistical analysis. Multivariate statistical analyses, such as principle component analysis (PCA), partial least squares-discriminant analysis (PLS-DA), and orthogonal partial least squares-discriminant analysis (OPLS-DA) were performed using the SIMCA-P software package. Significant variables with a variable importance in the projection (VIP) score higher than 1.0 were obtained from OPLS-DA, and variables that resulted in a p-value of less than 0.05 through one-way ANOVA were selected to verify the marker compounds. Finally, among the 11 variables extracted, 1-ethylbutyl-hydroperoxide (9.089 min), nonadecane (20.170 min), butylated hydroxytoluene (25.319 min), 5β,7βH,10α-eudesm-11-en-1α-ol (25.921 min), 7,9-bis(2-methyl-2-propanyl)-1-oxaspiro[4.5]deca-6,9-diene-2,8-dione (34.257 min), and 2-decyldodecyl-benzene (54.717 min) were selected as markers to indicate the origin of Sansuyu. The statistical model developed was suitable for the determination of the geographical origin of Sansuyu. The cultivation areas of four Korean and eight Chinese Sansuyu samples were predicted via the established OPLS-DA model, and it was confirmed that 11 of the 12 samples were accurately classified.