• Title/Summary/Keyword: multivariate classification

Search Result 309, Processing Time 0.026 seconds

A study on rock mass classification in the design of tunnel using multivariate discriminant analysis (다변량 판별분석을 통한 터널 설계시의 암반분류 연구)

  • Lee, Song;Ahn, Tae Hun;You, Oh Shick
    • Journal of Korean Tunnelling and Underground Space Association
    • /
    • v.6 no.3
    • /
    • pp.237-245
    • /
    • 2004
  • In designing a tunnel, RMR has been widely used to classify rock mass and to decide the support pattern according to the class of rock mass. However, this RMS system can't help relying on the empirical judgment of engineers who use variables which can be obtained only through consideration of the site conditions. In actuality, it is impossible to consider all the rating factors of RMS when using RMR system at the stage of designing. Therefore, in order to confirm possibility of RMR by use of only the quantitative factors for designing, this paper has done discriminant analysis. Rock strength or RQD has high coefficient of correlation with RMR value, and in consideration of the existing standards for rock mass classification, rock intensity and RQD are important factors for classification of rock mass. Through rock mass classification by the existing RMR system and rock mass classification by the discriminant analysis which has considered two variables only, the discriminant analysis using the rock intensity as an independent variable has shown 74.8% accuracy while the discriminant analysis using RQD as an independent variable has shown 74.3% accuracy. In case of the discriminant analysis which has considered both rock intensity and RQD, it has shown 82.5% accuracy. The existing cases have shown 40.3% accuracy at the stage of designing in which all the RMR factors are considered. It means that at the stage of designing, RMR system can work only with the rock intensity and RQD.

  • PDF

Classification Tree Analysis to Assess Contributing Factors Influencing Biosecurity Level on Farrow-to-Finish Pig Farms in Korea (분류 트리 기법을 이용한 국내 일괄사육 양돈장의 차단방역 수준에 영향을 미치는 기여 요인 평가)

  • Kim, Kyu-Wook;Pak, Son-Il
    • Journal of Veterinary Clinics
    • /
    • v.33 no.2
    • /
    • pp.107-112
    • /
    • 2016
  • The objective of this study was to determine potential contributing factors associated with biosecurity level of farrow-to-finish pig farms and to develop a classification tree model to explore how these factors related to each other based on prediction model. To this end, the author analyzed data (n = 193) extracted from a cross-sectional study of 344 farrow-to-finish farms which was conducted between March and September 2014 aimed to explore swine disease status at farm level. Standardized questionnaires with information about basic demographical data and management practices were collected in each farm by on-site visit of trained veterinarians. For the classification of the data sets regarding biosecurity level as a dependent variable and predictor variables, Chi-squared Automatic Interaction Detection (CHAID) algorithm was applied for modeling classification tree. The statistics of misclassification risk was used to evaluate the fitness of the model in terms of prediction results. Categorical multivariate input data (40 variables) was used to construct a classification tree, and the target variable was biosecurity level dichotomized into low versus high. In general, the level of biosecurity was lower in the majority of farms studied, mainly due to the limited implementation of on-farm basic biosecurity measures aimed at controlling the potential introduction and transmission of swine diseases. The CHAID model illustrated the relative importance of significant predictors in explaining the level of biosecurity; maintenance of medical records of treatment and vaccination, use of dedicated clothing to enter the farm, installing fence surrounding the farm perimeter, and periodic monitoring of the herd using written biosecurity plan in place. The misclassification risk estimate of the prediction model was 0.145 with the standard error of 0.025, indicating that 85.5% of the cases could be classified correctly by using the decision rule based on the current tree. Although CHAID approach could provide detailed information and insight about interactions among factors associated with biosecurity level, further evaluation of potential bias intervened in the course of data collection should be included in future studies. In addition, there is still need to validate findings through the external dataset with larger sample size to improve the external validity of the current model.

The Structure of Plant Community in Kwangnung Forest(I) -Analysis on the Forest Community of Soribong Area by the Classification and Ordination Techniques- (광릉(光陵) 삼림(森林)의 식물군집구조(植物群集構造)(I) -Classification 및 Ordination 방법에 의한 소리봉(蘇利峯)지역의 식생분석(植生分析)-)

  • Lee, Kyong Jae;Jo, Jae Chang;Lee, Bong Su;Lee, Do Suck
    • Journal of Korean Society of Forest Science
    • /
    • v.79 no.2
    • /
    • pp.173-186
    • /
    • 1990
  • To investigate the structure of the plant community of Soribong area in Kwangnung forest, forty-six plots were set up by the clumped sampling method. The classification by TWINSPAN and four kinds of multivariate ordination(PO, PCA, RA, DCA) were applied to the study area in order to classify them into several groups based on woody plants and environmental variables. The classification had been successfully overlayed on an ordination of the same data using DCA. The plots can be classified into four groups by TWINSPAN and DCA. The successional trends of tree species by both techniques seem to be from Pinus densiflora through Quercus mongolica, Q. serrata, Q. aliena, Carpinus laxiflora, Sorbus alnifolia to C. cordata, Fraxinus rhynchophylla, Cornus controversa in the canopy layer, and from Rhododendron mucronulatum, Rhus triohocarpa, Lespeoleza cyrtobotrya, Weigela subsessilis through Corylus sieboldiana, Lindera obtusiloba to Slaphylea bumalda, Callicarpa japonica, Lonicera maackii in the understory layer. As a result of the analysis for the relationship between the stand scores of DCA and environmental variables, they had a tendancy to increase significantly from the P. densiflora community to C. cordata community that was soil pH and the amount of humus, total nitrogen and exchangeable cations.

  • PDF

Chemotaxonomy of Trichoderma spp. Using Mass Spectrometry-Based Metabolite Profiling

  • Kang, Dae-Jung;Kim, Ji-Young;Choi, Jung-Nam;Liu, Kwang-Hyeon;Lee, Choong-Hwan
    • Journal of Microbiology and Biotechnology
    • /
    • v.21 no.1
    • /
    • pp.5-13
    • /
    • 2011
  • In this study, seven Trichoderma species (33 strains) were classified using secondary metabolite profile-based chemotaxonomy. Secondary metabolites were analyzed by liquid chromatography-electrospray ionization tandem mass spectrometry (LC-ESI-MS-MS) and multivariate statistical methods. T. longibrachiatum and T. virens were independently clustered based on both internal transcribed spacer (ITS) sequence and secondary metabolite analyses. T. harzianum formed three subclusters in the ITS-based phylogenetic tree and two subclusters in the metabolitebased dendrogram. In contrast, T. koningii and T. atroviride strains were mixed in one cluster in the phylogenetic tree, whereas T. koningii was grouped in a different subcluster from T. atroviride and T. hamatum in the chemotaxonomic tree. Partial least-squares discriminant analysis (PLS-DA) was applied to determine which metabolites were responsible for the clustering patterns observed for the different Trichoderma strains. The metabolites were hetelidic acid, sorbicillinol, trichodermanone C, giocladic acid, bisorbicillinol, and three unidentified compounds in the comparison of T. virens and T. longibrachiatum; harzianic acid, demethylharzianic acid, homoharzianic acid, and three unidentified compounds in T. harzianum I and II; and koninginin B, E, and D, and six unidentified compounds in T. koningii and T. atroviride. The results of this study demonstrate that secondary metabolite profiling-based chemotaxonomy has distinct advantages relative to ITS-based classification, since it identified new Trichoderma clusters that were not found using the latter approach.

A Study on the Node Split in Decision Tree with Multivariate Target Variables (다변량 목표변수를 갖는 의사결정나무의 노드분리에 관한 연구)

  • Kim, Seong-Jun
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.13 no.4
    • /
    • pp.386-390
    • /
    • 2003
  • Data mining is a process of discovering useful patterns for decision making from an amount of data. It has recently received much attention in a wide range of business and engineering fields. Classifying a group into subgroups is one of the most important subjects in data mining. Tree-based methods, known as decision trees, provide an efficient way to finding the classification model. The primary concern in tree learning is to minimize a node impurity, which is evaluated using a target variable in the data set. However, there are situations where multiple target variable should be taken into account, for example, such as manufacturing process monitoring, marketing science, and clinical and health analysis. The purpose of this article is to present some methods for measuring the node impurity, which are applicable to data sets with multivariate target variables. For illustration, a numerical cxample is given with discussion.

Charlson comorbidity index as a predictor of periodontal disease in elderly participants

  • Lee, Jae-Hong;Choi, Jung-Kyu;Jeong, Seong-Nyum;Choi, Seong-Ho
    • Journal of Periodontal and Implant Science
    • /
    • v.48 no.2
    • /
    • pp.92-102
    • /
    • 2018
  • Purpose: This study investigated the validity of the Charlson comorbidity index (CCI) as a predictor of periodontal disease (PD) over a 12-year period. Methods: Nationwide representative samples of 149,785 adults aged ${\geq}60$ years with PD (International Classification of Disease, 10th revision [ICD-10], K052-K056) were derived from the National Health Insurance Service-Elderly Cohort during 2002-2013. The degree of comorbidity was measured using the CCI (grade 0-6), including 17 diseases weighted on the basis of their association with mortality, and data were analyzed using multivariate Cox proportional-hazards regression in order to investigate the associations of comorbid diseases (CDs) with PD. Results: The multivariate Cox regression analysis with adjustment for sociodemographic factors (sex, age, household income, insurance status, residence area, and health status) and CDs (acute myocardial infarction, congestive heart failure, peripheral vascular disease, cerebral vascular accident, dementia, pulmonary disease, connective tissue disorders, peptic ulcer, liver disease, diabetes, diabetes complications, paraplegia, renal disease, cancer, metastatic cancer, severe liver disease, and human immunodeficiency virus [HIV]) showed that the CCI in elderly comorbid participants was significantly and positively correlated with the presence of PD (grade 1: hazard ratio [HR], 1.11; P<0.001; grade ${\geq}2$: HR, 1.12, P<0.001). Conclusions: We demonstrated that a higher CCI was a significant predictor of greater risk for PD in the South Korean elderly population.

Application of Multivariate Statistics and Geostatistical Techniques to Identify the Distribution Modes of the Co, Ni, As and Au-Ag ore in the Bou Azzer-East Deposits (Central Anti-Atlas Morocco)

  • Souiri, Muhammad;Aissa, Mohamed;Gois, Joaquim;Oulgour, Rachid;Mezougane, Hafid;El Azmi, Mohammed;Moussaid, Azizi
    • Economic and Environmental Geology
    • /
    • v.53 no.4
    • /
    • pp.363-381
    • /
    • 2020
  • The polymetallic Co, Ni, Cu, As, Au, and Ag deposits of Bou Azzer East are located in the western part of the Bou Azzer inlier in the Central Anti Atlas, Morocco. Six stages of emplacement of the mineralization have been identified. Precious metals (native gold and electrum) are present in all stages of this deposit except the early nickeliferous stage. From the Statistical analysis of the Co, As, Ni, Au, and Ag contents of a set of 501 samples, shows that the Pearson correlation coefficient between As-Co elements (0.966) is the highest followed by that of the Au-Ag couple (0.506). Principal component analysis (PCA) and hierarchical ascending classification (HAC) of the grades show, that Ni is associated with the pair (As-Co) and Cu is rather related to the pair (Au-Ag). The kriging maps show that the highest values of the Co, As and Ni appear in the contact of the serpentinite with other facies, as for those of Au and Ag, in addition to anomalous zones concordant with those of Co, Ni and As, they show anomalies at the extreme South and North of the study area. The development of the anomalous Au and Ag zones is mainly along the N40-50°E and N145°E directions.

Nomogram Estimating the Probability of Intraabdominal Abscesses after Gastrectomy in Patients with Gastric Cancer

  • Eom, Bang Wool;Joo, Jungnam;Kim, Young-Woo;Park, Boram;Yoon, Hong Man;Ryu, Keun Won;Kim, Soo Jin
    • Journal of Gastric Cancer
    • /
    • v.15 no.4
    • /
    • pp.262-269
    • /
    • 2015
  • Purpose: Intraabdominal abscess is one of the most common reasons for re-hospitalization after gastrectomy. This study aimed to develop a model for estimating the probability of intraabdominal abscesses that can be used during the postoperative period. Materials and Methods: We retrospectively reviewed the clinicopathological data of 1,564 patients who underwent gastrectomy for gastric cancer between 2010 and 2012. Twenty-six related markers were analyzed, and multivariate logistic regression analysis was used to develop the probability estimation model for intraabdominal abscess. Internal validation using a bootstrap approach was employed to correct for bias, and the model was then validated using an independent dataset comprising of patients who underwent gastrectomy between January 2008 and March 2010. Discrimination and calibration abilities were checked in both datasets. Results: The incidence of intraabdominal abscess in the development set was 7.80% (122/1,564). The surgical approach, operating time, pathologic N classification, body temperature, white blood cell count, C-reactive protein level, glucose level, and change in the hemoglobin level were significant predictors of intraabdominal abscess in the multivariate analysis. The probability estimation model that was developed on the basis of these results showed good discrimination and calibration abilities (concordance index=0.828, Hosmer-Lemeshow chi-statistic P=0.274). Finally, we combined both datasets to produce a nomogram that estimates the probability of intraabdominal abscess. Conclusions: This nomogram can be useful for identifying patients at a high risk of intraabdominal abscess. Patients at a high risk may benefit from further evaluation or treatment before discharge.

Characterization of Korean Archaeological Artifacts by Neutron Activation Analysis (II). Multivariate Classification of Korean Ancient Glass Pieces (중성자 방사화분석에 의한 한국산 고고학적 유물의 특성화 연구 (II). 다변량 해석법에 의한 고대 유리제품의 분류 연구)

  • Chul Lee;Oh Cheun Kwun;Ihn Chong Lee;Nak Bae Kim
    • Journal of the Korean Chemical Society
    • /
    • v.31 no.6
    • /
    • pp.567-575
    • /
    • 1987
  • Fourty five ancient Korean glass pieces have been determined for 19 elements such as Ag, As, Br, Ce, Co, Cr, Eu, Fe, Hf, K, La, Lu, Na, Ru, Sb, Sc, Sm, Th and Zn, and for one such as Pb by instrumental neutron activation analysis and by atomic absorption spectrometry, respectively. The multivariate data have been analyzed for the relation among elemental contents through the variance-covariance matrix. The data have been further analyzed by a principal component mapping method. As the results training set of 5 class have been chosen, based on the spread of sample points in an eigen vector plot and archaeological data. The 5 training set consisting of 36 species and a test set consisting of 9 species bave finally been analyzed for the assignment to certain classes or outliers through the statistical isolinear multiple component analysis (SIMCA). The results have showed the whole species for 5 training set and 3 species in the test set are assigned appropriately and these are in accord with the results by principal component mapping.

  • PDF

A Study on the Validity of the Technology Appraisal Model through the Analysis of the Business Performance and Technology Appraisal Items (기술금융기업의 경영성과와 기술력 평가항목 간 분석을 통한 기술력 평가모형의 타당성 연구)

  • Jun-won Lee
    • Information Systems Review
    • /
    • v.22 no.1
    • /
    • pp.73-89
    • /
    • 2020
  • This study started to identify the "Forward-looking" of the technology appraisal model introduced to diversify financing methods of SMEs and improve financial accessibility. The multivariate regression analysis was performed by setting the business performance(growth, profitability, and stability) of technology financing companies as dependent variables, technology appraisal items as independent variables, number of employees, age of the company, asset and the Korea Standard of Industry Classification related to firm size and industry characteristics as control variables. As a result of the analysis, the technology appraisal items did not explain the profitability of the company significantly and had a limited explanatory power on growth potential. However, in terms of stability, we confirmed that R&D capacity is a significant variable explaining the debt ratio of technology financing companies. Therefore, it is concluded that the 'Forward-looking' reflection on the growth and profitability of the company should be strengthened in the future adjustment of the technology appraisal model and the development of the technology appraisal model for investment.