• 제목/요약/키워드: Multivariate analysis of variance

검색결과 224건 처리시간 0.026초

Principal Discriminant Variate (PDV) Method for Classification of Multicollinear Data: Application to Diagnosis of Mastitic Cows Using Near-Infrared Spectra of Plasma Samples

  • Jiang, Jian-Hui;Tsenkova, Roumiana;Yu, Ru-Qin;Ozaki, Yukihiro
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1244-1244
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from mastitic and healthy cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from mastitic and healthy cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA and FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference, thereby providing a useful means for spectroscopy-based clinic applications.

  • PDF

PRINCIPAL DISCRIMINANT VARIATE (PDV) METHOD FOR CLASSIFICATION OF MULTICOLLINEAR DATA WITH APPLICATION TO NEAR-INFRARED SPECTRA OF COW PLASMA SAMPLES

  • Jiang, Jian-Hui;Yuqing Wu;Yu, Ru-Qin;Yukihiro Ozaki
    • 한국근적외분광분석학회:학술대회논문집
    • /
    • 한국근적외분광분석학회 2001년도 NIR-2001
    • /
    • pp.1042-1042
    • /
    • 2001
  • In linear discriminant analysis there are two important properties concerning the effectiveness of discriminant function modeling. The first is the separability of the discriminant function for different classes. The separability reaches its optimum by maximizing the ratio of between-class to within-class variance. The second is the stability of the discriminant function against noises present in the measurement variables. One can optimize the stability by exploring the discriminant variates in a principal variation subspace, i. e., the directions that account for a majority of the total variation of the data. An unstable discriminant function will exhibit inflated variance in the prediction of future unclassified objects, exposed to a significantly increased risk of erroneous prediction. Therefore, an ideal discriminant function should not only separate different classes with a minimum misclassification rate for the training set, but also possess a good stability such that the prediction variance for unclassified objects can be as small as possible. In other words, an optimal classifier should find a balance between the separability and the stability. This is of special significance for multivariate spectroscopy-based classification where multicollinearity always leads to discriminant directions located in low-spread subspaces. A new regularized discriminant analysis technique, the principal discriminant variate (PDV) method, has been developed for handling effectively multicollinear data commonly encountered in multivariate spectroscopy-based classification. The motivation behind this method is to seek a sequence of discriminant directions that not only optimize the separability between different classes, but also account for a maximized variation present in the data. Three different formulations for the PDV methods are suggested, and an effective computing procedure is proposed for a PDV method. Near-infrared (NIR) spectra of blood plasma samples from daily monitoring of two Japanese cows have been used to evaluate the behavior of the PDV method in comparison with principal component analysis (PCA), discriminant partial least squares (DPLS), soft independent modeling of class analogies (SIMCA) and Fisher linear discriminant analysis (FLDA). Results obtained demonstrate that the PDV method exhibits improved stability in prediction without significant loss of separability. The NIR spectra of blood plasma samples from two cows are clearly discriminated between by the PDV method. Moreover, the proposed method provides superior performance to PCA, DPLS, SIMCA md FLDA, indicating that PDV is a promising tool in discriminant analysis of spectra-characterized samples with only small compositional difference.

  • PDF

Genetic parameters for worm resistance in Santa Inês sheep using the Bayesian animal model

  • Rodrigues, Francelino Neiva;Sarmento, Jose Lindenberg Rocha;Leal, Tania Maria;de Araujo, Adriana Mello;Filho, Luiz Antonio Silva Figueiredo
    • Animal Bioscience
    • /
    • 제34권2호
    • /
    • pp.185-191
    • /
    • 2021
  • Objective: The objective of this study was to estimate the genetic parameters for worm resistance (WR) and associated characteristics, using the linear-threshold animal model via Bayesian inference in single- and multiple-trait analyses. Methods: Data were collected from a herd of Santa Inês breed sheep. All information was collected with animals submitted to natural contamination conditions. All data (number of eggs per gram of feces [FEC], Famacha score [FS], body condition score [BCS], and hematocrit [HCT]) were collected on the same day. The animals were weighed individually on the day after collection (after 12-h fasting). The WR trait was defined by the multivariate cluster analysis, using the FEC, HCT, BCS, and FS of material collected from naturally infected sheep of the Santa Inês breed. The variance components and genetic parameters for the WR, FEC, HCT, BCS, and FS traits were estimated using the Bayesian inference under the linear and threshold animal model. Results: A low magnitude was obtained for repeatability of worm-related traits. The mean values estimated for heritability were of low-to-high (0.05 to 0.88) magnitude. The FEC, HCT, BCS, FS, and body weight traits showed higher heritability (although low magnitude) in the multiple-trait model due to increased information about traits. All WR characters showed a significant genetic correlation, and heritability estimates ranged from low (0.44; single-trait model) to high (0.88; multiple-trait model). Conclusion: Therefore, we suggest that FS be included as a criterion of ovine genetic selection for endoparasite resistance using the trait defined by multivariate cluster analysis, as it will provide greater genetic gains when compared to any single trait. In addition, its measurement is easy and inexpensive, exhibiting greater heritability and repeatability and a high genetic correlation with the trait of resistance to worms.

벡터오차수정모형과 다변량 GARCH 모형을 이용한 코스피200 선물의 헷지성과 분석 (Hedging effectiveness of KOSPI200 index futures through VECM-CC-GARCH model)

  • 권동안;이태욱
    • Journal of the Korean Data and Information Science Society
    • /
    • 제25권6호
    • /
    • pp.1449-1466
    • /
    • 2014
  • 본 논문에서는 기초자산의 선물을 이용하는 헷지 전략을 연구하였다. 최적헷지비율을 구하기 위한 전통적인 방법으로 회귀분석이 사용되고 있으나, 현물과 선물 사이에 존재하는 장기균형관계와 금융 시계열 자료의 분산에 존재하는 변동성 군집현상 등의 특징을 설명하지 못하는 한계가 있다. 이를 극복하기 위해 코스피200 지수와 선물 자료에 대해 평균모형으로 벡터오차수정모형을 적합하고, 분산모형으로 다변량 GARCH 모형을 적합하여 분산-공분산 행렬을 추정하고, 이를 통해 최적헷지비율을 구하는 방법을 연구하였다. 실증분석 결과에 의하면 시장이 안정적일 때에는 회귀분석을 사용해도 큰 차이가 없지만, 시장이 불안정해지고 변동성이 커지는 구간에서는 벡터오차수정모형과 다변량 GARCH 모형을 이용하는 경우에 헷지성과가 월등히 좋아지는 결과를 얻을 수 있었다.

남성복(男性服)의 치수규격을 위한 하체부(下體部)의 체형분류(II) (Classification of Bodytype of Lower Part on Adult Male for the Apparel Sizing System)

  • 김구자
    • 한국의류학회지
    • /
    • 제17권4호
    • /
    • pp.602-607
    • /
    • 1993
  • Concept of the comfort and fitness becomes a major concern in the basic function of the ready-made clothes. This research was performed to classify and characterize Korean adult males anthropometrically. Sample size was 1290 subjects and their age range was from 19 to 54 years old. Sampling was carried out by the stratified sampling method. 75 variables in total were applied to classify the bodytypes. Data were analyzed by the multivariate method, especially factor and cluster analysis. The high factor loading items extracted by factor analysis were based to determine the variables of the cluster analysis for the similar bodytypes respectively. In the part of the lower body, 14 variables from the data were applied to classify the bodytypes of lower part by Ward's minimum variance method. The group fanning a cluster were subdivided into 5 sets by cross-tabulation extracted by the hierarchical cluster analysis. Type 3 and 4 in lower body were composed of the majority of 53.1% of the subjects. The Korean adult males had relatively well-balanced in lower body.

  • PDF

Application of varimax rotated principal component analysis in quantifying some zoometrical traits of a relict cow

  • Pares-Casanova, P.M.;Sinfreu, I.;Villalba, D.
    • 대한수의학회지
    • /
    • 제53권1호
    • /
    • pp.7-10
    • /
    • 2013
  • A study was conducted to determine the interdependence among the conformation traits of 28 "Pallaresa" cows using principal component analysis. Originally 21 body linear measurements were obtained, from which eight traits are subsequently eliminated. From the principal components analysis, with raw varimax rotation of the transformation matrix, two principal components were extracted, which accounted for 65.8% of the total variance. The first principal component alone explained 51.6% of the variation, and tended to describe general size, while the second principal component had its loadings for back-sternal diameter. The two extracted principal components, which are traits related to dorsal heights and back-sternal diameter, could be considered in selection programs.

논문 - 인자 및 군집분석을 이용한 둑 높이기 저수지 유형분류에 관한 연구 (The Classification of Dam Heightening Reservoir using Factor and Cluster Analysis)

  • 김해도;이광야;정인균;정광욱;권진욱
    • 한국관개배수논문집
    • /
    • 제18권2호
    • /
    • pp.66-75
    • /
    • 2011
  • Multivariate statistical analysis was applied to 110 dam heightening reservoir to classify the building conditions for waterfront centered around cultivated area using data of land cover, landscape, additional water quantity, local economic, tourism resources, and accessibility related variables. Five factors were extracted through factor analysis based on eigen value criteria of more than one. These five factors together account for 68.2% of the total variance. Characteristics of five factors for the downstream of dam heightening reservoirs are building conditions of waterfront, economic conditions, additional water quantity, eco-tours, and accessibility of tourism resources respectively. Five clusters were classified through cluster analysis based on factor score. The classified result shows that third cluster has remunerative terms for building waterfront.

  • PDF

정보통신기기와 융합서비스에 대한 소비자 구매행태 분석 (Analysis of Consumer's Purchasing Behavior on ICT Devices and Convergence Services in Korea)

  • 신정우;김창섭;이미숙
    • 정보화정책
    • /
    • 제21권4호
    • /
    • pp.81-97
    • /
    • 2014
  • 본 연구는 정보통신 기기 및 관련 서비스에 대한 소비자들의 구매행태를 분석하고, 다양한 정보통신 기기 및 서비스 간의 상관관계를 파악하고자 한다. 본 연구는 다양한 제품과 서비스의 다중선택 상황을 동시에 고려함으로써, 각 제품 및 서비스 그룹 내의 상관관계뿐만 아니라 그룹 간의 상관관계도 추가적으로 살펴보고자 한다. 분석자료는 소비자 설문조사를 통해 수집하였으며, 인구통계학 변수를 고려한 다변량 프로빗 모형(Multivariate Probit Model)과 분산-공분산 행렬(Variance-covariance Matrix)을 분석하기 위한 대안상수 모형(Alternative Specific Constant Model)을 각각 추정하였다. 또한 다차원척도 분석(Multi-dimensional Scaling Method)을 이용하여 제품 및 서비스 간의 관계도를 도식화하였으며, 다양한 정보통신 기기 및 서비스 간의 대체 또는 보완 관계를 도출하였다. 본 연구는 소비자들의 구매행태를 이해하고 예측함으로써 신제품과 서비스의 개발에 유용한 정보를 제공할 것으로 기대된다.

Genetic parameters and principal components analysis of breeding value for birth and weaning weight in Egyptian buffalo

  • Salem, Mohamed Mahmoud Ibrahim;Amin, Amin Mohamed Said;Ashour, Ayman Fouad;Ibrahim, Mohamed Mohamed El-said;Abo-Ismail, Mohammed Kotb
    • Animal Bioscience
    • /
    • 제34권1호
    • /
    • pp.12-19
    • /
    • 2021
  • Objective: The objectives of the current study were to study the main environmental factors affecting birth weight (BW) and weaning weight (WW), estimate variance components, genetic parameters and genetic trend and to evaluate the variability and relationships among breeding value of BW and WW using principal components analysis (PCA). Methods: A total of 16,370 records were collected from 8,271 buffalo calves. Genetic parameters and breeding values were estimated using a bivariate animal model which includes direct, maternal and permanent maternal effects. These estimates were standardized and used in PCA. Results: The direct heritability estimates were 0.06 and 0.41 for BW and WW, respectively whereas direct maternal heritability values were 0.03 and 0.14, respectively. Proportions of variance due to permanent environmental effects of dam were 0.455 and 0.280 for BW and WW respectively. The genetic correlation between BW and WWs was weak approaching zero, but the maternal correlation was 0.26. The first two principal components (PC1 and PC2) were estimated utilizing the standardized breeding values according to Kaiser method. The total variance explained by the first two PCs was 71.17% in which 45.91% and 25.25% were explained by PC1 and PC2, respectively. The direct breeding values of BW were related to PC2 but those of WW and maternal breeding values of BW and WWs were associated with PC1. Conclusion: The results of genetic parameters and PCA indicate that BW and WWs were not genetically correlated and improving growth traits of Egyptian buffaloes could be achieved using WW without any adverse effect by BW.

Social Media Marketing Strategies for Tourism Destinations: Effects of Linguistic Features and Content Types

  • Song, Seobgyu;Park, Seunghyun Brian;Park, Kwangsoo
    • Journal of Smart Tourism
    • /
    • 제1권3호
    • /
    • pp.21-29
    • /
    • 2021
  • This study explored the relationship between post types and linguistic characteristics in marketer-generated content and social media engagement to find the optimized content to enhance social media engagement level. Post data of 23,588 marketer-generated content were collected from 50 states' destination marketing organization Facebook pages in the United States. The collected data were analyzed by employing social media analytics, linguistic analysis, multivariate analysis of variance, and discriminant analysis. The results showed that there are significant differences in both engagement indicators and linguistic scores among the three post types. Based on research findings, this research not only provided researchers with theoretical implications but also suggested practitioners the most effective content designs for travel destination marketing in Facebook.