• 제목/요약/키워드: Categorical Traits

검색결과 16건 처리시간 0.017초

Complex Segregation Analysis of Categorical Traits in Farm Animals: Comparison of Linear and Threshold Models

  • Kadarmideen, Haja N.;Ilahi, H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제18권8호
    • /
    • pp.1088-1097
    • /
    • 2005
  • Main objectives of this study were to investigate accuracy, bias and power of linear and threshold model segregation analysis methods for detection of major genes in categorical traits in farm animals. Maximum Likelihood Linear Model (MLLM), Bayesian Linear Model (BALM) and Bayesian Threshold Model (BATM) were applied to simulated data on normal, categorical and binary scales as well as to disease data in pigs. Simulated data on the underlying normally distributed liability (NDL) were used to create categorical and binary data. MLLM method was applied to data on all scales (Normal, categorical and binary) and BATM method was developed and applied only to binary data. The MLLM analyses underestimated parameters for binary as well as categorical traits compared to normal traits; with the bias being very severe for binary traits. The accuracy of major gene and polygene parameter estimates was also very low for binary data compared with those for categorical data; the later gave results similar to normal data. When disease incidence (on binary scale) is close to 50%, segregation analysis has more accuracy and lesser bias, compared to diseases with rare incidences. NDL data were always better than categorical data. Under the MLLM method, the test statistics for categorical and binary data were consistently unusually very high (while the opposite is expected due to loss of information in categorical data), indicating high false discovery rates of major genes if linear models are applied to categorical traits. With Bayesian segregation analysis, 95% highest probability density regions of major gene variances were checked if they included the value of zero (boundary parameter); by nature of this difference between likelihood and Bayesian approaches, the Bayesian methods are likely to be more reliable for categorical data. The BATM segregation analysis of binary data also showed a significant advantage over MLLM in terms of higher accuracy. Based on the results, threshold models are recommended when the trait distributions are discontinuous. Further, segregation analysis could be used in an initial scan of the data for evidence of major genes before embarking on molecular genome mapping.

Investigation of Biases for Variance Components on Multiple Traits with Varying Number of Categories in Threshold Models Using Bayesian Inferences

  • Lee, D.H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제15권7호
    • /
    • pp.925-931
    • /
    • 2002
  • Gibbs sampling algorithms were implemented to the multi-trait threshold animal models with any combinations of multiple binary, ordered categorical, and linear traits and investigate the amount of bias on these models with two kinds of parameterization and algorithms for generating underlying liabilities. Statistical models which included additive genetic and residual effects as random and contemporary group effects as fixed were considered on the models using simulated data. The fully conditional posterior means of heritabilities and genetic (residual) correlations were calculated from 1,000 samples retained every 10th samples after 15,000 samples discarded as "burn-in" period. Under the models considered, several combinations of three traits with binary, multiple ordered categories, and continuous were analyzed. Five replicates were carried out. Estimates for heritabilities and genetic (residual) correlations as the posterior means were unbiased when underlying liabilities for a categorical trait were generated given by underlying liabilities of the other traits and threshold estimates were rescaled. Otherwise, when parameterizing threshold of zero and residual variance of one for binary traits, heritability estimates were inflated 7-10% upward. Genetic correlation estimates were biased upward if positively correlated and downward if negatively correlated when underling liabilities were generated without accounting for correlated traits on prior information. Residual correlation estimates were, consequently, much biased downward if positively correlated and upward if negatively correlated in that case. The more categorical trait had categories, the better mixing rate was shown.

다형질 Threshold 개체모형에서 Missing 기록을 포함한 이산형 자료에 대한 Bayesian 분석 (Bayesian Analysis for Categorical Data with Missing Traits Under a Multivariate Threshold Animal Model)

  • 이득환
    • Journal of Animal Science and Technology
    • /
    • 제44권2호
    • /
    • pp.151-164
    • /
    • 2002
  • 한우의 근내지방도 또는 임신 여부 등과 같이 이산형 분포의 성질을 갖는 다수의 형질들에 대한 유전모수 및 종축의 유전능력을 평가하기 위한 방법으로써 Threshold 모형하에서 Bayesian 추론방법의 일종인 Gibbs sampling방법을 모의실험을 통하여 알아보았으며 기록이 누락된 다수의 형질을 포함하는 다형질 Threshold 개체모형에서의 종축평가 방법론을 제시하였다. 이산형 형질의 관측치에 대응하는 임의의 잠재변수는 기록을 갖고 있는 형질들에 대한 사전정보를 고려한 사후조건확률분포에서 Gibbs sampling을 할 때 모수에 근접하는 확률분포를 얻을 수 있었으며 이러한 이산형 기록들에 대한 육종가 추정치는 선형모형에서 보다 Threshold 모형에서의 추정치가 실제 모수에 더욱 근접하는 것을 알 수 있었다. 따라서 기록이 누락된 개체들에 대한 이산형 분포를 갖는 형질들에 대하여 선형분포를 갖는 형질들과 함께 동시 유전분석할 때 Threshod 모형이 일반 선형모형 보다 적합함을 알 수 있었다.

Holstein 젖소의 선형심사형질과 등급형질에 대한 유전변이 추정 (Estimation of Genetic Variations for Linear Type Traits and Composite Traits on Holstein Cows)

  • 이득환
    • Journal of Animal Science and Technology
    • /
    • 제48권2호
    • /
    • pp.161-168
    • /
    • 2006
  • Genetic parameters for linear type and composite traits were estimated by using Bayesian inference via Gibbs sampling with a multiple threshold animal model in Holstein cows. Fifteen linear type traits and 5 composite traits were included to estimate genetic variance and covariance components in the model. In this study, 30,204 records were obtained in the cows from 305 sires. Heritability estimates for linear type traits had the estimates as high as 0.28~0.64. Heritability estimates for composite traits were also high, when the traits were assumed to be categorical traits. Final score was more correlated with the composite traits than with the linear type traits.

범주형 자료를 포함한 다형질 임계개체모형에서 유전능력 추정 알고리즘 (Computing Algorithm for Genetic Evaluations on Several Linear and Categorical Traits in A Multivariate Threshold Animal Model)

  • 이득환
    • Journal of Animal Science and Technology
    • /
    • 제46권2호
    • /
    • pp.137-144
    • /
    • 2004
  • 불연속 범주형 자료에 대한 잠재변수가 존재한다는 가정하에 임계값을 추정하고 잠재변수를 생성하며 생성된 잠재변수 및 기타 연속변량에 대한 관측치를 포함하는 다변량 임계개체모형을 설정하고 유전능력을 예측하기 위한 방법을 제시하였다. 각각의 범주형 조사 자료의 특성을 갖는 형질에 있어서 임계점의 추정은 추정 가능한 임계점에 대한 1차 미분값(gradient)과 2차 미분값(Hessian)을 이용한 Newton 방법을 이용하면 추정가능하며 지역모수인 육종가의 추정은 PCG 방법으로 구현 가능하다. 이러한 이론은 Quaas(2001)가 제시한 하나의 이산형 자료와 하나의 연속형 자료의 2변량 동시 분석방법을 확장하여 전개한 것이며 이때 잠재변수 및 임계점의 추정은 기타 형질의 잔차 회귀계수 및 상관을 고려해야 한다. 본 연구를 위한 모의실험은 2개의 연속변량으로 체중과 유량을 고려하였고 또 다른 2개의 불연속 변량인 분만난이도와 출생시 생존유무를 고려하여 4형질 동시 분석을 실시하였다. 임계모형에 의한 육종가 추정치의 정확도는 4개의 구간으로 분류되어 기록된 분만난이도의 경우에 91${\sim}$92%의 정확도를 보였고 이항분포인 분만시 생존유무에 대하여는 87~89%의 정확도를 보였다. 반면에 이들 범주형 자료를 선형으로 간주하고 분석한 선형 동물개체 혼합모형에서는 72${\sim}$84% 및 59${\sim}$70%으로 비교적 낮은 추정의 정확도를 보였다. 따라서 범주형 자료의 유전분석은 선형 혼합모형 보다 임계형 혼합모형이 크게 타당할 것으로 사료되었다.

베이지안 분계점 모형에 의한 순서 범주형 변수의 대체 (Imputation for Binary or Ordered Categorical Traits Based on the Bayesian Threshold Model)

  • 이승천
    • 응용통계연구
    • /
    • 제18권3호
    • /
    • pp.597-606
    • /
    • 2005
  • 대개의 표본조사에서 무응답은 필연적으로 발생되고 있고, 직접 표본조사에 참가하지 않은 데이터의 사용자는 무응답의 원인을 알 수 없는 것이 일반적이므로 데이터 분석에 어려움을 갖는다. 또 대부분의 통계분석 방법은 무응답을 전제하지 않고 있어 무응답이 있는 항목은 데이터 분석의 걸림돌이 된다고 하겠다. 최근 무응답에 대해 대체법이 하나의 표준적인 처리 방법이 되고 있어 현재까지 대체법에 대한 많은 연구가 있었으나 대부분의 대체법은 정규성 등을 가정한 연속형 변수의 대체법에 대한 것이었다. 그러나 표본조사에서 많은 중요한 항목들이 순서 범주에 의해 측정되는 경우가 많으므로 범주형변수의 대체법에 대한 연구가 필요하며, 본 연구에서는 보조변수가 있는 경우 Bayesian 모형에 의한 순서범주형 항목의 대체법에 대해 알아본다.

순서범주형자료 분석을 위한 베이지안 분계점 모형 (A Bayesian Threshold Model for Ordered Categorical Traits)

  • 최병수;이승천
    • 응용통계연구
    • /
    • 제18권1호
    • /
    • pp.173-182
    • /
    • 2005
  • 순서를 갖는 범주형자료의 분석을 위한 중요한 통계적 방법인 순위로짓모형의 대안으로 무정보 사전분포에 의한 베이지안 분계점 모형을 정의하고, 실증 자료분석을 통해 베이지안 모형의 유용성을 살펴보았다.

Genetic parameters for marbling and body score in Anglonubian goats using Bayesian inference via threshold and linear models

  • Figueiredo Filho, Luiz Antonio Silva;Sarmento, Jose Lindenberg Rocha;Campelo, Jose Elivalto Guimaraes;de Oliveira Almeida, Marcos Jacob;de Sousa, Antonio Junior;da Silva Santos, Natanael Pereira;da Silva Costa, Marcio;Torres, Tatiana Saraiva;Sena, Luciano Silva
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제31권9호
    • /
    • pp.1407-1414
    • /
    • 2018
  • Objective: The aim of this study was to estimate (co) variance components and genetic parameters for categorical carcass traits using Bayesian inference via mixed linear and threshold animal models in Anglonubian goats. Methods: Data were obtained from Anglonubian goats reared in the Brazilian Mid-North region. The traits in study were body condition score, marbling in the rib eye, ribeye area, fat thickness of the sternum, hip height, leg perimeter, and body weight. The numerator relationship matrix contained information from 793 animals. The single- and two-trait analyses were performed to estimate (co) variance components and genetic parameters via linear and threshold animal models. For estimation of genetic parameters, chains with 2 and 4 million cycles were tested. An 1,000,000-cycle initial burn-in was considered with values taken every 250 cycles, in a total of 4,000 samples. Convergence was monitored by Geweke criteria and Monte Carlo error chain. Results: Threshold model best fits categorical data since it is more efficient to detect genetic variability. In two-trait analysis the contribution of the increase in information and the correlations between traits contributed to increase the estimated values for (co) variance components and heritability, in comparison to single-trait analysis. Heritability estimates for the study traits were from low to moderate magnitude. Conclusion: Direct selection of the continuous distribution of traits such as thickness sternal fat and hip height allows obtaining the indirect selection for marbling of ribeye.

Bayesian Analysis of Multivariate Threshold Animal Models Using Gibbs Sampling

  • Lee, Seung-Chun;Lee, Deukhwan
    • Journal of the Korean Statistical Society
    • /
    • 제31권2호
    • /
    • pp.177-198
    • /
    • 2002
  • The estimation of variance components or variance ratios in linear model is an important issue in plant or animal breeding fields, and various estimation methods have been devised to estimate variance components or variance ratios. However, many traits of economic importance in those fields are observed as dichotomous or polychotomous outcomes. The usual estimation methods might not be appropriate for these cases. Recently threshold linear model is considered as an important tool to analyze discrete traits specially in animal breeding field. In this note, we consider a hierarchical Bayesian method for the threshold animal model. Gibbs sampler for making full Bayesian inferences about random effects as well as fixed effects is described to analyze jointly discrete traits and continuous traits. Numerical example of the model with two discrete ordered categorical traits, calving ease of calves from born by heifer and calving ease of calf from born by cow, and one normally distributed trait, birth weight, is provided.

Estimation of Genetic Parameters for Calving Ease by Heifers and Cows Using Multi-trait Threshold Animal Models with Bayesian Approach

  • Lee, D.H.
    • Asian-Australasian Journal of Animal Sciences
    • /
    • 제15권8호
    • /
    • pp.1085-1090
    • /
    • 2002
  • Genetic parameters for birth weights (BWT), calving ease scores observed from calves born by heifers (CEH), and calving ease scores observed from calves born by cows (CEC) were estimated using Bayesian methodology with Gibbs sampling in different threshold animal models. Data consisted of 77,458 records for calving ease scores and birth weights in Gelbvieh cattle. Gibbs samplers were used to obtain the parameters of interest for the categorical traits in two univariate threshold animal models, a bivariate threshold animal model, and a three-trait linear-threshold animal model. Samples of heritabilities and genetic correlations were calculated from the posterior means of dispersion parameters. In a univariate threshold animal model with CEH (model 1), the posterior means of heritabilities for calving ease was 0.35 for direct genetic effects and 0.18 for maternal genetic effects. In the other univariate threshold model with CEC (model 2), the posterior means of heritabilities of CEC was 0.28 for direct genetic effects and 0.18 for maternal genetic effects. In a bivariate threshold model with CEH and CEC (model 3), heritability estimates were similar to those in unvariate threshold models. In this model, genetic correlation between heifer calving ease and cow calving ease was 0.89 and 0.87 for direct genetic effect and maternal genetic effects, respectively. In a three-trait animal model, which contained two categorical traits (CEH and CEC) and one continuous trait (BWT) (model 4), heritability estimates of CEH and CEC for direct (maternal) genetic effects were 0.40 (0.23) and 0.23 (0.13), respectively. In this model, genetic correlation estimates between CEH and CEC were 0.89 and 0.66 for direct genetic effects and maternal effects, respectively. These estimates were greater than estimates between BWT and CEH (0.82 and 0.34) or BWT and CEC (0.85 and 0.26). This result indicates that CEH and CEC should be high correlated rather than estimates between calving ease and birth weight. Genetic correlation estimates between direct genetic effects and maternal effects were -0.29, -0.31 and 0.15 for BWT, CEH and CEC, respectively. Correlation for permanent environmental effects between BWT and CEC was -0.83 in model 4. This study can provide genetic evaluation for calving ease with other continuous traits jointly with assuming that calving ease from first calving was a same trait to calving ease from later parities calving. Further researches for reliability of dispersion parameters would be needed even if the more correlated traits would be concerned in the model, the higher reliability could be obtained, especially on threshold model with property that categorical traits have little information.