• Title/Summary/Keyword: Multifactor Dimensionality Reduction(MDR)

Search Result 30, Processing Time 0.023 seconds

Boosting Multifactor Dimensionality Reduction Using Pre-evaluation

  • Hong, Yingfu;Lee, Sangbum;Oh, Sejong
    • ETRI Journal
    • /
    • v.38 no.1
    • /
    • pp.206-215
    • /
    • 2016
  • The detection of gene-gene interactions during genetic studies of common human diseases is important, and the technique of multifactor dimensionality reduction (MDR) has been widely applied to this end. However, this technique is not free from the "curse of dimensionality" -that is, it works well for two- or three-way interactions but requires a long execution time and extensive computing resources to detect, for example, a 10-way interaction. Here, we propose a boosting method to reduce MDR execution time. With the use of pre-evaluation measurements, gene sets with low levels of interaction can be removed prior to the application of MDR. Thus, the problem space is decreased and considerable time can be saved in the execution of MDR.

A Comparison Study on SVM MDR and D-MDR for Detecting Gene-Gene Interaction in Continuous Data (연속형자료의 유전자 상호작용 규명을 위한 SVM MDR과 D-MDR의 방법 비교)

  • Lee, Jong-Hyeong;Lee, Jea-Young
    • Communications for Statistical Applications and Methods
    • /
    • v.18 no.4
    • /
    • pp.413-422
    • /
    • 2011
  • We have used a multifactor dimensionality reduction(MDR) method to study the major gene interaction effect in general; however, without application of the MDR method in continuous data. In light of this, many methods have been suggested such as Expanded MDR, Dummy MDR and SVM MDR. In this paper, we compare the two methods of SVM MDR and D-MDR. In addition, we identify the gene-gene interaction effect of single nucleotide polymorphisms(SNPs) associated with economic traits in Hanwoo(Korean cattle). Lastly, we discuss a new method in consideration of the advantages that the other methods present.

Main SNP Identification of Hanwoo Carcass Weight with Multifactor Dimensionality Reduction(MDR) Method (MULTIFACTOR DIMENSIONALITY REDUCTION(MDR)을 이용한 한우 도체중에서의 주요 SNP 규명)

  • Lee, Jea-Young;Kim, Dong-Chul
    • The Korean Journal of Applied Statistics
    • /
    • v.21 no.1
    • /
    • pp.53-63
    • /
    • 2008
  • It is commonly believed that disease of human or economic traits of livestock are caused not by single gene acting alone, but by multiple genes interacting with one an-other. This issue is difficult due to the limitations of parametric statistical method like as logistic regression for detection of gene effects that are dependent solely on interactions with other genes and with environmental exposures. Multifactor dimensionality reduction (MDR) nonparametric statistical method, to improve the identification of single nucleotide polymorphism (SNP) associated with the Hanwoo(Korean cattle) carcass cold weight, is applied and compared with ANOVA results.

Multifactor-Dimensionality Reduction in the Presence of Missing Observations

  • Chung, Yu-Jin;Lee, Seung-Yeoun;Park, Tae-Sung
    • Proceedings of the Korean Statistical Society Conference
    • /
    • 2005.11a
    • /
    • pp.31-36
    • /
    • 2005
  • An identification and characterization of susceptibility genes for common complex multifactorial diseases is a challengeable task, in which the effect of single genetic variation will be likely dependent on other genetic variations(gene-gene interaction) and environmental factors (gene-environment interaction). To address is issue, the multifactor dimensionality reduction (MDR) has been proposed and implemented by Ritchie et al. (2001), Moore et al. (2002), Hahn et al.(2003) and Ritchie et al. (2003). With MDR, multilocus genotypes effectively reduce the dimension of genotype predictors from n to one, which improves the identification of polymorphism combinations associated with disease risk. However, MDR cannot handle missing observations appropriately, in which missing observation is treated as an additional genotype category. This approach may suffer from a sparseness problem since when high-order interactions are considered, an additional missing category would make the contingency table cells more sparse. We propose a new MDR approach with minimum loss of sample sizes by considering missing data over all possible multifactor classes. We evaluate the proposed MDR by using the prediction errors and cross validation consistency.

  • PDF

Gene-Gene Interaction Analysis for the Accelerated Failure Time Model Using a Unified Model-Based Multifactor Dimensionality Reduction Method

  • Lee, Seungyeoun;Son, Donghee;Yu, Wenbao;Park, Taesung
    • Genomics & Informatics
    • /
    • v.14 no.4
    • /
    • pp.166-172
    • /
    • 2016
  • Although a large number of genetic variants have been identified to be associated with common diseases through genome-wide association studies, there still exits limitations in explaining the missing heritability. One approach to solving this missing heritability problem is to investigate gene-gene interactions, rather than a single-locus approach. For gene-gene interaction analysis, the multifactor dimensionality reduction (MDR) method has been widely applied, since the constructive induction algorithm of MDR efficiently reduces high-order dimensions into one dimension by classifying multi-level genotypes into high- and low-risk groups. The MDR method has been extended to various phenotypes and has been improved to provide a significance test for gene-gene interactions. In this paper, we propose a simple method, called accelerated failure time (AFT) UM-MDR, in which the idea of a unified model-based MDR is extended to the survival phenotype by incorporating AFT-MDR into the classification step. The proposed AFT UM-MDR method is compared with AFT-MDR through simulation studies, and a short discussion is given.

EFMDR-Fast: An Application of Empirical Fuzzy Multifactor Dimensionality Reduction for Fast Execution

  • Leem, Sangseob;Park, Taesung
    • Genomics & Informatics
    • /
    • v.16 no.4
    • /
    • pp.37.1-37.3
    • /
    • 2018
  • Gene-gene interaction is a key factor for explaining missing heritability. Many methods have been proposed to identify gene-gene interactions. Multifactor dimensionality reduction (MDR) is a well-known method for the detection of gene-gene interactions by reduction from genotypes of single-nucleotide polymorphism combinations to a binary variable with a value of high risk or low risk. This method has been widely expanded to own a specific objective. Among those expansions, fuzzy-MDR uses the fuzzy set theory for the membership of high risk or low risk and increases the detection rates of gene-gene interactions. Fuzzy-MDR is expanded by a maximum likelihood estimator as a new membership function in empirical fuzzy MDR (EFMDR). However, EFMDR is relatively slow, because it is implemented by R script language. Therefore, in this study, we implemented EFMDR using RCPP ($c^{{+}{+}}$ package) for faster executions. Our implementation for faster EFMDR, called EMMDR-Fast, is about 800 times faster than EFMDR written by R script only.

Major SNP Marker Identification with MDR and CART Application

  • Lee, Jea-Young;Choi, Yu-Mi
    • Communications for Statistical Applications and Methods
    • /
    • v.15 no.2
    • /
    • pp.265-271
    • /
    • 2008
  • It is commonly believed that diseases of human or economic traits of livestock are caused not by single genes acting alone, but multiple genes interacting with one another. This issue is difficult due to the limitations of parametric-statistic methods of gene effects. So we introduce multifactor-dimensionality reduction(MDR) as a methods for reducing the dimensionality of multilocus information. The MDR method is nonparametric (i. e., no hypothesis about the value of a statistical parameter is made), model free (i. e., it assumes no particular inheritance model) and is directly applicable to case-control studies. Application of the MDR method revealed the best model with an interaction effect between the SNPs, SNP1 and SNP3, while only one main effect of SNP1 was statistically significant for LMA (p < 0.01) under a general linear mixed model.

Statistical Interaction for Major Gene Combinations (우수 유전자 조합 선별을 위한 통계적 상호작용 방법비교)

  • Lee, Jea-Young;Lee, Yong-Won;Choi, Young-Jin
    • The Korean Journal of Applied Statistics
    • /
    • v.23 no.4
    • /
    • pp.693-703
    • /
    • 2010
  • Diseases of human or economical traits of cattles are occured by interaction of genes. We introduce expanded multifactor dimensionality reduction(E-MDR), dummy multifactor dimensionality reduction(D-MDR) and SNPHarvester which are developed to find interaction of genes. We will select interaction of outstanding gene combinations and select final best genotype groups.

Power of Expanded Multifactor Dimensionality Reduction with CART Algorithm (CART 알고리즘을 활용한 확장된 다중인자 차원축소방법의 검정력 평가)

  • Lee, Jea-Young;Lee, Jong-Hyeong;Lee, Ho-Guen
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.5
    • /
    • pp.667-678
    • /
    • 2010
  • It is important to detect the gene-gene interaction in GWAS(Genome-Wide Association Study). There are many studies about detecting gene-gene interaction. The one is Multifactor dimensionality reduction method. But MDR method is not applied continuous data and expanded multifactor dimensionality reduction(E-MDR) method is suggested. The goal of this study is to evaluate the power of E-MDR for identifying gene-gene interaction by simulation. Also we applied the method on the identify interaction e ects of single nucleotid polymorphisms(SNPs) responsible for economic traits in a Korean cattle population (real data).

Multifactor Dimensionality Reduction(MDR) Analysis by Dummy Variables (더미(dummy) 변수를 활용한 다중인자 차원 축소(MDR) 방법)

  • Lee, Jea-Young;Lee, Ho-Guen
    • The Korean Journal of Applied Statistics
    • /
    • v.22 no.2
    • /
    • pp.435-442
    • /
    • 2009
  • Multiple genes interacting is a difficult due to the limitations of parametric statistical method like as logistic regression for detection of gene effects that are dependent solely on interactions with other genes and with environmental exposures. Multifactor dimensionality reduction(MDR) statistical method by dummy variables was applied to identify interaction effects of single nucleotide polymorphisms(SNPs) responsible for longissimus mulcle dorsi area(LMA), carcass cold weight(CWT) and average daily gain(ADG) in a Hanwoo beef cattle population.