일배체형에 기초한 연쇄분석의 통계학적 알고리즘 연구

Statistical Algorithm in Genetic Linkage Based on Haplotypes

  • 김진흠 (수원대학교 통계정보학과) ;
  • 강대룡 (연세대학교 의과대학 예방의학교실) ;
  • 이윤경 (연세대학교 대학원 보건학과) ;
  • 신선미 (연세대학교 대학원 보건학과) ;
  • 서일 (연세대학교 의과대학 예방의학교실) ;
  • 남정모 (연세대학교 의과대학 예방의학교실)
  • Kim, Jin-Heum (Department of Applied Statistics, University of Suwon) ;
  • Kang, Dae-Ryong (Department of Preventive Medicine and Public Health, Yonsei University College of Medicine) ;
  • Lee, Yun-Kyung (Graduate School of Public Health, Yonsei University) ;
  • Shin, Sun-Mi (Graduate School of Public Health, Yonsei University) ;
  • Suh, Il (Department of Preventive Medicine and Public Health, Yonsei University College of Medicine) ;
  • Nam, Chung-Mo (Department of Preventive Medicine and Public Health, Yonsei University College of Medicine)
  • 발행 : 2004.12.01

초록

Objectives : This study was conducted to propose a new transmission/disequilibrium test(TDT) to test the linkage between genetic markers and disease-susceptibility genes based on haplotypes. Simulation studies were performed to compare the proposed method with that of Zhao et al. in terms of type I error probability and powers. Methods : We estimated the haplotype frequencies using the expectation-maximization(EM) algorithm with parents genotypes taken from a trio dataset, and then constructed a two-way contingency table containing estimated frequencies to all possible pairs of parents haplotypes. We proposed a score test based on differences between column marginals and their corresponding row marginals. The test also involved a covariance structure of marginal differences and their variances. In simulation, we considered a coalescent model with three genetic markers of biallele to investigate the performance of the proposed test under six different configurations. Results : The haplotype-based TDT statistics, our test and Zhao et al.'s test satisfied a type I error probability, but the TDT test based on single locus showed a conservative trend. As expected, the tests based on haplotypes also had better powers than those based on single locus. Our test and that of Zhao et al. were comparable in powers. Conclusion : We proposed a TDT statistic based on haplotypes and showed through simulations that our test was more powerful than the single locus-based test. We will extend our method to multiplex data with affected and/or unaffected sibling(s) or simplex data having only one parent s genotype.

키워드

참고문헌

  1. Spielman RS, McGinnis RE, Ewens W. Transmission test for linkage disequilibriwn: the insulin gene region and insulin-dependent diabetes mellitus (IDDM). Am J Hum Genet 1993; 52: 506-516
  2. Sclnlze TG, McMahon F. Genetic associalion mapping at the crossroads: Which test and why? Overview and practical guidelines, Am J Med Genet 2002; 114: 1-11 https://doi.org/10.1002/ajmg.10042
  3. Kang D, Lee KM, Current status of genomic epidemiology research, Korean J Prev Med 2003; 36: 213-222 (Korean)
  4. Gabriel SB, Schaffner SF, Nguyen H, Moore 1M, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, La-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D. The stucure of haplotype blocks in the human genome. Science 2002; 296: 2225-2229 https://doi.org/10.1126/science.1069424
  5. The international HapMap Consortium. The international HapMap project. Nature 2003; 426: 789-796 https://doi.org/10.1038/nature02168
  6. Jorde LB, Linkage disequilibrium as a genetic mapping tool. Am J Hum Genet 1995; 56:11-14
  7. Keavney B. Genetic epidemiological studies of coronary heart disease, Int J Epidemiol 2002; 31: 730-736 https://doi.org/10.1093/ije/31.4.730
  8. Daly MJ, Rioux JD, Schaffner SF, Hudson TJ, Lander ES, High-resolution haplotype structure in thehuman genome, Nat Genet 2001; 29: 229-232
  9. Clark AG, Inference of heploypes from PCR-arrplified surples of diploid population, Mol Biol Evol 1990; 7: 111-122
  10. org JC, Williams RC, UrbanekM. An E-M algorithm and testing strategy for multiplelocus haplotypes. Am J Hum Genet 1995; 56: 799-810
  11. Stephens M, Smith NJ, Donnelly. A new statistical rrethcd for haplotype reconstruction from pcpulation data, AmJ Hwn Genet 2001; 68: 978-989
  12. Niu T, Qin ZS, Xu X, Liu JS, Bayesian haplotype inference for multiple linked single nucleotide polymorphsms. Am J Hwn Genet 2002; 70: 157-169
  13. Zhao JH, Curtis D, Sham PC. Model-free analysis and permutation tests for allelic associalions. Hum Hered 2000; 50: 133-139 https://doi.org/10.1159/000022901
  14. Wilson SR. On extending the transmission/disequlibrium test(TDT). Ann Human Genet 1997;61: 151-161 https://doi.org/10.1017/S0003480097006040
  15. Clayton D, Jones H. Transmission/disequilibrium tests for extended marker haplotypes. Am J Hum Genet 1999; 65: 1161-1169 https://doi.org/10.1086/302566
  16. Clayton D. A generalization of the transmission/disequilibrium test(TDT) for uncertain haplotype transmission. Am J Hum Genet 1999; 65: 1170-1177 https://doi.org/10.1086/302577
  17. Zhao H, Zhang S, Merikangas KR, Trixler M, Wildenaauer DB, Sun F, Kidd KK, Transmission/disequilibrium tests using multiple tightly linked markers, Am J Hum Genet 2000;67: 936-946
  18. Spielman RS, Ewens WJ. The TDT and other family-based tests for linkage disequilibrium and association. Am J Hum Genet 1996; 59: 983-989
  19. Stuart A. A test for homogeneity of the marginal distributions in a two-way classification. Biometrika 1955; 42: 412-416 https://doi.org/10.1093/biomet/42.3-4.412
  20. Cardon LR, Abecasis GR. Using haplotype blocks to map human complex trait loci. Trends in Genetics 2003; 19: 135-140 https://doi.org/10.1016/S0168-9525(03)00022-2
  21. Terwilliger JD, Weiss KM. Linkage disequilibrim mapping of complex disease: fantasy or reality? Curr Opin Biotechnol 1998; 9: 578-594 https://doi.org/10.1016/S0958-1669(98)80135-3
  22. Rosenberg NA, Nordborg M, Genealogical trees, coalescent theory andthe analysis of genetic polymorphisms. Nature Rev Genet 2002; 3: 380-390
  23. Suh I. Genetic epidemiology study on the long term change of blood pressure and the incidence of hypertension. Korean Research Foundation; 2004 (Korean)