Browse > Article

Identifying Compound Risk Factors of Disease by Evolutionary Learning of SNP Combinatorial Features  

Rhee, Je-Keun (서울대학교 생물정보학 협동과정)
Ha, Jung-Woo (서울대학교 컴퓨터공학부)
Bae, Seol-Hui (질병관리본부 국립보건연구원 유전체센터 바이오과학정보과)
Kim, Soo-Jin (서울대학교 컴퓨터공학부 협동과정)
Lee, Min-Su (서울대학교 컴퓨터공학부)
Park, Keun-Joon (질병관리본부 국립보건연구원 유전체센터 바이오과학정보과)
Zhang, Byoung-Tak (서울대학교 컴퓨터공학부)
Abstract
Most diseases are caused by complex processes of various factors. Although previous researches have tried to identify the causes of the disease, there are still lots of limitations to clarify the complex factors. Here, we present a disease classification model based on an evolutionary learning approach of combinatorial features using the data sets from the genetics and cohort studies. We implemented a system for finding the combinatorial risk factors and visualizing the results. Our results show that the proposed method not only improves classification accuracy but also identifies biologically meaningful sets of risk factors.
Keywords
Machine Learning; Evolutionary Learning; Bioinformatics; Feature Selection;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 R. Sladek, et al., A genome-wide association study identifies novel risk loci for type 2 diabetes, Nature, vol.445, pp.881-885, 2007   DOI   ScienceOn
2 J. Lee, et al., Analysis of asthma related SNP genotype data using normalized mutual inform-ation and support vector machines, Journal of KIISE : Software and Applications, vol.36, no.9, pp.691-696, Sept. 2009 (in Korean).   과학기술학회마을   ScienceOn
3 KEGG: Kyoto Encyclopedia of Genes and Genomes, http://www.genome.jp/kegg/
4 F. S. Collins, et al. The Human Genome Project: Lessons from Large-Scale Biology, Science, vol.300, pp.286-290, 2003   DOI   ScienceOn
5 S. Kim, et al., Text classifiers evolved on a simulated DNA computer, IEEE Congress on Evo-lutionary Computation (CEC 2006), pp.9196-9202, 2006   DOI
6 S. Kim, et al., Evolving hypernetwork classifiers for microRNA expression profile analysis, IEEE Congress on Evolutionary Computation (CEC 2007), pp.313-319, 2007   DOI
7 C.-H. Park, et al., Use of evolutionary hypernet-works for mining prostate cancer data, 8th Inter-national Symposium on advanced Intelligent Sys-tems (ISIS 2007), pp.702-706, 2007
8 N. Tiffin, et al., Computational disease gene iden-tification: a concert of methods prioritizes type 2 diabetes arid obesity candidate genes, Nucleic Acids Res., vol.34, no.10, pp.3067-3081, 2006   DOI   ScienceOn
9 C.-H. Park, et al., Finding cancer-related gene combinations using a molecular evolutionary algo-rithm, IEEE 7th international conference on Bio-inforrmatics & BioEngineering (BIBE 2007), pp.158-163, 2007   DOI
10 J.-K. Kim, B.-T. Zhang. Evolving hypernetworks for pattern classification, IEEE Congress on Evo-lutionary Computation (CEC 2007), pp.1856-1862, 2007   DOI
11 K. Bhasi, et al., Information-theoretic identification of predictive SNPs and supervised visualization of genome-wide association studies, Nucleic Acids Res., vol.34, no.14, pp.e101, 2006   DOI   ScienceOn
12 B.-T. Zhang, "Hypernetworks: A molecular evolu-tionary architecture for cognitive learning and memory," IEEE Computational Intelligence Magazine, vol.3, no.3, pp.49-63, 2008   DOI   ScienceOn
13 M. A. Hall, Correlation-based feature selection of discrete and numeric class machine learning. International Conference on Machine Learning, pp.359-366, 2000