Browse > Article
http://dx.doi.org/10.3745/KIPSTD.2003.10D.6.897

The Training Data Generation and a Technique of Phylogenetic Tree Generation using Decision Tree  

Chae, Deok-Jin (전남대학교 대학원 전산학과)
Sin, Ye-Ho (극동대학교 정보통신학부)
Cheon, Tae-Yeong (충북대학교 생물학과)
Go, Heung-Seon (충북대학교 생물학과)
Ryu, Geun-Ho (충북대학교 전기전자 및 컴퓨터공학부)
Hwang, Bu-Hyeon (전남대학교 전산학과)
Abstract
The traditional animal phylogenetic tree is to align the body structure of the animal phylums from simple to complex based on the initial development character. Currently, molecular systematics research based on the molecular, it is on the fly, is again estimating prior trend and show the new genealogy and interest of the evolution. In this paper, we generate the training set which is obtained from a DNA sequence ans apply to the classification. We made use of the mitochondrial DNA for the experiment, and then proved the accuracy using the MEGA program which is anaysis program, it is used in the biology field. Although the result of the mining has to proved through biological experiment, it can provede the methodology for the efficient classify and can reduce the time and effort to the experiment.
Keywords
Data Mining; Classification; Phylogenetic Tree; Molecular Systematics;
Citations & Related Records
연도 인용수 순위
  • Reference
1 T. D. Kocher, W. K. Thomas, A. Meyer, S. V. Edwards, S. Paabo, F. X. Villablanca and A. C. Wilson, 'Dynamics of mitochondrial DNA evolution in animals: amplification and sequencing with conserved primers,' Proc. Natl. Acad. Sci. USA. 16, pp.6196-6200, 1989
2 Usama Fayadd, Gregory Piatetsky-Shapiro and Padhraic Smyth, Chapter 1 From Data Mining to Knowledge Discovery : An Overview, Advances in Knowledge Discovery and Data Mining, AAAI Press, pp.1-34, 1996
3 J L. Boore, D. V. Lavrov and W. M. Brown, 'Gene translocation links insects and crustaceans,' Nature 392, pp, 667-668, 1998   DOI   ScienceOn
4 Manish Mehta, Rakesh Agarawal, and Jorma Rissanen, 'SLIQ : A Fast Scalable Classifier for Data Mining,' EDBT 96, Avignon, France, March, 1996
5 Peiter Adriaans and Dolf Zantinge, 'Data Mining,' Addition Wesley, 1996
6 P. S. Bradley, U. M. Fayyad and O. L. Mangasarian, 'Data Mining: Overview and Optimization Opportunities,' http:// elib.stanford.edu, Technical Report MP-TR-98-01, 1998
7 M. Rebhan, V. Chalifa-Caspi, J Prilusky and D. Lancet, 'GeneCards : a novel functional genornics compendium with automated data mining and query reformulation support,' Bioinformatics, Vol.14, No.8, pp.656-664, 1998   DOI   ScienceOn
8 R. J. Hilderman, H. J. Hamilton and N. Cercone, 'Data Mining in Large Databases Using Domain Generalization Graphs,' Dept. of CS, Univ. of Regina, submitted for publication, 1998
9 T. Zhang, 'Data Clustering for Very Large Datasets Plus Applications,' A dissertation submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy, Dept. of CS, Univ of Wisconsin, 1997
10 J. Setubal and Meidanis J. 'Introduction to Computational Molecular Biology,' MA : PWS Publishing Company, Boston, 1997
11 정재훈 '생물정보학과 인터넷 자원', 한국유전학회, 유전 제3권, pp.176-200, 2000
12 김기중, 분자생물학적 자료와 계통수 제작, 한국유전학회, 유전 제3권, pp.259-271, 2000
13 RuleQuest Research Data Mining Tools, See5, http://www. relequest.comlindex.html
14 M. Mehta, J Rissanen and R. Agrawal, 'MDL-based De cision Tree Pruning,' Proc. of the 1st Int'l Conference on Knowledge Discovery in Databases and Data Mining, Montrcal, Canada, August, 1995
15 P. Adriaans and D. Zanting, 'Data Mining,' Addison Wesley, 1996
16 Alvis Brazma, Inge Jonassen, Ingvar Eidhammer and David Gilbert, 'Approaches to the automatic discovery of pattern biosequences,' The Journal of Computational Biology, November, 1997
17 J. L. Boore, T. M. Collins, D. Stanton, L. L. Daehler and W. M. Brown, 'Deducing the patterns of arthropod phylogeny from mitochondrial DNA rearrangements,' Nature 376, pp.163-165, 1995   DOI   ScienceOn