Browse > Article

Identification of Conserved Protein Domain Combination based on Association Rule  

Jung, Suk-Hoon (한국과학기술원 정보통신공학과)
Jang, Woo-Hyuk (한국과학기술원 정보통신공학과)
Han, Dong-Soo (한국과학기술원 전산학과)
Abstract
Protein domain is the conserved unit of compact tree-dimensional structure and evolution, which carries specific function. Domains may appear in patterns in proteins, since they have been conserved through the evolution for functional formation of proteins. In this paper, we propose a formulated method for conservation analysis of domain combination based on association rule. Proposed method measures mutual dependency of domains in a combination, as well as co-occurrence frequency of them, which is conventionally used. Based on the method, we extracted conserve domain combinations in S.cerevisiae proteins and analyzed their functions based on Gene Ontology. From the results, we drew conclusions that domains in S.cerevisiae proteins form patterns whose members are highly affiliated to one another, and that extracted patterns tend to be associated with molecular function. Moreover, the results testified to proposed method superior to conventional ones for identifying domain combinations conserved for functional cooperation.
Keywords
protein domain; domain combination; conserved domain combination; association role;
Citations & Related Records
연도 인용수 순위
  • Reference
1 E. R. Omiecinski, “Alternative interest measures for mining associations in databases,” Vol.15, No.1, pp. 57-69, 2003   DOI   ScienceOn
2 Jacob, F., Evolution and Tinkering, Sci., Vol. 196 pp. 1161-1166, 1977   DOI   ScienceOn
3 Achila D, Banci L, Bertini I, Bunce J, Ciofi-Baffoni S, HuffmanDL, “Structure of human Wilson protein domains 5 and 6 and theirinterplay with domain 4 and the copper chaperone HAH1 in copperuptake,” Proc. Natl. Acad. Sci. U S A, Vol. 103(15): pp. 5729-5734, 2006   DOI   ScienceOn
4 F. Couto, M. Silva, and P. Coutinho. “Implementation of a functional semantic similarity measure between gene products,” Department of Informatics, pp. 3-29, 2003
5 Apic G., Gough J. and Teichmann S. “Domain combinations in archaeal, eubacterial and euka-ryolic proteomes,” J. Mol. Biol., Vol.310, pp. 311-325, 2001   DOI   ScienceOn
6 Consortium, T. G. O., “Gene ontology: tool for the unification of biology,” Nature Genet., 25, pp. 25-29. 2000   DOI   ScienceOn
7 http://au.expasy.org/sprot/