Deciphering FEATURE for Novel Protein Data Analysis and Functional Annotation

단백질 구조 및 기능 분석을 위한 FEATURE 시스템 개선

  • 유승학 (고려대학교 전기전자전파공학부) ;
  • 윤성로 (고려대학교 전기전자전파공학부)
  • Published : 2009.09.30

Abstract

FEATURE is a computational method to recognize functional and structural sites for automatic protein function prediction. By profiling physicochemical properties around residues, FEATURE can characterize and predict functional and structural sites in 3D protein structures in a high-throughput manner. Despite its effectiveness, it has been challenging to apply FEATURE to novel protein data due to limited customization support. To address this problem, we thoroughly analyze the internal modules of FEATURE and propose a methodology to customize FEATURE so that it can be used for new protein data for automatic functional annotations.

FEATURE는 단백질 내에서 특정 기능이나 구조를 가지고 있는 site의 미세환경분포를 이용하여 다른 단백질 내에서 이와 유사한 미세환경을 가지고 있는 부분을 찾아 그 분분이 site일 확률을 수치적으로 제시해 줌으로써 사용자로 하여금 site의 존재 유무와 그 위치를 판단하는데 기준을 제공해주는 유용한 툴이다. 하지만 기존의 FEATURE에서 사용된 데이터 이외의 새로운 단백질 구조 데이터를 FEATURE에 적용하기 위해서는 FEATURE 내부의 module을 입력 데이터 구조에 맞게 수정해야 한다. 그러나 FEATURE 내부의 module 구조를 수정하는 방식이 직관적이지 않기 때문에 많은 연구자들이 FEATURE를 원활하게 사용하지 못하였다. 따라서 본 논문에서는 FEATURE의 내부 구조를 분석하고 FEATURE를 새로운 단백질 데이터에 적용하기 위한 방법을 제시한다.

Keywords

References

  1. Sungroh Yoon, Jessica C. Ebert, Eui-Young Chung, Giovanni De Micheli and Russ B. Altman " Clustering protein environments for function prediction: finding PROSITE motifs in 3D" BMC(BioMedCentral) Bioinformatics 8(Suppl 4):S10, 2007.
  2. Liping Wei and Russ B. Altman, "Recognizing complex, asymmetric functional sites in protein structures using a bayesian scoring function," Journal of Bioinformatics and Computational Biology Vol. 1, pp. 119-138, 2003. https://doi.org/10.1142/S0219720003000150
  3. Steven C. Bagley, Liping Wei, Carol Cheon, and Russ B. Altman "Characterizing oriented protein structural sites using biochemical properties" Proc Int Conf Intell Syst Mol Biol. 3, pp. 12-20, 1995.
  4. Wallace, A.C., N.Borkakoti, and J.M. Thornton, "TESS: a geometric hashing algorithm for deriving 3D coordinate templates for searching structural databases. Application to enzyme active sites" Protein Sci. 6, 11(1997), pp. 2308-2323 , 1997. https://doi.org/10.1002/pro.5560061104
  5. Wallace, A.C., R.A. Laskowski, and J.M. Thornton, "Derivation of 3D coordinate templates for searching structural databases: application to Ser-His-Asp catalytic triads in the serine proteinases and lipases" Protein Sci. 5, 6(1996), pp. 1001-1013, 1996. https://doi.org/10.1002/pro.5560050603
  6. Fetrow, J.S. and J. Skolnick, "Method for prediction of protein function from sequence using the sequence-to structure-to function paradigm with application to glutaredoxins/thioredoxins and T1 ribonucleases" J Mol Biol. 281, 5(1998) pp. 949- 968, 1998. https://doi.org/10.1006/jmbi.1998.1993
  7. Fetrow, J.S., A. Godzik, and J. Skolnick, "Functional analysis of the Escherichia coli genome using the sequence- to structure-to-funtion paradigm: identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity" J Mol Biol. 282, 4(1998), pp. 703-711, 1998. https://doi.org/10.1006/jmbi.1998.2061
  8. Inbal Haplperin, Dariya S Glazer, Shirely Wu and Russ B Altman "The FEATURE framework for protein function annotation: modelling new functions, improving performance, and extending to novel applications" BMC Genomics 9(Suppl 2):52, 2008. https://doi.org/10.1186/1471-2164-9-52
  9. M.P. Liang, D.L. Brutlag, R.B. Altman, "Automated construction of structural motifs for predicting functional sites on protein structures," The Pac Symp Biocomput. pp. 204-215, 2003.
  10. Liping Wei, Russ B. Altman, Jeffrey T. Chang "Using the radial distributions of physical features to compare amino acid environments and align amino acid sequences" Pac Symp Biocomput. pp. 465-76, 1997.
  11. Liping Wei and Russ B. Altman "Recognizing protein binding sites using statistical descriptions of their 3D environments" Pac Symp Biocomput. pp. 497-508, 1998.