DOI QR코드

DOI QR Code

A Study of Flexible Protein Structure Alignment Using Three Dimensional Local Similarities

단백질 3차원 구조의 지역적 유사성을 이용한 Flexible 단백질 구조 정렬에 관한 연구

  • 박찬용 (한국전자통신연구원 라이프인포매틱스팀) ;
  • 황치정 (충남대학교 전기정보통신공학부)
  • Published : 2009.10.31

Abstract

Analysis of 3-dimensional (3D) protein structure plays an important role of structural bioinformatics. The protein structure alignment is the main subjects of the structural bioinformatics and the most fundamental problem. Protein Structures are flexible and undergo structural changes as part of their function, and most existing protein structure comparison methods treat them as rigid bodies, which may lead to incorrect alignment. We present a new method that carries out the flexible structure alignment by means of finding SSPs(Similar Substructure Pairs) and flexible points of the protein. In order to find SSPs, we encode the coordinates of atoms in the backbone of protein into RDA(Relative Direction Angle) using local similarity of protein structure. We connect the SSPs with Floyd-Warshall algorithm and make compatible SSPs. We compare the two compatible SSPs and find optimal flexible point in the protein. On our well defined performance experiment, 68 benchmark data set is used and our method is better than three widely used methods (DALI, CE, FATCAT) in terms of alignment accuracy.

구조적 생물 정보학 분야는 단백질의 3차원 구조를 대상으로 단백질을 연구하는 분야이며, 본 논문에서는 구조적 생물 정보학 분야의 핵심 연구 주제중의 하나인 Flexible 단백질 구조 정렬에 관한 새로운 알고리즘을 제시한다. Flexible 단백질 구조 정렬을 위하여, 단백질의 3차원 구조의 지역적인 유사성을 이용하여 두 단백질의 유사한 부분 구조를 추출해 내고, 이 추출된 유사 구조간에 연결 가능성을 검색하여 정렬이 가능한 모든 유사 구조를 찾고, 이 유사 구조에 꺽임점을 도입하여 Flexible 단백질 구조 정렬을 수행하였다. 이 과정에서 단백질의 지역적 유사성을 정확히 비교하기 위하여 RDA를 이용한 방법을 제안하였고, Flexible 단백질 구조 정렬시 신뢰성 있는 꺽임점 위치 선정 방법과 그래프를 이용한 최적화 방법을 제안하였다. 성능 평가를 위하여 다양한 방법으로 Flexible 단백질 구조 정렬의 성능 평가를 수행하였고, 기존의 방법인 DALI, CE, FATCAT 보다 성능의 우수함을 나타내었다.

Keywords

References

  1. J. W. Kimball. Biology. Wm. C. Brown Publishers, 6th edition, 1994.
  2. 박찬용, 황치정. "기하인스턴싱 기법을 이용한 단백질 구조 가 시 및 속도 향상에 관한 연구," 정보처리논문지, 제16-A권 제3 호, pp.153-158, 2009.
  3. K. Arun, T. Huang, and S. Blostein, "Least-squares fitting of two 3-D point sets," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.9, No.5, pp.698-700, 1987. https://doi.org/10.1109/TPAMI.1987.4767965
  4. L. Holm and C. Sander, "3-D lookup: fast protein structure database searches at 90% reliability," In Proceedings of 3rd International Conference on Intelligent Systems for Molecular Biology (ISMB'95), pp.179-187, 1995.
  5. L. Holm and C. Sander, "Protein structure comparison by alignment of distance matrices," Journal of Molecular Biology, Vol.233, pp.123-138, 1993. https://doi.org/10.1006/jmbi.1993.1489
  6. I. N. Shindyalov and P. E. Bourne, "Protein structure alignment by incremental combinatorial extension (CE) of the optimal path," Protein Engineering, Vol.11, No.9, pp.739-747, 1998. https://doi.org/10.1093/protein/11.9.739
  7. J. F. Gibrat, T. Madej, and H. Bryant, "Surprising similarities in structure comparison," Current Opinion in Structural Biology, Vol.6, pp.377-385, 1996. https://doi.org/10.1016/S0959-440X(96)80058-3
  8. W. Taylor and C. Orengo, "Protein structure alignment," Journal of Molecular Biology, Vol.208, pp.1-22, 1989. https://doi.org/10.1016/0022-2836(89)90084-3
  9. M. Gerstein, A.M. Lesk and C. Chothia, "Structural mechanisms for domain movements in proteins," Biochemistry Vol.33, pp.6739-6749, 1994. https://doi.org/10.1021/bi00188a001
  10. M. Shatsky, H.J. Wolfson, and R. Nussinov, "Flexible protein alignment and hinge detection," Proteins: Structure, Function, and Genetics Vol.48, pp.242-256, 2002. https://doi.org/10.1002/prot.10100
  11. Ye Yuzhen and Adam Godzik, "Flexible structure alignment by chaining aligned fragment pairs allowing twists," Bioinformatics Vol.19, suppl.2, pp.246-255, 2003.
  12. M. L. Sierk and W. R. Pearson, "Sensitivity and selectivity in protein structure comparison," Protein Science, 13, pp. 773-785, 2004. https://doi.org/10.1110/ps.03328504
  13. Y. Yuzhen and A. Godzik, "Database searching by flexible protein structure alignment," Protein Science, Vol.13, No.7, pp.1841-1850, 2004. https://doi.org/10.1110/ps.03602304
  14. T. Holton, T. R. Ioerger, J. A. Christopher and J. C. Sacchettini, "Determining protein structure from electron-density maps using pattern matching," Acta Cryst. D56, pp.722-734, 2000. https://doi.org/10.1107/S0907444900003450
  15. E. W. Dijkstra, "A note on two problems in connection with graphs," Numerische Math. Vol.1, pp.269-271, 1959. https://doi.org/10.1007/BF01386390
  16. R. Floyd, "Algorithm 97 (shortest path)," Commun. ACM, Vol.5, No.6, pp.345, 1962. https://doi.org/10.1145/367766.368168
  17. P. Koehl, "Protein structure similarities," Current Opinion in Structural Biology, Vol.11, pp.348-353, 2001. https://doi.org/10.1016/S0959-440X(00)00214-1
  18. N. N. Alexandrov and D. Fischer, "Analysis of topological and nontopological structural similarities in the PDB: new examples with old structures," Proteins: Structure, Function, and Genetics, Vol.25, No.3, pp.354-365, 1996. https://doi.org/10.1002/(SICI)1097-0134(199607)25:3<354::AID-PROT7>3.3.CO;2-W
  19. G. J. Kleywegt and A. Jones, "Superposition," CCP4/ESFEACBM Newsletter on Protein Crystallography, Vol.31, pp. 9-14, 1994.
  20. S. Subbiah, D. V. Laurents and M. Levitt, "Structural similarity of DNA-binding domains of bacteriophage repressors and the globin core," Current Biology, Vol.3, pp.141-148, 1993. https://doi.org/10.1016/0960-9822(93)90255-M
  21. D. Fischer, A. Elofsson, D. Rice, and D. Eisenberg, "Assessing the performance of fold recognition methods by means of a comprehensive benchmark," In Proceedings of 1996 Pacific Symposium on Biocomputing (PSB'96), pp.300-318, 1996.
  22. M. Novotny, D. Madsen, and G. J. Kleywegt, "Evaluation of protein fold comparison servers," Proteins: Structure, Function and Bioinformatics, Vol.54, pp.260-270, 2004. https://doi.org/10.1002/prot.10553
  23. M. L. Sierk and W. R. Pearson, "Sensitivity and selectivity in protein structure comparison," Protein Science, Vol.13, pp. 773-785, 2004. https://doi.org/10.1110/ps.03328504