Genome Scale Protein Secondary Structure Prediction Using a Data Distribution on a Grid Computing

  • Cho, Min-Kyu (HTSD-NMR Laboratory, Department of Biochemistry, Yonsei University) ;
  • Lee, Soojin (Distributed and Mobile Computing Laboratory Graduate School of Information and Communication, Ajou University) ;
  • Jung, Jin-Won (HTSD-NMR Laboratory, Department of Biochemistry, Yonsei University) ;
  • Kim, Jai-Hoon (Distributed and Mobile Computing Laboratory Graduate School of Information and Communication, Ajou University) ;
  • Lee, Weontae (HTSD-NMR Laboratory, Department of Biochemistry, Yonsei University)
  • 발행 : 2003.06.01

초록

After many genome projects, algorithms and software to process explosively growing biological information have been developed. To process huge amount of biological information, high performance computing equipments are essential. If we use the remote resources such as computing power, storages etc., through a Grid to share the resources in the Internet environment, we will be able to obtain great efficiency to process data at a low cost. Here we present the performance improvement of the protein secondary structure prediction (PSIPred) by using the Grid platform, distributing protein sequence data on the Grid where each computer node analyzes its own part of protein sequence data to speed up the structure prediction. On the Grid, genome scale secondary structure prediction for Mycoplasma genitalium, Escherichia coli, Helicobacter pylori, Saccharomyces cerevisiae and Caenorhabditis slogans were performed and analyzed by a statistical way to show the protein structural deviation and comparison between the genomes. Experimental results show that the Grid is a viable platform to speed up the protein structure prediction and from the predicted structures.

키워드