Web Services Based Biological Data Analysis Tool

  • Kim, Min Kyung (Department of Computer Science and Engineering, Ewha University) ;
  • Choi, Yo Hahn (School of Computer Science and Engineering, Sejong University) ;
  • Yoo, Seong Joon (School of Computer Science and Engineering, Sejong University) ;
  • Park, Hyun Seok (Department of Computer Science and Engineering, Ewha University, Institute of Bioinformatics, Macrogen Inc.)
  • Published : 2004.09.01

Abstract

Biological data and analysis tools are accumulated in distributed databases and web servers. For this reason, biologists who want to find information from the web should be aware of the various kinds of resources where it is located and how it is retrieved. Integrating the data from heterogeneous biological resources will enable biologists to discover new knowledge across the specific domain boundaries from sequences to expression, structure, and pathway. And inevitably biological databases contain noisy data. Therefore, consensus among databases will confirm the reliability of its contents. We have developed WeSAT that integrates distributed and heterogeneous biological databases and analysis tools, providing through Web Services protocols. In WeSAT, biologists are retrieved specific entries in SWISS-PROT/EMBL, PDB, and KEGG, which have annotated information about sequence, structure, and pathway. And further analysis is carried by integrated services for example homology search and multiple alignments. WeSAT makes it possible to retrieve real time updated data and analysis from the scattered databases in a single platform through Web Services.

Keywords

References

  1. Benson, D.A., Karsch-Mizrachi, I., Lipman, D.J., Ostell, J., and Wheeler, D.L. (2004). GenBank: update. Nucleic Acids Res. 32, D23-26. https://doi.org/10.1093/nar/gkh045
  2. Cuff, J.A., Coates, G.M., Cutts, T.J., and Rae, M. (2004). The Ensembl computing architecture. Genome Res. 14, 971-975 https://doi.org/10.1101/gr.1866304
  3. FlyBase Consortium. (2003). The FlyBase database of the Drosophila genome projects and community literature. Nucleic Acids Res. 31, 172-175 https://doi.org/10.1093/nar/gkg094
  4. Harris, T.W., Chen, N., Cunningham, F., Tello-Ruiz, M., Antoshechkin, I., Bastiani, C., Bieri, T., Blasiar, D., Bradnam, K., Chan, J., Chen, C.K, Chen, W.J., Davis, P., Kenny, E., Kishore, R, Lawson, D., Lee, R., Muller, H.M., Nakamura, C., Ozersky, P., Petcherski, A, Rogers, A, Saba, A, Schwarz, E.M., Van Auken, K., Wang, Q., Durbin, R., Spieth, J., Sternberg, P.W., and Stein, L.D. (2004). WormBase: a multi-species resource for nematode biology and genomics. Nucleic Acids Res. 32,D411-417 https://doi.org/10.1093/nar/gkh066
  5. Kanehisa, M., Goto, S., Kawashima, S., Okuno, Y., and Hattori, M. (2004). The KEGG resource for deciphering the genome. Nucleic Acids Res. 32, 277-280 https://doi.org/10.1093/nar/gkh063
  6. Siepel, A, Farmer, A., Tolopko, A, Zhuang, M., Mendes, P., Beavis, W., and Sobral, B. (2001). ISYS: a decentralized, component-based approach to the integration of heterogeneous bioinformatics resources. Bioinformatics. 17,83-94 https://doi.org/10.1093/bioinformatics/17.1.83
  7. Stein, L. (2002). Creating a bioinformatics nation. Nature 417.119-120 https://doi.org/10.1038/417119a
  8. Stein, L.D. (2003). Integrating biological databases. Nat. Rev. Genet. 4, 337-345
  9. Stevens, R.D., Robinson, A. J., and Goble, C.A. (2003). myGrid: personalised bioinformatics on the information grid. Bioinformatics 19, i302-304 https://doi.org/10.1093/bioinformatics/btg1041
  10. Sugawara, H. and Miyazaki, S. (2003). Biological SOAP servers and web services provided by the public sequence data bank. Nucleic Acids Res. 31, 3836-3839 https://doi.org/10.1093/nar/gkg558
  11. Westbrook, J., Feng, Z., Chen, L., Yang, H., and Berman, H.M. (2003). The Protein Data Bank and structural genomics. Nucleic Acids Res. 31, 489-491 https://doi.org/10.1093/nar/gkg068
  12. Wilkinson, M.D. and Links, M. (2002) BioMOBY: an open source biological web services proposal. Brief Bioinform. 3,331-341 https://doi.org/10.1093/bib/3.4.331
  13. Wren, J.D. (2004). 404 not found: the stability and persistence of URLs published in MEDLINE. Bioinformatics 20,668-672 https://doi.org/10.1093/bioinformatics/btg465