Browse > Article
http://dx.doi.org/10.7472/jksii.2014.15.1.29

A Framework of Intelligent Middleware for DNA Sequence Analysis in Cloud Computing Environment  

Oh, Junseok (Communications Policy Research Center, Yonsei University)
Lee, Yoonjae (Graduate School of Information, Yonsei University)
Lee, Bong Gyou (Graduate School of Information, Yonsei University)
Publication Information
Journal of Internet Computing and Services / v.15, no.1, 2014 , pp. 29-43 More about this Journal
Abstract
The development of NGS technologies, such as scientific workflows, has reduced the time required for decoding DNA sequences. Although the automated technologies change the genome sequence analysis environment, limited computing resources still pose problems for the analysis. Most scientific workflow systems are pre-built platforms and are highly complex because a lot of the functions are implemented into one system platform. It is also difficult to apply components of pre-built systems to a new system in the cloud environment. Cloud computing technologies can be applied to the systems to reduce analysis time and enable simultaneous analysis of massive DNA sequence data. Web service techniques are also introduced for improving the interoperability between DNA sequence analysis systems. The workflow-based middleware, which supports Web services, DBMS, and cloud computing, is proposed in this paper for expecting to reduceanalysis time and aiding lightweight virtual instances. It uses DBMS for managing the pipeline status and supporting the creation of lightweight virtual instances in the cloud environment. Also, the RESTful Web services with simple URI and XML contents are applied for improving the interoperability. The performance test of the system needs to be conducted by comparing results other developed DNA analysis services at the stabilization stage.
Keywords
Intelligent middleware; Cloud environment; DNA sequence analysis; Database management; RESTful web services;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Genome Informatics System, http://gisys.kr, 2012.
2 J. H. Song, K. H. Kim, "An XPDL-based Workflow Model Analyzer," Review of Korean Society for Internet Information, Vol. 11, No. 6, pp.145-157, 2010.   과학기술학회마을
3 A. Kaya, "Workflow Interoperability: The WfMC Reference Model and an Imple-mentation," Master Thesis, Technical University Hamburg-Harburg, Germany, 2001.
4 D. Hoolingsworth, "The Workflow Reference Model: 10 Years On," Fujitsu Services, United Kingdom, pp. 295-312, 2004.
5 S. Kim, K. Yoon and Y. Kim, "A Design of Integrated Scientific Workflow Execution Environment for A Computational Scientific Application," Reviewof Korean Society for Internet Information, Vol. 13, No. 1, pp.37-44, 2012.   과학기술학회마을   DOI
6 B. Ludascher, I. Altintas, C. Berkley, D. Higgins, E. Jaeger, M. Hones, E. A. Lee, J. Tao, and Y. Zhao, "Scientific Workflow Management and the KEPLER System," Concurrency and Computation: Practice and Experience, Vol. 18, No. 10, pp. 1039-1065, 2006..   DOI
7 A. Barker and J. V. Hemert, "Scientific Workflow: A Survey and Research Direc-tions," in Proceedings of the 7th international conference on Parallel processing and applied mathematics, pp. 746-753, 2007.
8 Y. Hahn and S. Lee, "Bioworks, A scientific Workflow Platform for Problem Solving in Biological Domain," in Proceedings of the 5th KOCON Conference, 2007.
9 Kepler Project, https://kepler-project.org, 2009.
10 T. Oinn, M. Addis, J. Ferris, D. Marvin, M. Senger, M. Greenwood, T. Carver, K. Glover, M. R. Pocock, A. Wipat, and P. Li, "Taverna: a Tool for the Composition and Enactment of Bioinformatics Workflows," Bioinformatics, Vol. 20, No. 17, pp. 3045-3054, 2004.   DOI
11 T. Oinn, M. Addis, J. Ferris, D. Marvin, M. Greenwood, C. Goble, A. Wipat, P. Li, and T. Carver, "Delivering Web Service Coordination Capability to Users," in Proceedings of the 13th international World Wide Web conference, pp. 438-439, 2004.
12 Bioinformatics Workflow Builder Interface, http://www.alphaworks.ibm
13 S. Lee, T. D. Wang, N. Hashmi and M. P. Cummings, Bio-STEER: A Semantic Web Workflow Tool for Grid Computing in the Life Science. Future Generation Computer Systems, 23, 3 (2007)
14 I. Taylor, M. Shields, I. Wang, and R. Philp, "Distributed P2P Computing within Triana: A Galaxy Visualization Test Case," in Proceedings of the 17th International Parallel and Distributed Processing Symposium, 2003.
15 L.D. Stein, "The case for cloud computing in genome informatics," Stein Genome Biology, Vol. 11, p. 207, 2010.   DOI
16 G. Allen, K. Davis, K. N. Dolkas, N. D. Doulamis, T. Goodale, T. Kielmann, A. Merzky, J. Nabrzyski, J. Pukacki, T. Radke, M. Russell, J. Shal, and I. Taylor, "Enabling Applications on the Grid: A GridLab Overview," International Journal of High Performance Computing Applications: Special Issue on Grid Computing: Infra-structure and Applications, Vol. 17, No. 4, pp. 1-22, 2003.
17 D. Blankenberg, G. V. Kuster, N. Coraor, G. Andanda, R. Lazarus, M. Mangan, A. Nekrutenko, and J. Taylor, "Galaxy: A Web-Based Genome Analysis Tool for Ex-perimentalists," Informatics for Molecular Biologists, Vol. 19, pp. 1-21, 2010.
18 S. Kim and Y. Park, "Overcoming limits of Bioinformatics using Cloud Computing," Journal of KIISE: Computer Systems and Theory, Vol. 27, No. 6, pp. 27-34, 2009.
19 E. Deelman, D. Gannon, M. Shields, and I. Taylor, "Workflows and e-science: an Overview of Workflow System Features and Capabilities," Future Generation Computer Systems, Vol. 25, No. 5, pp. 528-540, 2009.   DOI   ScienceOn
20 A.J. Nebro, G. Luque, F. Luna, E. Alba, "DNA fragment assembly using a grid-based genetic algorithm," Computers & Operations Research, Vol. 35, pp. 2776-2790, 2008.   DOI
21 Schadt, Eric E., et al., "Computational solutions to large-scale data management and analysis," Nature Reviews Genetics, Vol. 11, No. 9, pp. 647-657, 2010.   DOI
22 Geospiza, http://www.geospiza.com/cloud, 2010.
23 Teragen Etax, http://www.thera-gen.com, 2011.
24 KT ucloud, https://en.ucloudbiz.olleh.com/, 2010.
25 Clunix supercomputing http://eng.clunix.com/, 2000.