To create such an environment, the bioKepler project will create scientific workflow components to develop an array of bioinformatics tools using distributed execution techniques. Once customized, these components will be used on multiple distributed platforms, including various cloud and grid computing platforms. The tools will be selected to meet the diverse needs of researchers, and organized into eight groups covering most aspects of bioinformatics applications: sequence database searches; mapping; sequence assembly; gene prediction; clustering; multiple sequence alignment, phylogeny and taxonomy; protein annotation; and other miscellaneous utilities such as data format transformation and parsing.
Training Next-Generation Scientists
"These tools will be applicable to a wide range of bioinformatics and computational biology problems," said Altintas, noting that "a key part of this project will also focus on education and outreach efforts, underscoring the importance of training next-generation scientists, as well as the need to narrow the gap between bioinformatics and technology."
All the resources, materials, and open-source software products produced by the bioKepler project will be integrated with Calit2's Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis (CAMERA), a data repository and a bioinformatics resource for metagenomic analysis.
"The Kepler workflow system has already been used comprehensively in the CAMERA project," said project co-investigator Weizhong Li, a research scientist at Cal
|Contact: Jan Zverina|
University of California - San Diego