Technological advances in DNA sequencing make determining how living things are related possible by analyzing the ways in which their genes have been rearranged on chromosomes. However, inferring these evolutionary relationships from rearrangement events requires massive computing impossible even on the most advanced computing systems available today.
A four-year $1 million project, funded by the National Science Foundation's PetaApps program, aims to develop computational tools that will use next-generation petascale computers to understand genomic evolution. A team of universities received the grant, including the Georgia Institute of Technology, the University of South Carolina and Penn State. The funding is part of the American Recovery and Reinvestment Act.
"Genome sequences are now available for many organisms, but making biological sense of the genomic data requires high-performance computing methods and an evolutionary perspective, whether you are trying to understand how genes of new functions arise, why genes are organized as they are in chromosomes, or why these arrangements are subject to change," said lead investigator David A. Bader, professor, Computational Science and Engineering Division, Georgia Tech's College of Computing.
Even on today's fastest parallel computers, it could take centuries to analyze genome rearrangements for large, complex organisms. So, the research team -- which also includes Jijun Tang, associate professor, department of computer science and engineering, University of South Carolina, and Stephen Schaeffer, associate professor of biology, Penn State -- is focusing on future generations of petascale machines, which will be able to process more than a thousand trillion calculations per second. Today, most personal computers can only process a few hundred thousand calculations per second.
The researchers plan to develop new algorithms in an open-source software framework that will use the
|Contact: A'ndrea Elyse Messer|