Navigation Links
Petascale computing tools could provide deeper insight into genomic evolution
Date:11/17/2009

Technological advances in high-throughput DNA sequencing have opened up the possibility of determining how living things are related by analyzing the ways in which their genes have been rearranged on chromosomes. However, inferring such evolutionary relationships from rearrangement events is computationally intensive on even the most advanced computing systems available today.

Research recently funded by the American Recovery and Reinvestment Act of 2009 aims to develop computational tools that will utilize next-generation petascale computers to understand genomic evolution. The four-year $1 million project, supported by the National Science Foundation's PetaApps program, was awarded to a team of universities that includes the Georgia Institute of Technology, the University of South Carolina and The Pennsylvania State University.

"Genome sequences are now available for many organisms, but making biological sense of the genomic data requires high-performance computing methods and an evolutionary perspective, whether you are trying to understand how genes of new functions arise, why genes are organized as they are in chromosomes, or why these arrangements are subject to change," said lead investigator David A. Bader, a professor in the Computational Science and Engineering Division of Georgia Tech's College of Computing.

Even on today's fastest parallel computers, it could take centuries to analyze genome rearrangements for large, complex organisms. That is why the research team -- which also includes Jijun Tang, an associate professor in the Department of Computer Science and Engineering at the University of South Carolina; and Stephen Schaeffer, an associate professor of biology at Penn State -- is focusing on future generations of petascale machines, which will be able to process more than a thousand trillion, or 10^15, calculations per second. Today, most personal computers can only process a few hundred thousand calculations per second.

The researchers plan to develop new algorithms in an open-source software framework that will utilize the capabilities of parallel, petascale computing platforms to infer ancestral rearrangement events. The starting point for developing these new algorithms will be GRAPPA, an open-source code co-developed by Bader and initially released in 2000 that reconstructed the evolutionary relatedness among species.

"GRAPPA is currently the most accurate method for determining genome rearrangement, but it has only been applied to small genomes with simple events because of the limitation of the algorithms and the lack of computational power," explained Bader, who is also executive director of high-performance computing at Georgia Tech.

On a dataset of a dozen bellflower genomes, the latest version of GRAPPA determined the flowers' evolutionary relatedness one billion times faster than the original implementation that did not utilize parallel processing or optimization.

The researchers will test the performance of their new algorithms by analyzing a collection of fruit fly genomes.

"Fruit flies -- formally known as Drosophila -- are an excellent model system for studying genome rearrangement because the genome sizes are relatively small for animals, the mechanism that alters gene order is reasonably well understood, and the evolutionary relationships among the 12 sequenced genomes are known," said Schaeffer.

The analysis of genome rearrangements in Drosophila will provide a relatively simple system to understand the mechanisms that underlie gene order diversity, which can later be extended to more complex mammalian genomes, such as primates.

The researchers believe these new algorithms will make genome rearrangement analysis more reliable and efficient, while potentially revealing new evolutionary patterns. In addition, the algorithms will enable a better understanding of the mechanisms and rate of gene rearrangements in genomes, and the importance of the rearrangements in shaping the organization of genes within the genome.

"Ultimately this information can be used to identify microorganisms, develop better vaccines, and help researchers better understand the dynamics of microbial communities and biochemical pathways," added Bader.


'/>"/>

Contact: Abby Vogel
avogel@gatech.edu
404-385-3364
Georgia Institute of Technology Research News
Source:Eurekalert  

Related biology news :

1. Petascale climate modeling heats up at University of Miami
2. European light research opens door for optical storage and computing
3. Multithreaded supercomputer seeks software for data-intensive computing
4. PNNL researchers earn top honors at Supercomputing conference
5. Developer of advanced computing memory, father of biochemical engineering, and innovative engineering educators win highest engineering honors of 2009
6. Cloud computing brings cost of protein research down to Earth
7. Harnessing cloud computing for data-intensive research on oceans, galaxies
8. A genomic CluE for cloud computing
9. New caledonian crows find 2 tools better than 1
10. Ginkgo SRMs: Tools for product analysis/quality
11. Interacting protein theory awaits test from new neutron analysis tools
Post Your Comments:
*Name:
*Comment:
*Email:
Related Image:
Petascale computing tools could provide deeper insight into genomic evolution
(Date:3/15/2016)... New York , March 15, 2016 ... new market report published by Transparency Market Research "Digital Door ... Trends and Forecast 2015 - 2023," the global digital door ... US$ 731.9 Mn in 2014 and is forecast to grow ... 2023. Growth of micro, small and medium enterprises (MSMEs) across ...
(Date:3/11/2016)... http://www.apimages.com ) - --> http://www.apimages.com ) - ... Images ( http://www.apimages.com ) - Germany . ... new refugee identity cards. DERMALOG will be unveiling this device, and ... Hanover next week.   --> Germany ... the new refugee identity cards. DERMALOG will be unveiling this device, ...
(Date:3/9/2016)... NEW YORK , March 9, 2016 ... current and future states of the RNA Sequencing (RNA ... in segments such as instruments, tools and reagents, data ... Analyze various segments of the RNA-Sequencing market such ... RNA-Sequencing services Identify the main factors affecting each segment ...
Breaking Biology News(10 mins):
(Date:5/23/2016)... ... 2016 , ... The need for blood donations in South Texas and across the nation is ... & Tissue Center, blood donations are on the decline. In fact, donations across the country ... percent in South Texas in the last four years alone. , There is no substitute ...
(Date:5/23/2016)... Ind. , May 23, 2016 Zimmer Biomet ... musculoskeletal healthcare, today announced that its Board of Directors has ... for the second quarter of 2016. The ... or about July 29, 2016 to stockholders of record as ... declarations of dividends are subject to approval of the Board ...
(Date:5/23/2016)... May 23, 2016 - Leading CRO,s Use ... - Frontage Implement a Single Platform to Manage End-to-end Operations ... Within the Bioanalytical lab Frontage Laboratories, a full-service contract ... and China , has selected IDBS, ... In addition to serving as the global electronic lab notebook (ELN), ...
(Date:5/23/2016)... ... May 23, 2016 , ... Foresight Institute ... announced the winners for the 2015 Foresight Institute Feynman Prizes. , These ... two categories, one for experiment and the other for theory in nanotechnology. Prof. ...
Breaking Biology Technology: