Navigation Links
Genome comparison tools found to be susceptible to slip-ups

You might call it comparing apples and oranges, but lining up different species' genomes is common practice in evolutionary research. Scientists can see how species have evolved, pinpoint which sections of DNA are similar between species, meaning they probably are crucial to the animals' survival, or sketch out evolutionary trees in places where the fossil record is spotty.

But the tools used to align genomes from different species have serious quality-control issues, according to a study published online this week in the journal Nature Biotechnology.

"We discovered that there's a disturbingly low level of agreement between genome alignments produced by different tools," said corresponding author Martin Tompa, a UW professor of computer science and engineering and of genome sciences. "What this should suggest to biologists is that they should be very cautious about trusting these alignments in their entirety."

This is especially true when comparing distantly related species, and in regions of the genome that do not code for a protein, he said.

Aligning genomes, while simple in theory, is difficult in practice. Aligning more than two sequences becomes much harder with every additional sequence. At the scale of a mammal's entire genome, all of its genetic code, finding the optimal alignment of many genomes is far beyond the capabilities of any computer, Tompa said.

Various software tools instead use strategic shortcuts.

"At a high level the tools are very similar," Tompa said. "They make different decisions at the lower, more detailed levels, and those decisions seem to have widespread effect on the outcome."

The new paper compared the alignments from a previous study in which four research teams each took the same 1 percent of the human genome and aligned it to the genomes of 27 other vertebrate animals, ranging from mouse to elephant.

"This is a marvelous dataset," Tompa said. "It's a very large-scale multiple sequence alignment, done by four expert teams using four different tools, all of them working on the same input sequences."

However, the new study found that the resulting alignments were quite different. The authors also compared the coverage of each tool, meaning how much of the human DNA it was able to match to each other species, as well as what fraction of alignments were suspiciously close to a random match.

The best-performing tool was the newest one, Pecan, developed by the European Bioinformatics Institute.

"Our study pretty clearly points to Pecan as being the highest-quality alignment of the four tools we compared," Tompa said. It aligned as much of the human genome to other species as any of the other tools, and its matches were considerably more reliable, especially between more distantly related species.

The other tools in the study were Threaded Blockset Aligner (or TBA), Multiple Limited Area Global Alignment of Nucleotides (or MLAGAN) and Mavid. All four are free programs developed by academic institutions, Tompa said.

"I'm hoping that the designers of these tools will take a very close look at our paper and might be able to improve their tools as a result," he said. "I think we're all interested in having a better understanding of which methods work the best and how to make them better."


Contact: Hannah Hickey
University of Washington

Related biology news :

1. Unravelling new complexity in the genome
2. Conquest of land began in shark genome
3. One species entire genome discovered inside anothers
4. Genome study shines light on genetic link to height
5. First individual genome sequence published
6. Ultraconserved elements in the genome: Are they indispensable?
7. $10 million gift to support cutting-edge epigenome center at USC
8. Fungus genome yielding answers to protect grains, people and animals
9. Which came first, the chicken genome or the egg genome?
10. Researchers expand efforts to explore functional landscape of the human genome
11. Genome update defines landscape of breast and colon cancers
Post Your Comments:
(Date:10/29/2015)... , Oct. 29, 2015  Rubicon Genomics, Inc., ... U.S. distribution of its DNA library preparation products, ... Rubicon,s new ThruPLEX Plasma-seq kit. ThruPLEX Plasma-seq has ... preparation of NGS libraries for liquid biopsies--the analysis ... and prognostic applications in cancer and other conditions. ...
(Date:10/29/2015)... NXTD ) ("NXT-ID" or ... the growing mobile commerce market and creator of ... leading marketplace to discover and buy innovative technology ... on StackSocial for this holiday season.   ... a biometric authentication company focused on the growing ...
(Date:10/27/2015)... NEW YORK , Oct. 27, 2015 ... the major issues of concern for various industry verticals ... This is due to the growing demand for secure ... practices in various ,sectors, such as hacking of bank ... concerns for electronic equipment such as PC,s, laptops, and ...
Breaking Biology News(10 mins):
(Date:11/25/2015)... , November 26, 2015 ... Biobanking Market 2016 - 2020 report analyzes that ... integrity and quality in long-term samples, minimizing manual ... cost-effectiveness. Automation minimizes manual errors such as mislabeling ... efficiency. Further, it plays a vital role in ...
(Date:11/25/2015)... ANGELES and HOLLISTON, Mass. ... Technology, Inc. (Nasdaq: HART ), a biotechnology company ... that CEO Jim McGorry will present at ... December 1, 2015 at 2:30 p.m. PT. The presentation ... below) for 30 days. Management will also be available ...
(Date:11/25/2015)... DIEGO , Nov. 25, 2015 Orexigen® ... management will participate in a fireside chat discussion at ... New York . The discussion is scheduled ... .  A replay will be ... Media Contact:McDavid Stilwell  , Julie NormartVP, Corporate Communications ...
(Date:11/24/2015)... , ... November 24, 2015 , ... The United States ... of the 2016 USGA Green Section Award. Presented annually since 1961, the USGA Green ... or her work with turfgrass. , Clarke, of Iselin, N.J., is an ...
Breaking Biology Technology: