Navigation Links
Database gives access to the latest findings about the tree of life
Date:3/29/2010

Durham, NC If scientists have identified some two million species, where can you find the latest information about the tree of life that unites them all? A vastly improved database gives scientists and educators access to state-of-the-art knowledge about the evolutionary relationships among living things.

TreeBASE a database designed to help scientists store, share, and study evolutionary trees was first developed in the mid-1990s as way to archive the vast amounts of phylogenetic information accumulating in the literature.

"Phylogenies were being published at an explosive rate," said Bill Piel of Yale University. "What we needed was a database where we could compile them so people could use them later."

The database allows researchers to archive and retrieve published phylogenetic trees and data from different studies. "People can store sequence alignments, morphological character sets, and the resulting phylogenetic trees all in digital form. They can also be recovered and reanalyzed or combined with other data," Piel said.

Since the first prototype was developed, researchers have contributed more than 6,500 trees from over 2400 articles, describing the relationships among well over 60,000 terminal taxa. A variety of journals now require their authors to deposit phylogenetic data in TreeBASE, and peer reviewers are given anonymous access to the data prior to publication.

Years of work have gone into improving and upgrading the original version. "At some point we knew we had to make it bigger and better," said Michael Donoghue of Yale University. Now, a team of biologists and computer scientists is releasing a new version that is completely rebuilt. With this upgrade, the database is poised to become an increasingly valuable resource for a number of fields, including conservation biology, biogeography, and education, developers say.

"We have introduced a wide variety of features that didn't exist before," said Val Tannen at the University of Pennsylvania. "In terms of data deposition and how users interact with it, it has taken a huge leap forward," Donoghue added.

For one, TreeBASE can now store much richer information. "Trees can contain information such as the length of each branch, which is important for studying the timing of evolutionary events," Piel explained. The database also has an improved system for making sure that information such as taxonomic names and DNA sequence IDs match those found in other sources.

Researchers will also be able to take advantage of a more user-friendly interface and more advanced search techniques. "There are things you can query now that you couldn't before," said Piel. "For example, you can search for trees that share a certain topology."

"The visualization tools have also received a major upgrade," Piel added. "For example, now users can manipulate large trees and zoom in and out."

A number of advanced features have also been introduced that will allow bioinformaticians to do new and creative things with the data without being blocked by the user interface, said Piel. These include support for new machine-readable phylogenetic data exchange and web service standards. In addition, the metadata in TreeBASE are being made available for harvesting en masse.

According to Rutger Vos of the University of Reading, "all these features basically mean that TreeBASE plays nice with other Linked Data resources on the web, allowing the next generation of web applications to automatically understand the connections among different biological data resources."

In addition to getting a major makeover, the database also has a new home. Most recently housed at the San Diego Supercomputer Center with support from the CIPRES project, TreeBASE is now being hosted by the National Evolutionary Synthesis Center (NESCent) in Durham, North Carolina.

NESCent has made an initial commitment to host TreeBASE for up to five years, explained Todd Vision, Associate Director of Informatics at NESCent. "This partnership enables TreeBASE to continue serving the scientific needs of the community and to keep pace with technological innovations," said Vision.

Looking to the future, the team has established a non-profit foundation to ensure the database's long-term sustainability. "The foundation will become a caretaker of TreeBASE and other phylogenetic resources, such as the Tree of Life Web (ToLWeb) project," said Piel.

To enable wider participation in TreeBASE's future development, the code has been made open source and is hosted by SourceForge. The developers now communicate on a public forum. "In essence, this allows anyone with the necessary skills to participate in TreeBASE development, whether small or large," says Hilmar Lapp, Assistant Director for Informatics at NESCent.

"The really good news is that we now have a much better product much more stable, much more industrial-strength and we have an arrangement with NESCent that's going to be very successful," Donoghue added. "Now that we have a new home for it, we can service it, we can build it, and we can continue to modify it," he added. "This is a good place to be right now."


'/>"/>

Contact: Todd Vision
tjv@bio.unc.edu
919-668-4596
National Evolutionary Synthesis Center (NESCent)
Source:Eurekalert

Related biology news :

1. Penn, Georgia collaboration awarded $14.6 million to expand pathogen database
2. Largest-ever database for liver proteins may lead to treatments for hepatitis
3. Rutgers-Camden developing enzyme function database
4. International collaboration by scientists culminates in novel ion channels database
5. New MegaMatcher Accelerator Boosts Speed for High-Volume Biometric Identification and Database Duplicate Searching
6. Canadian scientist mines drugs database for new diabetes treatment
7. Scientists mine drugs database for new diabetes treatment
8. Soybean database will help breeders engineer better-performing plants
9. Pfizer inks global license to Genomatix Software and databases
10. Database shows effects of acid rain on microorganisms in Adirondack Lakes
11. Lincoln Park Zoo launches first-of-its-kind wildlife reintroduction database
Post Your Comments:
*Name:
*Comment:
*Email:
(Date:5/12/2016)... DALLAS , May 12, 2016 ... has just published the overview results from the Q1 ... of the recent wave was consumers, receptivity to a ... wearables data with a health insurance company. ... choose to share," says Michael LaColla , CEO ...
(Date:4/28/2016)... -- First quarter 2016:   , Revenues ... first quarter of 2015 The gross margin was 49% ... and the operating margin was 40% (-13) Earnings per ... from operations was SEK 249.9 M (21.2) , Outlook ... 7,000-8,500 M. The operating margin for 2016 is estimated ...
(Date:4/15/2016)... CHICAGO , April 15, 2016  A ... companies make more accurate underwriting decisions in a ... offering timely, competitively priced and high-value life insurance ... health screenings. With Force Diagnostics, rapid ... and lifestyle data readings (blood pressure, weight, pulse, ...
Breaking Biology News(10 mins):
(Date:6/24/2016)... ... June 24, 2016 , ... While the majority of commercial ... Cary 5000 and the 6000i models are higher end machines that use the more ... the spectrophotometer’s light beam from the bottom of the cuvette holder. , FireflySci ...
(Date:6/23/2016)... , ... June 23, 2016 , ... ... release of its second eBook, “Clinical Trials Patient Recruitment and Retention Tips.” Partnering ... retention in this eBook by providing practical tips, tools, and strategies for clinical ...
(Date:6/23/2016)... CAMBRIDGE, Mass. , June 23, 2016 /PRNewswire/ ... the development of novel compounds designed to target ... compound, napabucasin, has been granted Orphan Drug Designation ... in the treatment of gastric cancer, including gastroesophageal ... cancer stemness inhibitor designed to inhibit cancer stemness ...
(Date:6/23/2016)... SILVER SPRING, Md. , June 23, 2016 ... evidence collected from the crime scene to track the criminal ... sick, and the U.S. Food and Drug Administration (FDA) uses ... Sound far-fetched? It,s not. ... whole genome sequencing to support investigations of foodborne illnesses. Put ...
Breaking Biology Technology: