"We live in the post-genomic era, when DNA sequence data is growing exponentially", says Miami University (Ohio) computational biologist Iddo Friedberg. "But for most of the genes that we identify, we have no idea of their biological functions. They are like words in a foreign language, waiting to be deciphered." Understanding the function of genes is a problem that has emerged at the forefront of molecular biology. Many groups develop and employ sophisticated algorithms to decipher these "words". However, until now there was no comprehensive picture of how well these methods perform, "To use the information in our genes to our advantage, we first need to take stock of how well we are doing in interpreting these data".
To do so, Friedberg and his colleagues, Predrag Radivojac, of Indiana University, Bloomington IN and Sean Mooney, Buck Institute for Research on Aging, Novato CA organized the Critical Assessment of protein Function Annotation, or CAFA. CAFA is a community-wide experiment to assess the performance of the many methods used today to predict the functions of proteins, the workhorses of the cell coded by our genes.
Thirty research groups comprising 102 scientists and students participated in CAFA, presented a total of 54 methods. The participating groups came from leading universities in North America, Europe, Asia and Australia. The groups participated in blind-test experiments in which they predicted the function of protein sequences for which the functions are already known but haven't yet been made publicly available. Independent assessors then judged their performance.
The results are published in this month's issue of Nature Methods co-authored by members of all the participating groups, with Friedberg and Radivojac as lead authors. Fifteen companion papers have been published in a special issue of BMC Bioinformatics detailing the methods
"We have discovered a great enthusiasm and community spir
|Contact: Iddo Friedberg|