In addition to the 2,700-plus real genes, the DNA sequence contained more than 2,500 would-be genes. These had architectural features common to genes but didn't look like recognized genes from other organisms. The team found about 500 of these that produced proteins, so the researchers re-classified these genes as functioning. Lastly, the scientists also found 38 proteins out of another 12,000 sequences that were gene longshots.
"Using proteomics, we always suspected we'd be able to detect genes not called out in the genome, but it was surprising how many hypothetical genes actually produced proteins," said Jacobs.
For the next round, additional DOE resources will enable the sequencing and analysis of the genomes of six other Cyanothece strains in a quest to find the best one to produce hydrogen.
"The goal is to find the hydrogen-producing workhorse of these seven," Pakrasi said. "Work is ongoing, and I expect in a year or so we will learn a lot more."
|Contact: Mary Beckman|
DOE/Pacific Northwest National Laboratory