Like all sources of information, DNA sequences come in various degrees of quality and reliability. To identify, proof, and discard compromised molecular data has thus become a critical component of the scientific endeavor - one that everyone generating sequence data is assumed to carry out before using the sequences for research purposes.
"Many researchers find sequence quality control difficult, though", says Dr. Henrik Nilsson of the University of Gothenburg and the lead author of a new article on sequence reliability, published in the Open Access journal MycoKeys. "There just isn't any straightforward document to put in their hands to give them a flying start. As a result, scientists differ in the degree to which they are aware of the need to exercise sequence quality control and in what measures they take." Previous studies have highlighted several shortcomings of publicly available DNA sequences - more than ten percent of the fungal DNA sequences may be misidentified at the species level, for example.
"A second complication", adds co-author Prof. Urmas Koljalg of the University of Tartu, "is that the software available for sequence quality management tend to be very complex and resource intensive. It borders on the unfair to expect everyone to have access to, and to master, such computer environments. Fortunately, a whole lot can be done towards quality control of DNA sequences using just manual means and a web browser. The
|Contact: Dr. Henrik Nilsson|