November 28, Shenzhen and Hong Kong, China BGI, the world's largest genomics organization, today announced its latest advances in the analysis, management and dissemination of "Big Genomics Data" at their 3rd bioinformatics software and data release conference. Released at the conference include new bioinformatics analysis pipelines and software, including SOAPhecate v2.5 and SOAPgaea, as well as an updated version of EasyGenomics, one cloud-based bioinformatics solution. Additionally, BGI's big-data journal GigaScience also provided an update on its integrated GigaDB database and reported on plans for its data analysis platform based on the Galaxy workflow system.
Genomics and next-generation sequencing (NGS) technologies have revolutionized life sciences research. With the cost of DNA sequencing steadily plummeting in price, the amount of data generated with NGS technologies continues to grow at an unprecedented pace. This has led in recent years to an over 400,000% increase in daily sequencing data generation. In the age of "Big Genomics Data", the handling, storing and sharing of these tremendous volumes of data has become a significant research bottleneck.
For dealing with big data efficiently, BGI has integrated the Apache Hadoop MapReduce framework into algorithms for NGS analyses. Based on this framework, they have also developed two new algorithms: SOAP-Hecate and SOAP-Gaea. These algorithms are two of the key components of the flexible green cloud computing infrastructure at BGI for de novo Assembly and NGS Analysis. They have successfully been applied into analyzing the sequencing data in clinical and biological research with a fast turnaround time, high efficiency and low cost.
In the conference, Yan Li, Director of Bioinformatics Products from the BGI, announced a new version of distributed genome assembler SOAP-Hecate v2.5. This software not only outputs linearized sequences, but is also a flexible and easy-to-use platform t
|Contact: Jia Liu|