Navigation Links
Samtools CRAMS in support for improved compression formats
Date:8/15/2014

Computer scientists at the Wellcome Trust Sanger Institute have released a major upgrade of Samtools, one of the most popular next-generation sequence analysis tools. The revised Samtools 1.0 enables researchers to easily compress, share and analyse genomic sequence data, reducing costs and supporting genomics research around the world.

The Global Alliance for Genomics and Health, in which the Sanger Institute is a partner, has been set up to enable researchers and clinicians to work together using standardised and efficient DNA sequence data formats to find the genetic variants responsible for disease. Samtools 1.0 supports this initiative by enabling researchers to read and write data in the new CRAM format, which was recently adopted by the Global Alliance, in addition to the existing SAM and BAM file formats for genomic sequence information.

The benefits of using CRAM are immediate: it gives a size reduction of 10-30 per cent. In addition, in a similar fashion to the JPEG format for images, CRAM supports much greater compression up to a hundred fold in "lossy" mode which preserves almost all of the important information.

"This major rebuild of Samtools reflects our commitment to supporting the global use of sequencing data," says Dr Richard Durbin, Head of Computational Genomics at the Sanger Institute. "Genome science worldwide relies on fast and efficient data analysis and storage, and Samtools 1.0 fulfils this need by supporting new sequencing and analysis technologies."

Samtools software is embedded in many bioinformatics pipelines and is the foundation of many thousands of genomic research papers. Since its creation in 2009, the program has been downloaded more than 225,000 times. Samtools 1.0 is freely available at http://www.htslib.org/. This new version was substantially rewritten to support the highly efficient genomic data format CRAM, add new functionality, and integrate more cleanly with other tools.

"Samtools 1.0 embeds CRAM into genomic data analysis pipelines and removes the need for additional processing," says Dr John Marshall, from the Sanger Institute. "This development paves the way for widespread uptake of this highly efficient file format in genomic research and will lead to lower storage costs."

The significant savings in storage that can be achieved are due to incorporating data compression techniques developed jointly by the Sanger Institute and the EMBL-European Bioinformatics Institute.

"It has been exciting to work on implementing CRAM into Samtools," says James Bonfield, at the Sanger Institute. "The great flexibility of CRAM has allowed a number of new compression techniques to be incorporated, which when combined with Samtools 1.0 will help to future-proof genomic data storage and analysis."


'/>"/>

Contact: Mark Thomson
mt9@sanger.ac.uk
44-122-371-0865
Wellcome Trust Sanger Institute
Source:Eurekalert

Related biology news :

1. New funding supports search for solutions to white-nose syndrome
2. NOAA, EPA-supported scientists find average but large Gulf dead zone
3. Supportive moms and sisters boost female baboons rank
4. NSF grant to Wayne State supports new concept for manufacturing nanoscale devices
5. Widespread support for rapid HIV testing in dental surgeries -- new study
6. Virus infection supports organ acceptance
7. Pew grants 22 young scientists support for biomedical research
8. Report supports shutdown of all high seas fisheries
9. Phase 3 study strengthens support of ibrutinib as second-line therapy for CLL
10. Information technology can simplify weight-loss efforts; social support still important for success
11. Surveys find that despite economic challenges Malagasy fishers support fishing regulations
Post Your Comments:
*Name:
*Comment:
*Email:
(Date:1/25/2016)...   Unisys Corporation (NYSE: UIS ) today announced ... International Airport, New York City , to help ... to enter the United States using passports ... pilot testing of the system at Dulles last year. The ... during January 2016. --> pilot testing of the ...
(Date:1/22/2016)... , January 22, 2016 ... the addition of the  "Global Behavioral ... offering. --> http://www.researchandmarkets.com/research/4lmf2s/global_behavioral ) ... "Global Behavioral Biometric Market 2016-2020"  report ... Research and Markets ( http://www.researchandmarkets.com/research/4lmf2s/global_behavioral ) has ...
(Date:1/21/2016)... , January 21, 2016 ... to a new market research report "Emotion Detection and ... Others), Software Tools (Facial Expression, Voice Recognition and ... - Global forecast to 2020", published by MarketsandMarkets, ... expected to reach USD 22.65 Billion by 2020, ...
Breaking Biology News(10 mins):
(Date:2/8/2016)... SHELTON, Conn. , Feb. 8, 2016  NanoViricides, Inc. (NYSE ... that its CEO, Eugene Seymour , MD, MPH, will present ... 5:30PM at the Waldorf-Astoria Hotel in New York City ... presentation will be in the Windsor Room at 5:30PM EST. Registered ... New York City . --> ...
(Date:2/6/2016)... VA (PRWEB) , ... February 06, 2016 , ... The ... session, cost-free, for middle and high school teachers on Wednesday February 10, 2016. ... held at the Smithsonian-Mason School of Conservation, located at 1500 Remount Road in Front ...
(Date:2/5/2016)... On Thursday, February 11, 2-1-1 San ... health and disaster services, and the Community Information ... care coordination and service delivery for the community to ... to better connect service providers to the information they ... Diego has handled more than 2.5 million ...
(Date:2/4/2016)... , February 4, 2016 - New FDA action date ... New FDA action date of July 22, ... July 22, 2016   - Lifitegrast ... the past decade indicated for the treatment of signs and symptoms of ... has the potential to be the only product approved in the U.S. in the ...
Breaking Biology Technology: