Author(s): Deorowicz S, Grabowski S
Abstract Share this page
Abstract MOTIVATION: Modern sequencing instruments are able to generate at least hundreds of millions short reads of genomic data. Those huge volumes of data require effective means to store them, provide quick access to any record and enable fast decompression. RESULTS: We present a specialized compression algorithm for genomic data in FASTQ format which dominates its competitor, G-SQZ, as is shown on a number of datasets from the 1000 Genomes Project (www.1000genomes.org). AVAILABILITY: DSRC is freely available at http:/sun.aei.polsl.pl/dsrc.
This article was published in Bioinformatics
and referenced in Applied Microbiology: Open Access