Illumina is committed to delivering innovative sequencing technologies, and to helping customers manage growing volumes of data output that result from the proliferation of sequencing-based research. Enancio’s genomic data compression technology offers optimal levels of speed and efficiency, and nicely complements other Illumina informatics solutions.
Genomic data compression allows for:
Enancio’s lossless genomic data compression technology reduces the data storage footprint by as much as five times by compressing the output from Illumina sequencers. Enancio technology uses a reference-based compression method. The idea is to use an ultra-fast mapping scheme to map reads onto a reference genome, and then store only the data needed to regenerate those reads: a position and a list of differences.
Other data compression technologies usually suffer from low speed. Enancio technology is optimized for high compression ratios, as well as fast compression and decompression rates, while preserving data integrity. Quality scores are encoded in a lossless way using a range encoder and context models adapted to the different types of quality schemes.
All files compressed with the Illumina compression technology can easily be decompressed using the decompression software available here. The decompression software is free to download and to use.
Once installed, a simple command can be used to directly pipe the output of decompression on the fly into a wide range of popular mapping tools such as BWA, STAR, and BowTie. The compression and decompression technology will also be seamlessly integrated within the DRAGEN secondary analysis workflow.
Download NowContact us to learn more.
Enancio’s genomic data compression technology will be directly integrated into DRAGEN, which provides accurate, ultra-rapid secondary genomic analysis of sequencing data.
Learn MoreWe offer a variety of resources and information to help simplify the process of setting up your informatics infrastructure.
Our sequencing data analysis software helps you spend more time doing research, and less time configuring and running analysis workflows.
Explore a broad range of informatics products designed to simplify genomic data analysis and management.