Toward Extreme Lossy Compression Data to Enable Higher Data Rates MX
Jean Jakoncic, Herbert J Bernstein

TL;DR
A new lossy compression method for MX crystallographic data allows high compression ratios while preserving usable structural information for data archival.
Contribution
A flexible lossy compression tool (l_bnl_compress) that enables controlled data compression for MX data without excessive loss.
Findings
Compression ratios of up to 300:1 are feasible for permanent data archival with minimal impact on structural analysis.
Electron density maps from highly compressed data showed few measurable differences compared to those from uncompressed data.
Three test data sets from different experiments demonstrated the effectiveness of the compression method.
Abstract
At last, a useful lossy compression method is available for MX crystallographic data. One could achieve a compression ratio as high as 3000 to 1, but with data loss so high that structural data can’t be interpreted faithfully. On the other hand, our l_bnl_compress lossy-but-not-(too)-lossy compression application allows one flexibility to control the amount of compression to apply. We have successfully applied lossy compression to three test data-sets, representing an array of samples and experiments performed at state- of-the art synchrotron beamlines. These were a Hen- Egg-White Lysozyme data set from a crystal collected at 7.5 keV for S_SAD phasing, a data set from Thermolysin with a complexed fragment molecule from a fragment-screening campaign, and a data set from activated Spheugomonas CBASS Casp5 at high resolution. We present results on the compressed data for these three test…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Algorithms and Data Compression · Advanced Data Storage Technologies
