The impact of lossy data compression on the power spectrum of the high redshift 21-cm signal with LOFAR
J. K. Chege, L.V.E. Koopmans, A. R. Offringa, B. K. Gehlot, S.A., Brackenhoff, E. Ceccotti, S. Ghosh, C. H\"ofer, F.G. Mertens, M. Mevius and, S. Munshi

TL;DR
This study evaluates lossy data compression using Dysco on LOFAR's high-redshift 21-cm signal data, demonstrating it introduces negligible noise compared to thermal noise, enabling significant data volume reduction without bias.
Contribution
It provides a detailed analysis of Dysco compression effects on the 21-cm power spectrum, establishing optimal settings for minimal impact on scientific results.
Findings
Compression noise is over five orders of magnitude lower than thermal noise.
Compression does not affect calibration or introduce coherent artifacts.
Data volumes can be reduced by a factor of about 4 without bias.
Abstract
Current radio interferometers output multi-petabyte-scale volumes of data per year making the storage, transfer, and processing of this data a sizeable challenge. This challenge is expected to grow with the next-generation telescopes such as the Square Kilometre Array. Lossy compression of interferometric data post-correlation can abate this challenge. However, since high-redshift 21-cm studies impose strict precision requirements, the impact of such lossy data compression on the 21-cm signal power spectrum statistic should be understood. We apply Dysco visibility compression, a technique to normalize and quantize specifically designed for radio interferometric data. We establish the level of the compression noise in the power spectrum in comparison to the thermal noise as well as its coherency behavior. Finally, for optimal compression results, we compare the compression noise obtained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
