Understanding The Effectiveness of Lossy Compression in Machine Learning   Training Sets

Robert Underwood; Jon C. Calhoun; Sheng Di; Franck Cappello

arXiv:2403.15953·cs.LG·March 26, 2024·1 cites

Understanding The Effectiveness of Lossy Compression in Machine Learning Training Sets

Robert Underwood, Jon C. Calhoun, Sheng Di, Franck Cappello

PDF

Open Access

TL;DR

This paper systematically evaluates 17 lossy data compression methods across 7 ML/AI applications, demonstrating significant compression ratios with minimal quality loss, and provides insights for future compressor design.

Contribution

It introduces a comprehensive evaluation methodology and offers the first extensive comparison of lossy compression effects on diverse ML/AI tasks.

Findings

01

Achieves 50-100x compression with ≤1% quality loss

02

Modern lossy methods outperform traditional approaches

03

Provides guidelines for designing ML/AI-friendly compressors

Abstract

Learning and Artificial Intelligence (ML/AI) techniques have become increasingly prevalent in high performance computing (HPC). However, these methods depend on vast volumes of floating point data for training and validation which need methods to share the data on a wide area network (WAN) or to transfer it from edge devices to data centers. Data compression can be a solution to these problems, but an in-depth understanding of how lossy compression affects model quality is needed. Prior work largely considers a single application or compression method. We designed a systematic methodology for evaluating data reduction techniques for ML/AI, and we use it to perform a very comprehensive evaluation with 17 data reduction methods on 7 ML/AI applications to show modern lossy compression methods can achieve a 50-100x compression ratio improvement for a 1% or less loss in quality. We identify…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications