Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments
Angie Boggust, Venkatesh Sivaraman, Yannick Assogba, Donghao Ren,, Dominik Moritz, Fred Hohman

TL;DR
This paper introduces Compress and Compare, an interactive visual system that aids machine learning practitioners in comparing and analyzing model compression experiments, revealing behavior changes and supporting complex trade-off evaluations.
Contribution
We developed an interactive visualization tool that consolidates compression experiment analysis, enabling comprehensive comparison and understanding of model behavior changes due to compression.
Findings
Supports common compression analysis tasks effectively
Helps identify subtle behavior changes and artifacts
Enhances practitioner intuition and workflow structure
Abstract
To deploy machine learning models on-device, practitioners use compression algorithms to shrink and speed up models while maintaining their high-quality output. A critical aspect of compression in practice is model comparison, including tracking many compression experiments, identifying subtle changes in model behavior, and negotiating complex accuracy-efficiency trade-offs. However, existing compression tools poorly support comparison, leading to tedious and, sometimes, incomplete analyses spread across disjoint tools. To support real-world comparative workflows, we develop an interactive visual system called Compress and Compare. Within a single interface, Compress and Compare surfaces promising compression strategies by visualizing provenance relationships between compressed models and reveals compression-induced behavior changes by comparing models' predictions, weights, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReal-time simulation and control systems · Embedded Systems Design Techniques · Parallel Computing and Optimization Techniques
MethodsVisual Analytics · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
