Sparse-by-Design Cross-Modality Prediction: L0-Gated Representations for Reliable and Efficient Learning
Filippo Cenacchi

TL;DR
This paper introduces L0GM, a unified, modality-agnostic sparsification method that improves efficiency and calibration in cross-modality predictive systems by directly controlling active features.
Contribution
L0GM provides a novel, representation-level gating framework applicable across diverse modalities, enabling comparable accuracy-efficiency trade-offs and reliable calibration analysis.
Findings
L0GM achieves competitive accuracy on multiple benchmarks.
It activates fewer features while maintaining performance.
L0GM reduces calibration error across modalities.
Abstract
Predictive systems increasingly span heterogeneous modalities such as graphs, language, and tabular records, but sparsity and efficiency remain modality-specific (graph edge or neighborhood sparsification, Transformer head or layer pruning, and separate tabular feature-selection pipelines). This fragmentation makes results hard to compare, complicates deployment, and weakens reliability analysis across end-to-end KDD pipelines. A unified sparsification primitive would make accuracy-efficiency trade-offs comparable across modalities and enable controlled reliability analysis under representation compression. We ask whether a single representation-level mechanism can yield comparable accuracy-efficiency trade-offs across modalities while preserving or improving probability calibration. We propose L0-Gated Cross-Modality Learning (L0GM), a modality-agnostic, feature-wise hard-concrete…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
