TL;DR
LiVeAction introduces a lightweight neural codec architecture optimized for resource-constrained environments, offering superior rate-distortion performance across diverse modalities with practical deployment considerations.
Contribution
The paper proposes a novel neural codec design that reduces complexity and generalizes across modalities, outperforming existing generative tokenizers in rate-distortion efficiency.
Findings
Outperforms state-of-the-art generative tokenizers in rate-distortion.
Designed for low-power sensors with practical deployment.
Supports arbitrary signal modalities with simplified training.
Abstract
Modern sensors generate rich, high-fidelity data, yet applications operating on wearable or remote sensing devices remain constrained by bandwidth and power budgets. Standardized codecs such as JPEG and MPEG achieve efficient trade-offs between bitrate and perceptual quality but are designed for human perception, limiting their applicability to machine-perception tasks and non-traditional modalities such as spatial audio arrays, hyperspectral images, and 3D medical images. General-purpose compression schemes based on scalar quantization or resolution reduction are broadly applicable but fail to exploit inherent signal redundancies, resulting in suboptimal rate-distortion performance. Recent generative neural codecs, or tokenizers, model complex signal dependencies but are often over-parameterized, data-hungry, and modality-specific, making them impractical for resource-constrained…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
