Test-Time Steering for Lossless Text Compression via Weighted Product of Experts

Qihang Zhang; Muchen Li; Ziao Wang; Renjie Liao; and Lele Wang

arXiv:2511.10660·cs.CL·November 17, 2025

Test-Time Steering for Lossless Text Compression via Weighted Product of Experts

Qihang Zhang, Muchen Li, Ziao Wang, Renjie Liao, and Lele Wang

PDF

Open Access

TL;DR

This paper introduces a novel test-time method that adaptively combines universal and neural language models using a weighted product of experts to improve lossless text compression without additional training.

Contribution

It proposes a test-time steering framework with weighted product of experts to enhance neural compression models' generalization to unseen data.

Findings

01

Improves compression rates over individual models

02

No fine-tuning required for new data distributions

03

Compatible with any autoregressive language model

Abstract

Lossless compression techniques are crucial in an era of rapidly growing data. Traditional universal compressors like gzip offer low computational overhead, high speed, and broad applicability across data distributions. However, they often lead to worse compression rates than modern neural compressors, which leverage large-scale training data to model data distributions more effectively. Despite their advantages, neural compressors struggle to generalize to unseen data. To address this limitation, we propose a novel framework that performs Test-Time Steering via a Weighted Product of Experts (wPoE). At inference, our method adaptively combines a universal compression model with a pretrained neural language model, ensuring the compression rate is at least as good as that of the best individual model. Extensive experiments demonstrate that our approach improves the performance of text…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAlgorithms and Data Compression · Advanced Data Compression Techniques · Big Data and Digital Economy