Minimal Communication-Cost Statistical Learning
Milad Sefidgaran, Abdellatif Zaidi, Piotr Krasnowski

TL;DR
This paper introduces a joint training and source coding scheme for statistical learning that minimizes communication costs while ensuring small risk and model complexity, with provable guarantees.
Contribution
It proposes a novel scheme combining training and source coding with theoretical guarantees on risk, generalization, and communication cost.
Findings
Guarantees small empirical risk and generalization error.
Achieves minimal communication cost through KL divergence constraints.
Provides one-shot guarantees for every encoder message.
Abstract
A client device which has access to training data samples needs to obtain a statistical hypothesis or model and then to send it to a remote server. The client and the server devices share some common randomness sequence as well as a prior on the hypothesis space. In this problem a suitable hypothesis or model should meet two distinct design criteria simultaneously: (i) small (population) risk during the inference phase and (ii) small 'complexity' for it to be conveyed to the server with minimum communication cost. In this paper, we propose a joint training and source coding scheme with provable in-expectation guarantees, where the expectation is over the encoder's output message. Specifically, we show that by imposing a constraint on a suitable Kullback-Leibler divergence between the conditional distribution induced by a compressed learning model given and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition
