Compositional Distributed Learning for Multi-View Perception: A Maximal Coding Rate Reduction Perspective
Zhuojun Tian, Mehdi Bennis

TL;DR
This paper introduces a novel distributed learning framework for multi-view perception that uses maximal coding rate reduction and subspace fusion to improve classification accuracy and preserve representation diversity.
Contribution
It proposes a new algorithm combining maximal coding rate reduction with subspace basis fusion for distributed multi-view learning, with theoretical guarantees.
Findings
Achieves high classification accuracy in simulations.
Maintains diversity of learned representations.
Ensures basis fusion consistency theoretically.
Abstract
In this letter, we formulate a compositional distributed learning framework for multi-view perception by leveraging the maximal coding rate reduction principle combined with subspace basis fusion. In the proposed algorithm, each agent conducts a periodic singular value decomposition on its learned subspaces and exchanges truncated basis matrices, based on which the fused subspaces are obtained. By introducing a projection matrix and minimizing the distance between the outputs and its projection, the learned representations are enforced towards the fused subspaces. It is proved that the trace on the coding-rate change is bounded and the consistency of basis fusion is guaranteed theoretically. Numerical simulations validate that the proposed algorithm achieves high classification accuracy while maintaining representations' diversity, compared to baselines showing correlated subspaces and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Domain Adaptation and Few-Shot Learning · Advanced Vision and Imaging
