Tighter Risk Bounds for Mixtures of Experts
Wissam Akretche, Fr\'ed\'eric LeBlanc, Mario Marchand

TL;DR
This paper derives tighter risk bounds for mixtures of experts with local differential privacy constraints on their gating mechanism, improving understanding of their generalization capabilities.
Contribution
It introduces novel risk bounds for mixtures of experts with LDP gating, especially for one-out-of-$n$ mechanisms, showing significant improvements over existing bounds.
Findings
Bounds depend logarithmically on the number of experts
Experimental results confirm improved generalization with LDP gating
Theoretical guarantees are tighter than previous bounds under reasonable conditions
Abstract
In this work, we provide upper bounds on the risk of mixtures of experts by imposing local differential privacy (LDP) on their gating mechanism. These theoretical guarantees are tailored to mixtures of experts that utilize the one-out-of- gating mechanism, as opposed to the conventional -out-of- mechanism. The bounds exhibit logarithmic dependence on the number of experts, and encapsulate the dependence on the gating mechanism in the LDP parameter, making them significantly tighter than existing bounds, under reasonable conditions. Experimental results support our theory, demonstrating that our approach enhances the generalization ability of mixtures of experts and validating the feasibility of imposing LDP on the gating mechanism.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychology of Social Influence · Decision-Making and Behavioral Economics · Philosophy and History of Science
