Mediated Experts for Deep Convolutional Networks
Sebastian Agethen, Winston H. Hsu

TL;DR
The paper introduces Mediated Mixture-of-Experts, a supervised deep learning architecture that improves classification accuracy and supports incremental learning by combining expert networks with a mediator and complexity control mechanisms.
Contribution
It proposes a novel Mediated Mixture-of-Experts architecture that enhances accuracy, enables incremental learning, and reduces computational complexity through shared layers and early stopping.
Findings
Achieved improved accuracy over single models on a benchmark dataset.
Demonstrated effective incremental learning by adding new experts.
Showed that complexity can be controlled without retraining.
Abstract
We present a new supervised architecture termed Mediated Mixture-of-Experts (MMoE) that allows us to improve classification accuracy of Deep Convolutional Networks (DCN). Our architecture achieves this with the help of expert networks: A network is trained on a disjoint subset of a given dataset and then run in parallel to other experts during deployment. A mediator is employed if experts contradict each other. This allows our framework to naturally support incremental learning, as adding new classes requires (re-)training of the new expert only. We also propose two measures to control computational complexity: An early-stopping mechanism halts experts that have low confidence in their prediction. The system allows to trade-off accuracy and complexity without further retraining. We also suggest to share low-level convolutional layers between experts in an effort to avoid computation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning
