Mediated Experts for Deep Convolutional Networks

Sebastian Agethen; Winston H. Hsu

arXiv:1511.06072·cs.LG·November 20, 2015

Mediated Experts for Deep Convolutional Networks

Sebastian Agethen, Winston H. Hsu

PDF

Open Access

TL;DR

The paper introduces Mediated Mixture-of-Experts, a supervised deep learning architecture that improves classification accuracy and supports incremental learning by combining expert networks with a mediator and complexity control mechanisms.

Contribution

It proposes a novel Mediated Mixture-of-Experts architecture that enhances accuracy, enables incremental learning, and reduces computational complexity through shared layers and early stopping.

Findings

01

Achieved improved accuracy over single models on a benchmark dataset.

02

Demonstrated effective incremental learning by adding new experts.

03

Showed that complexity can be controlled without retraining.

Abstract

We present a new supervised architecture termed Mediated Mixture-of-Experts (MMoE) that allows us to improve classification accuracy of Deep Convolutional Networks (DCN). Our architecture achieves this with the help of expert networks: A network is trained on a disjoint subset of a given dataset and then run in parallel to other experts during deployment. A mediator is employed if experts contradict each other. This allows our framework to naturally support incremental learning, as adding new classes requires (re-)training of the new expert only. We also propose two measures to control computational complexity: An early-stopping mechanism halts experts that have low confidence in their prediction. The system allows to trade-off accuracy and complexity without further retraining. We also suggest to share low-level convolutional layers between experts in an effort to avoid computation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning