Diversified Dynamic Routing for Vision Tasks
Botos Csaba, Adel Bibi, Yanwei Li, Philip Torr, Ser-Nam Lim

TL;DR
This paper introduces Diversified Dynamic Routing, an unsupervised method for training mixture of experts in vision tasks, improving data partitioning and expert assignment for better performance in segmentation and detection.
Contribution
It proposes a novel Diversified Dynamic Routing approach that explicitly trains models to find optimal data partitions and assign experts without supervision.
Findings
Improved semantic segmentation accuracy on Cityscapes.
Enhanced object detection and instance segmentation on MS-COCO.
Outperforms several baseline models in experiments.
Abstract
Deep learning models for vision tasks are trained on large datasets under the assumption that there exists a universal representation that can be used to make predictions for all samples. Whereas high complexity models are proven to be capable of learning such representations, a mixture of experts trained on specific subsets of the data can infer the labels more efficiently. However using mixture of experts poses two new problems, namely (i) assigning the correct expert at inference time when a new unseen sample is presented. (ii) Finding the optimal partitioning of the training data, such that the experts rely the least on common features. In Dynamic Routing (DR) a novel architecture is proposed where each layer is composed of a set of experts, however without addressing the two challenges we demonstrate that the model reverts to using the same subset of experts. In our method,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification · Video Surveillance and Tracking Methods
