DeGMix: Efficient Multi-Task Dense Prediction with Deformable and Gating Mixer
Yangyang Xu, Yibo Yang, Bernard Ghanem, Lefei Zhang, Bo Du, Jun Zhu

TL;DR
DeGMix is an efficient multi-task dense prediction model that combines deformable CNNs and query-based Transformers with shared gating, achieving superior performance with lower computational cost.
Contribution
This work introduces DeGMix, a novel multi-task dense prediction framework integrating deformable CNNs and gating transformers for improved efficiency and accuracy.
Findings
DeGMix outperforms existing models on multiple dense prediction datasets.
DeGMix uses fewer GFLOPs than comparable Transformer and CNN models.
DeGMix achieves higher accuracy with less computational complexity.
Abstract
Convolution neural networks and Transformers have their own advantages and both have been widely used for dense prediction in multi-task learning (MTL). Existing studies typically employ either CNNs (effectively capture local spatial patterns) or Transformers (capturing long-range dependencies) independently, but integrating their strengths may yield more robust models. In this work, we present an efficient MTL model that combines the adaptive capabilities of deformable CNN and query-based Transformer with shared gating for MTL of dense prediction. This combination may offer a simple and efficient solution owing to its powerful and flexible task-specific learning and the advantages of lower cost, less complexity, and smaller parameters than traditional MTL methods. We introduce an efficient multi-task dense prediction with deformable and gating mixer (DeGMix). First, the deformable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Brain Tumor Detection and Classification
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Layer Normalization · Label Smoothing · Adam · Residual Connection · Dense Connections · Dropout
