DeMT: Deformable Mixer Transformer for Multi-Task Learning of Dense Prediction
Yangyang Xu, Yibo Yang, Lefei Zhang

TL;DR
DeMT is a novel multi-task learning model that combines deformable CNN and query-based Transformer, achieving superior dense prediction performance with fewer computational resources.
Contribution
This work introduces a deformable mixer encoder and task-aware transformer decoder, integrating CNN and Transformer advantages for multi-task dense prediction.
Findings
Outperforms existing models on NYUD-v2 and PASCAL-Context datasets.
Uses fewer GFLOPs while achieving higher accuracy.
Effective multi-task learning with improved task interaction.
Abstract
Convolution neural networks (CNNs) and Transformers have their own advantages and both have been widely used for dense prediction in multi-task learning (MTL). Most of the current studies on MTL solely rely on CNN or Transformer. In this work, we present a novel MTL model by combining both merits of deformable CNN and query-based Transformer for multi-task learning of dense prediction. Our method, named DeMT, is based on a simple and effective encoder-decoder architecture (i.e., deformable mixer encoder and task-aware transformer decoder). First, the deformable mixer encoder contains two types of operators: the channel-aware mixing operator leveraged to allow communication among different channels ( efficient channel location mixing), and the spatial-aware deformable operator with deformable convolution applied to efficiently sample more informative spatial locations (i.e.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · AI in cancer detection
MethodsMulti-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Adam · Byte Pair Encoding · Linear Layer · Dropout · Softmax · Residual Connection · Label Smoothing
