Prompt Guided Transformer for Multi-Task Dense Prediction

Yuxiang Lu; Shalayiding Sirejiding; Yue Ding; Chunlin Wang; Hongtao; Lu

arXiv:2307.15362·cs.CV·July 31, 2023

Prompt Guided Transformer for Multi-Task Dense Prediction

Yuxiang Lu, Shalayiding Sirejiding, Yue Ding, Chunlin Wang, Hongtao, Lu

PDF

Open Access 1 Repo

TL;DR

The paper introduces a lightweight Prompt Guided Transformer that uses task-specific prompts within a shared architecture to efficiently perform multi-task dense prediction, achieving state-of-the-art results with fewer parameters.

Contribution

It proposes a novel prompt-conditioned Transformer block integrated into a task-conditional model, balancing performance and parameter efficiency in multi-task dense prediction.

Findings

01

Achieves state-of-the-art results on PASCAL-Context and NYUD-v2 benchmarks.

02

Uses only 2.7% of total parameters in the lightweight decoder.

03

Maintains high performance with significantly fewer parameters.

Abstract

Task-conditional architecture offers advantage in parameter efficiency but falls short in performance compared to state-of-the-art multi-decoder methods. How to trade off performance and model parameters is an important and difficult problem. In this paper, we introduce a simple and lightweight task-conditional model called Prompt Guided Transformer (PGT) to optimize this challenge. Our approach designs a Prompt-conditioned Transformer block, which incorporates task-specific prompts in the self-attention mechanism to achieve global dependency modeling and parameter-efficient feature adaptation across multiple tasks. This block is integrated into both the shared encoder and decoder, enhancing the capture of intra- and inter-task features. Moreover, we design a lightweight decoder to further reduce parameter usage, which accounts for only 2.7% of the total model parameters. Extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

innovator-zero/MTDP_Lib
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Brain Tumor Detection and Classification

MethodsMulti-Head Attention · Attention Is All You Need · Softmax · Dense Connections · Linear Layer · Dropout · Adam · Label Smoothing · Absolute Position Encodings · Byte Pair Encoding