Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions

Jingdong Zhang; Hanrong Ye; Xin Li; Wenping Wang; Dan Xu

arXiv:2411.18823·cs.CV·August 1, 2025

Multi-Task Label Discovery via Hierarchical Task Tokens for Partially Annotated Dense Predictions

Jingdong Zhang, Hanrong Ye, Xin Li, Wenping Wang, Dan Xu

PDF

Open Access

TL;DR

This paper introduces a hierarchical task token approach to improve multi-task dense prediction with partial labels, enabling direct pixel-wise supervision and outperforming existing methods on multiple datasets.

Contribution

It proposes a novel hierarchical task token framework for discovering pixel-wise supervision signals, reducing reliance on heavy mapping networks and enhancing multi-task learning.

Findings

01

Significant performance improvements on NYUD-v2, Cityscapes, and PASCAL Context datasets.

02

Effective discovery of pseudo dense labels for multi-task learning.

03

Enhanced cross-task feature interactions through hierarchical task tokens.

Abstract

In recent years, simultaneous learning of multiple dense prediction tasks with partially annotated label data has emerged as an important research area. Previous works primarily focus on leveraging cross-task relations or conducting adversarial training for extra regularization, which achieve promising performance improvements, while still suffering from the lack of direct pixel-wise supervision and extra training of heavy mapping networks. To effectively tackle this challenge, we propose a novel approach to optimize a set of compact learnable hierarchical task tokens, including global and fine-grained ones, to discover consistent pixel-wise supervision signals in both feature and prediction levels. Specifically, the global task tokens are designed for effective cross-task feature interactions in a global context. Then, a group of fine-grained task-specific spatial tokens for each task…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Rough Sets and Fuzzy Logic · Text and Document Classification Technologies

MethodsSparse Evolutionary Training · Focus