$\mathcal{B}^{3}$-Net: Controlled Posterior Bridge Learning for Multi-Task Dense Prediction
Meihua Zhou, Li Yang

TL;DR
The paper introduces $\\mathcal{B}^{3}$-Net, a novel multi-task dense prediction framework that explicitly models evidence reliability to improve task feature fusion and reduce negative transfer.
Contribution
It proposes a controlled posterior bridge learning approach with reliability estimation, heteroscedastic evidence fusion, and bounded redistribution for better multi-task dense prediction.
Findings
Achieves competitive or superior results on NYUD-v2, PASCAL-Context, and Cityscapes.
Demonstrates that controlled posterior bridge learning improves task performance.
Verifies gains are due to the proposed method, not backbone capacity.
Abstract
Multi-task dense prediction solves complementary pixel-level tasks in a unified model, such as semantic segmentation, depth estimation, surface normal estimation, and edge detection. Existing decoder-side interactions use attention, prompts, routing, diffusion, Mamba, or bridge features to exchange task evidence, but most of them organize this evidence implicitly. They usually fuse task features by similarity or affinity, without explicitly modeling that evidence reliability varies across tasks and spatial locations. As a result, unreliable evidence may contaminate the shared representation and intensify negative transfer. We propose -Net, a controlled posterior bridge learning framework for multi-task dense prediction. Our method decomposes decoder-side interaction into reliability estimation, posterior bridge construction, and bounded redistribution. The Precision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
