SGW-based Multi-Task Learning in Vision Tasks

Ruiyuan Zhang; Yuyao Chen; Yuchi Huo; Jiaxiang Liu; Dianbing Xi; Jie; Liu; Chao Wu

arXiv:2410.03778·cs.CV·October 8, 2024

SGW-based Multi-Task Learning in Vision Tasks

Ruiyuan Zhang, Yuyao Chen, Yuchi Huo, Jiaxiang Liu, Dianbing Xi, Jie, Liu, Chao Wu

PDF

Open Access

TL;DR

This paper introduces a novel SGW-based multi-task learning framework that employs an information bottleneck and neural collapse to improve knowledge sharing and task performance in complex vision tasks.

Contribution

It proposes a new KEM module with ETF space projection to reduce inter-task interference and enhance robustness in multi-task learning.

Findings

01

Significant performance improvements over existing methods.

02

Effective reduction of inter-task interference.

03

Enhanced robustness through ETF space projection.

Abstract

Multi-task-learning(MTL) is a multi-target optimization task. Neural networks try to realize each target using a shared interpretative space within MTL. However, as the scale of datasets expands and the complexity of tasks increases, knowledge sharing becomes increasingly challenging. In this paper, we first re-examine previous cross-attention MTL methods from the perspective of noise. We theoretically analyze this issue and identify it as a flaw in the cross-attention mechanism. To address this issue, we propose an information bottleneck knowledge extraction module (KEM). This module aims to reduce inter-task interference by constraining the flow of information, thereby reducing computational complexity. Furthermore, we have employed neural collapse to stabilize the knowledge-selection process. That is, before input to KEM, we projected the features into ETF space. This mapping makes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInfrared Target Detection Methodologies