The Double Dilemma in Multi-Task Radiology Report Generation: A Gradient Dynamics Analysis and Solution

Erjian Zhang; Yatong Hao; Liejun Wang; Zhiqing Guo

arXiv:2605.22635·cs.LG·May 22, 2026

The Double Dilemma in Multi-Task Radiology Report Generation: A Gradient Dynamics Analysis and Solution

Erjian Zhang, Yatong Hao, Liejun Wang, Zhiqing Guo

PDF

1 Repo

TL;DR

This paper analyzes the limitations of linear scalarization in multi-task radiology report generation using gradient dynamics, and proposes a novel optimizer, CAME-Grad, to improve clinical report quality.

Contribution

The paper introduces CAME-Grad, a conflict-averse, magnitude-enhanced optimizer that addresses the double dilemma in multi-task learning for radiology report generation.

Findings

01

CAME-Grad improves performance across eight RRG methods.

02

Achieves an average of 2.3% improvement on MIMIC-CXR.

03

Achieves an average of 1.9% improvement on IU X-Ray.

Abstract

While multi-task learning based automatic radiology report generation (RRG) is widely adopted to ensure clinical consistency, most focus on architectural designs yet remain limited to coarse linear scalarization strategies. These strategies cannot effectively balance the hard constraints of discriminative clinical supervision with the smoothness requirements of report generation. To address these problems, we analyze the failure mechanism of linear scalarization from the perspective of gradient dynamics, utilizing the stochastic differential equation (SDE) framework to characterize it as a "Double Dilemma" of drift term deviation and diffusion term decay. Based on this, we propose a backbone-agnostic optimizer named Conflict-Averse Magnitude-Enhanced Gradient Descent (CAME-Grad). Through conflict-averse direction rectification and magnitude-enhanced energy injection, the algorithm not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vpsg-research/CAME-Grad
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.