Hybrid Reinforced Medical Report Generation with M-Linear Attention and Repetition Penalty
Wenting Xu, Zhenghua Xu, Junyang Chen, Chang Qi, Thomas Lukasiewicz

TL;DR
This paper introduces a novel medical report generation method combining high-order feature interactions, a hybrid reward system, and a repetition penalty, significantly improving report quality and reducing generation errors.
Contribution
It proposes a hybrid reinforcement learning framework with m-linear attention and a search algorithm for optimal reward weighting, addressing key limitations of existing methods.
Findings
Outperforms state-of-the-art baselines on two public datasets
Effectively reduces repeated terms in generated reports
Demonstrates efficiency of the reward search algorithm
Abstract
To reduce doctors' workload, deep-learning-based automatic medical report generation has recently attracted more and more research efforts, where deep convolutional neural networks (CNNs) are employed to encode the input images, and recurrent neural networks (RNNs) are used to decode the visual features into medical reports automatically. However, these state-of-the-art methods mainly suffer from three shortcomings: (i) incomprehensive optimization, (ii) low-order and unidimensional attention mechanisms, and (iii) repeated generation. In this article, we propose a hybrid reinforced medical report generation method with m-linear attention and repetition penalty mechanism (HReMRG-MR) to overcome these problems. Specifically, a hybrid reward with different weights is employed to remedy the limitations of single-metric-based rewards. We also propose a search algorithm with linear complexity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Image Retrieval and Classification Techniques
