Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report   Generation

Christy Y. Li; Xiaodan Liang; Zhiting Hu; Eric P. Xing

arXiv:1805.08298·cs.CV·November 27, 2018·153 cites

Hybrid Retrieval-Generation Reinforced Agent for Medical Image Report Generation

Christy Y. Li, Xiaodan Liang, Zhiting Hu, Eric P. Xing

PDF

Open Access

TL;DR

This paper introduces HRGR-Agent, a hybrid model combining retrieval and generation for medical report creation, achieving state-of-the-art results with diverse, coherent, and accurate reports guided by reinforcement learning.

Contribution

The novel HRGR-Agent framework integrates retrieval and generation with hierarchical decision-making and reinforcement learning for improved medical report generation.

Findings

01

Achieves state-of-the-art results on two datasets.

02

Generates well-structured and diverse reports.

03

Improves detection accuracy of medical terminologies.

Abstract

Generating long and coherent reports to describe medical images poses challenges to bridging visual patterns with informative human linguistic descriptions. We propose a novel Hybrid Retrieval-Generation Reinforced Agent (HRGR-Agent) which reconciles traditional retrieval-based approaches populated with human prior knowledge, with modern learning-based approaches to achieve structured, robust, and diverse report generation. HRGR-Agent employs a hierarchical decision-making procedure. For each sentence, a high-level retrieval policy module chooses to either retrieve a template sentence from an off-the-shelf template database, or invoke a low-level generation module to generate a new sentence. HRGR-Agent is updated via reinforcement learning, guided by sentence-level and word-level rewards. Experiments show that our approach achieves the state-of-the-art results on two medical report…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning