A reinforcement learning approach for VQA validation: an application to   diabetic macular edema grading

Tatiana Fountoukidou; Raphael Sznitman

arXiv:2307.09886·cs.CV·July 20, 2023

A reinforcement learning approach for VQA validation: an application to diabetic macular edema grading

Tatiana Fountoukidou, Raphael Sznitman

PDF

TL;DR

This paper presents a reinforcement learning-based method to validate Visual Question Answering algorithms in medical imaging, specifically for diabetic macular edema grading, by simulating clinical reasoning through an adaptive questioning approach.

Contribution

It introduces an RL agent for automatic, adaptive questioning to evaluate VQA models' reasoning, enhancing validation beyond traditional static methods.

Findings

01

The RL agent asks clinically relevant questions similar to a clinician.

02

The approach improves understanding of VQA model reasoning in medical diagnosis.

03

Demonstrated effectiveness in diabetic macular edema grading context.

Abstract

Recent advances in machine learning models have greatly increased the performance of automated methods in medical image analysis. However, the internal functioning of such models is largely hidden, which hinders their integration in clinical practice. Explainability and trust are viewed as important aspects of modern methods, for the latter's widespread use in clinical communities. As such, validation of machine learning models represents an important aspect and yet, most methods are only validated in a limited way. In this work, we focus on providing a richer and more appropriate validation approach for highly powerful Visual Question Answering (VQA) algorithms. To better understand the performance of these methods, which answer arbitrary questions related to images, this work focuses on an automatic visual Turing test (VTT). That is, we propose an automatic adaptive questioning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocus