Reinforcement learning using Deep Q Networks and Q learning accurately   localizes brain tumors on MRI with very small training sets

Joseph N Stember; Hrithwik Shalu

arXiv:2010.10763·cs.CV·March 22, 2021

Reinforcement learning using Deep Q Networks and Q learning accurately localizes brain tumors on MRI with very small training sets

Joseph N Stember, Hrithwik Shalu

PDF

TL;DR

This study demonstrates that Deep Q Networks, a reinforcement learning approach, can accurately localize brain tumors in MRI images with very small training sets, outperforming supervised methods that tend to overfit.

Contribution

The paper introduces a generalized Deep Q learning method in a gridworld environment for medical image analysis, reducing the need for extensive annotations.

Findings

01

Deep Q learning achieved 70% accuracy on test images.

02

Supervised keypoint detection overfitted with only 11% accuracy.

03

Reinforcement learning showed improved generalization over training.

Abstract

Purpose Supervised deep learning in radiology suffers from notorious inherent limitations: 1) It requires large, hand-annotated data sets, 2) It is non-generalizable, and 3) It lacks explainability and intuition. We have recently proposed Reinforcement Learning to address all threes. However, we applied it to images with radiologist eye tracking points, which limits the state-action space. Here we generalize the Deep-Q Learning to a gridworld-based environment, so that only the images and image masks are required. Materials and Methods We trained a Deep Q network on 30 two-dimensional image slices from the BraTS brain tumor database. Each image contained one lesion. We then tested the trained Deep Q network on a separate set of 30 testing set images. For comparison, we also trained and tested a keypoint detection supervised deep learning network for the same set of training / testing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.