FigureNet: A Deep Learning model for Question-Answering on Scientific   Plots

Revanth Reddy; Rahul Ramesh; Ameet Deshpande; Mitesh M. Khapra

arXiv:1806.04655·cs.LG·April 3, 2019

FigureNet: A Deep Learning model for Question-Answering on Scientific Plots

Revanth Reddy, Rahul Ramesh, Ameet Deshpande, Mitesh M. Khapra

PDF

TL;DR

FigureNet is a deep learning model designed to answer questions about scientific plots by identifying plot elements, quantifying values, and understanding their relationships, demonstrating improved accuracy and efficiency over previous methods.

Contribution

We introduce FigureNet, a novel architecture that effectively reasons about scientific plots, outperforming existing models in accuracy and training efficiency.

Findings

01

Outperforms Relation Networks baseline by ~7% on FigureQA dataset.

02

Reduces training time by over an order of magnitude.

03

Successfully identifies plot elements and their relationships for question-answering.

Abstract

Deep Learning has managed to push boundaries in a wide variety of tasks. One area of interest is to tackle problems in reasoning and understanding, with an aim to emulate human intelligence. In this work, we describe a deep learning model that addresses the reasoning task of question-answering on categorical plots. We introduce a novel architecture FigureNet, that learns to identify various plot elements, quantify the represented values and determine a relative ordering of these statistical values. We test our model on the FigureQA dataset which provides images and accompanying questions for scientific plots like bar graphs and pie charts, augmented with rich annotations. Our approach outperforms the state-of-the-art Relation Networks baseline by approximately $7%$ on this dataset, with a training time that is over an order of magnitude lesser.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.