FigureNet: A Deep Learning model for Question-Answering on Scientific Plots
Revanth Reddy, Rahul Ramesh, Ameet Deshpande, Mitesh M. Khapra

TL;DR
FigureNet is a deep learning model designed to answer questions about scientific plots by identifying plot elements, quantifying values, and understanding their relationships, demonstrating improved accuracy and efficiency over previous methods.
Contribution
We introduce FigureNet, a novel architecture that effectively reasons about scientific plots, outperforming existing models in accuracy and training efficiency.
Findings
Outperforms Relation Networks baseline by ~7% on FigureQA dataset.
Reduces training time by over an order of magnitude.
Successfully identifies plot elements and their relationships for question-answering.
Abstract
Deep Learning has managed to push boundaries in a wide variety of tasks. One area of interest is to tackle problems in reasoning and understanding, with an aim to emulate human intelligence. In this work, we describe a deep learning model that addresses the reasoning task of question-answering on categorical plots. We introduce a novel architecture FigureNet, that learns to identify various plot elements, quantify the represented values and determine a relative ordering of these statistical values. We test our model on the FigureQA dataset which provides images and accompanying questions for scientific plots like bar graphs and pie charts, augmented with rich annotations. Our approach outperforms the state-of-the-art Relation Networks baseline by approximately on this dataset, with a training time that is over an order of magnitude lesser.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
