DVQA: Understanding Data Visualizations via Question Answering

Kushal Kafle; Brian Price; Scott Cohen; Christopher Kanan

arXiv:1801.08163·cs.CV·March 30, 2018·27 cites

DVQA: Understanding Data Visualizations via Question Answering

Kushal Kafle, Brian Price, Scott Cohen, Christopher Kanan

PDF

Open Access 1 Repo

TL;DR

DVQA introduces a dataset and methods for understanding bar charts through question answering, addressing the challenge of extracting numeric and semantic info from diverse visualizations.

Contribution

The paper presents DVQA, a novel dataset for bar chart question answering, and proposes baseline algorithms that outperform existing visual question answering models.

Findings

01

State-of-the-art VQA models perform poorly on DVQA.

02

Proposed baselines significantly outperform existing models on DVQA.

03

DVQA enables automatic extraction of information from bar charts in various domains.

Abstract

Bar charts are an effective way to convey numeric information, but today's algorithms cannot parse them. Existing methods fail when faced with even minor variations in appearance. Here, we present DVQA, a dataset that tests many aspects of bar chart understanding in a question answering framework. Unlike visual question answering (VQA), DVQA requires processing words and answers that are unique to a particular bar chart. State-of-the-art VQA algorithms perform poorly on DVQA, and we propose two strong baselines that perform considerably better. Our work will enable algorithms to automatically extract numeric and semantic information from vast quantities of bar charts found in scientific publications, Internet articles, business reports, and many other areas.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kushalkafle/DVQA_dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Anomaly Detection Techniques and Applications