Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous   Questions in VQA

Elias Stengel-Eskin; Jimena Guallar-Blasco; Yi Zhou; Benjamin Van; Durme

arXiv:2211.07516·cs.CL·June 5, 2023

Why Did the Chicken Cross the Road? Rephrasing and Analyzing Ambiguous Questions in VQA

Elias Stengel-Eskin, Jimena Guallar-Blasco, Yi Zhou, Benjamin Van, Durme

PDF

Open Access 1 Repo

TL;DR

This paper addresses ambiguity in visual questions by creating a dataset, analyzing linguistic causes, and developing a question-generation model that reduces ambiguity and integrates answer group information.

Contribution

It introduces a dataset of ambiguous visual questions, analyzes their linguistic causes, and proposes a question-generation model that reduces ambiguity without direct supervision.

Findings

01

The dataset reveals a linguistically-aligned ontology of ambiguity reasons.

02

The question-generation model produces less ambiguous questions.

03

The model effectively integrates answer group information without explicit supervision.

Abstract

Natural language is ambiguous. Resolving ambiguous questions is key to successfully answering them. Focusing on questions about images, we create a dataset of ambiguous examples. We annotate these, grouping answers by the underlying question they address and rephrasing the question for each group to reduce ambiguity. Our analysis reveals a linguistically-aligned ontology of reasons for ambiguity in visual questions. We then develop an English question-generation model which we demonstrate via automatic and human evaluation produces less ambiguous questions. We further show that the question generation objective we use allows the model to integrate answer group information without any direct supervision.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

esteng/ambiguous_vqa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques

MethodsOntology