A Mutual Information Maximization Approach for the Spurious Solution   Problem in Weakly Supervised Question Answering

Zhihong Shao; Lifeng Shang; Qun Liu; Minlie Huang

arXiv:2106.07174·cs.CL·June 15, 2021

A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering

Zhihong Shao, Lifeng Shang, Qun Liu, Minlie Huang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a mutual information maximization method to address the spurious solution problem in weakly supervised question answering, improving model accuracy by leveraging semantic correlations.

Contribution

It proposes explicitly maximizing mutual information between questions and solutions to better distinguish correct solutions from spurious ones.

Findings

01

Significant performance improvements over previous methods.

02

More effective training in producing correct solutions.

03

Validated on four question answering datasets.

Abstract

Weakly supervised question answering usually has only the final answers as supervision signals while the correct solutions to derive the answers are not provided. This setting gives rise to the spurious solution problem: there may exist many spurious solutions that coincidentally derive the correct answer, but training on such solutions can hurt model performance (e.g., producing wrong solutions or answers). For example, for discrete reasoning tasks as on DROP, there may exist many equations to derive a numeric answer, and typically only one of them is correct. Previous learning methods mostly filter out spurious solutions with heuristics or using model confidence, but do not explicitly exploit the semantic correlations between a question and its solution. In this paper, to alleviate the spurious solution problem, we propose to explicitly exploit such semantic correlations by maximizing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ZhihongShao/MIMAX
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications