LaSQuE: Improved Zero-Shot Classification from Explanations Through   Quantifier Modeling and Curriculum Learning

Sayan Ghosh; Rakesh R Menon; Shashank Srivastava

arXiv:2212.09104·cs.CL·December 20, 2022

LaSQuE: Improved Zero-Shot Classification from Explanations Through Quantifier Modeling and Curriculum Learning

Sayan Ghosh, Rakesh R Menon, Shashank Srivastava

PDF

Open Access

TL;DR

LaSQuE is a novel method that enhances zero-shot classification by modeling linguistic quantifiers, aggregating explanations with attention, and employing curriculum learning, leading to significant generalization improvements.

Contribution

The paper introduces LaSQuE, a new approach that leverages quantifier semantics, explanation aggregation, and curriculum learning for improved zero-shot classification from language explanations.

Findings

01

Up to 7% improvement in generalization to unseen tasks.

02

Effective modeling of quantifiers enhances explanation-based learning.

03

Aggregation and curriculum strategies outperform prior methods.

Abstract

A hallmark of human intelligence is the ability to learn new concepts purely from language. Several recent approaches have explored training machine learning models via natural language supervision. However, these approaches fall short in leveraging linguistic quantifiers (such as 'always' or 'rarely') and mimicking humans in compositionally learning complex tasks. Here, we present LaSQuE, a method that can learn zero-shot classifiers from language explanations by using three new strategies - (1) modeling the semantics of linguistic quantifiers in explanations (including exploiting ordinal strength relationships, such as 'always' > 'likely'), (2) aggregating information from multiple explanations using an attention-based mechanism, and (3) model training via curriculum learning. With these strategies, LaSQuE outperforms prior work, showing an absolute gain of up to 7% in generalizing to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Explainable Artificial Intelligence (XAI)