Striking a Balance: Alleviating Inconsistency in Pre-trained Models for   Symmetric Classification Tasks

Ashutosh Kumar; Aditya Joshi

arXiv:2203.13491·cs.CL·March 28, 2022

Striking a Balance: Alleviating Inconsistency in Pre-trained Models for Symmetric Classification Tasks

Ashutosh Kumar, Aditya Joshi

PDF

Open Access

TL;DR

This paper addresses the inconsistency issue in pre-trained models for symmetric classification tasks in NLP by applying a consistency loss, improving prediction stability without sacrificing accuracy across multiple datasets.

Contribution

It introduces a consistency loss function specifically designed to reduce prediction inconsistency in symmetric classification tasks during fine-tuning.

Findings

01

Significant improvement in prediction consistency on paraphrase detection datasets.

02

No notable decrease in classification accuracy across tested datasets.

03

Effective approach for symmetric tasks without compromising performance.

Abstract

While fine-tuning pre-trained models for downstream classification is the conventional paradigm in NLP, often task-specific nuances may not get captured in the resultant models. Specifically, for tasks that take two inputs and require the output to be invariant of the order of the inputs, inconsistency is often observed in the predicted labels or confidence scores. We highlight this model shortcoming and apply a consistency loss function to alleviate inconsistency in symmetric classification. Our results show an improved consistency in predictions for three paraphrase detection datasets without a significant drop in the accuracy scores. We examine the classification performance of six datasets (both symmetric and non-symmetric) to showcase the strengths and limitations of our approach.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques