Does QA-based intermediate training help fine-tuning language models for   text classification?

Shiwei Zhang; Xiuzhen Zhang

arXiv:2112.15051·cs.CL·January 3, 2022·1 cites

Does QA-based intermediate training help fine-tuning language models for text classification?

Shiwei Zhang, Xiuzhen Zhang

PDF

Open Access

TL;DR

This study investigates whether intermediate training on QA tasks improves fine-tuning of language models for text classification, revealing variable benefits across models and tasks.

Contribution

It provides empirical evidence on the effects of QA-based intermediate training across multiple models and classification tasks, highlighting inconsistent transfer performance.

Findings

01

QA training benefits vary across models

02

Similar QA tasks lead to better transfer performance

03

Transfer performance is inconsistent across different models

Abstract

Fine-tuning pre-trained language models for downstream tasks has become a norm for NLP. Recently it is found that intermediate training based on high-level inference tasks such as Question Answering (QA) can improve the performance of some language models for target tasks. However it is not clear if intermediate training generally benefits various language models. In this paper, using the SQuAD-2.0 QA task for intermediate training for target text classification tasks, we experimented on eight tasks for single-sequence classification and eight tasks for sequence-pair classification using two base and two compact language models. Our experiments show that QA-based intermediate training generates varying transfer performance across different language models, except for similar QA tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsBalanced Selection