TL;DR
This paper introduces Match-Prompt, a prompt learning approach that enhances the multi-task generalization ability of pre-trained language models in text matching by focusing on essential matching signals across diverse tasks.
Contribution
It proposes a two-stage training strategy using prompt tokens to improve generalization across multiple text matching tasks, outperforming traditional fine-tuning methods.
Findings
Improves multi-task generalization in text matching models.
Enhances out-of-domain and new task adaptation performance.
Outperforms previous fine-tuning paradigms.
Abstract
Text matching is a fundamental technique in both information retrieval and natural language processing. Text matching tasks share the same paradigm that determines the relationship between two given texts. The relationships vary from task to task, e.g.~relevance in document retrieval, semantic alignment in paraphrase identification and answerable judgment in question answering. However, the essential signals for text matching remain in a finite scope, i.e.~exact matching, semantic matching, and inference matching. Ideally, a good text matching model can learn to capture and aggregate these signals for different matching tasks to achieve competitive performance, while recent state-of-the-art text matching models, e.g.~Pre-trained Language Models (PLMs), are hard to generalize. It is because the end-to-end supervised learning on task-specific dataset makes model overemphasize the data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
