Diversity Over Quantity: A Lesson From Few Shot Relation Classification
Amir DN Cohen, Shauli Ravfogel, Shaltiel Shmidman, Yoav Goldberg

TL;DR
This paper emphasizes that in few-shot relation classification, increasing the diversity of relation types in training data improves generalization to unseen relations more effectively than merely increasing dataset size.
Contribution
The authors introduce REBEL-FS, a new benchmark with more diverse relation types, and demonstrate that relation diversity enhances FSRC performance.
Findings
Diverse relation training data improves generalization to unseen relations.
REBEL-FS contains an order of magnitude more relation types than existing datasets.
Diversity-focused data curation reduces the need for large datasets in FSRC.
Abstract
In few-shot relation classification (FSRC), models must generalize to novel relations with only a few labeled examples. While much of the recent progress in NLP has focused on scaling data size, we argue that diversity in relation types is more crucial for FSRC performance. In this work, we demonstrate that training on a diverse set of relations significantly enhances a model's ability to generalize to unseen relations, even when the overall dataset size remains fixed. We introduce REBEL-FS, a new FSRC benchmark that incorporates an order of magnitude more relation types than existing datasets. Through systematic experiments, we show that increasing the diversity of relation types in the training data leads to consistent gains in performance across various few-shot learning scenarios, including high-negative settings. Our findings challenge the common assumption that more data alone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Text and Document Classification Technologies
MethodsSparse Evolutionary Training
