Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes
Tim Schopf, Alexander Blatzheim, Nektarios Machner, Florian Matthes

TL;DR
FusionSent is a novel approach that fine-tunes and fuses sentence embedding models for efficient, few-shot multi-label classification of scientific documents, significantly outperforming baselines on new and existing datasets.
Contribution
The paper introduces FusionSent, a fusion-based sentence embedding fine-tuning method for few-shot classification of scientific documents with many classes, along with a new large-scale dataset.
Findings
FusionSent outperforms strong baselines by 6 F1 points.
The method is efficient and effective for label-scarce scenarios.
A new dataset with 203,961 articles and 130 classes is provided.
Abstract
Scientific document classification is a critical task and often involves many classes. However, collecting human-labeled data for many classes is expensive and usually leads to label-scarce scenarios. Moreover, recent work has shown that sentence embedding model fine-tuning for few-shot classification is efficient, robust, and effective. In this work, we propose FusionSent (Fusion-based Sentence Embedding Fine-tuning), an efficient and prompt-free approach for few-shot classification of scientific documents with many classes. FusionSent uses available training examples and their respective label texts to contrastively fine-tune two different sentence embedding models. Afterward, the parameters of both fine-tuned models are fused to combine the complementary knowledge from the separate fine-tuning steps into a single model. Finally, the resulting sentence embedding model is frozen to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching Methods · Text and Document Classification Technologies
