Boosting Few-Shot Text Classification via Distribution Estimation
Han Liu, Feng Zhang, Xiaotong Zhang, Siyang Zhao, Fenglong Ma,, Xiao-Ming Wu, Hongyang Chen, Hong Yu, Xianchao Zhang

TL;DR
This paper introduces a novel distribution estimation method for few-shot text classification that leverages unlabeled query samples to improve performance, addressing challenges of negative transfer across categories.
Contribution
The paper proposes two strategies to estimate class distributions using unlabeled data, enhancing few-shot text classification without negative transfer effects.
Findings
Outperforms state-of-the-art baselines on eight datasets
Effective distribution estimation improves classification accuracy
Utilizes unlabeled query samples for better supervision
Abstract
Distribution estimation has been demonstrated as one of the most effective approaches in dealing with few-shot image classification, as the low-level patterns and underlying representations can be easily transferred across different tasks in computer vision domain. However, directly applying this approach to few-shot text classification is challenging, since leveraging the statistics of known classes with sufficient samples to calibrate the distributions of novel classes may cause negative effects due to serious category difference in text domain. To alleviate this issue, we propose two simple yet effective strategies to estimate the distributions of the novel classes by utilizing unlabeled query samples, thus avoiding the potential negative transfer issue. Specifically, we first assume a class or sample follows the Gaussian distribution, and use the original support set and the nearest…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Text and Document Classification Technologies · COVID-19 diagnosis using AI
