CrossTune: Black-Box Few-Shot Classification with Label Enhancement
Danqing Luo, Chen Zhang, Yan Zhang, Haizhou Li

TL;DR
This paper introduces CrossTune, a label-enhanced black-box adaptation method for few-shot text classification that improves performance by modeling label semantics and leveraging ChatGPT-generated data, outperforming previous gradient-free tuning methods.
Contribution
CrossTune is a novel label-enhanced cross-attention network that models semantic relatedness between inputs and labels for black-box LLM adaptation without prompt search.
Findings
Outperforms previous black-box tuning methods by 5.7% on average.
Effective even without ChatGPT data augmentation.
Demonstrates strong results across seven benchmark datasets.
Abstract
Training or finetuning large-scale language models (LLMs) requires substantial computation resources, motivating recent efforts to explore parameter-efficient adaptation to downstream tasks. One approach is to treat these models as black boxes and use forward passes (Inference APIs) to interact with them. Current research focuses on adapting these black-box models to downstream tasks using gradient-free prompt optimization, but this often involves an expensive process of searching task-specific prompts. Therefore, we are motivated to study black-box language model adaptation without prompt search. Specifically, we introduce a label-enhanced cross-attention network called CrossTune, which models the semantic relatedness between the input text sequence and task-specific label descriptions. Its effectiveness is examined in the context of few-shot text classification. To improve the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · COVID-19 diagnosis using AI
