An Empirical Study on Few-shot Knowledge Probing for Pretrained Language   Models

Tianxing He; Kyunghyun Cho; James Glass

arXiv:2109.02772·cs.AI·September 14, 2021·5 cites

An Empirical Study on Few-shot Knowledge Probing for Pretrained Language Models

Tianxing He, Kyunghyun Cho, James Glass

PDF

Open Access 1 Repo

TL;DR

This paper investigates the effectiveness of few-shot learning approaches for probing world knowledge in pretrained language models, introducing a new dataset for 2-hop relations and demonstrating that simple fine-tuning methods outperform prompt engineering in low-data scenarios.

Contribution

It compares various few-shot knowledge probing methods, introduces TREx-2p dataset for 2-hop relations, and shows simple bias fine-tuning surpasses prompt engineering techniques.

Findings

01

Few-shot examples significantly improve probing performance.

02

Fine-tuning bias vectors outperforms prompt-engineering methods.

03

The new TREx-2p dataset enables evaluation of 2-hop relations.

Abstract

Prompt-based knowledge probing for 1-hop relations has been used to measure how much world knowledge is stored in pretrained language models. Existing work uses considerable amounts of data to tune the prompts for better performance. In this work, we compare a variety of approaches under a few-shot knowledge probing setting, where only a small number (e.g., 10 or 20) of example triples are available. In addition, we create a new dataset named TREx-2p, which contains 2-hop relations. We report that few-shot examples can strongly boost the probing performance for both 1-hop and 2-hop relations. In particular, we find that a simple-yet-effective approach of finetuning the bias vectors in the model outperforms existing prompt-engineering methods. Our dataset and code are available at \url{https://github.com/cloudygoose/fewshot_lama}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cloudygoose/fewshot_lama
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications