P-Adapters: Robustly Extracting Factual Information from Language Models with Diverse Prompts
Benjamin Newman, Prafulla Kumar Choubey, Nazneen Rajani

TL;DR
This paper introduces P-Adapters, lightweight models that improve the consistency and accuracy of factual information extraction from LLMs across diverse prompts without requiring additional annotations.
Contribution
The work proposes P-Adapters, a novel lightweight approach that enhances factual extraction consistency from LLMs, outperforming complex MoE models without extra annotation needs.
Findings
P-Adapters achieve 12-26% improvement in precision.
P-Adapters show 36-50% improvement in consistency.
Access to original prompt embeddings is key to success.
Abstract
Recent work (e.g. LAMA (Petroni et al., 2019)) has found that the quality of the factual information extracted from Large Language Models (LLMs) depends on the prompts used to query them. This inconsistency is problematic because different users will query LLMs for the same information using different wording, but should receive the same, accurate responses regardless. In this work we aim to address this shortcoming by introducing P-Adapters: lightweight models that sit between the embedding layer and first attention layer of LLMs. They take LLM embeddings as input and output continuous prompts that are used to query the LLM. Additionally, we investigate Mixture of Experts (MoE) models that learn a set of continuous prompts ("experts") and select one to query the LLM. They require a separate classifier trained on human-annotated data to map natural language prompts to the continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Tanh Activation · Dropout · Layer Normalization · Dense Connections · Softmax · Residual Connection
