SPICA: Retrieving Scenarios for Pluralistic In-Context Alignment
Quan Ze Chen, K.J. Kevin Feng, Chan Young Park, Amy X. Zhang

TL;DR
SPICA is a framework that improves in-context model alignment by considering group-level differences during example retrieval, leading to more accurate and equitable alignment across diverse demographic groups.
Contribution
SPICA introduces a novel retrieval method incorporating group differences, including scenario banks and group-informed metrics, enhancing alignment for diverse groups.
Findings
More accurate retrieval of preferences with SPICA
Higher user ratings with SPICA compared to similarity-based methods
More uniform benefits across different demographic groups
Abstract
When different groups' values differ, one approach to model alignment is to steer models at inference time towards each group's preferences. However, techniques like in-context learning only consider similarity when drawing few-shot examples and not cross-group differences in values. We propose SPICA, a framework that accounts for group-level differences during in-context example retrieval. SPICA introduces three designs: scenario banks, group-informed retrieval metrics, and in-context alignment prompts. From an evaluation of SPICA on an alignment task collecting inputs from four demographic groups (), our metrics retrieve in-context examples that more closely match observed preferences, with the best prompt configuration using multiple contrastive responses to demonstrate examples. In an end-to-end evaluation (), we observe that SPICA is higher rated than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsHigher Education Learning Practices
MethodsALIGN
