CoPA: Benchmarking Personalized Question Answering with Data-Informed Cognitive Factors

Hang Su; Zequn Liu; Chen Hu; Xuesong Lu; Yingce Xia; and Zhen Liu

arXiv:2604.14773·cs.CL·April 17, 2026

CoPA: Benchmarking Personalized Question Answering with Data-Informed Cognitive Factors

Hang Su, Zequn Liu, Chen Hu, Xuesong Lu, Yingce Xia, and Zhen Liu

PDF

1 Repo 1 Datasets

TL;DR

CoPA is a new benchmark for evaluating personalized question answering by measuring how well models align with individual user preferences derived from interaction data.

Contribution

It introduces a data-driven method to assess personalization in QA models using six cognitive factors and a benchmark with nearly 2,000 user profiles.

Findings

01

CoPA enables fine-grained, factor-level evaluation of personalized QA.

02

It provides a more comprehensive standard than generic metrics.

03

The benchmark correlates well with user-specific preferences.

Abstract

While LLMs have demonstrated remarkable potential in Question Answering (QA), evaluating personalization remains a critical bottleneck. Existing paradigms predominantly rely on lexical-level similarity or manual heuristics, often lacking sufficient data-driven validation. We address this by mining Community-Individual Preference Divergence (CIPD), where individual choices override consensus, to distill six key personalization factors as evaluative dimensions. Accordingly, we introduce CoPA, a benchmark with 1,985 user profiles for fine-grained, factor-level assessment. By quantifying the alignment between model outputs and user-specific cognitive preferences inferred from interaction patterns, CoPA provides a more comprehensive and discriminative standard for evaluating personalized QA than generic metrics. The code is available at https://github.com/bjzgcai/CoPA.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bjzgcai/CoPA
github

Datasets

sssss-hang/CoPA
dataset· 409 dl
409 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.