From Chat Logs to Collective Insights: Aggregative Question Answering

Wentao Zhang; Woojeong Kim; Yuntian Deng

arXiv:2505.23765·cs.CL·September 23, 2025

From Chat Logs to Collective Insights: Aggregative Question Answering

Wentao Zhang, Woojeong Kim, Yuntian Deng

PDF

Open Access 1 Video

TL;DR

This paper introduces a new task called Aggregative Question Answering that involves reasoning over large-scale chatbot conversations to extract collective insights, supported by a benchmark dataset and experimental analysis.

Contribution

It proposes the novel task of Aggregative Question Answering, creates the WildChat-AQA benchmark, and highlights the challenges faced by current methods in reasoning and computational efficiency.

Findings

01

Existing methods struggle with reasoning accuracy.

02

Current approaches are computationally expensive.

03

The benchmark enables future research in collective conversational insights.

Abstract

Conversational agents powered by large language models (LLMs) are rapidly becoming integral to our daily interactions, generating unprecedented amounts of conversational data. Such datasets offer a powerful lens into societal interests, trending topics, and collective concerns. Yet, existing approaches typically treat these interactions as independent and miss critical insights that could emerge from aggregating and reasoning across large-scale conversation logs. In this paper, we introduce Aggregative Question Answering, a novel task requiring models to reason explicitly over thousands of user-chatbot interactions to answer aggregative queries, such as identifying emerging concerns among specific demographics. To enable research in this direction, we construct a benchmark, WildChat-AQA, comprising 6,027 aggregative questions derived from 182,330 real-world chatbot conversations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

From Chat Logs to Collective Insights: Aggregative Question Answering· underline

Taxonomy

TopicsTopic Modeling · AI in Service Interactions · Expert finding and Q&A systems