PolInterviews -- A Dataset of German Politician Public Broadcast Interviews
Lukas Birkenmaier, Laureen Sieber, Felix Bergstein

TL;DR
This paper introduces PolInterviews, a comprehensive dataset of German politician interviews from YouTube, enabling research on political communication dynamics and politician behavior in German media contexts.
Contribution
It provides the first open, transcribed dataset of German politician interviews with detailed speaker info, facilitating new research avenues.
Findings
Contains 99 interviews with 33 politicians
Includes 28,146 sentences in a structured format
Enables analysis of political communication patterns
Abstract
This paper presents a novel dataset of public broadcast interviews featuring high-ranking German politicians. The interviews were sourced from YouTube, transcribed, processed for speaker identification, and stored in a tidy and open format. The dataset comprises 99 interviews with 33 different German politicians across five major interview formats, containing a total of 28,146 sentences. As the first of its kind, this dataset offers valuable opportunities for research on various aspects of political communication in the (German) political contexts, such as agenda-setting, interviewer dynamics, or politicians' self-presentation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Sentiment Analysis and Opinion Mining · Media Studies and Communication
