Can Commercial LLMs Be Parliamentary Political Companions? Comparing LLM Reasoning Against Romanian Legislative Expuneri de Motive
Iulian Luc\u{a}u, Adelin-George Voicu

TL;DR
This study assesses the reasoning capabilities of various commercial LLMs against Romanian legislative explanations, revealing strengths in frontier models and limitations due to contextual ignorance and confabulation.
Contribution
It introduces a novel evaluation framework comparing LLM reasoning to official legislative reasoning and conceptualizes the legislator-LLM relationship through principal-agent theory.
Findings
Frontier models outperform open-weight models in semantic similarity scores.
All models show task-dependent confabulation, especially on politically idiosyncratic proposals.
Failures in reasoning compound across bounded rationality levels, influenced by training data coverage.
Abstract
This paper evaluates whether commercial large language models (LLMs) can function as reliable political advisory tools by comparing their outputs against official legislative reasoning. Using a dataset of 15 Romanian Senate law proposals paired with their official explanatory memoranda (expuneri de motive), we test six LLMs spanning three provider families and multiple capability tiers: GPT-5-mini, GPT-5-chat (OpenAI), Claude Haiku 4.5 (Anthropic), and Llama 4 Maverick, Llama 3.3 70B, and Llama 3.1 8B (Meta). Each model generates predicted rationales evaluated through a dual framework combining LLM-as-Judge semantic scoring and programmatic text similarity metrics. We frame the LLM-politician relationship through principal-agent theory and bounded rationality, conceptualizing the legislator as a principal delegating advisory tasks to a boundedly rational agent under structural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
