SteuerLLM: Local specialized large language model for German tax law analysis
Sebastian Wind, Jeta Sopa, Laurin Schmid, Quirin Jackl, Sebastian Kiefer, Fei Wu, Martin Mayr, Harald K\"ostler, Gerhard Wellein, Andreas Maier, Soroosh Tayebi Arasteh

TL;DR
SteuerLLM is a specialized large language model for German tax law that outperforms general models by training on domain-specific synthetic data and using a structured evaluation benchmark, advancing legal AI capabilities.
Contribution
The paper introduces SteuerEx, a novel benchmark for German tax law, and SteuerLLM, a domain-adapted LLM trained on synthetic data, demonstrating improved performance over general models.
Findings
SteuerLLM outperforms comparable general-purpose models.
Domain-specific training data enhances legal reasoning accuracy.
Open release of datasets and models supports reproducible research.
Abstract
Large language models (LLMs) demonstrate strong general reasoning and language understanding, yet their performance degrades in domains governed by strict formal rules, precise terminology, and legally binding structure. Tax law exemplifies these challenges, as correct answers require exact statutory citation, structured legal argumentation, and numerical accuracy under rigid grading schemes. We algorithmically generate SteuerEx, the first open benchmark derived from authentic German university tax law examinations. SteuerEx comprises 115 expert-validated examination questions spanning six core tax law domains and multiple academic levels, and employs a statement-level, partial-credit evaluation framework that closely mirrors real examination practice. We further present SteuerLLM, a domain-adapted LLM for German tax law trained on a large-scale synthetic dataset generated from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Text Readability and Simplification · Topic Modeling
