AppellateGen: A Benchmark for Appellate Legal Judgment Generation

Hongkun Yang; Lionel Z. Wang; Wei Fan; Yiran Hu; Lixu Wang; Chenyu Liu; Yu Zeng; Shenghong Fu; Lei Gong; Zhengxin Zhang; Haoyang Li; Jiexin Zheng; Xin Xu

arXiv:2601.01331·cs.CY·March 31, 2026

AppellateGen: A Benchmark for Appellate Legal Judgment Generation

Hongkun Yang, Lionel Z. Wang, Wei Fan, Yiran Hu, Lixu Wang, Chenyu Liu, Yu Zeng, Shenghong Fu, Lei Gong, Zhengxin Zhang, Haoyang Li, Jiexin Zheng, Xin Xu

PDF

1 Repo

TL;DR

AppellateGen introduces a benchmark dataset and a multi-agent system for second-instance legal judgment generation, emphasizing reasoning over initial verdicts and evidentiary updates, highlighting challenges for current LLMs.

Contribution

The paper presents a new appellate judgment dataset and a multi-agent system to improve legal reasoning modeling in second-instance trials.

Findings

01

SLMAS enhances logical consistency in judgments.

02

Current LLMs struggle with complex appellate reasoning.

03

The dataset and code are publicly available.

Abstract

Legal judgment generation is a critical task in legal intelligence. However, existing research in legal judgment generation has predominantly focused on first-instance trials, relying on static fact-to-verdict mappings while neglecting the dialectical nature of appellate (second-instance) review. To address this, we introduce AppellateGen, a benchmark for second-instance legal judgment generation comprising 7,351 case pairs. The task requires models to draft legally binding judgments by reasoning over the initial verdict and evidentiary updates, thereby modeling the causal dependency between trial stages. We further propose a judicial Standard Operating Procedure (SOP)-based Legal Multi-Agent System (SLMAS) to simulate judicial workflows, which decomposes the generation process into discrete stages of issue identification, retrieval, and drafting. Experimental results indicate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://anonymous.4open.science/r/AppellateGen-5763
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.