SG-LegalCite: A Principle-Augmented Benchmark for Legal Citation Retrieval in Singapore Law
Shannon Lee Yueh Ern, Kaidong Feng, Yingpeng Du, Chloe Lee En Jia, Zhu Sun

TL;DR
This paper introduces SG-LegalCite, a new benchmark dataset for legal citation retrieval in Singapore law that emphasizes the importance of explicit legal principles in the retrieval process.
Contribution
It proposes a principle-augmented retrieval paradigm and provides a large dataset to improve legal citation retrieval accuracy in Singapore's unique legal context.
Findings
Explicit legal principles significantly improve retrieval accuracy.
SG-LegalCite contains 100,890 case-principle pairs from Singapore Supreme Court judgments.
Experiments show the effectiveness of the principle-augmented approach.
Abstract
Legal citation in common-law systems depends not only on factual similarity, but also on the legal principle for which a precedent is invoked. However, existing benchmarks for legal citation retrieval use case facts, citation context, or full judgments as inputs, where the governing legal principle is often missing or only implicitly expressed and entangled with broader context. As a result, models may retrieve precedents that are factually similar yet doctrinally irrelevant. This limitation is particularly consequential in Singapore, where the legal system has evolved independently: only domestic precedents are binding, while foreign authorities serve merely as persuasive references. Thus, we propose a new retrieval paradigm that ranks cited cases based on queries integrating case facts and explicit legal principles, inspired by real-world legal reasoning workflows. To support this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
