LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Hung Yun Tseng; Wuzhen Li; Blerina Gkotse; Grigorios Chrysos

arXiv:2603.20572·cs.LG·March 24, 2026

LJ-Bench: Ontology-Based Benchmark for U.S. Crime

Hung Yun Tseng, Wuzhen Li, Blerina Gkotse, Grigorios Chrysos

PDF

Open Access

TL;DR

LJ-Bench introduces an ontology-based benchmark grounded in U.S. legal frameworks to evaluate LLM robustness against a wide spectrum of illegal activities, addressing gaps in existing benchmarks.

Contribution

This work presents the first comprehensive, ontology-based benchmark for assessing LLM vulnerability to diverse crime categories grounded in legal standards.

Findings

01

LLMs are more vulnerable to societal harm attacks than individual harm attacks.

02

The benchmark covers 76 crime types organized taxonomically.

03

Experiments reveal specific weaknesses in LLMs across different crime categories.

Abstract

The potential of Large Language Models (LLMs) to provide harmful information remains a significant concern due to the vast breadth of illegal queries they may encounter. Unfortunately, existing benchmarks only focus on a handful types of illegal activities, and are not grounded in legal works. In this work, we introduce an ontology of crime-related concepts grounded in the legal frameworks of Model Panel Code, which serves as an influential reference for criminal law and has been adopted by many U.S. states, and instantiated using Californian Law. This structured knowledge forms the foundation for LJ-Bench, the first comprehensive benchmark designed to evaluate LLM robustness against a wide range of illegal activities. Spanning 76 distinct crime types organized taxonomically, LJ-Bench enables systematic assessment of diverse attacks, revealing valuable insights into LLM vulnerabilities…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCybercrime and Law Enforcement Studies · Crime Patterns and Interventions · Crime, Illicit Activities, and Governance