AITP: Traffic Accident Responsibility Allocation via Multimodal Large Language Models

Zijin Zhou; Songan Zhang

arXiv:2604.20878·cs.CL·April 24, 2026

AITP: Traffic Accident Responsibility Allocation via Multimodal Large Language Models

Zijin Zhou, Songan Zhang

PDF

TL;DR

This paper introduces AITP, a multimodal large language model designed for traffic accident responsibility allocation, leveraging reasoning mechanisms and legal knowledge integration, and presents a comprehensive benchmark for traffic accident reasoning tasks.

Contribution

The paper presents AITP, a novel multimodal LLM with reasoning and legal knowledge integration, and introduces DecaTARA, a large benchmark for traffic accident reasoning tasks.

Findings

01

AITP achieves state-of-the-art performance in responsibility allocation.

02

DecaTARA includes 67,941 videos and 195,821 QA pairs for comprehensive evaluation.

03

The approach establishes a new paradigm for reasoning-driven multimodal traffic analysis.

Abstract

Multimodal Large Language Models (MLLMs) have achieved remarkable progress in Traffic Accident Detection (TAD) and Traffic Accident Understanding (TAU). However, existing studies mainly focus on describing and interpreting accident videos, leaving room for deeper causal reasoning and integration of legal knowledge. Traffic Accident Responsibility Allocation (TARA) is a more challenging task that requires multi-step reasoning grounded in traffic regulations. To address this, we introduce AITP (Artificial Intelligence Traffic Police), a multimodal large language model for responsibility reasoning and allocation. AITP enhances reasoning via a Multimodal Chain-of-Thought (MCoT) mechanism and integrates legal knowledge through Retrieval-Augmented Generation (RAG). We further present DecaTARA, a decathlon-style benchmark unifying ten interrelated traffic accident reasoning tasks with 67,941…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.