Semantic-Drive: Democratizing Long-Tail Data Curation via Open-Vocabulary Grounding and Neuro-Symbolic VLM Consensus

Antonio Guillen-Perez

arXiv:2512.12012·cs.CV·December 17, 2025

Semantic-Drive: Democratizing Long-Tail Data Curation via Open-Vocabulary Grounding and Neuro-Symbolic VLM Consensus

Antonio Guillen-Perez

PDF

Open Access

TL;DR

Semantic-Drive is a neuro-symbolic framework that enhances long-tail safety-critical event detection in autonomous vehicle data by combining real-time open-vocabulary grounding with forensic scene analysis, all on local hardware.

Contribution

It introduces a novel local neuro-symbolic approach with a multi-model consensus mechanism for precise, privacy-preserving long-tail data curation in autonomous driving.

Findings

01

Achieves 0.966 recall on nuScenes, outperforming CLIP's 0.475

02

Reduces risk assessment error by 40% compared to single models

03

Operates entirely on consumer hardware, ensuring privacy

Abstract

The development of robust Autonomous Vehicles (AVs) is bottlenecked by the scarcity of "Long-Tail" training data. While fleets collect petabytes of video logs, identifying rare safety-critical events (e.g., erratic jaywalking, construction diversions) remains a manual, cost-prohibitive process. Existing solutions rely on coarse metadata search, which lacks precision, or cloud-based VLMs, which are privacy-invasive and expensive. We introduce Semantic-Drive, a local-first, neuro-symbolic framework for semantic data mining. Our approach decouples perception into two stages: (1) Symbolic Grounding via a real-time open-vocabulary detector (YOLOE) to anchor attention, and (2) Cognitive Analysis via a Reasoning VLM that performs forensic scene analysis. To mitigate hallucination, we implement a "System 2" inference-time alignment strategy, utilizing a multi-model "Judge-Scout" consensus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning