MILD: Mediator Agent System with Bidirectional Perception and Multi-Layered Alignment for Human-Vehicle Collaboration
Jiyao Wang, Yunbiao Wang, Yubo Jiao, Xiao Yang, Dengbo He, Sasan Jafarnejad, Luis Miranda-Moreno, Raphael Frank, and Jiangbo Yu

TL;DR
The paper introduces MILD, an agentic system architecture for human-vehicle collaboration that enhances transparency, alignment, and safety through perception, explainable strategies, and regulatory constraints, improving performance and user experience.
Contribution
It proposes the MILD system with a perception and strategy agent, along with ECPO and retrieval-augmented generation, to improve alignment, safety, and explainability in human-vehicle interaction.
Findings
MILD outperforms baselines in perception accuracy and strategy quality.
MILD yields higher human-rated policy adequacy, comfort, and explanation.
Field experiments validate the effectiveness of the proposed system.
Abstract
Prior studies report that partial driving automation can increase the cognitive demands on human drivers. This effect largely arises from human drivers' lack of transparent insight into the vehicle's intentions and decision logic, as well as from automated systems' limited awareness of the driver's dynamic state and preferences. This bidirectional misalignment undermines shared situational awareness and exacerbates coordination failures in human-vehicle interaction. To address these limitations, we argue for a paradigm shift that elevates the human role from passive supervisor to active manager. We introduce the Mediator-in-the-Loop-Driving (MILD) system, based on an agentic system architecture to facilitate synergistic human-vehicle collaboration. MILD integrates a perception agent for joint in-cabin and out-of-cabin understanding with a lightweight strategy agent that generates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
