AutoMT: A Multi-Agent LLM Framework for Automated Metamorphic Testing of Autonomous Driving Systems
Linfeng Liang, Chenkai Tan, Yao Deng, Yingfeng Cai, T.Y Chen, Xi Zheng

TL;DR
AutoMT introduces an automated multi-agent framework powered by LLMs for generating diverse metamorphic test cases in autonomous driving, significantly improving fault detection and test coverage over manual methods.
Contribution
It automates the extraction of metamorphic relations from traffic rules and integrates vision-language analysis, enhancing testing efficiency and coverage in autonomous driving systems.
Findings
Achieves up to 5x higher test diversity compared to manual methods.
Detects up to 20.55% more behavioral violations.
Automates MR extraction, reducing manual effort.
Abstract
Autonomous Driving Systems (ADS) are safety-critical, where failures can be severe. While Metamorphic Testing (MT) is effective for fault detection in ADS, existing methods rely heavily on manual effort and lack automation. We present AutoMT, a multi-agent MT framework powered by Large Language Models (LLMs) that automates the extraction of Metamorphic Relations (MRs) from local traffic rules and the generation of valid follow-up test cases. AutoMT leverages LLMs to extract MRs from traffic rules in Gherkin syntax using a predefined ontology. A vision-language agent analyzes scenarios, and a search agent retrieves suitable MRs from a RAG-based database to generate follow-up cases via computer vision. Experiments show that AutoMT achieves up to 5 x higher test diversity in follow-up case generation compared to the best baseline (manual expert-defined MRs) in terms of validation rate, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Software Testing and Debugging Techniques · Safety Systems Engineering in Autonomy
