Test and Evaluation Framework for Multi-Agent Systems of Autonomous   Intelligent Agents

Erin Lanus; Ivan Hernandez; Adam Dachowicz; Laura Freeman; Melanie; Grande; Andrew Lang; Jitesh H. Panchal; Anthony Patrick; Scott Welch

arXiv:2101.10430·eess.SY·January 21, 2022

Test and Evaluation Framework for Multi-Agent Systems of Autonomous Intelligent Agents

Erin Lanus, Ivan Hernandez, Adam Dachowicz, Laura Freeman, Melanie, Grande, Andrew Lang, Jitesh H. Panchal, Anthony Patrick, Scott Welch

PDF

TL;DR

This paper proposes a comprehensive test and evaluation framework for complex multi-agent cyber-physical systems with embedded AI, addressing challenges of diverse system integration, ongoing adaptation, and resource constraints.

Contribution

It introduces a unifying framework that supports testing throughout development and operation, accommodating system learning and environmental variability.

Findings

01

Framework supports continuous testing during system operation

02

Addresses testing challenges at multiple hierarchical levels

03

Provides a generic use case illustrating practical application

Abstract

Test and evaluation is a necessary process for ensuring that engineered systems perform as intended under a variety of conditions, both expected and unexpected. In this work, we consider the unique challenges of developing a unifying test and evaluation framework for complex ensembles of cyber-physical systems with embedded artificial intelligence. We propose a framework that incorporates test and evaluation throughout not only the development life cycle, but continues into operation as the system learns and adapts in a noisy, changing, and contended environment. The framework accounts for the challenges of testing the integration of diverse systems at various hierarchical scales of composition while respecting that testing time and resources are limited. A generic use case is provided for illustrative purposes and research directions emerging as a result of exploring the use case via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.