Strongly-Typed Agents are Guaranteed to Interact Safely

David Balduzzi

arXiv:1702.07450·cs.LG·June 7, 2018

Strongly-Typed Agents are Guaranteed to Interact Safely

David Balduzzi

PDF

Open Access

TL;DR

This paper introduces strongly-typed agents, ensuring safe interactions in multi-agent systems, especially under gradient descent, by providing formal guarantees and connecting to concepts like convexity and tensor decomposition.

Contribution

It formalizes strongly-typed agents and proves they guarantee safe interactions, extending understanding of multi-agent safety in gradient-based learning.

Findings

01

Strongly-typed agents guarantee safe interactions.

02

Gradient descent converges to Nash equilibrium in safe games.

03

Connections to convexity, blind source separation, and tensor decomposition.

Abstract

As artificial agents proliferate, it is becoming increasingly important to ensure that their interactions with one another are well-behaved. In this paper, we formalize a common-sense notion of when algorithms are well-behaved: an algorithm is safe if it does no harm. Motivated by recent progress in deep learning, we focus on the specific case where agents update their actions according to gradient descent. The paper shows that that gradient descent converges to a Nash equilibrium in safe games. The main contribution is to define strongly-typed agents and show they are guaranteed to interact safely, thereby providing sufficient conditions to guarantee safe interactions. A series of examples show that strong-typing generalizes certain key features of convexity, is closely related to blind source separation, and introduces a new perspective on classical multilinear games based on tensor…

Figures1

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Reinforcement Learning in Robotics · Model Reduction and Neural Networks