Many Episode Learning in a Modular Embodied Agent via End-to-End   Interaction

Yuxuan Sun; Ethan Carlson; Rebecca Qian; Kavya Srinet; Arthur Szlam

arXiv:2204.08687·cs.AI·January 11, 2023

Many Episode Learning in a Modular Embodied Agent via End-to-End Interaction

Yuxuan Sun, Ethan Carlson, Rebecca Qian, Kavya Srinet, Arthur Szlam

PDF

Open Access

TL;DR

This paper presents a modular embodied agent that improves through end-to-end interactions with crowd-workers, utilizing a combination of learned and heuristic modules, and a credit assignment system for continuous learning.

Contribution

It introduces a novel framework for self-improving embodied agents via crowd-sourced annotations and credit assignment in an end-to-end interaction setting.

Findings

01

Agent performance improved over multiple interaction rounds.

02

Effective credit assignment enabled targeted module updates.

03

Crowd-sourcing facilitated continuous learning and adaptation.

Abstract

In this work we give a case study of an embodied machine-learning (ML) powered agent that improves itself via interactions with crowd-workers. The agent consists of a set of modules, some of which are learned, and others heuristic. While the agent is not "end-to-end" in the ML sense, end-to-end interaction is a vital part of the agent's learning mechanism. We describe how the design of the agent works together with the design of multiple annotation interfaces to allow crowd-workers to assign credit to module errors from end-to-end interactions, and to label data for individual modules. Over multiple automated human-agent interaction, credit assignment, data annotation, and model re-training and re-deployment, rounds we demonstrate agent improvement.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Anomaly Detection Techniques and Applications · Data Stream Mining Techniques