Addressing Failures in Robotics using Vision-Based Language Models   (VLMs) and Behavior Trees (BT)

Faseeh Ahmad; Jonathan Styrud; Volker Krueger

arXiv:2411.01568·cs.RO·November 5, 2024

Addressing Failures in Robotics using Vision-Based Language Models (VLMs) and Behavior Trees (BT)

Faseeh Ahmad, Jonathan Styrud, Volker Krueger

PDF

Open Access

TL;DR

This paper presents a novel approach combining Vision Language Models and Behavior Trees to detect, identify, and recover from both known and unknown failures in robotic systems, enhancing autonomy and robustness.

Contribution

It introduces the integration of VLMs with BTs for failure detection and recovery, enabling autonomous handling of unforeseen failures in robotics.

Findings

01

Effective failure detection using VLMs in simulations

02

Successful incorporation of VLM-generated conditions into BTs

03

Improved robustness in robotic failure management

Abstract

In this paper, we propose an approach that combines Vision Language Models (VLMs) and Behavior Trees (BTs) to address failures in robotics. Current robotic systems can handle known failures with pre-existing recovery strategies, but they are often ill-equipped to manage unknown failures or anomalies. We introduce VLMs as a monitoring tool to detect and identify failures during task execution. Additionally, VLMs generate missing conditions or skill templates that are then incorporated into the BT, ensuring the system can autonomously address similar failures in future tasks. We validate our approach through simulations in several failure scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning