End-to-End Neuro-Symbolic Reinforcement Learning with Textual   Explanations

Lirui Luo; Guoxi Zhang; Hongming Xu; Yaodong Yang; Cong Fang; Qing Li

arXiv:2403.12451·cs.AI·June 14, 2024·2 cites

End-to-End Neuro-Symbolic Reinforcement Learning with Textual Explanations

Lirui Luo, Guoxi Zhang, Hongming Xu, Yaodong Yang, Cong Fang, Qing Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a neuro-symbolic reinforcement learning framework that jointly learns structured states and symbolic policies, uses GPT-4 for generating textual explanations, and demonstrates effectiveness on Atari tasks.

Contribution

It proposes a novel joint learning approach combining vision models and symbolic policies, with GPT-4 generated explanations to improve interpretability.

Findings

01

Effective on nine Atari tasks

02

GPT-4 provides clear textual explanations

03

Joint learning refines structured states and policies

Abstract

Neuro-symbolic reinforcement learning (NS-RL) has emerged as a promising paradigm for explainable decision-making, characterized by the interpretability of symbolic policies. NS-RL entails structured state representations for tasks with visual observations, but previous methods cannot refine the structured states with rewards due to a lack of efficiency. Accessibility also remains an issue, as extensive domain knowledge is required to interpret symbolic policies. In this paper, we present a neuro-symbolic framework for jointly learning structured states and symbolic policies, whose key idea is to distill the vision foundation model into an efficient perception module and refine it during policy learning. Moreover, we design a pipeline to prompt GPT-4 to generate textual explanations for the learned policies and decisions, significantly reducing users' cognitive load to understand the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

liruiluo/nsrl-vision-pub
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces

MethodsAttention Is All You Need · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Adam · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer