BF++: a language for general-purpose program synthesis

Vadim Liventsev; Aki H\"arm\"a; Milan Petkovi\'c

arXiv:2101.09571·cs.AI·July 11, 2022

BF++: a language for general-purpose program synthesis

Vadim Liventsev, Aki H\"arm\"a, Milan Petkovi\'c

PDF

Open Access 1 Repo

TL;DR

BF++ is a new programming language designed for automatic agent programming in POMDPs, enabling neural program synthesis to improve decision-making in reinforcement learning tasks, especially where expert knowledge integration is crucial.

Contribution

The paper introduces BF++, a novel language tailored for program synthesis in decision systems, bridging the gap between neural models and symbolic, expert-validated programs.

Findings

01

Successfully applied neural program synthesis to OpenAI Gym benchmarks.

02

Demonstrated improved incorporation of expert knowledge into decision models.

03

Showed potential for more transparent and reviewable decision mechanisms.

Abstract

Most state of the art decision systems based on Reinforcement Learning (RL) are data-driven black-box neural models, where it is often difficult to incorporate expert knowledge into the models or let experts review and validate the learned decision mechanisms. Knowledge-insertion and model review are important requirements in many applications involving human health and safety. One way to bridge the gap between data and knowledge driven systems is program synthesis: replacing a neural network that outputs decisions with a symbolic program generated by a neural network or by means of genetic programming. We propose a new programming language, BF++, designed specifically for automatic programming of agents in a Partially Observable Markov Decision Process (POMDP) setting and apply neural program synthesis to solve standard OpenAI Gym benchmarks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vadim0x60/cibi
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Fault Detection and Control Systems · Adversarial Robustness in Machine Learning