Goals as Reward-Producing Programs

Guy Davidson; Graham Todd; Julian Togelius; Todd M. Gureckis; Brenden M. Lake

arXiv:2405.13242·cs.AI·May 20, 2025

Goals as Reward-Producing Programs

Guy Davidson, Graham Todd, Julian Togelius, Todd M. Gureckis, Brenden M. Lake

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach to modeling human goals as reward-producing programs, enabling the generation of human-like goals and games that are evaluated as fun and authentic by human judges.

Contribution

It presents a method to model goals as reward-producing programs, learn a fitness function over these programs, and generate novel, human-like goals through program synthesis and quality-diversity sampling.

Findings

01

Generated goals are indistinguishable from human-created games.

02

Model's fitness scores predict fun and human-like qualities.

03

Human evaluators favor model-generated goals in experiments.

Abstract

People are remarkably capable of generating their own goals, beginning with child's play and continuing into adulthood. Despite considerable empirical and computational work on goals and goal-oriented behavior, models are still far from capturing the richness of everyday human goals. Here, we bridge this gap by collecting a dataset of human-generated playful goals (in the form of scorable, single-player games), modeling them as reward-producing programs, and generating novel human-like goals through program synthesis. Reward-producing programs capture the rich semantics of goals through symbolic operations that compose, add temporal constraints, and allow for program execution on behavioral traces to evaluate progress. To build a generative model of goals, we learn a fitness function over the infinite set of possible goal programs and sample novel goals with a quality-diversity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

guydav/goals-as-reward-producing-programs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEconomic and Social Issues

MethodsSparse Evolutionary Training