AlgoPilot: Fully Autonomous Program Synthesis Without Human-Written   Programs

Xiaoxin Yin

arXiv:2501.06423·cs.AI·January 14, 2025

AlgoPilot: Fully Autonomous Program Synthesis Without Human-Written Programs

Xiaoxin Yin

PDF

TL;DR

AlgoPilot introduces a fully autonomous program synthesis method that generates algorithms from scratch using reinforcement learning guided by a trajectory language model, without relying on human-written programs or prior knowledge.

Contribution

It presents a novel RL-based framework guided by a TLM trained on random functions, enabling the discovery of classical algorithms like Bubble Sort without prior algorithmic knowledge.

Findings

01

Successfully generated interpretable sorting algorithms

02

Demonstrated ability to synthesize classical algorithms from scratch

03

Established a new paradigm for autonomous algorithm discovery

Abstract

Program synthesis has traditionally relied on human-provided specifications, examples, or prior knowledge to generate functional algorithms. Existing methods either emulate human-written algorithms or solve specific tasks without generating reusable programmatic logic, limiting their ability to create novel algorithms. We introduce AlgoPilot, a groundbreaking approach for fully automated program synthesis without human-written programs or trajectories. AlgoPilot leverages reinforcement learning (RL) guided by a Trajectory Language Model (TLM) to synthesize algorithms from scratch. The TLM, trained on trajectories generated by random Python functions, serves as a soft constraint during the RL process, aligning generated sequences with patterns likely to represent valid algorithms. Using sorting as a test case, AlgoPilot demonstrates its ability to generate trajectories that are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.