Polymath: A Self-Optimizing Agent with Dynamic Hierarchical Workflow

Chia-Tung Ho; Jing Gong; Xufeng Yao; Yunsheng Bai; Abhishek B Akkur; Haoxing Ren

arXiv:2508.02959·cs.AI·August 8, 2025

Polymath: A Self-Optimizing Agent with Dynamic Hierarchical Workflow

Chia-Tung Ho, Jing Gong, Xufeng Yao, Yunsheng Bai, Abhishek B Akkur, Haoxing Ren

PDF

TL;DR

Polymath is a self-optimizing agent that dynamically refines hierarchical workflows using graph optimization and self-reflection, enabling effective problem-solving without labeled data across diverse tasks.

Contribution

The paper introduces Polymath, a novel self-optimizing agent that automates workflow generation and refinement through a code-based, dynamic hierarchical structure without relying on labeled datasets.

Findings

01

Achieves 8.1% average improvement over baselines

02

Effectively solves real-world, dynamic problems

03

Operates across coding, math, and QA tasks

Abstract

Large language models (LLMs) excel at solving complex tasks by executing agentic workflows composed of detailed instructions and structured operations. Yet, building general-purpose agents by manually embedding foundation models into agentic systems such as Chain-of-Thought, Self-Reflection, and ReACT through text interfaces limits scalability and efficiency. Recently, many researchers have sought to automate the generation and optimization of these workflows through code-based representations. However, existing methods often rely on labeled datasets to train and optimize workflows, making them ineffective and inflexible for solving real-world, dynamic problems where labeled data is unavailable. To address this challenge, we introduce Polymath, a self-optimizing agent with dynamic hierarchical workflow that leverages the flexibility of task flow graphs and the expressiveness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.