Synatra: Turning Indirect Knowledge into Direct Demonstrations for Digital Agents at Scale
Tianyue Ou, Frank F. Xu, Aman Madaan, Jiarui Liu, Robert Lo, Abishek, Sridhar, Sudipta Sengupta, Dan Roth, Graham Neubig, Shuyan Zhou

TL;DR
Synatra transforms indirect online knowledge into large-scale direct supervision to improve digital agent performance, achieving superior results with cost-effective synthetic demonstrations compared to human data.
Contribution
This work introduces Synatra, a novel method to convert indirect knowledge into direct demonstrations for training digital agents at scale.
Findings
Synthetic demonstrations outperform limited human data in effectiveness.
The approach reduces supervision costs by 97%.
Agents trained with Synatra surpass comparable models on multiple benchmarks.
Abstract
LLMs can now act as autonomous agents that interact with digital environments and complete specific objectives (e.g., arranging an online meeting). However, accuracy is still far from satisfactory, partly due to a lack of large-scale, direct demonstrations for digital tasks. Obtaining supervised data from humans is costly, and automatic data collection through exploration or reinforcement learning relies on complex environmental and content setup, resulting in datasets that lack comprehensive coverage of various scenarios. On the other hand, there is abundant knowledge that may indirectly assist task completion, such as online tutorials that were created for human consumption. In this work, we present Synatra, an approach that effectively transforms this indirect knowledge into direct supervision at scale. We define different types of indirect knowledge, and carefully study the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMulti-Agent Systems and Negotiation · Semantic Web and Ontologies · Modular Robots and Swarm Intelligence
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Cosine Annealing · {Dispute@FaQ-s}How to file a dispute with Expedia? · Linear Layer · Weight Decay · Linear Warmup With Cosine Annealing · Byte Pair Encoding · Softmax
