AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent

Yu Li; Lehui Li; Lin Chen; Qingmin Liao; Fengli Xu; Yong Li

arXiv:2511.04921·cs.CL·April 2, 2026

AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent

Yu Li, Lehui Li, Lin Chen, Qingmin Liao, Fengli Xu, Yong Li

PDF

TL;DR

This paper introduces AgentExpt, a framework utilizing LLMs and citation networks to automate experiment design by recommending datasets and baselines with improved accuracy and interpretability.

Contribution

It presents a comprehensive data collection pipeline, a collective perception enhanced retriever, and a reasoning-augmented reranker for better dataset and baseline recommendation.

Findings

01

Curated dataset covers 85% of datasets and baselines used at top AI conferences.

02

Proposed method outperforms prior baselines with +5.85% Recall@20 and +8.30% HitRate@5.

03

Achieves more reliable and interpretable automation of experimental design.

Abstract

Large language model agents are becoming increasingly capable at web-centric tasks such as information retrieval, complex reasoning. These emerging capabilities have given rise to surge research interests in developing LLM agent for facilitating scientific quest. One key application in AI research is to automate experiment design through agentic dataset and baseline retrieval. However, prior efforts suffer from limited data coverage, as recommendation datasets primarily harvest candidates from public portals and omit many datasets actually used in published papers, and from an overreliance on content similarity that biases model toward superficial similarity and overlooks experimental suitability. Harnessing collective perception embedded in the baseline and dataset citation network, we present a comprehensive framework for baseline and dataset recommendation. First, we design an…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.