Decoding ML Decision: An Agentic Reasoning Framework for Large-Scale Ranking System
Longfei Yun, Yihan Wu, Haoran Liu, Xiaoxuan Liu, Ziyun Xu, Yi Wang, Yang Xia, Pengfei Wang, Mingze Gao, Yunxiang Wang, Changfan Chen, Junfeng Pan

TL;DR
This paper introduces GEARS, a novel framework that transforms ranking system optimization into an autonomous, agentic discovery process, integrating expert reasoning and validation to improve policy quality and stability.
Contribution
GEARS redefines ranking optimization as an autonomous discovery process, incorporating expert reasoning skills and validation mechanisms for more effective and reliable large-scale ranking systems.
Findings
GEARS consistently finds superior ranking policies.
The framework maintains deployment stability.
It effectively combines algorithmic signals with deep context.
Abstract
Modern large-scale ranking systems operate within a sophisticated landscape of competing objectives, operational constraints, and evolving product requirements. Progress in this domain is increasingly bottlenecked by the engineering context constraint: the arduous process of translating ambiguous product intent into reasonable, executable, verifiable hypotheses, rather than by modeling techniques alone. We present GEARS (Generative Engine for Agentic Ranking Systems), a framework that reframes ranking optimization as an autonomous discovery process within a programmable experimentation environment. Rather than treating optimization as static model selection, GEARS leverages Specialized Agent Skills to encapsulate ranking expert knowledge into reusable reasoning capabilities, enabling operators to steer systems via high-level intent vibe personalization. Furthermore, to ensure production…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems · Forecasting Techniques and Applications · Mobile Crowdsensing and Crowdsourcing
