MaRCA: Multi-Agent Reinforcement Learning for Dynamic Computation Allocation in Large-Scale Recommender Systems

Wan Jiang; Xinyi Zang; Yudong Zhao; Yusi Zou; Yunfei Lu; Junbo Tong; Yang Liu; Ming Li; Jiani Shi; Xin Yang

arXiv:2512.24325·cs.IR·January 1, 2026

MaRCA: Multi-Agent Reinforcement Learning for Dynamic Computation Allocation in Large-Scale Recommender Systems

Wan Jiang, Xinyi Zang, Yudong Zhao, Yusi Zou, Yunfei Lu, Junbo Tong, Yang Liu, Ming Li, Jiani Shi, Xin Yang

PDF

Open Access

TL;DR

MaRCA is a multi-agent reinforcement learning framework that optimizes computation resource allocation in large-scale recommender systems, significantly improving revenue while efficiently managing computational costs.

Contribution

It introduces a novel multi-agent RL approach with end-to-end optimization, AutoBucket cost estimation, and MPC-based traffic forecasting for recommender systems.

Findings

01

Handled hundreds of billions of ad requests daily

02

Achieved 16.67% revenue uplift

03

Demonstrated effectiveness in a real-world e-commerce platform

Abstract

Modern recommender systems face significant computational challenges due to growing model complexity and traffic scale, making efficient computation allocation critical for maximizing business revenue. Existing approaches typically simplify multi-stage computation resource allocation, neglecting inter-stage dependencies, thus limiting global optimality. In this paper, we propose MaRCA, a multi-agent reinforcement learning framework for end-to-end computation resource allocation in large-scale recommender systems. MaRCA models the stages of a recommender system as cooperative agents, using Centralized Training with Decentralized Execution (CTDE) to optimize revenue under computation resource constraints. We introduce an AutoBucket TestBench for accurate computation cost estimation, and a Model Predictive Control (MPC)-based Revenue-Cost Balancer to proactively forecast traffic loads and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Mobile Crowdsensing and Crowdsourcing