Design Once, Deploy at Scale: Template-Driven ML Development for Large Model Ecosystems

Jiang Liu; John Martabano Landy; Yao Xuan; Swamy Muddu; Nhat Le; Munaf Sahaf; Luc Kien Hang; Rupinder Khandpour; Kevin De Angeli; Chang Yang; Shouyuan Chen; Shiblee Sadik; Anirudh Agrawal; Djordje Gligorijevic; Jingzheng Qin; Peggy Yao; Alireza Vahdatpour

arXiv:2603.24963·cs.AI·March 31, 2026

Design Once, Deploy at Scale: Template-Driven ML Development for Large Model Ecosystems

Jiang Liu, John Martabano Landy, Yao Xuan, Swamy Muddu, Nhat Le, Munaf Sahaf, Luc Kien Hang, Rupinder Khandpour, Kevin De Angeli, Chang Yang, Shouyuan Chen, Shiblee Sadik, Anirudh Agrawal, Djordje Gligorijevic, Jingzheng Qin, Peggy Yao, Alireza Vahdatpour

PDF

TL;DR

This paper introduces the Standard Model Template (SMT), a framework for scalable, standardized ML model development in large recommendation ecosystems, significantly improving efficiency and model performance.

Contribution

The paper presents SMT, a composable, standardized approach that simplifies technique propagation and enhances model performance across large-scale recommendation systems.

Findings

01

0.63% average improvement in cross-entropy

02

92% reduction in engineering time per model

03

6.3x increase in technique-model pair adoption throughput

Abstract

Modern computational advertising platforms typically rely on recommendation systems to predict user responses, such as click-through rates, conversion rates, and other optimization events. To support a wide variety of product surfaces and advertiser goals, these platforms frequently maintain an extensive ecosystem of machine learning (ML) models. However, operating at this scale creates significant development and efficiency challenges. Substantial engineering effort is required to regularly refresh ML models and propagate new techniques, which results in long latencies when deploying ML innovations across the ecosystem. We present a large-scale empirical study comparing model performance, efficiency, and ML technique propagation between a standardized model-building approach and independent per-model optimization in recommendation systems. To facilitate this standardization, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.