Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale

Jieming Di; Xiaoyu Chen; Ying She; Siyu Wang; Lizzie Liu; Fenggang Wu; Jiaoying Mu; Tony Tsui; Amr Elroumy; Hsing Tang; Zewei Jiang; Qiao Yang; Lin Qi; Haibo Lin; Weifeng Cui; Daniel Li; Kapil Gupta; Shivendra Pratap Singh; Jie Zheng; Arnold Overwijk; Ling Leng; Sri Reddy; Robert Malkin; Rocky Liu

arXiv:2605.00324·cs.IR·May 4, 2026

Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale

Jieming Di, Xiaoyu Chen, Ying She, Siyu Wang, Lizzie Liu, Fenggang Wu, Jiaoying Mu, Tony Tsui, Amr Elroumy, Hsing Tang, Zewei Jiang, Qiao Yang, Lin Qi, Haibo Lin, Weifeng Cui, Daniel Li, Kapil Gupta, Shivendra Pratap Singh, Jie Zheng, Arnold Overwijk, Ling Leng, Sri Reddy

PDF

TL;DR

The paper presents IEFF, a system enabling retrain-free feature efficiency rollouts in large-scale ranking systems by elastically controlling feature coverage at serving time, reducing iteration time and resource use.

Contribution

Introducing IEFF, a novel infrastructure system that allows elastic feature fading at scale, eliminating the need for model retraining during feature efficiency improvements.

Findings

01

IEFF accelerates efficiency rollouts by 5× in production.

02

Eliminates retraining-related GPU overhead.

03

Gradual feature fading prevents 50-55% of online performance degradation.

Abstract

Large-scale ranking systems depend on thousands of features derived from user behavior across multiple time horizons. Typically requires model retraining -- resulting in long iteration cycles (3--6 months), substantial GPU resource consumption, and limited rollout throughput. We introduce Intelligent Elastic Feature Fading (IEFF), a production infrastructure system that enables retrain-free feature efficiency rollouts by elastically controlling feature coverage and distribution at serving time. IEFF supports incremental feature coverage adjustments while models adapt through recurring training, eliminating dependencies on explicit retraining cycles. The system incorporates strict safety guardrails, reversibility mechanisms, and comprehensive monitoring to ensure stability at scale. Across multiple production use cases, IEFF accelerates efficiency-related rollouts by 5 $\times$ ,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.