Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale
Jieming Di, Xiaoyu Chen, Ying She, Siyu Wang, Lizzie Liu, Fenggang Wu, Jiaoying Mu, Tony Tsui, Amr Elroumy, Hsing Tang, Zewei Jiang, Qiao Yang, Lin Qi, Haibo Lin, Weifeng Cui, Daniel Li, Kapil Gupta, Shivendra Pratap Singh, Jie Zheng, Arnold Overwijk, Ling Leng, Sri Reddy

TL;DR
The paper presents IEFF, a system enabling retrain-free feature efficiency rollouts in large-scale ranking systems by elastically controlling feature coverage at serving time, reducing iteration time and resource use.
Contribution
Introducing IEFF, a novel infrastructure system that allows elastic feature fading at scale, eliminating the need for model retraining during feature efficiency improvements.
Findings
IEFF accelerates efficiency rollouts by 5× in production.
Eliminates retraining-related GPU overhead.
Gradual feature fading prevents 50-55% of online performance degradation.
Abstract
Large-scale ranking systems depend on thousands of features derived from user behavior across multiple time horizons. Typically requires model retraining -- resulting in long iteration cycles (3--6 months), substantial GPU resource consumption, and limited rollout throughput. We introduce Intelligent Elastic Feature Fading (IEFF), a production infrastructure system that enables retrain-free feature efficiency rollouts by elastically controlling feature coverage and distribution at serving time. IEFF supports incremental feature coverage adjustments while models adapt through recurring training, eliminating dependencies on explicit retraining cycles. The system incorporates strict safety guardrails, reversibility mechanisms, and comprehensive monitoring to ensure stability at scale. Across multiple production use cases, IEFF accelerates efficiency-related rollouts by 5,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
