Optimizing Feature Extraction for On-device Model Inference with User Behavior Sequences
Chen Gong, Zhenzhe Zheng, Yiliu Chen, Sheng Wang, Fan Wu, Guihai Chen

TL;DR
This paper introduces AutoFeature, an automated engine that optimizes feature extraction from raw logs to significantly reduce on-device model inference latency in mobile apps.
Contribution
It proposes a novel graph-based optimization approach and caching strategy for feature extraction, improving efficiency without sacrificing accuracy.
Findings
AutoFeature reduces on-device inference latency by up to 4.53x.
Implemented in five industrial services with significant performance gains.
Effective elimination of redundant extraction operations across features.
Abstract
Machine learning models are widely integrated into modern mobile apps to analyze user behaviors and deliver personalized services. Ensuring low-latency on-device model execution is critical for maintaining high-quality user experiences. While prior research has primarily focused on accelerating model inference with given input features, we identify an overlooked bottleneck in real-world on-device model execution pipelines: extracting input features from raw application logs. In this work, we explore a new direction of feature extraction optimization by analyzing and eliminating redundant extraction operations across different model features and consecutive model inferences. We then introduce AutoFeature, an automated feature extraction engine designed to accelerate on-device feature extraction process without compromising model inference accuracy. AutoFeature comprises three core…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGreen IT and Sustainability · Software System Performance and Reliability · Context-Aware Activity Recognition Systems
