InFi: End-to-End Learning to Filter Input for Resource-Efficiency in Mobile-Centric Inference
Mu Yuan, Lan Zhang, Fengxiang He, Xueting Tong, Miao-Hui Song,, Zhengyuan Xu, Xiang-Yang Li

TL;DR
InFi is an end-to-end learnable input filtering framework that enhances resource efficiency in mobile AI inference by providing theoretical guidance and robust feature discriminability, leading to significant throughput and bandwidth savings.
Contribution
It introduces the first formal theoretical analysis of input filterability and presents a comprehensive, learnable filtering framework applicable across multiple modalities and deployments.
Findings
InFi achieves 8.5x throughput improvement.
InFi reduces bandwidth by 95%.
Maintains over 90% accuracy in mobile video analytics.
Abstract
Mobile-centric AI applications have high requirements for resource-efficiency of model inference. Input filtering is a promising approach to eliminate the redundancy so as to reduce the cost of inference. Previous efforts have tailored effective solutions for many applications, but left two essential questions unanswered: (1) theoretical filterability of an inference workload to guide the application of input filtering techniques, thereby avoiding the trial-and-error cost for resource-constrained mobile applications; (2) robust discriminability of feature embedding to allow input filtering to be widely effective for diverse inference tasks and input content. To answer them, we first formalize the input filtering problem and theoretically compare the hypothesis complexity of inference models and input filters to understand the optimization potential. Then we propose the first end-to-end…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Data Stream Mining Techniques · Water Quality Monitoring Technologies
