Optimizing Machine Learning Inference Queries with Correlative Proxy Models
Zhihui Yang, Zuozhi Wang, Yicong Huang, Yao Lu, Chen Li, X. Sean, Wang

TL;DR
This paper introduces CORE, a query optimizer that exploits predicate correlations to accelerate machine learning inference queries on unstructured datasets, outperforming existing methods by up to 80%.
Contribution
CORE builds proxy models online and uses branch-and-bound search to better exploit predicate correlations, improving query throughput.
Findings
CORE improves query throughput by up to 80%.
CORE outperforms Probabilistic Predicates by up to 63%.
The approach is validated on real-world datasets.
Abstract
We consider accelerating machine learning (ML) inference queries on unstructured datasets. Expensive operators such as feature extractors and classifiers are deployed as user-defined functions(UDFs), which are not penetrable with classic query optimization techniques such as predicate push-down. Recent optimization schemes (e.g., Probabilistic Predicates or PP) assume independence among the query predicates, build a proxy model for each predicate offline, and rewrite a new query by injecting these cheap proxy models in the front of the expensive ML UDFs. In such a manner, unlikely inputs that do not satisfy query predicates are filtered early to bypass the ML UDFs. We show that enforcing the independence assumption in this context may result in sub-optimal plans. In this paper, we propose CORE, a query optimizer that better exploits the predicate correlations and accelerates ML…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Advanced Neural Network Applications
