InferF: Declarative Factorization of AI/ML Inferences over Joins
Kanchan Chowdhury, Lixi Zhou, Lulu Xie, Xinwei Fu, Jia Zou

TL;DR
InferF is a system that optimizes AI/ML inference workflows over multi-way joins by factorizing computations, significantly reducing inference costs and improving speed through novel algorithms and formalization.
Contribution
The paper introduces a formal framework and algorithms for factorizing inference workflows over joins, addressing limitations in existing methods and implementing them in a real system.
Findings
Achieved up to 11.3x speedup in inference workflows
Formalized the problem of factorized inference over joins
Developed greedy and genetic algorithms for optimal factorization
Abstract
Real-world AI/ML workflows often apply inference computations to feature vectors joined from multiple datasets. To avoid the redundant AI/ML computations caused by repeated data records in the join's output, factorized ML has been proposed to decompose ML computations into sub-computations to be executed on each normalized dataset. However, there is insufficient discussion on how factorized ML could impact AI/ML inference over multi-way joins. To address the limitations, we propose a novel declarative InferF system, focusing on the factorization of arbitrary inference workflows represented as analyzable expressions over the multi-way joins. We formalize our problem to flexibly push down partial factorized computations to qualified nodes in the join tree to minimize the overall inference computation and join costs and propose two algorithms to resolve the problem: (1) a greedy algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Cloud Computing and Resource Management · Advanced Database Systems and Queries
