A Declarative System for Optimizing AI Workloads
Chunwei Liu, Matthew Russo, Michael Cafarella, Lei Cao, Peter Baille, Chen, Zui Chen, Michael Franklin, Tim Kraska, Samuel Madden, Gerardo, Vitagliano

TL;DR
Palimpzest is a declarative system that optimizes AI workload execution by balancing runtime, cost, and data quality, simplifying complex AI query orchestration for users.
Contribution
It introduces a cost-aware optimization framework and declarative language for AI workloads, enabling efficient and user-friendly query processing.
Findings
Achieves up to 3.3x faster and 2.9x cheaper plans than baseline
Parallelism yields up to 90.3x speedup at lower cost
Maintains high data quality with F1-score within 83.5% of baseline
Abstract
A long-standing goal of data management systems has been to build systems which can compute quantitative insights over large corpora of unstructured data in a cost-effective manner. Until recently, it was difficult and expensive to extract facts from company documents, data from scientific papers, or metrics from image and video corpora. Today's models can accomplish these tasks with high accuracy. However, a programmer who wants to answer a substantive AI-powered query must orchestrate large numbers of models, prompts, and data operations. For even a single query, the programmer has to make a vast number of decisions such as the choice of model, the right inference method, the most cost-effective inference hardware, the ideal prompt design, and so on. The optimal set of decisions can change as the query changes and as the rapidly-evolving technical landscape shifts. In this paper we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Data Processing Techniques · AI-based Problem Solving and Planning
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection · Byte Pair Encoding · Label Smoothing · Adam · Absolute Position Encodings
