Efficiently Answering Durability Prediction Queries
Junyang Gao, Yifan Xu, Pankaj K. Agarwal, Jun Yang

TL;DR
This paper introduces Multi-Level Splitting Sampling (MLSS), a novel method that efficiently answers complex durability prediction queries by improving Monte Carlo simulations, reducing computational costs while maintaining accuracy.
Contribution
The paper presents MLSS, a new importance splitting technique for durability prediction queries that handles complex models and reduces simulation costs without sacrificing accuracy.
Findings
MLSS achieves an order-of-magnitude cost reduction compared to standard Monte Carlo methods.
MLSS provides unbiased estimates with the same error guarantees as traditional methods.
The approach effectively handles black-box models and complex query scenarios.
Abstract
We consider a class of queries called durability prediction queries that arise commonly in predictive analytics, where we use a given predictive model to answer questions about possible futures to inform our decisions. Examples of durability prediction queries include "what is the probability that this financial product will keep losing money over the next 12 quarters before turning in any profit?" and "what is the chance for our proposed server cluster to fail the required service-level agreement before its term ends?" We devise a general method called Multi-Level Splitting Sampling (MLSS) that can efficiently handle complex queries and complex models -- including those involving black-box functions -- as long as the models allow us to simulate possible futures step by step. Our method addresses the inefficiency of standard Monte Carlo (MC) methods by applying the idea of importance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Management and Algorithms · Traffic Prediction and Management Techniques
