Causal Search for Skylines (CSS): Causally-Informed Selective Data De-Correlation
Pratanu Mandal, Abhinav Gorantla, K. Sel\c{c}uk Candan, Maria Luisa Sapino

TL;DR
This paper introduces CSS, a causally-informed method to selectively de-correlate data attributes based on causal graphs, significantly improving the efficiency of skyline query algorithms by reducing dominance checks and computation time.
Contribution
The paper proposes a novel causally-informed de-correlation mechanism that enhances skyline algorithms' efficiency by leveraging causal structures to identify beneficial attribute correlations.
Findings
Significant reduction in dominance checks across datasets.
Decreased overall computation time for skyline discovery.
Applicable to multiple skyline algorithms and data types.
Abstract
Skyline queries are popular and effective tools in multi-criteria decision support as they extract interesting (pareto-optimal) points that help summarize the available data with respect to a given set of preference attributes. Unfortunately, the efficiency of the skyline algorithms depends heavily on the underlying data statistics. In this paper, we argue that the efficiency of the skyline algorithms could be significantly boosted if one could erase any attribute correlations that do not agree with the preference criteria, while preserving (or even boosting) correlations that agree with the user provided criteria. Therefore, we propose a causallyinformed selective de-correlation mechanism to enable skyline algorithms to better leverage the pruning opportunities provided by the positively-aligned data distributions, without having to suffer from the mis-alignments. In particular, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Constraint Satisfaction and Optimization · Geographic Information Systems Studies
