Polytope: An Algorithm for Efficient Feature Extraction on Hypercubes
Mathilde Leuridan, James Hawkes, Simon Smart, Emanuele Danovaro and, Tiago Quintino

TL;DR
This paper introduces Polytope, a novel algorithm that efficiently extracts non-rectangular data subsets from large hypercubes by pre-selecting only the necessary data, significantly improving scalability and resource use.
Contribution
The paper presents a new geometric algorithm for precise data extraction from hypercubes, overcoming limitations of traditional box-based methods.
Findings
Reduces I/O by extracting only needed data
Scales efficiently to petabyte-sized hypercubes
Improves data retrieval precision and resource utilization
Abstract
Data extraction algorithms on data hypercubes, or datacubes, are traditionally only capable of cutting boxes of data along the datacube axes. For many use cases however, this is not a sufficient approach and returns more data than users might actually need. This not only forces users to apply post-processing after extraction, but more importantly this consumes more I/O resources than is necessary. When considering very large datacubes from which users only want to extract small non-rectangular subsets, the box approach does not scale well. Indeed, with this traditional approach, I/O systems quickly reach capacity, trying to read and return unwanted data to users. In this paper, we propose a novel technique, based on computational geometry concepts, which instead carefully pre-selects the precise bytes of data which the user needs in order to then only read those from the datacube. As we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed and Parallel Computing Systems · Interconnection Networks and Systems · Parallel Computing and Optimization Techniques
