Randomized Computations on Large Data Sets: Tight Lower Bounds
Martin Grohe, Andre Hernich, Nicole Schweikardt

TL;DR
This paper establishes tight lower bounds on the number of random memory accesses required for certain decision problems in a data stream model with external memory, impacting query evaluation complexity.
Contribution
It introduces tight lower bounds for randomized algorithms solving set equality, multiset equality, and checksort in a model with external memory and limited internal memory.
Findings
Any randomized algorithm for these problems must perform Omega(log N) random accesses.
Lower bounds apply to query evaluation in XQuery, XPath, and relational algebra under memory constraints.
Results demonstrate fundamental limits of data processing in external memory models.
Abstract
We study the randomized version of a computation model (introduced by Grohe, Koch, and Schweikardt (ICALP'05); Grohe and Schweikardt (PODS'05)) that restricts random access to external memory and internal memory space. Essentially, this model can be viewed as a powerful version of a data stream model that puts no cost on sequential scans of external memory (as other models for data streams) and, in addition, (like other external memory models, but unlike streaming models), admits several large external memory devices that can be read and written to in parallel. We obtain tight lower bounds for the decision problems set equality, multiset equality, and checksort. More precisely, we show that any randomized one-sided-error bounded Monte Carlo algorithm for these problems must perform Omega(log N) random accesses to external memory devices, provided that the internal memory size is at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Algorithms and Data Compression · Machine Learning and Algorithms
