Minimizing Congestion for Balanced Dominators
Yosuke Mizutani, Annie Staker, Blair D. Sullivan

TL;DR
This paper introduces methods to minimize congestion and improve scalability in graph partitioning for metagenomic assembly, providing algorithms with theoretical guarantees and demonstrating their effectiveness on real-world data.
Contribution
It proposes new algorithms for sparse dominating sets and balanced neighborhood partitioning, addressing NP-hard problems with approximation and heuristic solutions.
Findings
Sparse dominating sets reduce tie-breaking ambiguity.
Balanced partitioning improves scalability and uniformity.
Algorithms maintain high query accuracy while reducing variance.
Abstract
A primary challenge in metagenomics is reconstructing individual microbial genomes from the mixture of short fragments created by sequencing. Recent work leverages the sparsity of the assembly graph to find -dominating sets which enable rapid approximate queries through a dominator-centric graph partition. In this paper, we consider two problems related to reducing uncertainty and improving scalability in this setting. First, we observe that nodes with multiple closest dominators necessitate arbitrary tie-breaking in the existing pipeline. As such, we propose finding dominating sets which minimize this effect via a new parameter. We prove minimizing congestion is NP-hard, and give an approximation algorithm, where is the max degree. To improve scalability, the graph should be partitioned into…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEconomic theories and models · Game Theory and Applications
