# On the Hardness and Inapproximability of Recognizing Wheeler Graphs

**Authors:** Daniel Gibney, Sharma V. Thankachan

arXiv: 1902.01960 · 2019-02-27

## TL;DR

This paper investigates the computational complexity of recognizing Wheeler graphs, proving NP-completeness for general cases, and explores related optimization problems, providing insights into their approximability and special polynomial-time solvable classes.

## Contribution

It establishes NP-completeness of Wheeler graph recognition for alphabets of size two or more, and analyzes the hardness of related optimization problems, offering new complexity results and algorithms.

## Key findings

- Recognition is NP-complete for alphabets of size ≥ 2.
- Wheeler Graph Violation is APX-hard, with no constant-factor approximation unless P=NP.
- Largest Wheeler subgraph problem is in APX for constant alphabet size.

## Abstract

In recent years several compressed indexes based on variants of the Burrows-Wheeler transformation have been introduced. Some of these index structures far more complex than a single string, as was originally done with the FM-index [Ferragina and Manzini, J. ACM 2005]. As such, there has been an effort to better understand under which conditions such an indexing scheme is possible. This led to the introduction of Wheeler graphs [Gagie it et al., Theor. Comput. Sci., 2017]. A Wheeler graph is a directed graph with edge labels which satisfies two simple axioms. Wheeler graphs can be indexed in a way which is space efficient and allows for fast traversal. Gagie et al. showed that de Bruijn graphs, generalized compressed suffix arrays, and several other BWT related structures can be represented as Wheeler graphs. Here we answer the open question of whether or not there exists an efficient algorithm for recognizing if a graph is a Wheeler graph. We demonstrate:(i) Recognizing if a graph is a Wheeler graph is NP-complete for any edge label alphabet of size $\sigma \geq 2$, even for DAGs. It can be solved in linear time for $\sigma =1$; (ii) An optimization variant called Wheeler Graph Violation (WGV) which aims to remove the minimum number of edges needed to obtain a Wheeler graph is APX-hard, even for DAGs. Hence, unless P = NP, there exists constant $C > 1$ such that there is no $C$-approximation algorithm. We show conditioned on the Unique Games Conjecture, for every constant $C \geq 1$, it is NP-hard to find a $C$-approximation to WGV; (iii) The Wheeler Subgraph problem (WS) which aims to find the largest Wheeler subgraph is in APX for $\sigma=O(1)$; (iv) For the above problems there exist efficient exponential time exact algorithms, relying on graph isomorphism being computed in strictly sub-exponential time; (v) A class of graphs where the recognition problem is polynomial time solvable.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1902.01960/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/1902.01960/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/1902.01960/full.md

---
Source: https://tomesphere.com/paper/1902.01960