Perfect phylogenies via branchings in acyclic digraphs and a   generalization of Dilworth's theorem

Ademir Hujdurovi\'c; Edin Husi\'c; Martin Milani\v{c}; Romeo Rizzi and; Alexandru I. Tomescu

arXiv:1701.05492·cs.DM·February 6, 2018

Perfect phylogenies via branchings in acyclic digraphs and a generalization of Dilworth's theorem

Ademir Hujdurovi\'c, Edin Husi\'c, Martin Milani\v{c}, Romeo Rizzi and, Alexandru I. Tomescu

PDF

TL;DR

This paper introduces new formulations and algorithms for the minimum conflict-free row split problem in perfect phylogeny, connecting it to branchings in acyclic digraphs and generalizing Dilworth's theorem, with implications for computational complexity and optimization.

Contribution

It provides transparent formulations linking the problem to acyclic digraph branchings, extends Dilworth's theorem, and offers improved algorithms and complexity results.

Findings

01

Strengthened heuristic via a new min-max theorem in digraphs

02

Proved APX-hardness of the problems

03

Developed approximation and exponential-time algorithms

Abstract

Motivated by applications in cancer genomics and following the work of Hajirasouliha and Raphael (WABI 2014), Hujdurovi\'c et al. (IEEE TCBB, to appear) introduced the minimum conflict-free row split (MCRS) problem: split each row of a given binary matrix into a bitwise OR of a set of rows so that the resulting matrix corresponds to a perfect phylogeny and has the minimum possible number of rows among all matrices with this property. Hajirasouliha and Raphael also proposed the study of a similar problem, in which the task is to minimize the number of distinct rows of the resulting matrix. Hujdurovi\'c et al. proved that both problems are NP-hard, gave a related characterization of transitively orientable graphs, and proposed a polynomial-time heuristic algorithm for the MCRS problem based on coloring cocomparability graphs. We give new, more transparent formulations of the two…

Figures9

Click any figure to enlarge with its caption.

Equations24

M[(r,r^{\prime},r^{\prime\prime}),(i,j)]=\left(\begin{array}[]{cc}1&1\\ 1&0\\ 0&1\\ \end{array}\right)\,.

M[(r,r^{\prime},r^{\prime\prime}),(i,j)]=\left(\begin{array}[]{cc}1&1\\ 1&0\\ 0&1\\ \end{array}\right)\,.

M^{B}_{(r,v),j}=\left\{\begin{array}[]{ll}1,&\hbox{if $v_{j}\in B^{+}(v)$;}\\ 0,&\hbox{otherwise.}\end{array}\right.

M^{B}_{(r,v),j}=\left\{\begin{array}[]{ll}1,&\hbox{if $v_{j}\in B^{+}(v)$;}\\ 0,&\hbox{otherwise.}\end{array}\right.

Π (C) = v \in C max f_{v} \geq f_{z} \geq v \in N min f_{v} = val (N) .

Π (C) = v \in C max f_{v} \geq f_{z} \geq v \in N min f_{v} = val (N) .

Π (P) = i = 1 \sum p Π (C_{i}) = i = 1 \sum p Π (\tilde{C}_{i}) \geq i = 1 \sum wdt (D) Π (\tilde{C}_{i}) \geq i = 1 \sum wdt (D) val (N_{i}) = val (T) .

Π (P) = i = 1 \sum p Π (C_{i}) = i = 1 \sum p Π (\tilde{C}_{i}) \geq i = 1 \sum wdt (D) Π (\tilde{C}_{i}) \geq i = 1 \sum wdt (D) val (N_{i}) = val (T) .

E

E

B

B

U (B)

U (B)

γ (M) = γ (Red (M)), η (M) = η (Red (M)),

γ (M) = γ (Red (M)), η (M) = η (Red (M)),

β (M) = β (Red (M)), ζ (M) = ζ (Red (M)) .

β (M) = β (Red (M)), ζ (M) = ζ (Red (M)) .

Ω (r, v) = {(r, v^{'}) ∣ v \in V, v^{'} \in B_{o pt}^{+} (v)} .

Ω (r, v) = {(r, v^{'}) ∣ v \in V, v^{'} \in B_{o pt}^{+} (v)} .

U (B) \subseteq \cup_{(r, v) \in U (B_{o pt})} Ω (r, v) .

U (B) \subseteq \cup_{(r, v) \in U (B_{o pt})} Ω (r, v) .

∣ U (B) ∣ \leq (r, v) \in U (B_{o pt}) \sum ∣Ω (r, v) ∣ \leq h ∣ U (B_{o pt}) ∣ = h β (M) .

∣ U (B) ∣ \leq (r, v) \in U (B_{o pt}) \sum ∣Ω (r, v) ∣ \leq h ∣ U (B_{o pt}) ∣ = h β (M) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Perfect phylogenies via branchings in acyclic digraphs

and a generalization of Dilworth’s theorem

Ademir Hujdurovića,b E-mail: [email protected]

Edin Husićc,b E-mail: [email protected]

Martin Milaniča,b E-mail: [email protected]

Romeo Rizzid E-mail: [email protected]

Alexandru I. Tomescue E-mail: [email protected]

Abstract

Motivated by applications in cancer genomics and following the work of Hajirasouliha and Raphael (WABI 2014), Hujdurović et al. (IEEE TCBB, to appear) introduced the minimum conflict-free row split (MCRS) problem: split each row of a given binary matrix into a bitwise OR of a set of rows so that the resulting matrix corresponds to a perfect phylogeny and has the minimum possible number of rows among all matrices with this property. Hajirasouliha and Raphael also proposed the study of a similar problem, in which the task is to minimize the number of distinct rows of the resulting matrix. Hujdurović et al. proved that both problems are NP-hard, gave a related characterization of transitively orientable graphs, and proposed a polynomial-time heuristic algorithm for the MCRS problem based on coloring cocomparability graphs.

We give new, more transparent formulations of the two problems, showing that the problems are equivalent to two optimization problems on branchings in a derived directed acyclic graph. Building on these formulations, we obtain new results on the two problems, including: (i) a strengthening of the heuristic by Hujdurović et al. via a new min-max result in digraphs generalizing Dilworth’s theorem, which may be of independent interest, (ii) APX-hardness results for both problems, (iii) approximation algorithms, and (iv) exponential-time algorithms solving the two problems to optimality faster than the naïve brute-force approach. Our work relates to several well studied notions in combinatorial optimization: chain partitions in partially ordered sets, laminar hypergraphs, and (classical and weighted) colorings of graphs.

a University of Primorska, UP IAM, Koper, Slovenia

b University of Primorska, UP FAMNIT, Koper, Slovenia

c London School of Economics, Department of Mathematics, London, United Kingdom

d University of Verona, Department of Computer Science, Verona, Italy

e Helsinki Institute for Information Technology HIIT, Department of Computer Science, University of Helsinki, Finland

Keywords: Perfect phylogeny, Minimum Conflict-Free Row Split problem, branching, acyclic digraph, chain partition, Dilworth’s theorem, min-max theorem, approximation algorithm, APX-hardness

1 Introduction

A perfect phylogeny is a rooted tree representing the evolutionary history of a set of $m$ objects. The objects bijectively label the leaves of the tree and there are $n$ binary variables called characters, each labeling exactly one edge of the tree. For each leaf, the set of characters that appear on the unique root-to-leaf path is the set of characters taking value $1$ at the object labeling the leaf. While every perfect phylogeny naturally corresponds to an $m\times n$ binary matrix having objects as rows and characters as columns, the perfect phylogeny problem asks the opposite question: Does a given binary matrix correspond to a perfect phylogeny? The perfect phylogeny problem and various generalizations of it have been extensively studied in computational biology. In this paper we study two combinatorial optimization problems, both generalizations of the perfect phylogeny problem, first considered by Hajirasouliha and Raphael [18] and motivated by applications in cancer genomics.

Following the work [18], Hujdurović et al. [20] introduced the minimum conflict-free row split problem, which can be informally stated as follows: given a binary matrix $M$ , split each row of $M$ into a bitwise OR111Here, OR denotes the usual binary OR function, assuming that value true is represented by $1$ and value false by [math], that is, $x\text{\,OR\,}y=1$ if and only if at least one of $x$ and $y$ has value $1$ . of a set of rows so that the resulting matrix corresponds to a perfect phylogeny and has the minimum number of rows among all matrices with this property. To state the problem formally, we need two definitions.

Definition 1.1.

Given a matrix $M$ , three distinct rows $r$ , $r^{\prime}$ , $r^{\prime\prime}$ of $M$ and two distinct columns $i$ and $j$ of $M$ , we denote by $M[(r,r^{\prime},r^{\prime\prime}),(i,j)]$ the $3\times 2$ submatrix of $M$ formed by rows $r$ , $r^{\prime}$ , $r^{\prime\prime}$ and columns $i$ , $j$ (in this order). Two columns $i$ and $j$ of a binary matrix $M$ are said to be in conflict if there exist rows $r,r^{\prime},r^{\prime\prime}$ of $M$ such that

[TABLE]

We say that a binary matrix $M$ is conflict-free if no two columns of $M$ are in conflict.

Definition 1.2.

Let $M\in\{0,1\}^{m\times n}$ . Label the rows of $M$ as $r_{1},r_{2},\dots,r_{m}$ . A binary matrix $M^{\prime}\in\{0,1\}^{m^{\prime}\times n}$ is a row split of $M$ if there exists a partition of the set of rows of $M^{\prime}$ into $m$ sets $R_{1},R_{2},\dots R_{m}$ such that for all $i\in\{1,\dots,m\}$ , $r_{i}$ is the bitwise $OR$ of the binary vectors in $R_{i}$ . The set $R_{i}$ of rows of $M^{\prime}$ is said to be the set of split rows of row $r_{i}$ (with respect to $M^{\prime}$ ).

For simplicity, we defined a row split as a binary matrix $M^{\prime}$ for which a suitable partition of rows exists. However, throughout the paper we will make a slight technical abuse of this terminology by considering any row split $M^{\prime}$ of $M$ as already equipped with an arbitrary (but fixed) partition of its rows $R_{1},\ldots,R_{m}$ satisfying the above condition. For an example of these notions, see Fig. 1. For the sake of clarity, from now on we omit displaying the zeros in binary matrices.

We denote by $\gamma(M)$ the minimum number of rows in a conflict-free row split $M^{\prime}$ of $M$ . Formally, the minimum conflict-free row split problem is defined as follows:

MinimumConflict-FreeRowSplit (MCRS):

Input: A binary matrix $M$ .

Task: Compute $\gamma(M)$ .

We will also consider a variant of the problem, proposed by Hajirasouliha and Raphael [18], in which the task is to compute a row split $M^{\prime}$ of $M$ such that the number of distinct rows in $M^{\prime}$ is minimized. Let $\eta(M)$ denote the minimum number of distinct rows in a conflict-free row split $M^{\prime}$ of $M$ . Similarly as above, we consider the corresponding optimization problem.

MinimumDistinctConflict-FreeRowSplit (MDCRS):

Input: A binary matrix $M$ .

Task: Compute $\eta(M)$ .

The connection between conflict-free matrices and perfect phylogenies is well known: the rows of a binary matrix $M$ are the leaves of a perfect phylogeny if and only if $M$ is conflict-free (see [9, 17]). Moreover, if this is the case, then the corresponding tree can be retrieved from $M$ in time linear in the size of $M$ [16]. The intuition behind the fact that a conflict-free matrix corresponds to a perfect phylogeny is that one can map each row to a leaf of a tree, and each column to an edge, so that each row has a $1$ exactly on those columns that are mapped to the edges on the path from the root to the leaf corresponding to the row. The forbidden $3\times 2$ matrix from Definition 1.1 as a submatrix leads a contradiction, since then the two distinct edges $e_{i}$ and $e_{j}$ to which columns $i$ and $j$ are mapped, respectively, are such that $e_{i}$ appears both before, and after, $e_{j}$ on a root-to-leaf path. We refer to [18, 20] and to references therein for further details on the biological aspects of the MCRS and the MDCRS problems.

Another well studied family of combinatorial objects closely related to the MCRS and the MDCRS problems are laminar set families. A set family $\mathcal{F}$ is said to be laminar if every two sets $A,B\in\mathcal{F}$ satisfy $A\cap B=\emptyset$ , $A\subseteq B$ , or $B\subseteq A$ . The connection with laminar families follows from the fact that a binary matrix $M$ is conflict-free if and only if the sets of rows indicating the positions of ones in the columns of $M$ form a laminar family. This connection will be exploited in Section 4.2. Laminar families of sets play an important role in network design problems [22], in the study of packing and covering problems [13, 5, 27], and in several other areas of combinatorial optimization, see, e.g., [28].

In [20], the MCRS and the MDCRS problems were proved NP-hard, a related characterization of transitively orientable graphs was given, and a polynomial-time heuristic algorithm was proposed for the MCRS problem based on coloring cocomparability graphs (that is, complements of transitively orientable graphs). Following [20], the main aim of this paper is to further advance the understanding of structural and computational aspects of the MCRS and the MDCRS problems.

Our results and techniques. The first and main result of this paper is a result showing that the MCRS and the MDCRS problems can be equivalently formulated as two optimization problems on branchings in a directed acyclic graph derived from the given binary matrix, the so-called containment digraph. (Precise definitions of these notions and the corresponding problems will be given in Section 2.) These equivalencies lead to more transparent formulations of the two problems. We will ascertain the applicability and usefulness of these novel formulations by deriving the following results and insights about the MCRS and the MDCRS problems:

•

We prove a new min-max result on digraphs strengthening Dilworth’s theorem on chain partitions and antichains in partially ordered sets. This result is described in Section 3.1, which can be read independently of the rest of the paper. This result, besides being interesting on its own as a generalization of a classical min-max result, connects well to the MCRS problem via the problem’s branching formulation. The constructive, algorithmic proof of the result shows that a related problem is polynomially solvable: a problem in which only a subset of all branchings of the containment digraph is examined, namely the so-called linear branchings (branchings corresponding to chain partitions of the partial order underlying the containment digraph). This approach leads to a new heuristic for the MCRS problem, improving on a previous heuristic from [20].

•

We strengthen the NP-hardness results for the two problems to APX-hardness results.

•

We complement the inapproximability results with three approximation algorithms: a $2$ -approximation algorithm for the MDCRS problem (implying that the problem is APX-complete) and two approximation algorithms for the MCRS problem, the approximation ratios of which are expressed in terms of two parameters of the containment digraph, corresponding to the height and the width of the underlying partial order, respectively.

•

The branching formulations allow for the development of faster exact exponential-time solutions for the two problems, when compared to a direct brute-force approach that follows directly from the problems’ definitions.

Comparison with related work. In [18], Hajirasouliha and Raphael introduced the so-called Minimum-Split-Row problem, in which only a given subset of rows of the input matrix needs to be split and, roughly speaking, the task is to minimize the number of additional rows in the resulting conflict-free row split. All results from [18] actually deal with the variant of the problem in which all rows need to be split (some perhaps trivially by setting $R_{i}=\{r_{i}\}$ ); in this case, the optimal value of the Minimum-Split-Row problem coincides with the difference $\gamma(M)-r(M)$ , where $r(M)$ is the number of rows of $M$ . In the same paper, a lower bound on the value of $\gamma(M)$ was derived and, in the concluding remarks of the paper, a study of the MDCRS problem was suggested. In subsequent works by Hujdurović et al. [20], the MCRS problem was introduced and several claims from [18] were proved incorrect, including an NP-hardness proof of the Minimum-Split-Row problem (which would imply NP-hardness of the MCRS problem). However, it was shown in [20] that the MCRS problem is indeed NP-hard, as is the MDCRS problem. Moreover, a polynomially solvable case of the MCRS problem was identified and an efficient heuristic algorithm for the problem on general instances was proposed, based on coloring cocomparability graphs.

The results of this paper improve on the previously known results about the two problems: NP-hardness results are strengthened to APX-hardness results, approximation algorithms for the two problems are proposed, and the heuristic algorithm for the MCRS problem given by Hujdurović et al. from [20] is improved. The key tools leading to most of these results are the newly proposed branching formulations and the new min-max theorem strengthening Dilworth’s theorem. The min-max theorem has a constructive algorithmic proof, leading to a polynomial-time algorithm to compute a chain partition of a given partially ordered set equipped with a monotone weight function such that the of sum of the maximum weights in the chains is minimized. This result contrasts with known results in the literature implying that two natural variants of the problem are NP-hard: (i) the variant in which the chains used in the partition have to be of bounded size [29, 25], and (ii) the variant in which the weight function is not necessarily monotone, which corresponds to a variant of the graph coloring problem known as Weighted Coloring (see, e.g., [15, 8, 6, 3]), in the class of cocomparability graphs. We refer to the remarks following Corollary 3.5 in Section 3.1 for more details. See also Figure 10 in Section 5, where we summarize the relations between the problems introduced in this paper and several problems studied in the literature, along with the corresponding complexity results.

Structure of the paper. The branching formulations of the two problems are given in Section 2. A strengthening of Dilworth’s theorem and its connection to the MCRS problem is discussed in Section 3. APX-hardness proofs and approximation algorithms are presented in Section 4. We conclude the paper with a summary and some questions for future research in Section 5.

Remark on notation. A binary matrix $M\in\{0,1\}^{m\times n}$ is a matrix having $m$ rows and $n$ columns, and all entries [math] or $1$ . Each row of such a matrix is a vector in $\{0,1\}^{n}$ ; each column is a vector in $\{0,1\}^{m}$ . We will usually denote by $R_{M}=\{r_{1},\ldots,r_{m}\}$ and $C_{M}=\{c_{1},\ldots,c_{n}\}$ the (multi)sets of rows and columns of $M$ , respectively. The entry of $M$ at row $r_{i}$ and column $c_{j}$ will be denoted by $M_{i,j}$ or $M_{r_{i},j}$ when appropriate. For brevity, we will often write “the number of distinct rows (resp., columns) of $M$ ” to mean “the maximum number of pairwise distinct rows (resp., columns) of $M$ ”. Two rows (resp., columns) are considered distinct if they differ as binary vectors. All binary matrices in this paper will be assumed to contain no row whose all entries are [math].

In our proofs and constructions we will often simplify the binary matrix $M$ under consideration by working instead with the matrix denoted by $\operatorname*{{Red}}(M)$ , obtained by taking from $M$ exactly one copy from each set of identical columns.

An extended abstract of this work appeared in the proceedings of WG 2017 [19].

2 Formulations in Terms of Branchings in Directed Acyclic Graphs

In this section, we are going to formulate the MCRS and the MDCRS problems in terms of branchings in directed acyclic graphs (DAGs). First, we give the necessary definitions.

Definition 2.1.

Let $D=(V,A)$ be a DAG. A branching of $D$ is a subset $B$ of $A$ such that $(V,B)$ is a digraph in which for each vertex $v$ there is at most one arc leaving $v$ .

The following construction (see, e.g., [18]) can be performed on any given binary matrix $M$ and results in a directed acyclic graph. Given a column $c_{j}\in C_{M}$ , the support of $c_{j}$ is the set defined as $\{r_{i}\in R_{M}:M_{i,j}=1\}$ and denoted by $\operatorname*{supp}_{M}(c_{j})$ . Given a binary matrix $M\in\{0,1\}^{m\times n}$ , the containment digraph $D_{M}$ of $M$ is the directed acyclic graph with vertex set $V=\{\operatorname*{supp}_{M}(c):c\in C_{M}\}$ and arc set $A=\{(v,v^{\prime}):v,v^{\prime}\in V\land v\subset v^{\prime}\}$ where $\subset$ is the relation of proper inclusion of sets. See Fig. 2 for an example.

Let $M\in\{0,1\}^{m\times n}$ be a binary matrix, let $D_{M}=(V,A)$ be the containment digraph of $M$ , and let $B$ be a branching of $D_{M}$ . For a vertex $v\in V$ , we denote by $N^{-}_{B}(v)$ the set of all vertices $v^{\prime}\in V$ such that $(v^{\prime},v)\in B$ . A source of $B$ is a vertex not entered by any arc of $B$ . For a vertex $v\in V$ , an element $r\in v$ (that is, a row of $M$ ) is said to be covered in $v$ with respect to $B$ (or just $B$ -covered) if $r\in\cup\,N^{-}_{B}(v)$ . (When it is clear to which branching we are referring to, we will say just that “ $r$ is covered in $v$ ”.) Analogously, we say that $r\in v$ is uncovered in $v$ with respect to $B$ if $r$ is not covered in $v$ . A $B$ -uncovered pair is a pair $(r,v)$ such that $r$ is a row of $M$ , $v$ is a vertex of $D_{M}$ (that is, the support of a column of $M$ ), $r\in v$ , and $r$ is uncovered in $v$ with respect to $B$ . For a row $r$ of $M$ , we will denote by $U_{B}(r)$ the set of all $B$ -uncovered pairs with first coordinate $r$ , and by $U(B)$ the set of all $B$ -uncovered pairs. To illustrate these notions, we elaborate further on the example from Fig. 2 in Fig. 3, where two branchings $B_{1}$ and $B_{2}$ of the arc set of $D_{M}$ are depicted, together with uncovered pairs $(r,v)$ with respect to each of the two branchings.

For a branching $B\subseteq A$ , we say that a vertex $v\in V$ is $B$ -irreducible if there exists some element $r\in v$ that is uncovered in $v$ with respect to $B$ (equivalently, if $v\not\in\cup\,N^{-}_{B}(v)$ ). In particular, every source of $B$ is $B$ -irreducible. We denote by $I(B)$ the set of all $B$ -irreducible vertices; see Fig. 3 for an example.

We denote by $\beta(M)$ the minimum number of elements in $U(B)$ over all branchings $B$ of $D_{M}$ . Similarly, we denote with $\zeta(M)$ the minimum number of elements in $I(B)$ over all branchings $B$ of $D_{M}$ . The corresponding optimization problems are the following:

MinimumUncoveringBranching (MUB):

Input: A binary matrix $M$ .

Task: Compute $\beta(M)$ .

MinimumIrreducingBranching (MIB):

Input: A binary matrix $M$ .

Task: Compute $\zeta(M)$ .

The announced equivalence between the MCRS and the MUB problems, and between the MDCRS and the MIB problems is captured in the following theorem. We denote by $\omega$ any real number such that there exists an ${\mathcal{O}}(n^{\omega})$ algorithm for multiplying two $n\times n$ binary matrices (e.g., $\omega=2.3728639$ [23]).

Theorem 2.1.

For every binary matrix $M\in\{0,1\}^{m\times n}$ with exactly $k$ distinct columns, the following holds:

Any branching $B$ of $D_{M}$ can be transformed in time ${\mathcal{O}}(kmn)$ to a conflict-free row split of $M$ with exactly $|U(B)|$ rows and with exactly $|I(B)|$ distinct rows. 2. 2.

Any conflict-free row split $M^{\prime}\in\{0,1\}^{m^{\prime}\times n}$ of $M$ can be transformed in time ${\mathcal{O}}(mn+m^{\prime}k^{2}+k^{\omega})$ to a branching $B$ of $D_{M}$ such that $|U(B)|$ is at most the number of rows of $M^{\prime}$ and $|I(B)|$ is at most the number of distinct rows of $M^{\prime}$ .

Consequently, for every binary matrix $M$ , we have $\gamma(M)=\beta(M)$ and $\eta(M)=\zeta(M)$ .

Results presented in Sections 3.2, 4.1, and 4.3 will rely on Theorem 2.1. Before giving a proof of the theorem, let us discuss one further consequence of it. The theorem allows for the development of faster exact exponential-time solutions for the two problems, when compared to a direct brute-force approach that follows directly from the problems’ definitions. Consider the simple approach of enumerating all possible branchings of $D_{M}$ and selecting the best one. Denoting by $W$ the set of vertices $u$ of $D_{M}$ of out-degree $d^{+}(u)$ at least one and disregarding polynomial factors, the time complexity of this approach is of the order ${\mathcal{O}}(\prod_{u\in W}d^{+}(u))={\mathcal{O}}(n^{n})={\mathcal{O}}(2^{n\log n})$ , where $n$ is the number of distinct columns of the input matrix $M$ . On the other hand, the time complexity of the straightforward approach to the two problems based on generating all possible row splits of $M$ cannot even be expressed as a function of $n$ only. A row with $k$ ones has at least as many splits as the number of partitions of a $k$ -element set, which is the quantity counted by the Bell number $B_{k}$ and clearly bounded from below by $2^{k}$ . Thus, for a matrix with $m$ rows, each with at least $n/2$ ones, the total number of row splits of $M$ is at least $2^{mn/2}$ .

Theorem 2.1 will be proved in two steps. First, we show how to split the input matrix $M$ in a conflict-free way, given a branching $B$ of its containment digraph; the number of rows (resp., distinct rows) of the resulting row split equals the number of $B$ -uncovered pairs (resp., $B$ -irreducible vertices). Second, we show that any conflict-free row split $M^{\prime}$ of $M$ can be reduced, by possibly deleting some rows, into a row split of $M$ obtained from some branching of $D_{M}$ (as in the first step).

The proof of the first part of Theorem 2.1 relies on the notion of a $B$ -split, defined as follows.

Definition 2.2.

Let $M$ be a binary matrix with rows $r_{1},\ldots,r_{m}$ and columns $c_{1},\ldots,c_{n}$ . For a branching $B$ of $D_{M}$ , we define the $B$ -split of $M$ , denoted by $M^{B}$ , as the matrix with rows indexed by the elements of the set $U(B)$ , and columns $c^{\prime}_{1},\ldots,c^{\prime}_{n}$ , as follows. Let $V=V(D_{M})$ and for all $j\in\{1,\ldots,n\}$ , let $v_{j}=\operatorname*{supp}_{M}(c_{j})$ (so $v_{j}\in V$ ). For a vertex $v\in V$ , we denote by $B^{+}(v)$ the set of all vertices in $V$ reachable by a directed path from $v$ in $(V,B)$ (note that $v\in B^{+}(v))$ . For all $(r,v)\in U(B)$ and all $j\in\{1,\ldots,n\}$ , set:

[TABLE]

Note that if $M^{B}_{(r,v),j}=1$ , then $r\in v_{j}$ . See Fig. 3 for an example of a binary matrix $M$ with two branchings $B_{1}$ and $B_{2}$ of its containment digraph and the corresponding row splits.

In the following lemma we show that the $B$ -split of $M$ is a conflict-free row split of $M$ and compute the number of rows (resp., the number of distinct rows) of $M^{B}$ .

Lemma 2.2.

Let $M$ be a binary matrix without duplicated columns, $B$ a branching of $D_{M}$ , and let $M^{B}$ be the $B$ -split of $M$ . Then $M^{B}$ is a conflict-free row split of $M$ with $|U(B)|$ rows, splitting each row $r_{i}$ of $M$ into rows of $M^{B}$ indexed by $U_{B}(r_{i})$ . Moreover, the number of distinct rows in $M^{B}$ is $|I(B)|$ .

Proof.

It is clear that the number of rows in $M^{B}$ is $|U(B)|$ . For a row $r$ of $M$ , we claim that $r$ is the bitwise OR of the rows of $M^{B}$ indexed by the set $U_{B}(r)$ . Suppose that $M_{r,j}=1$ . Then $r\in v_{j}$ . We claim that there exists a vertex $v\in V$ such that $(r,v)\in U_{B}(r)$ and $M^{B}_{(r,v),j}=1$ . If $(r,v_{j})\in U_{B}(r)$ , we can choose $v=v_{j}$ and we are done. If this is not the case, then $r$ is covered in $v_{j}$ , and hence $r\in v_{k}$ for some $v_{k}$ such that $(v_{k},v_{j})\in B$ . Now if $(r,v_{k})\not\in U_{B}(r)$ , then we repeat the argument with $v_{k}$ replaced by a “covering” in-neighbor. The procedure has to terminate after finitely many steps. Hence, we may assume that $(r,v_{k})\in U_{B}(r)$ . This implies that $M^{B}_{(r,v_{k}),j}=1$ . Suppose now that $M_{r,j}=0$ . Then $r\not\in v_{j}$ and therefore $M^{B}_{(r,v),j}=0$ , for every $(r,v)\in U_{B}(r)$ . This shows that $r$ is bitwise OR of the rows of $M^{B}$ indexed by $U_{B}(r)$ , and therefore $M^{B}$ is row split of matrix $M$ .

Suppose that two columns $c^{\prime}_{p}$ and $c^{\prime}_{q}$ of $M^{B}$ are in conflict. Then there exist row indices, $(r_{i},v_{i^{\prime}}),(r_{j},v_{j^{\prime}})$ and $(r_{k},v_{k^{\prime}})$ in $U(B)$ such that $M^{B}_{(r_{i},v_{i^{\prime}}),p}=M^{B}_{(r_{i},v_{i^{\prime}}),q}=M^{B}_{(r_{j},v_{j^{\prime}}),p}=M^{B}_{(r_{k},v_{k^{\prime}}),q}=1$ and $M^{B}_{(r_{j},v_{j^{\prime}}),q}=M^{B}_{(r_{k},v_{k^{\prime}}),p}=0$ . Since $M^{B}_{(r_{i},v_{i^{\prime}}),p}=M^{B}_{(r_{i},v_{i^{\prime}}),q}=1$ , we have $v_{p}\in B^{+}(v_{i^{\prime}})$ and $v_{q}\in B^{+}(v_{i^{\prime}})$ , that is, $v_{p}$ and $v_{q}$ are reachable by a directed path from $v_{i^{\prime}}$ in $(V,B)$ . Since $B$ is a branching, this is only possible if $v_{q}\in B^{+}(v_{p})$ or $v_{p}\in B^{+}(v_{q})$ ; we may assume without loss of generality that $v_{q}\in B^{+}(v_{p})$ . Since $M^{B}_{(r_{j},v_{j^{\prime}}),p}=1$ , it follows that $v_{p}\in B^{+}(v_{j^{\prime}})$ . This further implies that $v_{q}\in B^{+}(v_{j^{\prime}})$ . Since $r_{j}\in v_{j^{\prime}}$ , it follows that $r_{j}\in v_{q}$ , which contradicts the fact that $M^{B}_{(r_{j},v_{j^{\prime}}),q}=0$ . The obtained contradiction shows that $M^{B}$ is conflict-free.

It remains to prove that the number of distinct rows in $M^{B}$ is $|I(B)|$ . Note that for any row $(r,v)$ in $M^{B}$ we have $v\in I(B)$ . Let $v\in I(B)$ . It is not difficult to see that for $r_{i},r_{j}\in V$ such that $(r_{i},v)\in U(B)$ and $(r_{j},v)\in U(B)$ , the rows of $M^{B}$ indexed by $(r_{i},v)$ and $(r_{j},v)$ are equal. Hence the number of distinct rows in $M^{B}$ is at most $|I(B)|$ . To complete the proof we construct a set of size $|I(B)|$ of pairwise distinct rows of $M^{B}$ . For every $v_{i}\in I(B)$ , let $r^{i}$ be an arbitrary element of the (non-empty) set $v_{i}\setminus\cup N^{-}_{B}(v_{i})$ . Since $r^{i}$ is uncovered in $v_{i}$ with respect to $B$ , the pair $(r^{i},v_{i})$ is an element of $U(B)$ . We claim that the rows of $M^{B}$ indexed by $(r^{i},v_{i})$ over all $v_{i}\in I(B)$ are pairwise distinct. Suppose that there exist $v_{i}$ and $v_{j}$ in $I(B)$ such that $v_{i}\neq v_{j}$ and the rows of $M^{B}$ indexed by $(r^{i},v_{i})$ and $(r^{j},v_{j})$ are equal. Since $M^{B}_{(r^{i},v_{i}),i}=M^{B}_{(r^{j},v_{j}),j}=1$ and the two rows are equal, we infer that $M^{B}_{(r^{i},v_{i}),j}=M^{B}_{(r^{j},v_{j}),i}=1$ . Therefore $v_{j}\in B^{+}(v_{i})$ and $v_{i}\in B^{+}(v_{j})$ . Since $B$ is a DAG, it follows that $v_{i}=v_{j}$ , a contradiction. This shows that there are exactly $|I(B)|$ distinct rows in $M^{B}$ . ∎

The following lemma, exemplified in Fig. 4, is the key to the converse direction.

Lemma 2.3.

There exists an algorithm that takes as input a binary matrix $M$ without duplicated columns and a conflict-free row split $M^{\prime}\in\{0,1\}^{m^{\prime}\times n}$ of $M$ , and computes in time ${\mathcal{O}}(m^{\prime}n^{2}+n^{\omega})$ a branching $B$ of $D_{M}$ such that $M^{B}$ can be obtained from $M^{\prime}$ by removing some rows.

Proof.

Denote the rows of $M$ with $r_{1},\ldots,r_{m}$ and the columns with $c_{1},\ldots,c_{n}$ . Let $R_{i}$ be the set of split rows of $r_{i}$ , and let $c^{\prime}_{i}$ be the column of $M^{\prime}$ corresponding to $c_{i}$ . For $i\in\{1,\ldots,n\}$ , let $v_{i}=\operatorname*{supp}_{M}(c_{i})$ and $v^{\prime}_{i}=\operatorname*{supp}_{M^{\prime}}(c^{\prime}_{i})$ . We claim that for every $i,j\in\{1,\ldots,n\}$ , if $(v^{\prime}_{i},v^{\prime}_{j})$ is an arc in $D_{M^{\prime}}$ then $(v_{i},v_{j})$ is an arc in $D_{M}$ . Suppose that $(v^{\prime}_{i},v^{\prime}_{j})$ is an arc in $D_{M^{\prime}}$ and $(v_{i},v_{j})$ is not an arc in $D_{M}$ . It follows that $v^{\prime}_{i}\subseteq v^{\prime}_{j}$ and $v_{i}\not\subseteq v_{j}$ . Let $r_{k}\in v_{i}\setminus v_{j}$ . Then $M^{\prime}_{r^{\prime},i}=1$ for some $r^{\prime}\in R_{k}$ . Since $v^{\prime}_{i}\subseteq v^{\prime}_{j}$ , it follows that $M^{\prime}_{r^{\prime},j}=1$ and consequently $r_{k}\in v_{j}$ , a contradiction.

We say that an arc $(v^{\prime}_{i},v^{\prime}_{j})$ is elementary in $D_{M^{\prime}}$ if there exists no $k\in\{1,\ldots,n\}$ such that both $(v^{\prime}_{i},v^{\prime}_{k})$ and $(v^{\prime}_{k},v^{\prime}_{j})$ are arcs of $D_{M^{\prime}}$ . Let $B$ be the subset of the arc set of $D_{M}$ defined by $(v_{i},v_{j})\in B$ if and only if $v^{\prime}_{i}\neq\emptyset$ and $(v^{\prime}_{i},v^{\prime}_{j})$ is an elementary arc of $D_{M^{\prime}}$ . We claim that $B$ is a branching of $D_{M}$ . Suppose that $(v_{i},v_{j})\in B$ and $(v_{i},v_{k})\in B$ , for $j\neq k$ . Then, both $(v^{\prime}_{i},v^{\prime}_{j})$ and $(v^{\prime}_{i},v^{\prime}_{k})$ are elementary arcs of $D_{M^{\prime}}$ , which implies that $v_{i}^{\prime}\subseteq v_{j}^{\prime}\cap v_{k}^{\prime}$ . Since $v^{\prime}_{i}\neq\emptyset$ and $M^{\prime}$ is conflict-free, it follows that $v_{j}^{\prime}\subseteq v_{k}^{\prime}$ or vice versa. By definition of $B$ , we obtain that $v_{j}^{\prime}=v_{k}^{\prime}$ . However, since $v_{j}\neq v_{k}$ , we may assume that there exists some $r_{p}\in v_{j}\setminus v_{k}$ , and therefore, there exists $r^{\prime}\in R_{p}$ , such that $r^{\prime}\in v^{\prime}_{j}$ . Since $r_{p}\not\in v_{k}$ we have $R_{p}\cap v^{\prime}_{k}=\emptyset$ , contrary to the fact that $r^{\prime}\in R_{p}\cap v^{\prime}_{j}=R_{p}\cap v^{\prime}_{k}$ . We conclude that $B$ is a branching.

Next, we prove that $M^{B}$ can be obtained from $M^{\prime}$ by removing some rows, or, equivalently, that there exists a one-to-one mapping assigning to each row of $M^{B}$ an identical row of $M^{\prime}$ . Every row of $M^{B}$ is indexed by an element of $U(B)$ . Every element of $U(B)$ is of the form $(r_{i},v_{k})$ with $(r_{i},v_{k})\in U_{B}(r_{i})$ for some $i\in\{1,\ldots,m\}$ and $r_{i}\in v_{k}$ . To define a mapping as above, it suffices to show that there exists a row $r^{\prime}$ of $M^{\prime}$ such that $r^{\prime}\in R_{i}$ and $r^{\prime}$ is equal to the row of $M^{B}$ indexed by $(r_{i},v_{k})$ , or more precisely that $M^{\prime}_{r^{\prime},j}=1$ if and only if $v_{j}\in B^{+}(v_{k})$ . First, observe that $r_{i}\in v_{k}$ implies that there exists some $r^{\prime}\in R_{i}$ such that $M^{\prime}_{r^{\prime},k}=1$ .

Assume that $M^{\prime}_{r^{\prime},j}=1$ . Since $M^{\prime}_{r^{\prime},k}=M^{\prime}_{r^{\prime},j}=1$ and $M^{\prime}$ is conflict-free, it follows that either $v^{\prime}_{j}\subseteq v^{\prime}_{k}$ or $v^{\prime}_{k}\subseteq v^{\prime}_{j}$ . Suppose that $v^{\prime}_{j}$ is a proper subset of $v^{\prime}_{k}$ and therefore there exists a non-trivial $v^{\prime}_{j},v^{\prime}_{k}$ -path $P^{\prime}$ consisting only of elementary arcs of $D_{M^{\prime}}$ . Since $r^{\prime}$ is an element of $v^{\prime}_{j}$ , the set $v^{\prime}_{j}$ is non-empty, which implies that the path $P^{\prime}$ corresponds to a non-trivial $v_{j},v_{k}$ -path $P$ in $B$ , therefore $v_{k}\in B^{+}(v_{j})$ . Since $M^{\prime}_{r^{\prime},j}=1$ , and $r^{\prime}\in R_{i}$ , it follows that $r_{i}\in v_{j}$ . However, this contradicts the fact that $(r_{i},v_{k})\in U_{B}(r_{i})$ . Therefore, $v^{\prime}_{k}\subseteq v^{\prime}_{j}$ and consequently $v_{j}\in B^{+}(v_{k})$ . We proved that $M^{\prime}_{r^{\prime},j}=1$ implies that $v_{j}\in B^{+}(v_{k})$ .

Suppose now that $v_{j}\in B^{+}(v_{k})$ . If $v_{j}=v_{k}$ , then $j=k$ and $M^{\prime}_{r^{\prime},j}=M^{\prime}_{r^{\prime},k}=1$ , as desired. If $v_{j}\neq v_{k}$ , then since $v_{j}\in B^{+}(v_{k})$ , it follows that $v^{\prime}_{k}\subseteq v^{\prime}_{j}$ . Combining this with the fact that $M^{\prime}_{r^{\prime},k}=1$ , we conclude that $M^{\prime}_{r^{\prime},j}=1$ . This completes the proof that the row $r^{\prime}$ of $M^{\prime}$ is equal to the row of $M^{B}$ indexed by $(r_{i},v_{k})$ .

The above considerations imply the existence of a mapping assigning to each row of $M^{B}$ an identical row of $M^{\prime}$ . In fact, any mapping as defined above is also one-to-one, which can be seen as follows. First, two rows of $M^{B}$ indexed by elements of $U(B)$ with distinct first coordinates, say $r_{i}$ and $r_{j}$ , will be mapped to rows of $M^{\prime}$ from $R_{i}$ and $R_{j}$ , respectively, and by construction the sets $R_{i}$ and $R_{j}$ are disjoint. Second, suppose we have two rows of $M^{B}$ indexed by elements of $U(B)$ with identical first coordinates but distinct second coordinates, say $(r_{i},v_{j})$ and $(r_{i},v_{k})$ . The last part of the proof of Lemma 2.2 implies that no two rows of $M^{B}$ indexed by pairs that differ in the values of their second coordinates are identical. Consequently, the images of rows of $M^{B}$ indexed by $(r_{i},v_{j})$ and $(r_{i},v_{k})$ are also not identical (as binary vectors), and therefore they correspond to different rows of $M^{\prime}$ .

We conclude that $M^{B}$ can be obtained from $M^{\prime}$ by deleting some rows.

It remains to estimate the time complexity of computing branching $B$ . First, we compute the containment digraph $D_{M^{\prime}}$ in time ${\mathcal{O}}(m^{\prime}n^{2})$ . Second, we compute the set $A^{\prime}$ of elementary arcs of $D_{M^{\prime}}$ in time ${\mathcal{O}}(n^{\omega})$ using the algorithm of Aho et al. [1]. Finally, branching $B$ can be computed from $A^{\prime}$ in time ${\mathcal{O}}(|A^{\prime}|)={\mathcal{O}}(n^{2})$ . The claimed running time follows. ∎

Now we have everything ready to prove Theorem 2.1.

Theorem 2.1 (restated).

For every binary matrix $M\in\{0,1\}^{m\times n}$ with exactly $k$ distinct columns, the following holds:

Any branching $B$ of $D_{M}$ can be transformed in time ${\mathcal{O}}(kmn)$ to a conflict-free row split of $M$ with exactly $|U(B)|$ rows and with exactly $|I(B)|$ distinct rows. 2. 2.

Any conflict-free row split $M^{\prime}\in\{0,1\}^{m^{\prime}\times n}$ of $M$ can be transformed in time ${\mathcal{O}}(mn+m^{\prime}k^{2}+k^{\omega})$ to a branching $B$ of $D_{M}$ such that $|U(B)|$ is at most the number of rows of $M^{\prime}$ and $|I(B)|$ is at most the number of distinct rows of $M^{\prime}$ .

Consequently, for every binary matrix $M$ , we have $\gamma(M)=\beta(M)$ and $\eta(M)=\zeta(M)$ .

Proof.

Let $B$ be a branching of $D_{M}$ . By Lemma 2.2, it suffices to show that $M^{B}$ , the $B$ -split of $M$ , can be computed in time ${\mathcal{O}}(kmn)$ . This can be achieved as follows. First, we compute the reduced matrix $\operatorname*{{Red}}(M)$ in time ${\mathcal{O}}(mn)$ using radix sort. Second, we compute the containment digraph $D_{M}$ in time ${\mathcal{O}}(k^{2}m)={\mathcal{O}}(kmn)$ by performing pairwise comparisons of columns of $\operatorname*{{Red}}(M)$ . Third, we compute the set $U(B)$ in time ${\mathcal{O}}(k^{2}m)={\mathcal{O}}(kmn)$ by checking for each of the $k$ vertices $v\in V(D_{M})$ , each of the ${\mathcal{O}}(m)$ elements $r\in v$ , and each of the ${\mathcal{O}}(k)$ in-neighbors $u$ of $v$ in $D_{M}$ whether $r\in u$ . Fourth, in time ${\mathcal{O}}(k^{2})$ we compute for each $v\in V(D_{M})$ the set $B^{+}(v)$ . Finally, in time ${\mathcal{O}}(|U(B)|n)$ we compute the matrix $M^{B}$ using the definition. Note that $|U(B)|\leq km$ , hence ${\mathcal{O}}(|U(B)|n)={\mathcal{O}}(kmn)$ and the claimed time complexity follows.

Now, let $M^{\prime}\in\{0,1\}^{m^{\prime}\times n}$ be a conflict-free row split of $M$ .

Consider first the case when $M$ is without duplicated columns. In this case $k=n$ and by Lemma 2.3, in time ${\mathcal{O}}(m^{\prime}n^{2}+n^{\omega})$ a branching $B$ of $D_{M}$ can be computed such that $M^{B}$ , the $B$ -split of $M$ , can be obtained from $M^{\prime}$ by removing some rows. Since $|U(B)|$ (resp., $|I(B)|$ ) equals the number of rows (resp., the number of distinct rows) of $M^{B}$ , this implies that $|U(B)|$ is at most the number of rows of $M^{\prime}$ and $|I(B)|$ is at most the number of distinct rows of $M^{\prime}$ .

Consider now the general case. Let $X$ be a set of $k$ columns of $M$ such that the matrix $\operatorname*{{Red}}(M)$ can be identified with the submatrix of $M$ obtained by considering only the columns in $X$ . Let $M^{\prime\prime}\in\{0,1\}^{m^{\prime}\times k}$ be the submatrix of $M^{\prime}$ obtained by considering only the $k$ columns in $X$ . Then, $M^{\prime\prime}$ is a conflict-free row split of $\operatorname*{{Red}}(M)$ . Note that matrix $M^{\prime\prime}$ can be computed in time proportional to its size, $m^{\prime}k$ , plus the number of columns, $n$ , plus the time it takes to compute $\operatorname*{{Red}}(M)$ , which can be done in time ${\mathcal{O}}(mn)$ using radix sort. Since $\operatorname*{{Red}}(M)$ is without duplicated columns, we have by the previous case that in time ${\mathcal{O}}(m^{\prime}k^{2}+k^{\omega}))$ a branching $B$ of $D_{\operatorname*{{Red}}(M)}=D_{M}$ can be computed such that $|U(B)|$ is at most the number of rows of $M^{\prime\prime}$ and $|I(B)|$ is at most the number of distinct rows of $M^{\prime\prime}$ . Since the number of rows of $M^{\prime\prime}$ equals the number of rows of $M^{\prime}$ , this immediately implies that $|U(B)|$ is at most the number of rows of $M^{\prime}$ . Also, by construction, any two distinct rows of $M^{\prime\prime}$ correspond to a pair of distinct rows of $M^{\prime}$ and hence the number of distinct rows of $M^{\prime\prime}$ is at most the number of distinct rows of $M^{\prime}$ . This implies that $|I(B)|$ is at most the number of distinct rows of $M^{\prime}$ . The total time complexity is ${\mathcal{O}}(m^{\prime}k+mn+m^{\prime}k^{2}+k^{\omega})={\mathcal{O}}(mn+m^{\prime}k^{2}+k^{\omega})$ , which establishes the second part of the theorem.

Finally, we show that $\gamma(M)=\beta(M)$ and $\eta(M)=\zeta(M)$ . Let $B$ be a branching of $D_{M}$ such that $|U(B)|=\beta(M)$ . By the first part of the theorem, there exists a conflict-free row split $M^{\prime}$ of $M$ with $|U(B)|$ rows, therefore $\gamma(M)\leq|U(B)|=\beta(M)$ . Conversely, if $M^{\prime}$ is a conflict-free row split of $M$ with $\gamma(M)$ rows, then there exists a branching $B$ of $D_{M}$ such that $|U(B)|$ is at most the number of rows of $M^{\prime}$ (that is, $\gamma(M)$ ). This implies $\beta(M)\leq|U(B)|\leq\gamma(M)$ . Therefore, $\gamma(M)=\beta(M)$ . The proof of equality $\eta(M)=\zeta(M)$ is analogous. ∎

3 A Strengthening of Dilworth’s Theorem and its Connection to the

Minimum Conflict-Free Row Split Problem

By Theorem 2.1, the MCRS problem can be concisely formulated in terms of a problem on branchings in a derived digraph. As shown by Hujdurović et al. in [20], the MCRS problem is NP-hard; consequently, the MUB problem is also NP-hard. In this section we show that a related problem in which we examine only a subset of all the branchings of the containment digraph of the input binary matrix is polynomially solvable. This will be achieved by deriving, in Section 3.1, a min-max theorem generalizing the classical Dilworth’s theorem on partially ordered sets, which may be of independent interest. The resulting heuristic algorithm will be described in Section 3.2 (see also Remark 4.13 on p. 4.13).

3.1 A Min-Max Relation Strengthening Dilworth’s Theorem

This section can be read independently of the rest of the paper.

Consider a pair $(D,f)$ where $D=(V,A)$ is a DAG and $f:V\to{{\mathbb{Z}}_{+}}$ is a weight function of $D$ . (We use ${{\mathbb{Z}}_{+}}$ for the set of non-negative integers.) The weight function $f$ is called monotone if $f_{u}\leq f_{v}$ for every $(u,v)\in A$ .

In $D$ , a non-trivial path is a directed path with at least one arc. We denote by $D^{t}$ the transitive closure of $D$ , that is, the DAG $(V,A^{t})$ on the same vertex set as $D$ having an arc $(u,v)\in A^{t}$ if and only if there exists a non-trivial path in $D$ from $u$ to $v$ . A chain in $D$ is a sequence of vertices $C=(v_{1},v_{2},\ldots,v_{s})$ such that $(v_{i},v_{i+1})\in A^{t}$ for all $i\in\{1,\ldots,s-1\}$ ; sometimes we regard $C$ as the set of its vertices $C=\{v_{1},v_{2},\ldots,v_{s}\}$ . The price of chain $C$ is given by $\Pi(C)=\max_{v\in C}f_{v}$ . A family of vertex-disjoint chains $P=\{C_{1},\ldots,C_{p}\}$ is called a chain partition of $D$ if every vertex of $D$ is contained in precisely one chain of $P$ . The price of chain partition $P$ is defined as $\Pi(P)=\sum_{i=1}^{p}\Pi(C_{i})$ . Consider the following problem.

MinimumPriceChainPartition:

Input: A DAG $D=(V,A)$ and a monotone weight function $f:V\to{{\mathbb{Z}}_{+}}$ of $D$ .

Task: Compute a chain partition $P$ of $D$ such that the price $\Pi(P)$ is minimum possible.

In this section we give a polynomial-time algorithm and a min-max characterization for the above problem. As can be expected, the notion of antichain will play a main role in this min-max characterization. An antichain of $D$ is a set of vertices $N\subseteq V$ such that $N$ is an independent set (that is, a set of pairwise non-adjacent vertices) in $D^{t}$ ; in other words, no non-trivial path of $D$ has both endpoints in $N$ . Note that $|C\cap N|\leq 1$ for any chain $C$ and any antichain $N$ . The width of $D$ , denoted by ${\it wdt}(D)$ , is the maximum cardinality of an antichain in $D$ .

A classical theorem of Dilworth states that ${\it wdt}(D)$ equals the minimum number of chains in a chain partition of $D$ [7]. Moreover, a chain partition of $D$ into ${\it wdt}(D)$ chains can be computed in time $\widetilde{\mathcal{O}}(n^{\omega})$ where $n=|V(D)|$ , $\omega$ is any real number such that there exists an ${\mathcal{O}}(n^{\omega})$ algorithm for multiplying two $n\times n$ binary matrices (e.g., $\omega=2.373$ ), and the $\widetilde{\mathcal{O}}(\cdot)$ notation ignores logarithmic factors. Indeed, by applying the approach of Fulkerson [12] (see also [26, 24]), a minimum chain partition of $D$ can be computed by solving a maximum matching problem in a derived bipartite graph having $2n$ vertices. This can be done in time $\widetilde{\mathcal{O}}(n^{\omega})$ using the algorithm of [21].222Alternatively, one could use the bipartite matching algorithm from [10] to obtain the (incomparable) running time of ${\mathcal{O}}(\sqrt{n}m\log_{n}(n^{2}/m))$ where $m=|A^{t}|$ is the number of edges in the transitive closure of $D$ . For the sake of simplicity of presentation, we state the theorem with the running time resulting from using the Ibarra-Moran algorithm.

For later use, we summarize these facts as follows.

Theorem 3.1 (Dilworth’s theorem).

Every DAG $D$ admits a chain partition of size ${\it wdt}(D)$ . Such a chain partition can be computed in time $\widetilde{\mathcal{O}}(|V(D)|^{\omega})$ .

Our characterization will be a refinement of Dilworth’s theorem and its algorithmic proof makes use of Dilworth’s theorem as a subroutine. We must introduce one further notion however. A tower of antichains of $D$ is a sequence of antichains of $D$ , $T=(N_{1},N_{2},\ldots,N_{{\it wdt}(D)})$ , with $|N_{i}|=i$ . The value of an antichain $N$ is given by ${\it val}(N)=\min_{v\in N}f_{v}$ and the value of tower $T=(N_{1},N_{2},\ldots,N_{{\it wdt}(D)})$ is defined as ${\it val}(T)=\sum_{i=1}^{{\it wdt}(D)}{\it val}(N_{i})$ .

To appreciate the purpose of this notion, we begin with a simple observation.

Lemma 3.2.

Let $D$ be a DAG, let $P=\{C_{1},\ldots,C_{p}\}$ be a chain partition of $D$ , and let $T=(N_{1},N_{2},\ldots,N_{{\it wdt}(D)})$ be a tower of antichains of $D$ . Then, $\Pi(P)\geq{\it val}(T)$ even if the weight function $f$ is not monotone.

Proof.

For every chain $C$ and every antichain $N$ we have that $|C\cap N|\leq 1$ . Moreover, if $|C\cap N|=1$ , then $\Pi(C)\geq{\it val}(N)$ . Indeed, if $C\cap N=\{z\}$ , then

[TABLE]

Since $P$ is a chain partition of $D$ , then $|P|\geq{\it wdt}(D)$ , and we can always rename its chains as $\tilde{C}_{1},\tilde{C}_{2},\ldots,\tilde{C}_{p}$ in such a way that, for every $i=1,\ldots,{\it wdt}(D)$ , chain $\tilde{C}_{i}$ intersects the antichain $N_{i}$ . At this point,

[TABLE]

∎

For the case of monotone weight functions, the following min-max strengthening of Dilworth’s theorem holds.

Theorem 3.3.

Let $D$ be a DAG and let $f$ be a monotone weight function of $D$ . Then $D$ admits a chain partition $P=\{C_{1},\ldots,C_{{\it wdt}(D)}\}$ and a tower of antichains $T=(N_{1},N_{2},\ldots,N_{{\it wdt}(D)})$ such that $\Pi(P)={\it val}(T)$ . Such a pair $(P,T)$ can be computed in time $\widetilde{\mathcal{O}}(|V(D)|^{\omega+1})$ .

Proof.

The proof is by induction on $n=|V(D)|$ . Clearly, the statement holds for $n=1$ . As for the inductive step, let $n>1$ and consider a vertex $v\in V(D)$ without any incoming arcs and such that $f_{v}\leq f_{v^{\prime}}$ for all $v^{\prime}\in V(D)$ . Such a vertex exists since the subgraph of $D$ induced by the set of vertices achieving the minimum value of $f$ is acyclic. Let $D^{\prime}=D-v$ , and consider a chain partition $P^{\prime}=\{C_{1},\ldots,C_{{\it wdt}(D^{\prime})}\}$ of $D^{\prime}$ and a tower of antichains $T^{\prime}=(N_{1},N_{2},\ldots,N_{{\it wdt}(D^{\prime})})$ of $D^{\prime}$ such that $\Pi(P^{\prime})={\it val}(T^{\prime})$ .

Two cases are possible. If ${\it wdt}(D)>{\it wdt}(D^{\prime})$ then let $P$ be obtained from $P^{\prime}$ by adding a chain $C$ comprising the sole vertex $v$ and let $T$ be obtained from $T^{\prime}$ by adding any antichain $N_{{\it wdt}(D)}$ of $D$ such that $|N_{{\it wdt}(D)}|={\it wdt}(D)$ . Since $|N_{{\it wdt}(D)}|>{\it wdt}(D^{\prime})$ , we infer that $v\in N_{{\it wdt}(D)}$ and hence $\min_{u\in N_{{\it wdt}(D)}}f_{u}=f_{v}=\Pi(C)$ . Therefore, $\Pi(P)={\it val}(T)$ closing the induction in this case.

Assume therefore that ${\it wdt}(D)={\it wdt}(D^{\prime})$ . Let $T$ be an antichain in $D^{\prime}$ with $|T|={\it wdt}(D)$ and let $\widehat{T}$ be the set of vertices of $D$ from which there is a non-trivial path to a vertex of $T$ . Notice that $v\in\widehat{T}$ since $T\cup\{v\}$ is not an antichain and $v$ is a source vertex of $D$ . The DAG $D[V(D)\setminus\widehat{T}]$ is an acyclic subgraph of $D^{\prime}$ of width at least $|T|=wdt(D^{\prime})$ , since $T$ is a subset of its vertex set. Moreover, while the width of an arbitrary induced subgraph can in general increase with vertex removal, this is not the case for $D[V(D)\setminus\widehat{T}]$ , because any path in $D$ between two vertices of $V(D)\setminus\widehat{T}$ is also a path in $D[V(D)\setminus\widehat{T}]$ , by the choice of $\widehat{T}$ . It follows that the DAG $D[V(D)\setminus\widehat{T}]$ is of width $|T|={\it wdt}(D^{\prime})$ ; hence, by the inductive hypothesis, it admits a chain partition $P^{T}=\{C^{T}_{1},\ldots,C^{T}_{{\it wdt}(D^{\prime})}\}$ with $\Pi(P^{T})\leq\Pi(P^{\prime})={\it val}(T^{\prime})$ . (Indeed, we could just take $C^{T}_{i}:=C_{i}\setminus\widehat{T}$ for every $i\in\{1,\ldots,{\it wdt}(D^{\prime})\}$ .) Also the acyclic subgraph $D[T\cup\widehat{T}]$ of $D$ has width $|T|={\it wdt}(D^{\prime})$ ; hence, by Dilworth’s theorem it admits a chain partition $P^{\widehat{T}}=\{C^{\widehat{T}}_{1},C^{\widehat{T}}_{2},\ldots,C^{\widehat{T}}_{{\it wdt}(D^{\prime})}\}$ covering all its vertices. Now we construct our chain partition for $D$ : let $T=\{t_{1},t_{2},\ldots,t_{|T|}\}$ . After a possible renaming of the chains in the two chain partitions, we can assume that $C^{T}_{i}\cap T=\{t_{i}\}$ and $C^{\widehat{T}}_{i}\cap T=\{t_{i}\}$ for every $i=1,\ldots,{\it wdt}(D^{\prime})={\it wdt}(D)=|T|$ , and hence define the chain $\tilde{C}_{i}=C^{\widehat{T}}_{i}\cup C^{T}_{i}$ . (Indeed, $t_{i}$ will be the last vertex of $C^{\widehat{T}}_{i}$ and the first vertex of $C^{T}_{i}$ , thus this chaining of chains can be performed.) Note that $\tilde{P}:=\{\tilde{C}_{1},\ldots,\tilde{C}_{{\it wdt}(D)}\}$ is a chain partition of $D$ with $\Pi(\tilde{P})=\Pi(P^{T})\leq\Pi(P^{\prime})={\it val}(T^{\prime})$ Clearly, $T^{\prime}$ is a valid tower of antichains for $D$ .

The above proof of the existence of a pair $(P,T)$ of a chain partition and a tower of antichains of $D$ satisfying $\Pi(P)={\it val}(T)$ is constructive and can be turned into a $\widetilde{\mathcal{O}}(|V(D)|^{\omega+1})$ time algorithm for computing such a pair $(P,T)$ . Indeed, at each step of the algorithm, we delete one vertex, make one recursive call to the algorithm, compute the set $\widehat{T}$ and the acyclic subgraph $D[T\cup\widehat{T}]$ together with a chain partition $P^{\widehat{T}}=\{C^{\widehat{T}}_{1},C^{\widehat{T}}_{2},\ldots,C^{\widehat{T}}_{{\it wdt}(D^{\prime})}\}$ covering all its vertices. The time complexity of each step is dominated by computing $P^{\widehat{T}}$ . By Theorem 3.1, this can be done in time $\widetilde{\mathcal{O}}(|V(D)|^{\omega})$ . The claimed time complexity of $\widetilde{\mathcal{O}}(|V(D)|^{\omega+1})$ follows. ∎

To see that Theorem 3.3 is a strengthening of Dilworth’s theorem, consider an arbitrary DAG $D=(V,A)$ and let $f$ be the weight function of $D$ that is constantly equal to $1$ . Then, the price of any chain $C$ is $\Pi(C)=\max_{v\in C}f_{v}=1$ and the price of a chain partition $P$ equals its cardinality. Moreover, the value of any antichain $N$ is ${\it val}(N)=\min_{v\in N}f_{v}=1$ , and consequently the value of any tower $T=(N_{1},N_{2},\ldots,N_{{\it wdt}(D)})$ of antichains is ${\it val}(T)=\sum_{i=1}^{{\it wdt}(D)}{\it val}(N_{i})={\it wdt}(D)$ . Since ${\it wdt}(D)$ is a lower bound on the cardinality of any chain partition, applying Theorem 3.3 to $(D,f)$ gives exactly the statement of Dilworth’s theorem for $D$ .

We would also like to emphasize that due to the non-linearity of the definitions of the price of a chain and the value of an antichain, Theorem 3.3 is incomparable with the classical weighted generalization of Dilworth’s theorem due to Frank [11].

Remark 3.4.

The monotonicity assumption in Theorem 3.3 is necessary. If we drop it, the price $\min_{P}\Pi(P)$ and the value $\max_{T}{\it val}(T)$ may diverge. Consider the DAG on vertex set $\{a\},\{b\},\{a,b\},\{b,c\}$ , with $f_{\{a\}}=f_{\{b,c\}}=z$ and $f_{\{b\}}=f_{\{a,b\}}=Z$ , with $0<z<Z<2z$ , and where the arcs are according to set inclusion. Here, $\min_{P}\Pi(P)=2Z$ whereas $\max_{T}{\it val}(T)=Z+z$ .

On the other hand, a simple application of Dilworth’s theorem shows that the monotonicity assumption is not necessary in the case of $0,1$ -weight functions.

Lemma 3.2 and Theorem 3.3 imply the following.

Corollary 3.5.

MinimumPriceChainPartition* can be solved optimally in time $\widetilde{\mathcal{O}}(|V(D)|^{\omega+1})$ . More specifically, in the stated time a minimum price chain partition $P$ of $D$ can be found with the additional property that $|P|={\it wdt}(D)$ (hence $P$ is simultaneously a minimum price chain partition and a minimum size chain partition of $D$ ).*

Two remarks are in order here, showing that the result of Corollary 3.5 is sharp in two ways. First, let us note that the variant of the MinimumPriceChainPartition problem in which the chains used in the partition have to be of bounded size was studied by Moonen and Spieksma in [25], who described a practical application encountered at Bruynzeel Storage Systems, a manufacturing company in the Netherlands, to a problem of optimally loading pallets on a truck.333The upper bound on the size of chains relates to the fact that trucks are of bounded height. Moonen and Spieksma referred to the problem as “Minimum Weight Partition into $B$ -chains” (where $B$ is the upper bound on the size of the chains) and showed that the problem is APX-hard even in the case of unit weights, strengthening the previous NP-hardness result from [29].

Second, the variant of MinimumPriceChainPartition where the weight function $f$ is not restricted to be monotone is NP-hard. This follows from the fact that the Weighted Coloring problem is NP-hard in the class of interval graphs, as shown by Escoffier et al. [8]. The input to the Weighted Coloring problem is a graph $G=(V,E)$ and a weight function $f:V\to{{\mathbb{Z}}_{+}}$ and the task is to find a partition $\mathcal{I}$ of $V$ into independent sets minimizing the value of $\sum_{I\in\mathcal{I}}\max_{v\in I}f_{v}$ . The Weighted Coloring problem in interval graphs finds applications in distributed computing in transportation networks and in dynamic storage allocation in computer processes [15]. Given an interval graph $G=(V,E)$ represented by an interval model $(I_{v}=[a_{v},b_{v}]:v\in V)$ and a weight function $f:V\to{{\mathbb{Z}}_{+}}$ , the Weighted Coloring problem given $(G,f)$ is equivalent to the problem of finding a chain partition of the DAG with vertex set $V$ and arc set $\{(u,v):b_{u}<a_{v}\}$ of minimum price with respect to $f$ . The claimed NP-hardness follows.

3.2 Connection with the Minimum Conflict-Free Row Split Problem

We will now describe a heuristic algorithm for the MCRS problem based on Theorem 3.3 and its algorithmic proof. The basic idea is to search for an optimal solution only among linear branchings, where a branching of $D_{M}$ is said to be linear if it defines a subgraph of maximum in- and out-degree at most one, that is, a disjoint union of directed paths. Note that such branchings correspond bijectively to chain partitions of $D_{M}$ .

We denote with $\beta_{\ell}(M)$ the minimum number of elements in $U(B)$ over all linear branchings $B$ of $D_{M}$ . We now introduce the following problem, referred to as MinimumUncoveringLinearBranching: Given a binary matrix $M$ , compute a linear branching $B$ of $D_{M}$ such that $|U(B)|=\beta_{\ell}(M)$ .

For a binary matrix $M$ , define a function $f:V(D_{M})\to{{\mathbb{Z}}_{+}}$ with $f(v)=|v|$ (recall that vertices of $D_{M}$ are pairwise distinct subsets or $R_{M}$ ). By definition of $D_{M}$ , we have $u\subset v$ whenever $(u,v)$ is an arc in $D_{M}$ . This implies that $f$ is a monotone weight function of $D_{M}$ . It is not difficult to see that for a linear branching $B$ and its corresponding chain partition $P$ , we have $\Pi(P)=|U(B)|$ . Since linear branchings correspond bijectively to chain partitions, it follows that MinimumUncoveringLinearBranching is a special case of MinimumPriceChainPartition. Using Theorem 3.3, we obtain that a linear branching $B$ of $D_{M}$ with $|U(B)|=\beta_{\ell}(M)$ can be computed in time $\widetilde{\mathcal{O}}(|V(D)|^{\omega+1})$ . This proves the following theorem.

Theorem 3.6.

MinimumUncoveringLinearBranching* can be solved to optimality in time $\widetilde{\mathcal{O}}(|V(D)|^{\omega+1})$ .*

Note that Theorem 3.6 yields a heuristic polynomial-time algorithm for the MUB problem, and consequently for the MCRS problem. We are now going to explain why this algorithm improves on the heuristic for the latter problem by Hujdurović et al. from [20]. For the sake of simplicity of exposition, suppose that the input matrix $M$ does not have any pairs of identical columns. (It is not difficult to see that this assumption is without loss of generality.) In this case, the algorithm from [20] returns a row split of the input matrix naturally derived from an optimal coloring of the complement of the underlying undirected graph of $D_{M}$ , which is a cocomparability graph and thus an optimal coloring can be computed efficiently, see, e.g., [14]. Such optimal colorings correspond bijectively to minimum chain partitions of $D_{M}$ ; each color class corresponds to a chain. In the terminology of branchings, the conflict-free row split of the input matrix $M$ returned by the heuristic from [20] is exactly the $B$ -split of $M$ (cf. Definition 2.2) where $B$ is the linear branching of $D_{M}$ corresponding to a minimum chain partition of $D_{M}$ .

In the above approach, any proper coloring could be used instead of an optimal coloring of the derived cocomparability graph. In branching terminology, choosing a proper coloring of the derived cocomparability graph so that the number of rows of the output row split is minimized corresponds exactly to MinimumUncoveringLinearBranching, which can be solved optimally by Theorem 3.6. Thus, the heuristic algorithm for the MCRS problem that returns the $B$ -split of $M$ where $M$ is an optimal solution to MinimumUncoveringLinearBranching always returns solutions that are at least as good as those computed by the algorithm by Hujdurović et al. from [20]. Moreover, note that by Corollary 3.5, digraph $D_{M}$ has a minimum price chain partition that is also minimum with respect to size. This implies the existence of an optimal solution to MinimumUncoveringLinearBranching on $M$ such that the corresponding chain partition is of size ${\it wdt}(M)$ and, equivalently, the existence of an optimal coloring of the derived cocomparability graph that minimizes the number of rows in the derived conflict-free row split of $M$ over all proper colorings of the derived graph.

Remark 3.7.

As discussed in [18, 20], the main motivation for the MCRS problem comes from cancer genomics, with the goal to reconstruct, from a set of given mixed tumor samples, a simplest possible mutational history of the tumor, represented by a rooted tree (without any restriction on the shape of the tree). Without going into details, let us note that the output of the heuristic algorithm for the MCRS problem given by Theorem 3.6 corresponds to a simplest possible reconstruction of the mutational history within a restricted space of rooted trees, namely within the space of rooted trees such that the root is the only node that is allowed to have more than one non-leaf child.

4 (In)approximability Issues

In this section we will discuss (in)approximability properties of the four problems studied in this paper, giving both APX-hardness results and approximation algorithms. The approximation ratios of some of our algorithms will be described in terms of the following parameters of the input matrix. Recall that the width of a DAG $D$ is the maximum cardinality of an antichain in $D$ . The height of a DAG $D$ is the maximum number of vertices in a directed path contained in $D$ . The width and the height of a binary matrix $M$ are denoted by ${\it wdt}(M)$ and by $h(M)$ , respectively, and defined as the width, resp. the height, of the containment digraph of $M$ .

4.1 Hardness Results

Our main inapproximability results are summarized in the following theorem, which shows hardness already for very restricted input instances.

Theorem 4.1.

The MUB and the MIB problems (and consequently the MCRS and the MDCRS problems) are APX-hard, even for instances of height $2$ .

The above result implies that none of the four problems admits a polynomial-time approximation scheme (PTAS), unless P = NP. Proving that a problem is APX-hard also provides a different proof of NP-hardness.

The APX-hardness for the two branching problems is established by developing $L$ -reductions from the vertex cover problem in cubic graphs, which is known to be APX-hard [2]. The APX-hardness of the other two problems then follows from Theorem 2.1. Recall that ${\sf APX}$ is a class of problems approximable to within a constant factor in polynomial time. A problem $\Pi$ is said to be APX-hard if every problem in ${\sf APX}$ reduces to $\Pi$ by an approximation-preserving reduction. Another way to prove that a problem $\Pi$ is APX-hard is to show that an APX-complete problem $\Pi^{\prime}$ is $L$ -reducible to $\Pi$ . For the sake of self-containment, we recall the definition of $L$ -reducibility; for further background on APX-hardness, we refer to [4].

Definition 4.1.

Let $\Pi$ and $\Pi^{\prime}$ be two NP-hard optimization problems. Problem $\Pi$ is said to be $L$ -reducible to problem $\Pi^{\prime}$ if there exists a polynomial-time transformation $f$ mapping instances of $\Pi$ to instances of $\Pi^{\prime}$ and constants $a,b\in\mathbb{R}_{+}$ such that for every instance $x$ of $\Pi$ the following conditions hold:

•

$opt_{\Pi^{\prime}}(f(x))\leq a\cdot opt_{\Pi}$ ,

•

for every feasible solution $y^{\prime}$ of $f(x)$ with objective value $c_{2}$ we can compute in polynomial time solution $y$ for $x$ with objective value $c_{1}$ such that $|opt_{\Pi}(x)-c_{1}|\leq b\cdot|opt_{\Pi^{\prime}}(f(x))-c_{2}|$ .

To simplify the description of the hardness reductions of this section, we will use the notion of a column hypergraph of a given binary matrix $M$ . This notion is closely related to the containment digraph of $M$ and will find a further application in Section 4.2. Recall that a set family (or a hypergraph) is a pair $\mathcal{H}=(V,\mathcal{E})$ where $V=V(\mathcal{H})$ is a set and $\mathcal{E}=E(\mathcal{H})$ is a subset of the power set $\mathcal{P}(V)$ . Elements of $V(\mathcal{H})$ are the vertices of $\mathcal{H}$ ; elements of $E(\mathcal{H})$ are its hyperedges. The column hypergraph $\mathcal{H}_{M}$ of a binary matrix $M$ is the hypergraph having the rows of $M$ as vertices and the support sets of the columns of $M$ as hyperedges. Formally, $\mathcal{H}_{M}$ has vertex set $V(\mathcal{H}_{M})=R_{M}$ and hyperedge set $E(\mathcal{H}_{M})=\{\operatorname*{supp}_{M}(c):c\in C_{M}\}$ . Note that the set of hyperedges of the column hypergraph of $M$ equals the vertex set of the containment digraph $D_{M}$ .

We split the proof of Theorem 4.1 into two parts.

Proposition 4.2.

MinimumUcoveringBranching* is APX-hard, even for instances of height $2$ . Consequently, MinimumConflict-FreeRowSplit is APX-hard, even for instances of height $2$ .*

Proof.

We will prove the proposition using the fact that the vertex cover problem is APX-hard on cubic graphs [2]. Recall that a graph $G$ is cubic if every vertex of $G$ is incident with exactly three edges and that a vertex cover of a graph $G$ is a subset $C\subseteq V(G)$ such that $\forall\,e=\{v_{1},v_{2}\},\,e\in E(G)\Rightarrow v_{1}\in C\,\vee\,v_{2}\in C$ . For all $v\in V(G)$ , we define $E(v)$ as a set of all edges in $E(G)$ incident with $v$ . In symbols, $E(v)=\{e\in E(G):v\in e\}$ . We say that a graph $G$ is cubic if for every $v\in V(G)$ it holds $|E(v)|=3$ .

We will construct an $L$ -reduction from the vertex cover problem in cubic graphs to the MUB problem on instances of height $2$ . Let $G$ be a cubic graph. Let $x$ and $y$ be two new vertices not in $V(G)\cup E(G)$ . Let $R=E(G)\cup\{x,y\}$ and let $\mathcal{H}$ be the hypergraph with vertex set $R$ and edge set

[TABLE]

Let $M$ be a binary matrix without duplicated columns such that the column hypergraph of $M$ is isomorphic to $\mathcal{H}$ . Note that $M$ is of height $2$ . See Fig. 5 for an example construction, representing the containment digraph $D_{M}$ of the binary matrix derived from the complete graph $K_{4}$ .

We denote by $\tau(G)$ the vertex cover number of $G$ , that is, the minimum size of a vertex cover in $G$ . The APX-hardness of MinimumUcoveringBranching will be a consequence of the following claim and its proof.

Claim. $\tau(G)=\beta(M)-8|V(G)|$ .

Proof of the claim.

We split the proof of the equality into two parts, proving each of the two inequalities separately.

First, we prove the inequality $\beta(M)\leq\tau(G)+8|V(G)|$ . Let $C$ be a minimum vertex cover of $G$ . Define a branching $B$ of $D_{M}$ as follows:

[TABLE]

See Fig. 6 for an example.

It is clear from the construction that $B$ is indeed a branching. Since $C$ is a vertex cover, every $e\in E(G)$ is covered in $E(G)\cup\{x\}$ with respect to $B$ . It is now not difficult to see the set of uncovered pairs with respect to $B$ equals

[TABLE]

Since we have $|E(v)|=3$ for all $v\in V(G)$ , this implies $\beta(M)\leq|U(B)|=8|V(G)|+|C|=8|V(G)|+\tau(G)$ , as claimed.

Now we prove the inequality $\tau(G)\leq\beta(M)-8|V(G)|$ . Let $B$ be a branching of $D_{M}$ such that $|U(B)|=\beta(M)$ . For every source vertex $u$ in $D_{M}$ and every element $r\in u$ it holds that $r$ is uncovered in $u$ . Since the source vertices are exactly the vertices of the form $E(v)\cup\{x\}$ and $E(v)\cup\{y\}$ , we have exactly $8|V(G)|$ uncovered pairs corresponding to the source vertices. The minimality of $B$ implies that all arcs of the form $(E(v)\cup\{y\},E(v)\cup\{x,y\})$ are in $B$ . Therefore, for every $v\in V(G)$ , element $x$ is the only possibly uncovered element in vertex $E(v)\cup\{x,y\}$ .

We show that we may assume that vertex $E(G)\cup\{x\}$ is not irreducible, that is, that all its elements are covered in $E(G)\cup\{x\}$ . Suppose first that $x$ is not covered in $E(G)\cup\{x\}$ . Then $B$ does not contain any arc of the form $(E(v)\cup\{x\},E(G)\cup\{x\})$ , and therefore, by minimality, contains all arcs of the form $(E(v)\cup\{x\},E(v)\cup\{x,y\})$ . Replacing one of these arcs with the arc $(E(v)\cup\{x\},E(G)\cup\{x\})$ results in a branching $B^{\prime}$ such that $|U(B^{\prime})|\leq|U(B)|$ , hence in an optimal branching covering $x$ . Now, suppose that there exists some $e\in E(G)$ such that $e\not\in\cup N^{-}_{B}(E(G)\cup\{x\})$ . Let $v$ be an endpoint of $e$ in $G$ and consider the vertex $E(v)\cup\{x\}$ . Since $e$ is not covered in $E(G)\cup\{x\}$ , the arc $(E(v)\cup\{x\},E(G)\cup\{x\})$ is not in $B$ . The optimality of $B$ implies that $(E(v)\cup\{x\},E(v)\cup\{x,y\})\in B$ . Now, replace the arc $(E(v)\cup\{x\},E(v)\cup\{x,y\})$ with the arc $(E(v)\cup\{x\},E(G)\cup\{x\})$ . This results in a branching $B^{\prime}$ such that $e\in\cup N^{-}_{B^{\prime}}(E(G)\cup\{x\})$ . Moreover, $|U(B^{\prime})|\leq|U(B)|$ since removing the arc $(E(v)\cup\{x\},E(v)\cup\{x,y\})$ makes $x$ uncovered in $E(v)\cup\{x,y\}$ , but adding the arc $(E(v)\cup\{x\},E(G)\cup\{x\})$ makes element $e$ covered in $E(G)\cup\{x\}$ . Therefore, repeating the above procedure will eventually result in an optimal branching with respect to which $E(G)\cup\{x\}$ is not irreducible, as claimed.

Define $C=\{v\in V(G):(E(v)\cup\{x\},E(G)\cup\{x\})\in B\}$ . The fact that every $e\in E(G)$ is covered in $E(G)\cup\{x\}$ implies that $C$ is a vertex cover of $G$ . Moreover, for every $v\in C$ , element $x$ is the only uncovered element in vertex $E(v)\cup\{x,y\}$ , and for every $v\in V(G)\setminus C$ , all elements in $E(v)\cup\{x,y\}$ are covered. This implies that the total number of uncovered pairs by $B$ equals $8|V(G)|+|C|$ , implying $|C|=\beta(M)-8|V(G)|$ , which proves the claimed inequality $\tau(G)\leq\beta(M)-8|V(G)|$ .

This completes the proof of the claim. ∎

We now complete the proof by showing that the above reduction is an $L$ -reduction. Since $G$ is cubic, every vertex in a vertex cover of $G$ covers exactly $3$ edges, hence $\tau(G)\geq\frac{|E(G)|}{3}=\frac{|V(G)|}{2}$ . This implies that $\beta(M)=\tau(G)+8|V(G)|\leq 17\tau(G)$ , hence the first condition in the definition of $L$ -reducibility is satisfied with $a=17$ . The second condition in the definition of $L$ -reducibility states that for every branching $B$ of $D_{M}$ we can can compute in polynomial time a vertex cover $C$ of $G$ such that $|C|-\tau(G)\leq b\cdot(|U(B)|-\beta(M))$ for some $b>0$ . We claim that this can be achieved with $b=1$ . Indeed, the second part of the proof of above claim shows how one can transform in polynomial time any branching of $D_{M}$ into a vertex cover $C$ of $G$ such that $|C|\leq|U(B)|-8|V(G)|$ . Therefore, $|C|-\tau(G)\leq|U(B)|-8|V(G)|-\tau(G)=|U(B)|-\beta(M)$ . This shows that the vertex cover problem in cubic graphs is $L$ -reducible to the MinimumUcoveringBranching and completes the proof. ∎

Proposition 4.3.

The MIB problem is APX-hard, even for instances of height $2$ . Consequently, the MDCRS problem is APX-hard, even for instances of height $2$ .

Proof.

We construct an $L$ -reduction from the vertex cover problem in cubic graphs to the MIB problem. Let $G$ be a cubic graph. Let $M$ be a binary matrix without duplicated columns such that its column hypergraph is isomorphic to $\mathcal{H}$ , where $\mathcal{H}=(E,E\cup\{E(x):x\in V\})$ . See Fig. 7 for an example construction, representing the containment digraph $D_{M}$ of the binary matrix derived from the complete graph $K_{4}$ .

To prove APX-hardness, we will show that $\zeta(M)=|E(G)|+\tau(G)$ . This will suffice: since every vertex in a vertex cover covers at most three edges, we have $\tau(G)\geq|E(G)|/3$ , which will imply that $\zeta(M)\leq 4\tau(G)$ . Similar arguments as those used at the end of the proof of Proposition 4.2 can then be used to infer that the given reduction is an $L$ -reduction, thus completing the proof of the theorem.

We split the proof of $\zeta(M)=|E(G)|+\tau(G)$ into two parts. First we show that $\zeta(M)\leq|E(G)|+\tau(G)$ . Let $C$ be any minimum vertex cover of $G$ . Define a set of arcs $B$ of $D_{M}$ as $B=\{(e,E(x)):x\in e\wedge x\in V(G)\setminus C\}$ . We first claim that $B$ is branching of $D_{M}$ . Indeed, if this was not the case, then there would exist an edge $e\in E(G)$ and two distinct vertices $x,y\in V(G)$ such that $(e,E(x)),(e,E(y))\in B$ . This would imply that $e\in E(x)$ and $e\in E(y)$ and consequently $e=xy$ . By definition of $B$ , none of $x$ and $y$ is in $C$ , contradicting the fact that $C$ is vertex cover.

Let $x\in V(G)$ . We claim that $E(x)\in I(B)$ implies that $x\in C$ . Suppose for a contradiction that $E(x)\in I(B)$ with $x\not\in C$ . Since $x\in V(G)\setminus C$ , the definition of $B$ implies that $(e,E(x))\in B$ , for every $e\in E(x)$ , in particular, every element of $E(x)$ is $B$ -covered in $E(x)$ . Hence $E(x)\not\in I(B)$ , a contradiction. This shows that $|I(B)\cap\{E(x):x\in V(G)\}|\leq|C|$ . Together with $I(B)=(I(B)\cap E(G))\cup(I(B)\cap\{E(x):x\in V(G)\})$ this implies that $|I(B)|\leq|E(G)|+|C|$ . It follows that $\zeta(M)\leq|I(B)|\leq|E(G)|+|C|=|E(G)|+\tau(G)$ , as claimed.

Next we show that $\zeta(M)\geq|E(G)|+\tau(G)$ by showing that $\tau(G)\leq\zeta(M)-|E(G)|$ . Let $B$ be a branching of $D_{M}$ such that $|I(B)|=\zeta(M)$ . Define a set $C$ with $C=\{x\in V(G):E(x)\in I(B)\}$ . We claim that $C$ is a vertex cover of $G$ . Suppose that this does not hold, that is, that there exists $e\in E(G)$ , such that $e=xy$ and $x,y\in V(G)\setminus C$ . Since $x,y\not\in C$ , it follows that $E(x),E(y)\not\in I(B)$ . By construction, every element of $D_{M}$ of the form $E(z)$ is $B$ -irreducible, unless $B$ contains all the three arcs leading to $E(z)$ . Consequently, $B$ contains all the three arcs leading to $E(x)$ , and similarly for $E(y)$ . In particular, we infer that $(e,E(x)),(e,E(y))\in B$ , contradicting the fact that $B$ is a branching in $D_{M}$ . Since $I(B)$ is the disjoint union of $I(B)\cap E(G)$ and $I(B)\cap\{E(x):x\in V(G)\}$ and $E(G)\subseteq I(B)$ we have $|I(B)|=|I(B)\cap E(G)|+|I(B)\cap\{E(x):x\in V(G)\}|=|E(G)|+|C|$ , implying that $\tau(G)\leq|C|=|I(B)|-|E(G)|=\zeta(M)-|E(G)|$ . ∎

Theorem 4.1 (restated).

The MUB and the MIB problems (and consequently the MCRS and the MDCRS problems) are APX-hard, even for instances of height $2$ .

Proof.

The theorem combines the statements of Propositions 4.2 and 4.3. ∎

4.2 $2$ -Approximating $\eta$ and $\zeta$ via Laminar Set Families

The result of Theorem 4.1 raises the question whether the four problems (MCRS, MDRCS, MUB, and MIB) admit constant factor approximations. In this section, we show that this is the case for the MDRCS and the MIB problems. This will be achieved by proving a lower and an upper bound for $\eta(M)$ , which will together imply a simple $2$ -approximation algorithm.

The lower bound is based on a connection between conflict-free matrices and laminar set families and an upper bound on the size of a laminar family in terms of the size of the ground set. Recall that a hypergraph $\mathcal{H}$ is said to be laminar if every two hyperedges $e_{1},e_{2}\in E(\mathcal{H})$ satisfy $e_{1}\cap e_{2}=\emptyset$ , $e_{1}\subseteq e_{2}$ , or $e_{2}\subseteq e_{1}$ . Recall also that the column hypergraph $\mathcal{H}_{M}$ of a binary matrix $M$ is the hypergraph with vertex set $V(\mathcal{H}_{M})=R_{M}$ and hyperedge set $E(\mathcal{H}_{M})=\{\operatorname*{supp}_{M}(c):c\in C_{M}\}$ .

The following observation follows immediately from definitions.

Observation 4.4.

A binary matrix $M$ is conflict-free if and only if its column hypergraph $\mathcal{H}_{M}$ is laminar.

The following upper bound on the size of a laminar hypergraph is well known, see, e.g., [28].

Theorem 4.5.

Every laminar hypergraph $\mathcal{H}$ satisfies $|E(\mathcal{H})|\leq 2|V(\mathcal{H})|$ .

Observation 4.4 and Theorem 4.5 imply the following.

Corollary 4.6.

Every conflict-free binary matrix $M$ with $m$ rows satisfies $k\leq 2m$ , where $k$ is the number of distinct columns of $M$ .

The claimed $2$ -approximation will be based on three lemmas.

Lemma 4.7.

If $M^{\prime}$ is a conflict-free row split of $M$ , then the number of distinct columns of $M^{\prime}$ is at least as large as the number of distinct columns of $M$ .

Proof.

It suffices to prove that each two distinct columns of $M$ are still distinct after performing the row split. Let $c_{i},c_{j}$ be two distinct columns of $M$ and $c_{i}^{\prime},c_{j}^{\prime}$ the corresponding columns of $M^{\prime}$ . Then, without loss of generality, there exists a row $r$ of $M$ such that, $M_{r,i}=0$ and $M_{r,j}=1$ . Let $R(r)$ be the set of split rows of $r$ with respect to $M^{\prime}$ . Then for every $r^{\prime}\in R(r)$ it holds $M^{\prime}_{r^{\prime},i}=0$ . Since the rows in $R(r)$ split $r$ , there exists some $r^{\prime\prime}\in R(r)$ with $M^{\prime}_{r^{\prime\prime},j}=1$ . This gives us $M^{\prime}_{r^{\prime\prime},i}=0$ and $M^{\prime}_{r^{\prime\prime},j}=1$ , showing that columns $c_{i}^{\prime}$ and $c_{j}^{\prime}$ are distinct. ∎

The following lemma shows that the value of $\eta$ is invariant under deleting one of a pair of identical columns.

Lemma 4.8.

For every binary matrix $M$ it holds that

[TABLE]

Proof.

Since $\operatorname*{{Red}}(M)$ is submatrix of $M$ , it follows that $\gamma(\operatorname*{{Red}}(M))\leq\gamma(M)$ and, similarly, that $\eta(\operatorname*{{Red}}(M))\leq\eta(M)$ . Conversely, since any conflict-free row split of $\operatorname*{{Red}}(M)$ can be transformed to a conflict-free row split of $M$ with the same number of rows (by duplicating some columns) it follows that $\gamma(M)\leq\gamma(\operatorname*{{Red}}(M))$ and $\eta(M)\leq\eta(\operatorname*{{Red}}(M))$ . We have shown that $\gamma(M)=\gamma(\operatorname*{{Red}}(M))$ and $\eta(M)=\eta(\operatorname*{{Red}}(M))$ . Moreover, since the containment digraphs $D_{M}$ and $D_{\operatorname*{{Red}}(M)}$ are the same, we infer that $\beta(M)=\beta(\operatorname*{{Red}}(M))$ and $\zeta(M)=\zeta(\operatorname*{{Red}}(M))$ . ∎

Corollary 4.6 and Lemmas 4.7 and 4.8 together with a simple row splitting strategy imply the following.

Lemma 4.9.

For every binary matrix $M$ , we have $k/2\leq\eta(M)\leq k$ , where $k$ is the number of distinct columns of $M$ .

Proof.

Let $M\in\{0,1\}^{m\times n}$ . First, we prove that $k/2\leq\eta(M)$ or, equivalently, that $k\leq 2\eta(M)$ . Let $M^{\prime}\in\{0,1\}^{m^{\prime}\times n}$ be a row split of $M$ with exactly $\eta(M)$ distinct rows. Let $k^{\prime}$ be the number of distinct columns of $M^{\prime}$ . Let $N\in\{0,1\}^{\eta(M)\times n}$ be a new matrix obtained from $M^{\prime}$ by taking one row from each set of identical rows. It is not difficult to see that $N$ is conflict-free, with exactly $k^{\prime}$ distinct columns. Further on, by Corollary 4.6 it holds $k^{\prime}\leq 2\eta(M)$ and hence by Lemma 4.7 it holds $k\leq k^{\prime}\leq 2\eta(M)$ , as claimed.

It remains to show $\eta(M)\leq k$ . By Lemma 4.8 it suffices to show that $\eta(\operatorname*{{Red}}(M))\leq k$ . Let $M^{\prime}$ be the row split of $\operatorname*{{Red}}(M)$ obtained by splitting each row $r$ with $t$ ones into $t$ rows, each with exactly one non-zero entry. By construction, $M^{\prime}$ has exactly $k$ columns and therefore at most $k$ distinct rows. It follows that $\eta(\operatorname*{{Red}}(M))\leq k$ , as desired. ∎

Now we have everything ready to state and prove the announced approximation result.

Theorem 4.10.

There is a $2$ -approximation algorithm for the MDCRS (and consequently for the MIB) problem running in time ${\mathcal{O}}(mnk)$ on a given matrix $M\in\{0,1\}^{m\times n}$ where $k$ is the number of distinct columns of $M$ .

Proof.

Let $M$ be a binary matrix with $m$ rows and $n$ columns, exactly $k$ of which are distinct. The proof of Lemma 4.9 is constructive and leads to the following algorithm to compute a row split of $M$ with at most $k$ distinct rows:

Compute $\operatorname*{{Red}}(M)$ . (This can be done in time ${\mathcal{O}}(mn)$ using radix sort.) 2. 2.

Compute a row split $M^{\prime}$ of $\operatorname*{{Red}}(M)$ obtained by splitting each row $r$ with $t$ ones into $t$ rows, each with exactly one non-zero entry. (This can be done in time ${\mathcal{O}}(mk^{2})$ .) 3. 3.

Transform $M^{\prime}$ into a row split of $M$ by an appropriate duplication of some columns. (This can be done in time ${\mathcal{O}}(kmn)$ , since $M^{\prime}$ has at most $km$ rows and the constructed matrix will have exactly $n$ columns.)

Clearly, the algorithm produces a row split of $M$ with at most $k$ distinct rows. Since $\eta(M)\geq k/2$ , it follows that this is a $2$ -approximation. Moreover, using the fact that $k\leq n$ , we infer that the total time complexity of the algorithm is ${\mathcal{O}}(mn+mk^{2}+mnk)={\mathcal{O}}(mnk)$ , as stated. ∎

Note that Theorems 4.1 and 4.10 imply that the MDCRS and the MIB problems are APX-complete.

4.3 Two Approximation Algorithms for Computing $\gamma$ and $\beta$

While the question of whether the MCRS (and consequently the MUB) problem admits a constant factor approximation algorithm on general instances remains open, we give in this section two partial results in this direction. We show that the two problems admit constant factor approximation algorithms on instances of bounded height or width.

Roughly speaking, the following theorem shows that for instances of bounded height any algorithm for the MCRS problem based on branchings is a constant factor approximation algorithm.

Theorem 4.11.

Let $M$ be a binary matrix and let $B$ be an arbitrary branching of $D_{M}$ . Then, the number of rows in the $B$ -split of $M$ is at most $h(M)\gamma(M)$ .

Proof.

Let $h=h(M)$ and let $B_{opt}$ be a branching of $D_{M}$ with $|U(B_{opt})|=\beta(M)$ . Recall that the number of rows in the $B$ -split of $M$ is $|U(B)|$ . Since by Theorem 2.1 $\beta(M)=\gamma(M)$ , it suffices to prove that $|U(B)|\leq h\beta(M)$ . For $(r,v)\in U(B_{opt})$ , define the set $\Omega(r,v)$ with

[TABLE]

We claim that

[TABLE]

(In fact, since $U(B)=U(\emptyset)=\{(r,v):r\in v\in V\}$ , equality holds in (1), but we will not need it in the proof.) Let $(r,v^{\prime})\in U(B)$ . We will show that there exists some $(r,v)\in U(B_{opt})$ such that $(r,v^{\prime})\in\Omega(r,v)$ . If $(r,v^{\prime})\in U(B_{opt})$ , then $(r,v^{\prime})\in\Omega(r,v^{\prime})$ , since $v^{\prime}\in B_{opt}^{+}(v^{\prime})$ . If $(r,v^{\prime})\not\in U(B_{opt})$ , it follows that $r$ is covered in $v^{\prime}$ with respect to $B_{opt}$ , and therefore, there exists some $v^{\prime\prime}$ such that $(v^{\prime\prime},v^{\prime})\in B_{opt}$ , and $r\in v^{\prime\prime}$ . If $(r,v^{\prime\prime})\in U(B_{opt})$ , then it is clear that $(r,v^{\prime})\in\Omega(r,v^{\prime\prime})$ . If $(r,v^{\prime\prime})\not\in U(B_{opt})$ , then we repeat the described procedure, which has to terminate after finitely many steps. Therefore, there exists some $(r,v)\in U(B_{opt})$ such that $(r,v^{\prime})\in\Omega(r,v)$ , as claimed. This establishes inclusion (1).

Since the height of $D_{M}$ is $h$ , it follows that the height of $B_{opt}$ is at most $h$ . Moreover, since $B_{opt}$ is a branching, it follows that $|\Omega(r,v)|\leq h$ , for every $(r,v)\in U(B_{opt})$ . Combining this with (1), we have

[TABLE]

Since $\beta(M)=\gamma(M)$ , it follows that $|U(B)|\leq h\gamma(M)$ . This completes the proof. ∎

Remark 4.12.

Since in Theorem 4.11, there is no restriction on the branching $B$ , an $h(M)$ -approximation to $\gamma(M)$ can be obtained simply by taking $B=\emptyset$ and returning the resulting row split.

Remark 4.13.

The following example shows that for every $h>1$ and every $\epsilon>0$ , the algorithm for the MCRS problem given by Theorem 3.6 is not an $(h-\epsilon)$ -approximation when restricted to instances of height $h$ .

Example 4.14.

Fix a positive integer $h\geq 2$ . For all $d\geq 2$ , let $M_{d}$ be a binary matrix with $d^{h-1}$ rows (indexed by $1,\ldots,d^{h-1}$ ) and $1+d+d^{2}+\ldots+d^{h-1}$ columns. Entries of $M_{d}$ are defined so that the supports of columns of $M_{d}$ are given by $A_{j}^{i}=\{(j-1)d^{i-1}+k:1\leq k\leq d^{i-1}\}$ for all $i\in\{1,\ldots,h\}$ and all $j\in\{1,\ldots,d^{h-i-1}\}$ (see Figs. 8 and 9 for an example). The height of $M_{d}$ is $h$ . Matrix $M_{d}$ is conflict-free, therefore $\gamma(M_{d})=\beta(M_{d})=d^{h-1}$ . It can be seen that $\beta_{\ell}(M_{d})=h\cdot d^{h-1}-(h-1)\cdot d^{h-2}$ . Therefore, $\beta_{\ell}(M_{d})/\beta(M_{d})=h-\frac{h-1}{d}$ . It follows that for every $\epsilon>0$ , we have $\beta_{\ell}(M_{d})/\beta(M_{d})>h-\epsilon$ for all large enough $d$ .

For instances of bounded width, a constant factor approximation can be obtained by considering any $B$ -split resulting from a linear branching $B$ of $D_{M}$ consisting of ${\it wdt}(M)$ paths. Note that such a branching can be computed in polynomial time using Dilworth’s theorem (Theorem 3.1).

Theorem 4.15.

Any algorithm that, given a binary matrix $M$ , computes a linear branching $B$ of $D_{M}$ consisting of ${\it wdt}(M)$ paths and returns the corresponding $B$ -split of $M$ is a ${\it wdt}(M)$ -approximation algorithm for the MCRS problem.

Proof.

Let $M$ be a binary matrix and let $w={\it wdt}(M)$ . Let $P=\{C_{1},\ldots,C_{w}\}$ be a chain partition of $D_{M}$ (the existence of such a partition is guaranteed by Dilworth’s theorem) and let $B$ be the linear branching of $D_{M}$ corresponding to $P$ . We will prove that $|U(B)|\leq w|R_{M}|$ , where $R_{M}$ denotes the set of rows of $M$ . We claim that the number of elements in $U(B)$ with fixed first coordinate is at most $w$ . For a row $r$ of $M$ , let $N(r)=\{v\in V(D_{M}):(r,v)\in U(B)\}$ . We claim that $|N(r)\cap C_{i}|\leq 1$ , for every $i\in\{1,\ldots,w\}$ . Suppose that $v_{1}\neq v_{2}$ and $v_{1},v_{2}\in N(r)\cap C_{i}$ for some $i\in\{1,\ldots,w\}$ . Since $C_{i}$ is a chain, we may assume without loss of generality that $v_{1}\subset v_{2}$ . Moreover, since $v_{1},v_{2}$ are both in $C_{i}$ it follows that there exists a path in $B$ from $v_{1}$ to $v_{2}$ . Since $v_{1}\in N(r)$ , it follows that $r\in v_{1}$ , and since there exists a path in $B$ from $v_{1}$ to $v_{2}$ , it follows that $r$ is covered in $v_{2}$ with respect to $B$ . This contradicts the assumption that $v_{2}\in N(r)$ . The obtained contradiction shows that $|N(r)\cap C_{i}|\leq 1$ , as claimed. Since $|N(r)\cap C_{i}|\leq 1$ , and $P$ is a chain partition of $D_{M}$ , it follows that $|N(r)|=\sum_{i=1}^{w}|N(r)\cap C_{i}|\leq w$ . It is now easy to see that $|U(B)|=\sum_{r\in R_{M}}|N(r)|\leq w|R_{M}|$ .

Since matrix $M$ is assumed to have no row whose all entries are [math], every row split of $M$ contains at least $|R_{M}|$ rows, that is, $|R_{M}|\leq\gamma(M)$ . It follows that $|U(B)|\leq w\gamma(M)$ and since the $B$ -split of $M$ has exactly $|U(B)|$ rows (by Lemma 2.2), the claimed approximation ratio follows. ∎

5 Conclusion

In this paper, we revisited the minimum conflict-free row split problem and a variant of it. We formulated the two problems as optimization problems on branchings in a derived directed acyclic graph and, building on these formulations, obtained several new algorithmic and complexity insights about the two problems, including APX-hardness results and approximation algorithms. Moreover, we proved a min-max result on digraphs strengthening the classical Dilworth’s theorem and leading to a new heuristic for the MCRS problem. In Figure 10 we summarize the relations between several problems discussed in this paper, along with known complexity results and some applications. The relations are described informally; for instance, we say that problem $P_{1}$ reduces to problem $P_{2}$ if a polynomial-time algorithm for problem $P_{2}$ can be used to develop a polynomial-time algorithm for problem $P_{1}$ .

The main problem left open by our work is the determination of the exact (in)approximability status of the MCRS problem. In particular, does the problem admit a constant factor approximation? Other possibilities for related future research include: i) the study of the approximability properties of the closely related Minimum-Split-Row problem [18] (our preliminary investigations show that the problem, while being APX-hard, admits a $(2h(M)-1)$ -approximation); ii) a parameterized complexity study of the considered problems (along with identification of meaningful parameterizations), and iii) a study of extensions of the model that could be relevant for the biological application, such as the case when the input binary matrix may contain errors or has partially missing data. Finally, it would be interesting to find further applications of the polynomially solvable MinimumPriceChainPartition problem, as well as of the two branching problems, MinimumUncoveringBranching and MinimumIrreducingBranching, introduced in this paper.

Acknowledgments

The authors are grateful to the two anonymous reviewers for helpful remarks. This work is supported in part by the Slovenian Research Agency under research programs I0-0035, P1-0285, and research projects N1-0032, N1-0038, N1-0062, J1-6720, J1-7051, and by the Academy of Finland under Grant No. 274977.

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] A. V. Aho, M. R. Garey, and J. D. Ullman. The transitive reduction of a directed graph. SIAM J. Comput. , 1(2):131–137, 1972.
2[2] P. Alimonti and V. Kann. Some APX-completeness results for cubic graphs. Theoret. Comput. Sci. , 237(1-2):123–134, 2000.
3[3] J. Araujo, N. Nisse, and S. Pérennes. Weighted coloring in trees. SIAM J. Discrete Math. , 28(4):2029–2041, 2014.
4[4] G. Ausiello, P. Crescenzi, G. Gambosi, V. Kann, A. Marchetti-Spaccamela, and M. Protasi. Complexity and Approximation . Springer-Verlag, Berlin, 1999.
5[5] J. Cheriyan, T. Jordán, and R. Ravi. On 2 2 2 -coverings and 2 2 2 -packings of laminar families. In Algorithms—ESA ’99 (Prague) , volume 1643 of Lecture Notes in Comput. Sci. , pages 510–520. Springer, Berlin, 1999.
6[6] D. de Werra, M. Demange, B. Escoffier, J. Monnot, and V. T. Paschos. Weighted coloring on planar, bipartite and split graphs: complexity and approximation. Discrete Appl. Math. , 157(4):819–832, 2009.
7[7] R. P. Dilworth. A decomposition theorem for partially ordered sets. Ann. of Math. (2) , 51:161–166, 1950.
8[8] B. Escoffier, J. Monnot, and V. T. Paschos. Weighted coloring: further complexity and approximability results. Inform. Process. Lett. , 97(3):98–103, 2006.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Perfect phylogenies via branchings in acyclic digraphs

Abstract

1 Introduction

Definition 1.1**.**

Definition 1.2**.**

2 Formulations in Terms of Branchings in Directed Acyclic Graphs

Definition 2.1**.**

Theorem 2.1**.**

Definition 2.2**.**

Lemma 2.2**.**

Proof.

Lemma 2.3**.**

Proof.

Theorem 2.1 (restated)****.

Proof.

3 A Strengthening of Dilworth’s Theorem and its Connection to the

3.1 A Min-Max Relation Strengthening Dilworth’s Theorem

Theorem 3.1** (Dilworth’s theorem).**

Lemma 3.2**.**

Proof.

Theorem 3.3**.**

Proof.

Remark 3.4**.**

Corollary 3.5**.**

3.2 Connection with the Minimum Conflict-Free Row Split Problem

Theorem 3.6**.**

Remark 3.7**.**

4 (In)approximability Issues

4.1 Hardness Results

Theorem 4.1**.**

Definition 4.1**.**

Proposition 4.2**.**

Proof.

Proof of the claim.

Proposition 4.3**.**

Proof.

Theorem 4.1 (restated)****.

Proof.

4.2 222-Approximating η\etaη and ζ\zetaζ via Laminar Set Families

Observation 4.4**.**

Theorem 4.5**.**

Corollary 4.6**.**

Lemma 4.7**.**

Proof.

Lemma 4.8**.**

Proof.

Lemma 4.9**.**

Proof.

Theorem 4.10**.**

Proof.

4.3 Two Approximation Algorithms for Computing γ\gammaγ and β\betaβ

Theorem 4.11**.**

Proof.

Remark 4.12**.**

Remark 4.13**.**

Example 4.14**.**

Theorem 4.15**.**

Proof.

5 Conclusion

Acknowledgments

Definition 1.1.

Definition 1.2.

Definition 2.1.

Theorem 2.1.

Definition 2.2.

Lemma 2.2.

Lemma 2.3.

Theorem 2.1 (restated).

Theorem 3.1 (Dilworth’s theorem).

Lemma 3.2.

Theorem 3.3.

Remark 3.4.

Corollary 3.5.

Theorem 3.6.

Remark 3.7.

Theorem 4.1.

Definition 4.1.

Proposition 4.2.

Proposition 4.3.

Theorem 4.1 (restated).

4.2 $2$ -Approximating $\eta$ and $\zeta$ via Laminar Set Families

Observation 4.4.

Theorem 4.5.

Corollary 4.6.

Lemma 4.7.

Lemma 4.8.

Lemma 4.9.

Theorem 4.10.

4.3 Two Approximation Algorithms for Computing $\gamma$ and $\beta$

Theorem 4.11.

Remark 4.12.

Remark 4.13.

Example 4.14.

Theorem 4.15.