A $(1.4 + \epsilon)$-approximation algorithm for the $2$-Max-Duo problem
Yao Xu, Yong Chen, Guohui Lin, Tian Liu, Taibo Luo, Peng Zhang

TL;DR
This paper introduces a new approximation algorithm for the 2-Max-Duo problem, achieving a ratio close to 1.4, improving upon previous results and leveraging a vertex-degree reduction technique.
Contribution
The paper presents a novel vertex-degree reduction method that enables a $(1.4 + ext{epsilon})$-approximation for the 2-Max-Duo problem, surpassing prior approximation ratios.
Findings
Achieved a $(1.4 + ext{epsilon})$-approximation for 2-Max-Duo.
Developed a vertex-degree reduction technique.
Improved approximation ratio from previous 1.6+epsilon.
Abstract
The maximum duo-preservation string mapping (Max-Duo) problem is the complement of the well studied minimum common string partition (MCSP) problem, both of which have applications in many fields including text compression and bioinformatics. -Max-Duo is the restricted version of Max-Duo, where every letter of the alphabet occurs at most times in each of the strings, which is readily reduced into the well known maximum independent set (MIS) problem on a graph of maximum degree . In particular, -Max-Duo can then be approximated arbitrarily close to using the state-of-the-art approximation algorithm for the MIS problem. -Max-Duo was proved APX-hard and very recently a -approximation was claimed, for any . In this paper, we present a vertex-degree reduction technique, based on which, we show that -Max-Duo can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Genome Rearrangement Algorithms · DNA and Biological Computing
\Copyright
Yao Xu, Yong Chen, Guohui Lin, Tian Liu, Taibo Luo, Peng Zhang
A -approximation algorithm for the -Max-Duo problem111This work was partially supported by NSERC Canada and NSF China.222An extended abstract appears in Proceedings of the 28th International Symposium on Algorithms and Computation (ISAAC 2017).
LIPICS 92, Article No. 66, pp. 66:1–66:12.
Yao Xu
Department of Computing Science, University of Alberta. Edmonton, Alberta T6G 2E8, Canada. {xu2,taibo,guohui}@ualberta.ca
Yong Chen
Department of Mathematics, Hangzhou Dianzi University. Hangzhou, Zhejiang 310018, China. [email protected]
Department of Computing Science, University of Alberta. Edmonton, Alberta T6G 2E8, Canada. {xu2,taibo,guohui}@ualberta.ca
Guohui Lin333Correspondence authors.
Department of Computing Science, University of Alberta. Edmonton, Alberta T6G 2E8, Canada. {xu2,taibo,guohui}@ualberta.ca
Tian Liu
Key Laboratory of High Confidence Software Technologies (MOE), Institute of Software, School of Electronic Engineering and Computer Science, Peking University. Beijing 100871, China. [email protected]
Taibo Luo
Business School, Sichuan University. Chengdu, Sichuan 610065, China.
Department of Computing Science, University of Alberta. Edmonton, Alberta T6G 2E8, Canada. {xu2,taibo,guohui}@ualberta.ca
Peng Zhang‡
School of Computer Science and Technology, Shandong University. Jinan, Shandong 250101, China. [email protected]
Abstract.
The maximum duo-preservation string mapping (Max-Duo) problem is the complement of the well studied minimum common string partition (MCSP) problem, both of which have applications in many fields including text compression and bioinformatics. -Max-Duo is the restricted version of Max-Duo, where every letter of the alphabet occurs at most times in each of the strings, which is readily reduced into the well known maximum independent set (MIS) problem on a graph of maximum degree . In particular, -Max-Duo can then be approximated arbitrarily close to using the state-of-the-art approximation algorithm for the MIS problem. -Max-Duo was proved APX-hard and very recently a -approximation was claimed, for any . In this paper, we present a vertex-degree reduction technique, based on which, we show that -Max-Duo can be approximated arbitrarily close to .
Key words and phrases:
Approximation algorithm, duo-preservation string mapping, string partition, independent set
1991 Mathematics Subject Classification:
F.2.2 Pattern matching; G.2.1 Combinatorial algorithms; G.4 Algorithm design and analysis
1. Introduction
The minimum common string partition (MCSP) problem is a well-studied string comparison problem in computer science, with applications in fields such as text compression and bioinformatics. MCSP was first introduced by Goldstein et al. [16], and can be defined as follows: Consider two length- strings and over some alphabet , such that is a permutation of . Let be a partition of , which is a multi-set of substrings whose concatenation in a certain order becomes . The cardinality of is the number of substrings in . The MCSP problem asks to find a minimum cardinality partition of which is also a partition of . -MCSP denotes the restricted version of MCSP where every letter of the alphabet occurs at most times in each of the two strings.
Goldstein et al. [16] have shown that the MCSP problem is NP-hard and APX-hard, even when . There have been several approximation algorithms [10, 11, 12, 16, 18, 19] presented since 2004, among which the current best result is an -approximation for the general MCSP and an -approximation for -MCSP. On the other hand, MCSP is proved to be fixed parameter tractable (FPT), with respect to and/or the cardinality of the optimal partition [13, 17, 7, 8].
An ordered pair of consecutive letters in a string is called a duo of the string [16], which is said to be preserved by a partition if the pair resides inside a substring of the partition. Therefore, a length- substring in the partition preserves duos of the string. With the complementary objective to that of MCSP, the problem of maximizing the number of duos preserved in the common partition is referred to as the maximum duo-preservation string mapping problem by Chen et al. [9], denoted as Max-Duo. Analogously, -Max-Duo is the restricted version of Max-Duo where every letter of the alphabet occurs at most times in each string. In this paper, we focus on -Max-Duo, to design an improved approximation algorithm.
Along with Max-Duo, Chen et al. [9] introduced the constrained maximum induced subgraph (CMIS) problem, in which one is given an -partite graph with each having vertices arranged in an matrix, and the goal is to find vertices in each from different rows and different columns such that the number of edges in the induced subgraph is maximized. -CMIS is the restricted version of CMIS where for all . Given an instance of Max-Duo, we may construct an instance of CMIS by setting to be the number of distinct letters in the string , and to be the number of occurrences of the -th distinct letter; the vertex in the -entry of the matrix “means” mapping the -th occurrence of the -th distinct letter in the string to its -th occurrence in the string ; and there is an edge between a vertex of and a vertex of if the two corresponding mappings together preserve a duo. Therefore, Max-Duo is a special case of CMIS, and furthermore -Max-Duo is a special case of -CMIS. Chen et al. [9] presented a -approximation for -CMIS and a -approximation for -CMIS, based on a linear programming and randomized rounding techniques. These imply that -Max-Duo can also be approximated within a ratio of and -Max-Duo can be approximated within a ratio of .
Alternatively, an instance of the -Max-Duo problem with the two strings and can be viewed as a bipartite graph , constructed as follows: The vertices in and are in order and in order, respectively, and there is an edge between and if they are the same letter. The two edges are called a pair of parallel edges. This way, a common partition of the strings and corresponds one-to-one to a perfect matching in , and the number of duos preserved by the partition is exactly the number of pairs of parallel edges in the matching.
Moreover, from the bipartite graph , we can construct another graph in which every vertex of corresponds to a pair of parallel edges of , and there is an edge between two vertices of if the two corresponding pairs of parallel edges of cannot co-exist in any perfect matching of (called conflicting, which can be determined in constant time; see Section 2 for more details). This way, one easily sees that a set of duos that can be preserved together, by a perfect matching of , corresponds one-to-one to an independent set of [16, 5]. Therefore, the Max-Duo problem can be cast as a special case of the well-known maximum independent set (MIS) problem [15]; furthermore, Boria et al. [5] showed that in such a reduction, an instance of -Max-Duo gives rise to a graph with a maximum degree . It follows that the state-of-the-art \big{(}(\Delta+3)/{5}+\epsilon\big{)}-approximation algorithm for MIS [2], for any , is a {\big{(}(6k-3)}/{5}+\epsilon\big{)}-approximation algorithm for -Max-Duo. Especially, -Max-Duo can now be better approximated within a ratio of . Boria et al. [5] proved that -Max-Duo is APX-hard, similar to -MCSP [16], via a linear reduction from MIS on cubic graphs. For MIS on cubic graphs, it is NP-hard to approximate within [3]. Besides, Boria et al. [5] claimed that -Max-Duo can be approximated within , for any .
Recently, Boria et al. [4] presented a local search -approximation for the general Max-Duo problem. In the meantime, Brubach [6] presented a -approximation using a novel combinatorial triplet matching. Max-Duo has also been proved to be FPT by Beretta et al. [1], with respect to the number of preserved duos in the optimal partition. Most recently, two local search algorithms were independently designed for the general Max-Duo problem at the same time, achieving approximation ratios of [20] and [14] for any , respectively. They both exceed the previously the best {\big{(}(6k-3)}/{5}+\epsilon\big{)}-approximation algorithm for -Max-Duo, when . In this paper, we focus on the -Max-Duo problem; using the above reduction to the MIS problem, we present a vertex-degree reduction scheme and design an improved -approximation, for any .
The rest of the paper is organized as follows. We provide some preliminaries in Section 2, including several important structural properties of the graph constructed from the two given strings. The vertex-degree reduction scheme is also presented as a separate subsection in Section 2. The new approximation algorithm, denoted as Approx, is presented in Section 3, where we show that it is a -approximation for -Max-Duo. We conclude the paper in Section 4.
2. Preliminaries
Consider an instance of the -Max-Duo problem with two length- strings and such that is a permutation of . Recall that we can view the instance as a bipartite graph , where the vertices in and are in order and in order, respectively, and there is an edge between and if they are the same letter, denoted as . See Figure 1(a) for an example, where and . Note that , and so can be constructed in time.
The two edges are called a pair of parallel edges (and they are said to be parallel to each other); when both are included in a perfect matching of , the corresponding duo of is preserved. Two pairs of parallel edges are conflicting if they cannot co-exist in any perfect matching of . This motivates the following reduction from the -Max-Duo problem to the MIS problem: From the bipartite graph , we construct another graph in which a vertex of corresponds to the pair of parallel edges of ; two vertices of are conflicting if and only if the two corresponding pairs of parallel edges are conflicting, and two conflicting vertices of are adjacent in . One can see that a set of duos of that can be preserved all together, a set of pairwise non-conflicting pairs of parallel edges of , and an independent set in , are equivalent to each other. See Figure 1(b) for an example of the graph constructed from the bipartite graph shown in Figure 1(a). We note that and thus can be constructed in time from the instance of the -Max-Duo problem.
In the graph , for any , we use to denote the set of its neighbors, that is, the vertices adjacent to . The two ordered letters in the duo corresponding to the vertex is referred to as the letter content of . For example, in Figure 1(b), the letter content of is “” and the letter content of is “”.
Recall from the construction that there is an edge in the graph if , and there is a vertex in the graph if the parallel edges and are in .
Lemma 2.1**.**
The graph has the following properties.
- (1)
If , , then . 2. (2)
Given any subset of vertices , let , , and . If the subgraph in is connected, then all the vertices of have the same letter content; and consequently for any two vertices , we have both . 3. (3)
For any , we have
[TABLE]
Proof 2.2**.**
By definition, if and only if .
- (1)
If also , that is, , , then leading to . 2. (2)
Note that an edge if and only if the two vertices and are the same letter, and clearly each connected component in is complete bipartite and all the vertices are the same letter. It follows that if the induced subgraph in is connected, then all its vertices are the same letter; furthermore, all the duos starting with these vertices have the same letter content; and therefore for any two vertices , both . 3. (3)
For any vertex , or equivalently the pair of parallel edges in , which are incident at four vertices , a conflicting pair of parallel edges can be one of the six kinds: to share exactly one of the four vertices , to share exactly two vertices and , and to share exactly two vertices and . The sets of these six kinds of conflicting pairs are as described in the lemma, for example, is the set of conflicting pairs each sharing only the vertex with the pair .
From Lemma 2.1 and its proof, we see that for any vertex of there are at most conflicting vertices of each kind (corresponding to a set in Equation 1). We thus have the following corollary.
Corollary 2.3**.**
The maximum degree of the vertices in is .
2.1. When
We examine more properties for the graph when . First, from Corollary 2.3 we have .
Berman and Fujito [2] have presented an approximation algorithm with a performance ratio arbitrarily close to for the MIS problem, on graphs with maximum degree . This immediately implies a -approximation for -Max-Duo. Our goal is to reduce the maximum degree of the graph to achieve a better approximation algorithm. To this purpose, we examine all the degree- and degree- vertices in the graph , and show a scheme to safely remove them from consideration when computing an independent set. This gives rise to a new graph with maximum degree at most , leading to a desired -approximation for -Max-Duo.
We remark that, in our scheme we first remove the degree- vertices from to compute an independent set, and later we add half of these degree- vertices to the computed independent set to become the final solution. Contrary to the claim that there always exists a maximum independent set in containing no degree- vertices [5, Lemma 1], the instance in Figure 2.1 shows that any maximum independent set for the instance must contain some degree- vertices, thus invalidating the -approximation for -Max-Duo proposed in [5].
In more details, the instance of -Max-Duo, illustrated in Figure 2.1, consists of two length- strings and . The bipartite graph is shown in Figure 1(a) and the instance graph of the MIS problem is shown in Figure 1(b). In the graph , we have six degree- vertices: and . One can check that , , , is an independent set in , of size . On the other hand, if none of these degree- vertices is included in an independent set, then because the four vertices form a square implying that at most two of them can be included in the independent set, the independent set would be of size at most , and thus can never be a maximum independent set in .
Consider a duo of the string and for ease of presentation assume its letter content is “”. If no duo of the string has the same letter content “”, then this duo of the string can never be preserved; in fact this duo does not even become (a part of) a vertex of of the graph . If there is exactly one duo of the string having the same letter content “”, then these two duos make up a vertex , and from Lemma 2.1 we know that the degree of the vertex is at most , since there is no such vertex with sharing exactly the two letters and with . Therefore, if the degree of the vertex is six, then there must be two duos of the string and two duos of the string having the same letter content “”. Assume the other duo of the string and the other duo of the string having the same letter content “” are and , respectively. Then all four vertices exist in . We call the subgraph of induced on these four vertices a square, and denote it as , where and due to their conflicting relationships. One clearly sees that every square has a unique letter content, which is the letter content of its four member vertices.
In Figure 1(b), there are three squares , and , with their letter contents “”, “” and “”, respectively. The above argument says that every degree- vertex of must belong to a square, but the converse is not necessarily true, for example, all vertices of the square have degree . We next characterize several properties of a square.
The following lemma is a direct consequence of how the graph is constructed and the fact that .
Lemma 2.4**.**
In the graph constructed from an instance of -Max-Duo,
- (1)
for each index , there are at most two distinct and such that ; 2. (2)
if where , and (or symmetrically, ), then either or .
Lemma 2.5**.**
For any square in the graph , , , and . (Together, these imply that every vertex of is adjacent to either none or exactly two of the four member vertices of a square.)
Proof 2.6**.**
Consider the two vertices and , which have common neighbors and in the square.
Note that and share both the letters and . If there is a vertex adjacent to by sharing but not , then this vertex is with , and thus it has to be (by Lemma 2.4). We consider two subcases: If , then due to . Thus, this vertex actually shares and with ; also, it shares and with ; and therefore it is adjacent to too, but not adjacent to or . If , then this vertex shares only with the vertex ; also it shares only with ; and therefore it is adjacent to too, but not adjacent to or .
The other three symmetric cases can be discussed exactly the same and the lemma is proved.
Corollary 2.7**.**
In the graph , the degree- vertices can be partitioned into pairs, where each pair of degree- vertices belong to a square in and they are adjacent to the same six other vertices, two inside the square and four outside of the square.
Proof 2.8**.**
We have seen that every degree- vertex in the graph must be in a square. The above Lemma 2.5 states that the four vertices of a square can be partitioned into two pairs, and , and the two vertices inside each pair are non-adjacent to each other and have the same neighbors. In particular, if the vertex in the square has degree , then Lemma 2.1 states that it is adjacent to the six vertices (see an illustration in Figure 2.2).
Corollary 2.9**.**
If there is no square in the graph , then every degree- vertex is adjacent to a degree- vertex.
Proof 2.10**.**
Assume the vertex has degree . Due to the non-existence of any square in the graph and Lemma 2.1, either there is no vertex sharing exactly the two letters and with , or there is no vertex sharing exactly the two letters and with . We assume without loss of generality that there is no vertex sharing exactly the two letters and with , and furthermore assume , , is the vertex sharing exactly the two letters and with .
It follows that , for some and . Due to , this implies that and . Therefore, there is no vertex of sharing exactly the letter (, respectively) with the vertex , neither a vertex of sharing exactly the two letters and with the vertex . That is, the vertex is adjacent to only in the graph .
We say the two vertices and of are consecutive; and we say the two squares and in are consecutive. Clearly, two consecutive squares contain four pairs of consecutive vertices. The following Lemma 2.11 summarizes the fact that when two consecutive vertices belong to two different squares, then these two squares are also consecutive (and thus contain the other three pairs of consecutive vertices).
Lemma 2.11**.**
In the graph , if there are two consecutive vertices and belonging to two different squares and respectively, then , i.e., these two squares are consecutive.
Proof 2.12**.**
This is a direct result of the fact that no two distinct squares have any member vertex in common.
A series of consecutive squares in the graph , where , is maximal if none of the square and the square exists in the graph . Note that the non-existence of the square in does not rule out the existence of some of the four vertices in ; in fact by Lemma 2.1 there can be as many as two of these four vertices existing in (however, more than two would imply the existence of the square). Similarly, there can be as many as two of the four vertices existing in . In the sequel, a maximal series of consecutive squares starting with is denoted as , where . See for an example in Figure 3(b) where there is a maximal series of consecutive squares , where the instance of the -Max-Duo is expanded slightly from the instance shown in Figure 2.1.
Lemma 2.13**.**
Suppose , where , exists in the graph . Then,
- (1)
the two substrings and of the string and the two substrings and of the string are identical and do not overlap; 2. (2)
if a maximum independent set of contains less than vertices from , then it must contain either the four vertices or the four vertices .
Proof 2.14**.**
By the definition of the square , we have and ; we thus conclude that the two substrings and are identical. In Figure 3(b), for the two substrings are “”. If these two substrings overlapped, then there would be three occurrences of at least one letter, contradicting the fact that . This proves the first item.
Note that the square does not exist in the graph , and thus at most two of its four vertices (which are and ) exist in . We claim that if no vertex of the square is in , then there are exactly two of the four vertices and exist in and they both are in . Suppose otherwise there is at most one of the four vertices in , say ; we may increase the size of by removing while adding either the two vertices and or the two vertices and (depending on which vertices of the square are in ), a contradiction.
Assume next that a vertex of the square is in , say ; then due to maximality of and Lemma 2.5 both and are in . We claim and prove similarly as in the last paragraph that if no vertex of the square is in , then there are exactly two of the four vertices and exist in and they both are in . If there is a vertex of the square in , then it must be one of and ; and due to maximality and Lemma 2.5 both and are in . And so on; repeatedly applying this argument, we claim and prove similarly that if no vertex of the square is in , then there are exactly two of the four vertices and exist in and they both are in . If there is a vertex of the square in , then it must be one of and ; and due to maximality and Lemma 2.5 both and are in .
To summarize, we proved in the above two paragraphs that if contains less than vertices from , then there are exactly two of the four vertices and exist in and they both are in ; and these two vertices are either and or and . Symmetrically, there are exactly two of the four vertices and exist in and they both are in ; and these two vertices are either and or and . Clearly from the above, when the combination is and versus and , we may increase the size of to contain exactly vertices from without affecting any vertex outside of , a contradiction. Therefore, the only possible combinations are and versus and , and and versus and . This proves the second item of the lemma.
Suppose , where , exists in the graph . Let denote the string obtained from by removing the two substrings and and concatenating the remainder together, and denote the string obtained from by removing the two substrings and and concatenating the remainder. Let the graph denote the instance graph of the MIS problem constructed from the two strings and . See for an example in Figure 3(d), where there is a maximal series of consecutive squares in the graph .
Corollary 2.15**.**
Suppose , where , exists in the graph . Then, the union of a maximum independent set in the graph and certain vertices from becomes a maximum independent set in the graph , where these certain vertices are and if or is in the maximum independent set in , or they are , and if or is in the maximum independent set in .
Proof 2.16**.**
Consider the construction of the graph from the two strings and . Equivalently, starting with the graph , if we contract the vertices into the vertex if it exists or otherwise into a void vertex, contract the vertices into the vertex if it exists or otherwise into a void vertex, contract the vertices into the vertex if it exists or otherwise into a void vertex, and contract the vertices into the vertex if it exists or otherwise into a void vertex, then we obtain a graph that is exactly . In the graph , the vertices and , if both exist in , become adjacent to each other; so are the vertices and , if both exist in . It follows that the maximum independent set in the graph does not contain both vertices and , or both vertices and . Therefore, starting with the maximum independent set in the graph , we can add exactly vertices from to form an independent set in , of which the maximality can be proved by a simple contradiction.
We remark that in the extreme case where none of the vertices of and none of the vertices of are in the maximum independent set in , we may add either of the two sets of vertices from to form a maximum independent set in .
Iteratively applying the above string shrinkage process, or equivalently the vertex contracting process, associated with the elimination of a maximal series of consecutive squares. In iterations, we achieve the final graph containing no squares, which we denote as .
3. An approximation algorithm for -Max-Duo
A high-level description of the approximation algorithm, denoted as Approx, for the -Max-Duo problem is depicted in Figure 3.1.
In more details, given an instance of the -Max-Duo problem with two length- strings and , the first step of our algorithm is to construct the graph , which is done in time. In the second step (Lines 2–7 in Figure 3.1), it iteratively applies the vertex contracting process presented in Section 2 at the existence of a maximal series of consecutive squares, and at the end it achieves the final graph which does not contain any square. This second step can be done in time too since each iteration of vertex contracting process is done in time and there are iterations. In the third step (Lines 8–10 in Figure 3.1), let denote the set of singletons (degree-[math] vertices) and leaves (degree- vertices) in the graph ; our algorithm removes all the vertices of and their neighbors from the graph to obtain the remainder graph . This step can be done in time too due to , and the resultant graph has maximum degree by Corollaries 2.7 and 2.9. (See for an example illustrated in Figure 2(a).) In the fourth step (Lines 11–12 in Figure 3.1), our algorithm calls the state-of-the-art approximation algorithm for the MIS problem [2] on the graph to obtain an independent set in ; and returns as an independent set in the graph . The running time of this step is dominated by the running time of the state-of-the-art approximation algorithm for the MIS problem, which is a high polynomial in and . In the last step (Line 13 in Figure 3.1), using the independent set in , our algorithm adds vertices from each maximal series of consecutive squares according to Corollary 2.15, to produce an independent set in the graph . (For an illustrated example see Figure 2(b).) The last step can be done in time.
The state-of-the-art approximation algorithm for the MIS problem on a graph with maximum degree has a performance ratio of , for any [2].
Lemma 3.1**.**
In the graph , let denote the cardinality of a maximum independent set in , and let denote the cardinality of the independent set returned by the algorithm Approx. Then, , for any .
Proof 3.2**.**
Let denote the set of singletons (degree-[math] vertices) and leaves (degree- vertices) in the graph ; our algorithm Approx removes all the vertices of and their neighbors from the graph to obtain the remainder graph . The graph has maximum degree by Corollaries 2.7 and 2.9. Let denote the cardinality of a maximum independent set in , and let denote the cardinality of the independent set returned by the state-of-the-art approximation algorithm for the MIS problem. We have and , for any . Therefore,
[TABLE]
This proves the lemma.
Theorem 3.3**.**
The -Max-Duo problem can be approximated within a ratio arbitrarily close to , by a linear reduction to the MIS problem.
Proof 3.4**.**
We prove by induction. At the presence of maximal series of consecutive squares, we perform the vertex contracting process iteratively. In each iteration to handle one maximal series of consecutive squares, let and denote the graph before and after the contracting step, respectively. Let denote the cardinality of a maximum independent set in , and let denote the cardinality of the independent set returned by the algorithm Approx. Given any , from Lemma 3.1, we may assume that .
Let OPT denote the cardinality of a maximum independent set in , and let SOL denote the cardinality of the independent set returned by the algorithm Approx, which adds vertices from the maximal series of consecutive squares to the independent set in , according to Corollary 2.15, to produce an independent set in the graph . Lemma 2.13 states that . Therefore,
[TABLE]
This proves that for the original graph we also have accordingly. That is, the worst-case performance ratio of our algorithm Approx is , for any . The time complexity of the algorithm Approx has been determined to be polynomial at the beginning of the section, and it is dominated by the time complexity of the state-of-the-art approximation algorithm for the MIS problem. The theorem is thus proved.
4. Conclusion
In this paper, we examined the Max-Duo problem, the complement of the well studied minimum common string partition problem. Based on an existing linear reduction to the maximum independent set (MIS) problem [16, 5], we presented a vertex-degree reduction technique for the -Max-Duo to reduce the maximum degree of the constructed instance graph to . Along the way, we uncovered many interesting structural properties of the constructed instance graph. This degree reduction enables us to adopt the state-of-the-art approximation algorithm for the MIS problem on low degree graphs [2] to achieve a -approximation for -Max-Duo, for any .
It is worth mentioning that our vertex-degree reduction technique can be applied for -Max-Duo with . In fact, we had worked out the details for , to reduce the maximum degree of the constructed instance graph from to , leading to a -approximation for -Max-Duo, for any . Nevertheless, the -approximation is superseded by the -approximation for the general Max-Duo [14].
It would be worthwhile to investigate whether the maximum degree can be further reduced to , by examining the structural properties associated with the degree- vertices. On the other hand, it is also interesting to examine whether a better-than- approximation algorithm can be designed directly for the MIS problem on those degree- graphs obtained at the end of the vertex contracting process.
Acknowledgements.
All authors are supported by NSERC Canada. Additionally, Chen is supported by the NSFC Grants No. 11401149, 11571252 and 11571087, and the China Scholarship Council Grant No. 201508330054; Liu is supported by the NSFC Grant Nos. 61370052 and 61370156. Luo is supported by the NSFC Grant No. 71371129 and the PSF China Grant No. 2016M592680; Lin and Zhang are supported by the NSFC Grant No. 61672323.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] S. Beretta, M. Castelli, and R. Dondi. Parameterized tractability of the maximum-duo preservation string mapping problem. Theoretical Computer Science , 646:16–25, 2016.
- 2[2] P. Berman and T. Fujito. On approximation properties of the independent set problem for low degree graphs. Theory of Computing Systems , 32:115–132, 1999.
- 3[3] P. Berman and M. Karpinski. On some tighter inapproximability results. In Proceedings of the of 26th International Colloquium on Automata, Languages and Programming (ICALP’99) , pages 200–209, 1999.
- 4[4] N. Boria, G. Cabodi, P. Camurati, M. Palena, P. Pasini, and S. Quer. A 7/2-approximation algorithm for the maximum duo-preservation string mapping problem. In Proceedings of the 27th Annual Symposium on Combinatorial Pattern Matching (CPM 2016) , volume 54 of LIP Ics , pages 11:1–11:8, 2016.
- 5[5] N. Boria, A. Kurpisz, S. Leppänen, and M. Mastrolilli. Improved approximation for the maximum duo-preservation string mapping problem. In Proceedings of the 14th International Workshop on Algorithms in Bioinformatics (WABI 2014) , volume 8701 of LNBI , pages 14–25, 2014.
- 6[6] B. Brubach. Further improvement in approximating the maximum duo-preservation string mapping problem. In Proceedings of the 16th International Workshop on Algorithms in Bioinformatics (WABI 2016) , volume 9838 of LNBI , pages 52–64, 2016.
- 7[7] L. Bulteau, G. Fertin, C. Komusiewicz, and I. Rusu. A fixed-parameter algorithm for minimum common string partition with few duplications. In Proceedings of the 13th International Workshop on Algorithms in Bioinformatics (WABI 2013) , volume 8126 of LNBI , pages 244–258, 2013.
- 8[8] L. Bulteau and C. Komusiewicz. Minimum common string partition parameterized by partition size is fixed-parameter tractable. In Proceedings of the Twenty-Fifth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’14) , pages 102–121, 2014.
