Transposition diameter on circular binary strings
Misa Nakanishi

TL;DR
This paper investigates the transposition diameter of circular binary strings, establishing bounds and conditions for the minimal number of transpositions needed to transform one string into another, considering their circular nature.
Contribution
It introduces bounds and a characterization for transposition distances specifically on circular binary strings, extending previous work on linear strings.
Findings
Lower bound on transposition distance via partitions
Upper bound based on covering partitions
Necessary and sufficient condition for transposition diameter
Abstract
On the string of finite length, a (genomic) transposition is defined as the operation of exchanging two consecutive substrings. The minimum number of transpositions needed to transform one into the other is the transposition distance, that has been researched in recent years. In this paper, we study transposition distances on circular binary strings. A circular binary string is the string that consists of symbols and and regards its circular shifts as equivalent. The property of transpositions which partition strings is observed. A lower bound on the transposition distance is represented in terms of partitions. An upper bound on the transposition distance follows covering of the set of partitions. The transposition diameter is given with a necessary and sufficient condition.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Transposition diameter on circular binary strings
Misa Nakanishi E-mail address : [email protected]
Abstract
On the string of finite length, a (genomic) transposition is defined as the operation of exchanging two consecutive substrings. The minimum number of transpositions needed to transform one into the other is the transposition distance, that has been researched in recent years. In this paper, we study transposition distances on circular binary strings. A circular binary string is the string that consists of symbols [math] and and regards its circular shifts as equivalent. The property of transpositions which partition strings is observed. A lower bound on the transposition distance is represented in terms of partitions. An upper bound on the transposition distance follows covering of the set of partitions. The transposition diameter is given with a necessary and sufficient condition.
keyword : strings, sorting by transpositions, partitions, NP-hard problems
1 Introduction
Let be a finite set, called an alphabet. For the string of finite length, which is , a transposition is defined as the operation of exchanging two consecutive substrings and .
For two strings and that have the same multiplicity of symbols, the minimum number of transpositions needed to transform into is called the transposition distance.
A transposition is studied as one of operations on genome rearrangement [1]. On the other hand, a transposition is concerned with a combinatorial problem such as sorting a bridge hand [4]. Both of the studies have the object to find the minimum transposition sequence which transforms one into the other.
In the 1990s, transpositions on permutations were investigated. Bafna and Pevzner [2] gave a lower bound and an upper bound on the transposition distance between given pairs of permutations, and the transposition diameter, which is the maximum transposition distance taken over all pairs of permutations. Afterwards, Eriksson et al. [4] improved the upper bound and introduced circular permutations they call toric permutations. Hartman [6] showed circular permutations correspond to linear ones of which the length reduces by 1.
Then, transpositions on strings, which can contain redundant symbols, were researched as an expansion. Christie and Irving [3] determined the transposition diameter and its string pairs on binary strings. Radcliffe, Scott and Wilmer [7] led NP-hardness of transposition distances on binary strings from 3-PARTITION [5]. Therefore, on arbitrary strings, the complexity is also NP-hard.
In this paper, we study transpositions on circular binary strings. A circular binary string is the string that consists of symbols [math] and and regards its circular shifts as equivalent. In Section 2, terminology and definitions are introduced. In Section 3, the property of transpositions is observed, which partition strings. A partition is constituted by parts between ’s where weights as the number of [math]’s belong. Several functions are introduced that sum up the minimal or maximal weights and give a lower bound on transposition distances. A consecutive part is covered by finite number of parts. In this regard, an upper bound on transposition distances follows. Finally, the transposition diameter is represented by a necessary and sufficient condition. The distance between two strings is determined in polynomial time when it is a diameter.
2 Preliminary
For an alphabet , a binary string of length , that is , is denoted by
[TABLE]
For a binary string , a transposition is marked down as
[TABLE]
that is transformed into
[TABLE]
In the case that binary strings and have the same multiplicity of symbols, the transposition distance is defined as the minimum number of transpositions needed to transform into , that is denoted by . In this paper, transposition distances are defined to such string pairs.
Let be consecutive run of [math] of length and let be consecutive run of [math] of length 0 or more. For a string , (or ) represents the string that concatenates times. or simply represents the concatenation of and . As the symbol of a concatenation of strings, we use . For example, . In this paper, it is assumed that the number of ’s is less than or equal to the number of [math]’s on a binary string, otherwise the symbols are interchanged.
A circular binary string sees circular shifts of the string as equivalent. A equivalence relation is defined as
[TABLE]
A circular binary string as an equivalence type is denoted by
[TABLE]
A transposition for a circular binary string is defined in the same manner as a transposition for a linear string. Note that it separates a circular binary string into 3 segments and connects them in different order.
In this paper, all variable numbers are within range of the natural numbers.
3 Transposition distances on circular binary strings
A pair of circular binary strings is represented by
[TABLE]
A transposition is classified into type (T1) or type (T2) in terms of the number of parts it operates on. The index of each part is taken as modulo .
- (T1)
The transposition of this type operates on 3 parts.
For parts on , and , and ,
[TABLE]
is transformed into
[TABLE]
If there exists on such that
[TABLE]
then call it 3-transposition. If and then call and relative. 2. (T2)
The transposition of this type operates on 2 parts.
For parts on , and ,
[TABLE]
is transformed into
[TABLE]
If there exists on such that
[TABLE]
then call it 2-transposition. If then call and relative.
Observation 1. For , , and , there is 3-transposition if and only if , , and .
Proof. The given conditions of 3-transposition follow the definition. Conversely, let for any and , then this does not form 3-transposition. The contradiction follows for other forms in the same way.
For and , we define the following functions.
[TABLE]
[TABLE]
For , set and . Exceptionally for , set .
Theorem 2. For circular binary strings and , if then
[TABLE]
Proof. This is shown by induction on . In the case of , is transformed into by relative 3-transposition or 2-transposition. Set and . Let .
- (i)
In the case of 3-transposition, from Observation 1, for and , it forms that , , and . Let such that arbitrarily. Set . There are 4 cases with respect to whether , and are the elements of or not. In the case of , each for is equal to some for , that leads . In the case of and , , that leads . In the case of and , , that leads . In the case of , , that leads . For other combinations, the inequality holds similarly. 2. (ii)
In the case of 2-transposition, for and , it forms that , and . Let such that arbitrarily. Set . There are 4 cases with respect to whether and are the elements of or not. In the case of , each for is equal to some for , that leads . In the case of and , , that leads . In the case of and , , that leads . In the case of , , that leads .
In the case of , let be the first transformation from by 1 transposition. is transformed into by transpositions and suppose . is transformed into by 1 transposition so that . We obtain . For , it forms from the definition.
is led in the same way.
Lemma 3. For , , and , let for some and for all , . If for some , then .
Proof. It is obvious for . Let . Let . Suppose for all , . Take , and . Let and . We act transpositions on and .
()
Step (i) Let (mod ). If then from the side of act
[TABLE]
Label as . If then from the side of act
[TABLE]
Label as .
Step (ii) Let (mod ). If then from the side of act
[TABLE]
Label as . If then from the side of act
[TABLE]
Label as .
Step (iii) Let (mod ). If then from the side of act
[TABLE]
Label as . If then from the side of act
[TABLE]
Label as .
and was transformed into and respectively. . It forms , , and . From Observation 1 by relative 3-transposition, . That is, .
Suppose for some , . Let and . We act transpositions on and as Case (*). and was transformed into and respectively. . It forms , , and . From Observation 1 by relative 3-transposition, . That is, .
Otherwise, for some , . Let and . We label the next as .
[TABLE]
[TABLE]
[TABLE]
Take minimum such that
[TABLE]
We act transpositions on and .
Step 1. Act transpositions on .
Case (i) Let . For and , act
[TABLE]
For and act
[TABLE]
Case (ii) Let . For and , act (1). For and , act (2). For and , act
[TABLE]
Let . For , act
[TABLE]
Case (iii) Let . For and , act (1). For and , act (2). For and , act (3). Let and . For , act
[TABLE]
For , act
[TABLE]
Step 2. Label as . We act transpositions on and as ().
After the transpositions of Step 1 and 2, and was transformed into and respectively. . It forms and . Suppose then that contradicts the minimality of . So . From Observation 1 by relative 3-transposition, . That is, .
The maximum transposition distance on strings is called transposition diameter. Theorem 6 shows a necessary and sufficient condition of transposition diameter on circular binary strings.
Theorem 4. For circular binary strings and of parts, if and only if .
Proof. It is assumed that . For it is obvious. Let . From Theorem 2, if then , and necessarily, so . Conversely, let , and is shown by Lemma 3.
4 Conclusions
This study took up transposition distances on circular binary strings. In this paper, the distance bounds were shown in terms of partitions and the diameter was represented by a necessary and sufficient condition, that is related to a NP-hard problem. It can expand to resolve transposition distances on arbitrary strings and the rest of decision problems. They could be generalized with combinatorial analysis for perspective.
Acknowledgement
This study is supported by Ota and Oda laboratory at Keio University. I appreciate their general advice of mathematics, especially for Prof. Katsuhiro Ota.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bruce Alberts, Alexander Johnson, Julian Lewis, Martin Raff, Keith Roberts and Peter Walter, Molecular Biology of the Cell, 4th edition , New York, Garland Science, 2002.
- 2[2] V. Bafna and P. A. Pevzner, Sorting by transpositions , SIAM J. Discrete Math., 11(1998), pp. 224-240.
- 3[3] D. A. Christie and R. W. Irving, Sorting strings by reversals and by transpositions , SIAM J. Discrete Math., 14(2001), pp. 193-206.
- 4[4] H. Eriksson, K. Eriksson, J. Karlander, L. Svensson, and J. Wastlund, Sorting a bridge hand , Discrete Math., 241(2001), pp. 289-300.
- 5[5] M. R. Garey and D. S. Johnson, Computers and Intractibility , W. H. Freeman, San Francisco, 1979.
- 6[6] T. Hartman, A simpler 1.5-approximation algorithm for sorting by transpositions , In Combinatorial Pattern Matching (CPM ’03), volume 2676, pages 156-169, 2003.
- 7[7] A. J. Radcliffe, A. D. Scott, and E. L. Wilmer, Reversals and transpositions over finite alphabets , SIAM J. Discrete Math., 19(2005), pp. 224-244.
