This paper investigates the Ulam metric for multipermutation codes, analyzing sphere sizes and bounds to enhance understanding of code capacity for applications like flash memory.
Contribution
It extends the Ulam metric analysis from permutations to multipermutations, providing new bounds and insights for code size optimization.
Findings
01
Sphere sizes for multipermutations under the Ulam metric are characterized.
02
Bounds on maximum code size are derived for multipermutation codes.
03
Differences between permutation and multipermutation Ulam metrics are identified.
Abstract
Permutation codes, in the form of rank modulation, have shown promise for applications such as flash memory. One of the metrics recently suggested as appropriate for rank modulation is the Ulam metric, which measures the minimum translocation distance between permutations. Multipermutation codes have also been proposed as a generalization of permutation codes that would improve code size (and consequently the code rate). In this paper we analyze the Ulam metric in the context of multipermutations, noting some similarities and differences between the Ulam metric in the context of permutations. We also consider sphere sizes for multipermutations under the Ulam metric and resulting bounds on code size.
i∗:=i−min{k∈Z>0:(mσr(i)=mσr(i−k−1)) or (i−k=1)}.
i∗:=i−min{k∈Z>0:(mσr(i)=mσr(i−k−1)) or (i−k=1)}.
E(m):={
E(m):={
(m⋅ϕ(i,j)=m⋅ϕ(j,k))}.
(m⋅ϕ(i,k+3))[k,k+3]=
(m⋅ϕ(i,k+3))[k,k+3]=
=
=
E∗(m):={(i,j)∈[n]×[n]
E∗(m):={(i,j)∈[n]×[n]
f(i):=⎩⎨⎧i+min{p∈Z≥0:(m(i)=m(i+p+1))∨(i+p=n)}\hfill(if m(i)=m(i−1) or i=1)n\hfill(otherwise)
f(i):=⎩⎨⎧i+min{p∈Z≥0:(m(i)=m(i+p+1))∨(i+p=n)}\hfill(if m(i)=m(i−1) or i=1)n\hfill(otherwise)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCoding theory and cryptography · graph theory and CDMA systems · Cellular Automata and Applications
Permutation codes, in the form of rank modulation,
have shown promise for applications such as flash memory.
One of the metrics recently suggested as appropriate for rank
modulation is the Ulam metric, which measures the minimum
translocation distance between permutations. Multipermutation
codes have also been proposed as a generalization of
permutation codes that would improve code size (and
consequently the code rate). In this paper we analyze the
Ulam metric in the context of multipermutations, noting some
similarities and differences between the Ulam metric in the
context of permutations. We also consider sphere sizes for
multipermutations under the Ulam metric and resulting bounds
on code size.
I Introduction
Permutation (and multipermutation) codes
were invented as early as
the 1960’s, when Slepian proposed constructing a code by permuting the
order of the numbers of an initial sequence [13]. More recently,
Jiang et al. proposed permutation codes utilizing the Kendall-τ metric
for use in flash memory via the rank modulation scheme [8].
Since then, permutation codes
and their generalization to multipermutation codes
have been a hot topic in the research community with various
related schemes being suggested
[1, 2, 3, 4, 9, 11].
One scheme of particular interest was
the proposal of Farnoud et al. to utilize the Ulam metric in place of the Kendall-τ metric [3]
and subsequent study
expounded upon code size bounds [7].
The Ulam metric measures the minimum number of translocations
needed to transform one permutation into another, whereas
the Kendall-τ metric measures the minimum
adjacent transpositions needed to transform one permutation into another.
Errors in flash memory devices occur when cell charges leak
or when rewriting operations cause overshoot errors resulting
in inaccurate charge levels.
While the Kendall-τ metric is suitable for correcting
relatively small errors of this nature, the
Ulam metric would be more robust to large
charge leakages or overshoot errors within a cell.
However, there is a trade-off in code size when
rank modulation is used in conjunction with
the Ulam metric instead of the Kendall-τ metric.
The Ulam distance between permutations is
always less than or equal to the Kendall-τ
distance between permutations, which implies that
the maximum code size for a permutation code utilizing
the Ulam metric is less than or equal to the maximum
code size of a permutation code utilizing the
Kendall-τ metric [3].
One possible compensation for this trade-off is the
generalization from permutation codes to multipermutation
codes, which improves the maximum possible code size [4].
In flash memory devices, permutations or multipermutations
may be modeled physically by relative rankings of cell charges.
The number of possible messages is limited by the number
of distinguishable relative rankings. However,
it was shown in [4] that multipermutations
may significantly increase the total possible messages
compared to ordinary permutations.
For example, if only k different charge levels are possible,
then permutations of length k can be stored.
Hence, in r blocks of length k, one may store (k!)r
potential messages. On the other hand, if one uses r-regular
multipermutations in the same set of blocks, then
(kr)!/(r!)k potential messages are possible.
Bounds on permutation codes in the Ulam metric were studied in
[3] and [7].
In [10], the nonexistence of nontrivial perfect permutation
codes in the Ulam metric was proven by examining the size of
Ulam spheres, spheres comprised of all permutations within
a given Ulam distance of a particular permutation.
However, no similar study of Multipermutation Ulam spheres
exists, and currently known bounds on code size
do not always consider the problem of differing sphere sizes.
The current paper examines Ulam sphere sizes in the context of
multipermutations and provides new bounds on code size.
The paper is organized as follows:
First, Section II defines notation and basic concepts
used in the paper. Next, Section III compares
properties of the Ulam metric as defined for permutations
and multipermutations, and then provides a simplification of
the r-regular Ulam metric for multipermutations
(Lemmas 1 and 2).
Section IV considers an application of
Young Tableaux and the RSK-correspondence
to calculate r-regular Ulam sphere sizes
(Lemma 5 and Prop. 6).
Section V then discusses duplicate translocation
sets and a method of calculating the size of
spheres of radius t=1 for any center (Thm. 13).
Section VI follows, demonstrating minimal and
maximal sphere sizes
(Lemmas 14 and 16)
and providing
both lower and upper bounds on code size
(Lemmas 15,
17, and 18).
Finally Section VII gives some concluding remarks.
II Preliminaries and Notation
In this section we introduce notation and definitions
used in this paper. Unless otherwise stated, definitions are
based on conventions established in
[3], [4], and [10].
Throughout this paper n and r are assumed to be
positive integers, r dividing n.
The notation [n] denotes the set {1,2,…,n} and
Sn denotes the set of permutations
on [n], i.e. the symmetric group of size n!.
For σ∈Sn,
we write σ=[σ(1),σ(2),…,σ(n)],
where for all i∈[n],σ(i) is the image of i
under σ.
Throughout this paper we assume
σ, π∈Sn.
With a slight abuse of notation, we may also use
σ to mean the sequence
(σ(1),σ(2),…,σ(n))∈Zn
associated with σ∈Sn.
Multiplication of permutations is defined by composition so that
for all i∈[n], we have
(στ)(i)=σ(τ(i)).
The identity permutation, [1,2,…,n]∈Sn
is denoted by e.
An r-regular multiset is a multiset such that
each of its elements is repeated r times.
A multipermutation is an ordered tuple of the
elements of a multiset, and in the instance of an r-regular multiset, is called
an r-regular multipermutation.
Following the work of [4], this study focuses
on r-regular multipermutations, although many results
are extendible to general multipermutations.
For each σ∈Sn we
define a corresponding r-regular
multipermutation mσr
as follows:
for all i∈[n] and j∈[n/r],
[TABLE]
and mσr:=(mσr(1),mσr(2),…,mσr(n))∈Zn.
For example, if n=6,r=2, and σ=[1,5,2,4,3,6], then
mσr=(1,3,1,2,2,3).
This definition differs slightly from the
correspondence defined in [4], which was
defined in terms of the inverse permutation. This is so that
properties of the Ulam metric for permutations
will carry over to the Ulam metric for multipermutations
(Lemmas 1 and 2 of
Section III).
With the correspondence above, we may define
an equivalence relation between elements of Sn.
We say that σ≡rπ if and only if
mσr=mπr.
The equivalence class Rr(σ) of σ∈Sn
is defined as Rr(σ):={π∈Sn:π≡rσ}.
For a subset S⊆Sn, the notation
Mr(S):={mσr:σ∈S},
i.e. the set of r-regular multipermutations corresponding
to elements of S.
We say that σ≡rπ if and only if
mσr=mπr.
The equivalence class Rr(σ) of σ∈Sn
is defined by Rr(σ):={π∈Sn:π≡rσ}.
For a subset S⊆Sn, the notation
Mr(S):={mσr:σ∈S},
i.e. the set of r-regular multipermutations corresponding
to elements of S.
The following definition is our own.
For any m∈Zn, and
σ∈Sn, we define the product
(a right group action)
m⋅σ by composition,
similarly to the definition of multiplication
of permutations.
More precisely, for all i∈[n], let
(m⋅σ)(i):=m(σ(i)).
It is easily confirmed that
m⋅e=m
for all m∈Zn.
It is also easily confirmed that for all
σ,π∈Sn, we have
m⋅(σπ)=(m⋅σ)⋅π.
With this definition, notice that
mσr⋅π=mσπr.
It is possible for different permutations to
correspond to the same multipermutation, but
for τ∈Sn, it is clear that
mσr=mπr
implies
mσr⋅τ=mπr⋅τ.
We finish this section by defining what
a multipermutation code is.
A subset C⊆Sn is called an
r-regular multipermutation code if and only if
for all σ∈C, we also have Rr(σ)⊆C.
Such a code is denoted by MPC(n,r), and
we say that C is an MPC(n,r).
If C is an MPC(n,r) then
whenever a permutation is a member of
C its entire equivalence
class is also contained within C.
Thus if C is an
MPC(n,r) it can be represented by
the set of r-regular multipermutations associated
with elements of C, i.e. the set
Mr(C).
Moreover, we define the cardinality ∣C∣r
of an MPC(n,r)C as
∣C∣r:=∣Mr(C)∣
(this notation and definition differs slightly from
[4]).
III Multipermutation Ulam Metric
In this section we discuss some similarities and differences
between the Ulam metric for permutations and the Ulam
metric for multipermutations.
We begin by defining the Ulam metric for permutations.
For any two sequences u,v∈Zn,ℓ(u,v) denotes the length of the longest
common subsequence of u and v.
In other words,
ℓ(u,v) is the largest integer k∈Z>0
such that there exists a sequence (a1,a2,…,ak)
where for all p∈[k], we have ap=u(ip)=v(jk)
with
1≤i1<i2<⋯<ik≤n and 1≤j1<j2<⋯<jk≤n.
The Ulam distanced∘(σ,π) between permutations
σ,π∈Sn is defined as
d∘(σ,π):=n−ℓ(σ,π).
It is also known that the Ulam distance d∘(σ,π)
between σ,π∈Sn is equivalent to the
minimum number of translocations needed to transform
σ into π [3]. Here, for distinct i,j∈[n],
the translocation ϕ(i,j)∈Sn
is defined as follows:
[TABLE]
The notation ϕ(i,i) is understood to mean the identity
permutation, e.
When it is not necessary to specify any index, a translocation
may be written simply as ϕ.
Intuitively, when multiplied on the right of a permutation
σ∈Sn,
the translocation ϕ(i,j)∈Sn deletes
σ(i) from the
ith position of σ and then inserts it in the new
jth position
(shifting positions between i and j in the process).
The r-regular Ulam distanced∘r(σ,π)
between permutations σ,π∈Sn
is defined as the minimum Ulam distance among all
members of Rr(σ) and Rr(π). That is,
d∘r(σ,π):=σ′∈Rr(σ),π′∈Rr(π)mind∘(σ′,π′).
Notice that the r-regular Ulam distance is
defined over equivalence classes.
Although technically a distance between equivalence
classes, it is convenient to think of the r-regular
Ulam distance instead as a distance between
multipermutations.
Viewed this way, the property of the Ulam metric for
permutations, that it can be defined in terms of
longest common subsequences or equivalently
in terms of translocations, carries over to the
r-regular Ulam distance.
The next lemma states that the r-regular Ulam
distance between permutations σ and π
is equal to n minus the length of the longest
common subsequence of their corresponding
r-regular multipermutations.
Lemma 1**.**
d∘r(σ,π)=n−ℓ(mσr,mπr).**
Proof.
We will first show that
d∘r(σ,π)≥n−ℓ(mσr,mπr).
By definition of d∘r(σ,π), there
exist σ′∈Rr(σ) and
π′∈Rr(π) such that d∘r(σ,π)=d∘(σ′,π′)=n−ℓ(σ′,π′).
Hence if
for all σ′∈Rr(σ) and π′∈Rr(π)
we have ℓ(σ′,π′)≤ℓ(mσr,mπr),
then d∘r(σ,π)≥n−ℓ(mσr,mπr)
(subtracting a larger value from n
results in a smaller overall value).
Therefore it suffices to show that
that for all σ′∈Rr(σ) and
π′∈Rr(π), that ℓ(σ′,π′)≤ℓ(mσr,mπr).
This is simple to prove
because if two permutations have a common subsequence,
then their corresponding r-regular multipermutations
will have a related common subsequence.
Let σ′∈Rr(σ), π′∈Rr(π), and
ℓ(σ′,π′)=k. Then there exist indexes
1≤i1<i2<⋯<ik≤n and
1≤j1<j2<⋯<jk≤n such that
for all p∈[k],σ′(ip)=π′(jp).
Of course, whenever σ′(i)=π′(j), then
mσ′r(i)=mπ′r(j).
Therefore ℓ(σ′,π′)=k≤ℓ(mσ′r,mπ′r)=ℓ(mσr,mπr).
Next, we will show that d∘r(σ,π)≤n−ℓ(mσr,mπr).
Note that
[TABLE]
Here if σ′∈Rr(σ),π′∈Rr(π)maxℓ(σ′,π′)≥ℓ(mσr,mπr),
then
d∘r(σ,π)≤n−ℓ(mσr,mπr)
(subtracting a smaller value from n results in
a larger overall value).
It is enough to show that
there exist σ′∈Rr(σ) and
π′∈Rr(π) such that
ℓ(σ′,π′)≥ℓ(mσr,mπr).
To prove this fact, we take a longest common subsequence
of mσr and mπr and then
carefully choose σ′∈Rr(σ) and
π′∈Rr(π) to have an equally long common subsequence.
The next paragraph describes how this can be done.
Let ℓ(mσr,mπr)=k and
let (1≤i1<i2<⋯<ik≤n) and
(1≤j1<j2<⋯<jk≤n) be integer sequences
such that for all p∈[k],mσr(ip)=mπr(jp).
The existence of such sequences is guaranteed
by the definition of ℓ(mσr,mπr).
Now for all p∈[k], define
σ′(ip) to be the smallest integer l∈[n]
such that mσ(l)=mσ(ip)
and if q∈[k] with q<p, then
mσr(iq)=mπr(ip) implies
σ′(iq)<σ′(ip)=l.
For all p∈[k], define π(jp) similarly.
Then for all p∈[k],σ′(ip)=π′(jp).
The remaining terms of σ′ and π′ may
easily be chosen in such a manner that
σ′∈Rr(σ) and π′∈Rr(π).
Thus there exist σ′∈Rr(σ) and
π′∈Rr(π) such that
ℓ(σ′,π′)≥ℓ(mσr,mπr).
∎
The following example helps to illuminate
the choice of σ′ and π′ in the proof above.
If mσr=(2,1,2,1,3,3), and
mπr=(3,2,2,1,3,1), then we
have ℓ(mσr,mπr)=4,
with the common subsequence (2,2,1,3) of maximal length.
Here (1,3,4,6) and (2,3,4,5) are
sequences with
mσr(1)=mπr(2),
mσr(3)=mπr(3),
mσr(4)=mπr(4), and
mσr(6)=mπr(5).
Then following the convention outlined in the proof above,
σ′(1)=π′(2)=3,
σ′(3)=π′(3)=4,
σ′(4)=π′(4)=1, and
σ′(6)=π′(5)=5,
so that ℓ(σ′,π′)≥4.
The other elements of σ′ and π′ can
be chosen as follows so that σ′∈Rr(σ) and
π′∈Rr(π):
set σ′(2)=1, σ′(5)=6,
π′(1)=1, and π′(6)=6.
If two multipermutations mσr
and mπr have a common subsequence of
length k, then mσr can
be transformed into mπr with n−k (but
no fewer) delete/insert operations.
Delete/insert operations correspond to applying
(multiplying on the right) a translocation.
Hence by Lemma 1 we can
state the following lemma about the r-regular
Ulam distance.
There exists a translocation ϕ∈Sn
such that ℓ(mσr⋅ϕ,mπr)=ℓ(mσr,mπr)+1,
since it is always possible to arrange one element with a
single translocation. This then implies that
min{k∈Z:there exists (ϕ1,…,ϕk)s.t.mσr⋅ϕ1⋯ϕk=mπr}≤n−ℓ(mσr,mπr)=d∘r(σ,π).
At the same time, given
ℓ(mσr,mπr)≤n,
then for all translocations ϕ∈Sn,
we have that
ℓ(mσr⋅ϕ,mπr)≤ℓ(mσr,mπr)+1,
since a single translocation can only arrange one
element at a time. Therefore
min{k∈Z:there exists (ϕ1,…,ϕk) s.t mσr⋅ϕ1⋯ϕk=mπr}≥n−ℓ(mσr,mπr)=d∘r(σ,π), by
Lemma 1.
∎
Lemmas 1 and 2 allow us to view the
Ulam metric for r-regular multipermutations
similarly to the way we view the Ulam metric
for permutations; in terms of longest common
subsequences or in terms of the minimum number
of translocations.
Another known
property of the Ulam metric for permutations is left
invariance, i.e. given τ∈Sn,
we have d∘(σ,π)=d∘(τσ,τπ).
However, left invariance does not hold in general
for multipermutations, as the next lemma indicates.
Lemma 3**.**
Let n/r≥2 and r≥2.
Then there exist σ′,π′∈Sn such that
d∘r(e,σ′)=d∘r(π′e,π′σ′).
Proof.
Let
n/r≥2 and r≥2.
Define σ′,π′∈Sn by
[TABLE]
[TABLE]
First, consider d∘r(e,σ′).
Note that for mer and mσ′r,
for any integer i such that 2r<i≤n we have
e(i)=σ′(i), which implies mer(i)=mσ′r(i).
Meanwhile, the first 2r elements of
mer and mσ′r are
(r1,1,…,1,r2,2,…,2) and
(r2,2,…,2,r1,1,…,1) respectively,
so that the longest common subsequence of the first
2r elements of mer and mσ′r
is comprised of r1’s or r2’s.
Hence ℓ(mer,mσ′r)=(n−2r)+r=n−r, which by lemma 1 implies
that d∘r(e,σ′)=r≥2.
Next, consider d∘r(π′e,π′σ′).
Multiplying π′ and σ′ yields
[TABLE]
For all integers i
such that 2r<i≤n, we then have π′e(i)=π′(i)=π′σ′(i)⟹mπ′er(i)=mπ′σ′r(i).
Meanwhile, the first 2r elements of mπ′er
and mπ′σ′ are
(1,2,1,2,…,1,2) and (2,1,2,1,…,2,1) respectively.
Thus the longest common subsequence of the first 2r
elements of mπ′er and mπ′σ′r
is any length 2r−1 sequence of alternating 1’s and 2’s.
Hence ℓ(mπ′er,mπ′σ′r)=(n−2r)+(2r−1)=n−1, which by lemma 1 implies
that d∘r(π′e,π′σ′)=1.
∎
The fact that left invariance does not hold for
the r-regular Ulam metric has implications
on r-regular Ulam sphere sizes, defined and
discussed in the next section.
Left invariance implies sphere size does not depend upon
the center. However, we will demonstrate that in the
multipermutation case sizes may differ
depending upon the center, a
fact previously unknown.
IV Young Tableaux Sphere Size Calculation
In [10], Young tableaux and the
RSK-Correspondence were utilized to calculate
Ulam Sphere sizes. A similar approach can be applied
to r-regular Ulam spheres of arbitrary radius centered at
mer.
It is first necessary to introduce some basic notation and
definitions regarding Young tableaux. Additional information on the subject
can be found in [6], [12], and [14].
A Young diagram is a left-justified collection of cells with a
(weakly) decreasing number of cells in each row below. Listing the
number of cells in each row gives a partition
λ=(λ1,λ2,…,λk)
of n, where n is the total
number of cells in the Young diagram.
The notation λ⊢n indicates λ is a partition of n.
Because the partition λ⊢n
defines a unique Young diagram and vice versa,
a Young diagram may be referred to by its associated
partition λ⊢n.
For example, the partition λ:=(4,3,1)⊢8 has the
corresponding Young diagram pictured below.
[TABLE]
A Young tableau, or simply a tableau, is
a filling of a Young diagram λ⊢n
such that values in all cells are weakly increasing across
each row and strictly increasing down each column.
If each of the integers
1 through n appears exactly once in a tableau T that
is a filling of a Young diagram λ⊢n, then we
call T a standard Young tableau, abbreviated SYT.
The Schensted algorithm is an
algorithm for obtaining a tableau T←x from
a tableau T and a real number x.
The algorithm may be defined as follows:
1) Set i:=1.
2) If row i (of T) is empty or if x is greater or equal to
each of the entries in the ith row then input x
in a new box at the end of row i
and terminate the algorithm.
Otherwise, proceed to step 3.
3) Find the minimum entry y in row i such that y>x
and swap y and x. That is, replace y with x in its box
and set x:=y.
4) Set i:=i+1, and return to step 2.
As an example,
let T be the tableau pictured below on the far left.
The following diagrams illustrate the stages of the
Schensted algorithm applied to T to obtain T←2.
[TABLE]
The Schensted algorithm may be applied to the sequence
(mσr(1),mσr(2),…,mσr(n)) of an r-regular multipermutation mσr
to obtain a unique tableau
P:=(…(mσr(1)←mσr(2))←…)←mσr(n).
Meanwhile, a unique standard tableau results
from recording where each new box appears in the construction
of P. This recording is accomplished by inputing the
value i in the new box that appears when mσr(i)
is added (via the Schendsted algorithm) to
(…(mσr(1)←(mσr(2)←…)←mσr(i−1).
Following conventions, we denote
the standard tableau resulting from this recording
method by Q.
As an example, the two tableaux pictured below
are the respective P and Q resulting from
the multipermutation (2,3,2,1,3,1).
Notice that P and Q have the same shape.
Intermediate steps are omitted for brevity.
[TABLE]
The RSK-correspondence ([6, 14])
provides a bijection between r-regular multipermutations
mσr and ordered pairs (P,Q) on the same
Young diagram λ⊢n, where
P is a tableaux whose members come from
mσr and Q is a SYT.
A stronger form of the following lemma appears in [6].
Lemma 4**.**
Let σ∈Sn and
P be the tableau resulting from running
the Schendsted algorithm on the entries of σ.
Then the number of columns in P is equal to
ℓ(mer,mσr),
the length of the longest non-decreasing subsequence of
mσr.
The above lemma, in conjunction with the RSK-correspondence, means that
for all k∈[n], the size of the set
{mσr∈Mr(Sn):ℓ(mer,mσr)=k} is equal to the
sum of the number of ordered pairs
(P,Q) on each Young diagram λ⊢n such that
λ1=k, where P is a tableaux whose members come from
mσr and Q is a SYT.
The number of SYT on a particular λ⊢n is
denoted by fλ. We denote by Krλ
(our own notation) the
number of Young tableaux on λ⊢n such that
each i∈[n/r] appears exactly r times.
The next lemma states the relationship between
∣S(mer,t)∣, fλ, and Krλ.
Lemma 5**.**
Let t∈[0,n−1], and
Λ:={λ⊢n:λ1≥n−t}.
Then
∣S(mer,t)∣=λ∈Λ∑(fλ)(Krλ).
Proof.
Assume t∈[0,n−1].
Let Λ:={λ⊢n:λ1≥n−t}.
Furthermore, let
Λ(l):={λ⊢n:λ1=l},
the set of all partitions of n
having exactly l columns.
By the RSK-Correspondence, and Lemma 4,
there is a bijection between the set
{mσr:ℓ(mer,mσr)=l} and
the set of ordered pairs (P,Q) where both P and Q have
exactly l columns.
This implies that
∣{mσr:ℓ(mer,mσr)=l}∣=λ∈Λ(l)∑(fλ)(Krλ).
Note that by Lemma 1,
[TABLE]
Hence it follows that
∣S(mer,t)∣=λ∈Λ∑(fλ)(Krλ).
∎
The formula below, known as the
hook length formula, is due to Frame, Robinson, and Thrall [5, 6].
In the formula, the notation (i,j)∈λ
is used to refer to the cell in the ith
row and jth column of a Young diagram
λ⊢n. The notation h(i,j) denotes the
hook length of (i,j)∈λ, i.e.,
the number of boxes below or to the right of (i,j), including the box (i,j)
itself. More formally,
h(i,j):=∣{(i,j∗)∈λ:j∗≥j}∪{(i∗,j)∈λ:i∗≥i}∣.
The formula is as follows:
[TABLE]
Thus by applying Lemma 5,
it is possible to calculate r-regular sphere size
by using the hook length formula.
We will use this strategy to treat the sphere of radius r=1.
However, because sphere sizes are calculated
recursively, we must first calculate the sphere size
when r=0.
Remark**.**
∣S(mer,0)=1∣.
Although this is an obvious fact, we wish to consider why it is true
from the perspective of Lemma 5.
Note first that there is only one partition λ⊢n
such that λ1=n, namely
λ′:=(n) with the associated Young diagram below.
[TABLE]
It is clear that there is only one possible Young tableau on λ′
so that (fλ′)=1, and thus by Lemma 5∣S(mer,0)∣ = 1.
The following proposition is an application of
Lemma 5.
Proposition 6**.**
∣S(mer,1)∣=1+(n−1)(n/r−1).**
Proof.
First note that ∣S(mer,0)∣=1.
There is only one possible partition
λ⊢n such that λ1=n−1,
namely λ:=(n−1,1), with its Young diagram pictured below.
[TABLE]
Therefore by Lemma 5,
∣S(mer,1)∣=1+(fλ′)(Krλ′).
Applying the well-known hook length formula
([5, 6]), we obtain
fλ′=n−1. The value Krλ′ is characterized
by possible fillings of row 2 with the stipulation that each i∈[n/r]
must appear
exactly r times in the diagram. In this case, since there is only a single
box in row 2, the possible fillings are i∈[n/r−1],
each of which yields a unique Young tableau
of the desired type. Hence Krλ′=n/r−1,
which implies that ∣S(mer,1)∣=1+(n−1)(n/r−1).
∎
Proposition 6 demonstrates how
Young Tableaux may be used to calculate r-regular
Ulam spheres centered at mer.
V r-Regular Ulam Spheres and Duplication Sets
In the previous section we showed how multipermutation
Ulam spheres may be calculated when the center is
mer. In this section we
provide a way to calculate sphere sizes
for any center when the radius is t=1.
The r-regular Ulam sphere sizes play an
important role in understanding the
potential code size for a given minimum distance.
For example, the well-known sphere-packing
and Gilbert-Varshamov bounds rely on
calculating, or at least bounding sphere sizes.
In the case of permutations, recall that the
Ulam sphere S(σ,t) centered at
σ of radius t was defined as
S(σ,t):={π∈Sn:d∘(σ,π)≤t}, which is
equivalent by definition to the set
{π∈Sn:n−ℓ(σ,π)≤t}.
In the case of r-regular multipermutations, for
t∈Z>0,
we introduce the following analogous definition
of a sphere.
Definition**.**
Define
[TABLE]
We call S(mσr,t)
the r-regular Ulam sphere
centered at mσr of radius t.
By Lemma 1,
S(mσr,t)={mπr∈Mr(Sn):n−ℓ(mσr,mπr)≤t}.
It should be noted, however, that the notation mπr
is a bit misleading because given
mπr∈M(Sn),
we cannot uniquely determine π.
The r-regular Ulam sphere definition can also be viewed
in terms of translocations. Lemma 2 implies
that S(mσr,t) is equivalent to
{mπr∈Mr(Sn):there exists k∈[t] and (ϕ1,…,ϕk) s.t. mσr⋅ϕ1⋯ϕk=mπr}.
Lemma 5 provided a way to
calculate r-regular Ulam spheres centered at
mer.
Unfortunately, the choice of center has an impact
on the size of the sphere, as
is easily confirmed by comparing Proposition 6
to Lemma 16 (Section VI).
Hence the applicability of Lemma 5
is limited.
We begin to address the issue of differing sphere
sizes by considering the radius t=1 case.
To aid with calculating such sphere sizes,
we introduce
(as our own definition)
the following subset of the set of translocations.
Definition**.**
Let n∈Z>0.
Define
[TABLE]
We call Tn the unique set of translocations.
By definition, Tn is the set of all translocations in
Sn, except
translocations of the form ϕ(i,i−1). We exclude translocations
of this form because they can be modeled by translocations of the form
ϕ(i−1,i), and are therefore redundant.
We claim that the set Tn is precisely the set of translocations
needed to obtain all unique permutations within the
Ulam sphere of radius 1 via multiplication.
Moreover, there is no redundancy in the set,
that is, there is no smaller set of translocations
yielding the entire Ulam sphere of radius 1
when multiplied with a given center permutation.
These facts are stated in the next lemma.
Lemma 7**.**
S(σ,1)={σϕ∈Sn:ϕ∈Tn}*
and
∣Tn∣=∣S(σ,1)∣.*
Proof.
We will first show that
S(σ,1)={σϕ∈Sn:ϕ∈Tn}.
Note that
[TABLE]
It is trivial that
[TABLE]
Therefore
{σϕ∈Sn:ϕ∈Tn}⊆S(σ,1).
To see why S(σ,1)⊆{σϕ∈Sn:ϕ∈Tn},
consider any σϕ(i,j)∈{σϕ(i,j)∈Sn:i,j∈[n]}=S(σ,1).
If i−j=1, then ϕ(i,j)∈Tn, and thus
σϕ(i,j)∈{σϕ∈Sn:ϕ∈Tn}.
Otherwise, if i−j=1, then σϕ(i,j)=σϕ(j,i), and
i−j=1⟹j−i=−1=1, so ϕ(j,i)∈Tn.
Hence σϕ(i,j)=σϕ(j,i)∈{σϕ∈Sn:ϕ∈Tn}.
Next we show that ∣Tn∣=∣S(σ,1)∣.
By Proposition 6
(in the case that r=1), ∣S(σ,1)∣=1+(n−1)2.
On the other hand, ∣Tn∣=∣{ϕ(i,j)∈Sn:i−j=1}∣.
If i=1, then there are n values j∈[n]
such that i−j=1. Otherwise, if i∈[n] but
i=1, then there are n−1 values j∈[n] such
that i−j=1. However, for all i,j∈[n],ϕ(i,i)=ϕ(j,j)=e so that there are n−1
redundancies. Therefore
∣Tn∣=n+(n−1)(n−1)−(n−1)=1+(n−1)2.
∎
In the case
of permutations, the set
Tn has no redundancies. If
ϕ1,ϕ2∈Tn, then
σϕ1=σϕ2 implies ϕ1=ϕ2.
Alternatively, in the case of multipermutations,
the set Tn can generally be shrunken
to exclude redundancies.
Notice that
S(mσr,1)={mπr∈Mr(Sn):there exists ϕ s.t. mσr⋅ϕ=mπ}, which is equal to
{mσr⋅ϕ∈Mr(Sn):ϕ∈Tn}.
However, it is possible that there exist
ϕ1,ϕ2∈Tn such that ϕ1=ϕ2,
but mσr⋅ϕ1=mσr⋅ϕ2.
In such an instance we may refer to either ϕ1 or ϕ2
as a duplicate translocation for mσr.
If we remove all duplicate translocations for mσr
from Tn, then the resulting set will have the same
cardinality as the r-regular Ulam sphere of radius 1 centered
at mσr. The next definition (our own)
is the set of standard duplicate translocations.
For the remainder of the paper, assume that
m is an n-length integer tuple, i.e.
m∈Zn.
Definition**.**
Define
[TABLE]
We call D(m) the standard
duplicate translocation set for m.
For each i∈[n], also define
Di(m):={ϕ(i,j)∈Dn:j∈[n]}.
If we take an r-regular multipermutation mσr,
then removing D(mσr) from Tn
equates to removing a set
of duplicate translocations.
These duplications come in two varieties.
The first variety corresponds to the
first condition of the D(m) definition,
when m(i)=m(j).
For example, if σ∈S6 such that
mσ2=(1,3,2,2,3,1), then we have
mσ2⋅ϕ(1,5)=(3,2,2,3,1,1)=mσ2⋅ϕ(1,6),
since mσ2(2)=3=mσ2(4).
This is because moving the first 1 to the left or to the right
of the last 1 results in the same tuple.
The second variety corresponds to the second condition of
the D(m) definition above,
when m(i)=m(i−1).
For example, if
mσ2=(1,3,2,2,3,1) as before, then for all
j∈{1,2,3,4,5,6}, we have
mσ2⋅ϕ(3,j)=mσ2⋅ϕ(4,j).
This is because any translocation that deletes and inserts
the second of the two adjacent 2’s does not result in
a different tuple when compared to deleting and
inserting the first of the
two adjacent 2’s.
Lemma 8**.**
S(mσr,1)={mσr⋅ϕ∈Mr(Sn):ϕ∈Tn\D(mσr)}.**
Proof.
Notice
S(mσr,1)={mσr⋅ϕ∈Mr(Sn):ϕ∈Tn}.
Hence it suffices to show that for all
ϕ(i,j)∈D(mσr), there exists some
i′,j′∈[n] such that ϕ(i′,j′)∈Tn\D(mσr) and
mσr⋅ϕ(i,j)=mσr⋅ϕ(i′,j′).
We proceed by dividing the proof into two
main cases. Case I is when
(mσr(i)=mσr(i−1) or i=1.
Case II is when
(mσr(i)=mσr(i−1).
Case I (when (mσr(i)=mσr(i−1) or i=1)
can be split into two subcases:
[TABLE]
We can ignore the instance when i=j, since
ϕ(i,j)∈D(mσr) implies
i=j.
For case IA, if for all p∈[i,j]
(for a,b∈Z with
a<b, the notation [a,b]:={a,a+1,…,b}) we have
mσr(i)=mσr(p), then
mσr⋅ϕ(i,j)=mσr⋅e.
Thus setting i′=j′=1 yields the desired result.
Otherwise, if there exists p∈[i,j] such that
mσr(i)=mσr(p),
then let
[TABLE]
Then ϕ(i,j∗)∈Tn\D(mσr)
and mσr⋅ϕ(i,j)=mσr⋅ϕ(i,j∗).
Thus setting i′=i and j′=j∗ yields the desired result.
Case IB is similar to Case IA.
Case II (when
mσr(i)=mσr(i−1)),
can also be divided into two subcases.
[TABLE]
As in Case I, we can ignore the instance
when i=j.
For Case IIA, if for all p∈[i,j] we have
mσr(i)=mσr(p),
then mσr⋅ϕ(i,j)=mσr⋅e,
so setting i=j=1 achieves the desired result.
Otherwise, if there exists p∈[i,j] such that
mσr(i)=mσr(p),
then let
[TABLE]
Then
mσr⋅ϕ(i,j)=mσr⋅ϕ(i∗,j)
and either one of the following is true: (1)
ϕ(i∗,j)∈/Di∗(mσr)⟹ϕ(i∗,j)∈/D(mσr), so set
i′=i∗ and j′=j; or (2)
by Case IA there exist i′,j′∈[n] such that
ϕ(i′,j′)∈Tn\D(mσr) and
mσr⋅ϕ(i′,j′)=mσr⋅ϕ(i∗,j)=mσr⋅ϕ(i,j).
Case IIB is similar to Case IIA.
∎
While Lemma 8 shows
that D(mσr) is a set of duplicate translocations for
mσr,
we have not shown
that Tn\D(mσr) is the
set of minimal size having the quality that
S(mσr,1)={mσr⋅ϕ∈Mr(Sn):ϕ∈Tn\D(mσr)}.
In fact it is not minimal.
In some instances it is possible to remove
further duplicate translocations to reduce the set size.
We will define another set of duplicate translocations, but
a few preliminary definitions are first necessary.
We say that m is
alternating if for all odd integers 1≤i≤n,
m(i)=m(1) and
for all even integers 2≤i′≤n,
m(i′)=m(2) but
m(1)=m(2).
In other words, any alternating tuple is
of the form
(a,b,a,b,…,a,b) or (a,b,a,b,…,a)
where a,b∈Z and
a=b.
Any singleton is also said to be alternating.
Now for integers
1≤i≤n and 0≤k≤n−i, the substringm[i,i+k] of m is defined as
m[i,i+k]:=(m(i),m(i+1),…m(i+k)).
Given a substring m[i,j] of m,
the length∣m[i,j]∣ of m[i,j]
is defined as ∣m[i,j]∣:=j−i+1.
As an example, if m′:=(1,2,2,4,2,4,3,1,3),
then m′[3,6]=(2,4,2,4) is an alternating
substring of m′ of length 4.
Definition**.**
Next define
[TABLE]
We call E(m) the
alternating duplicate translocation set for m
because it is only nonempty when
m contains an alternating substring
of length at least 4.
For each i∈[n], also define
Ei(m):={ϕ(i,j)∈E(m):j∈[n]}.
In the example of m′:=(1,2,2,4,2,4,3,1,3) above,
m∗⋅ϕ(2,6)=m′⋅ϕ(6,3) and
ϕ(2,6),ϕ(6,3)∈T9\D(m′),
implying that ϕ(2,6)∈E(m′).
In fact, it can easily be shown that
E(m′)={ϕ(2,6)}.
Lemma 9**.**
*Let i∈[n].
Then
Ei(m)=∅
if and only if
m(i)=m(i−1)
There exists j∈[i+1,n] and k∈[i,j−2] such that*
i) For all p∈[i,k−1], m(p)=m(p+1)
ii) m[k,j] is alternating
iii) ∣m[k,j]∣≥4.
Proof.
Let i∈[n].
We will first assume 1) and 2) in the lemma statement
and show that Ei(m) is not empty.
Suppose m(i)=m(i−1), and
that there exists j∈[i+1,n] and k∈[i,j−2] such
that for all p∈[i,k−1], we have
m(p)=m(p+1). Suppose also
that m[k,j] is alternating with
∣m[k,j]∣≥4.
For ease of notation, let
a:=m(k)=m(k+2) and
b:=m(k+1)=m(k+3)
so that m[k,k+3]=(a,b,a,b)∈Z4.
Then
[TABLE]
Moreover, for all p∈/[k,k+3], we have
(m⋅ϕ(i,k+3))(p)=m(p)=(m⋅ϕ(k+3,k))(p). Therefore
m⋅ϕ(i,k+3)=m⋅ϕ(k+3,k). Also notice
that m(i)=m(i−1)
implies that mϕ(i,k+3)∈/D(m).
Hence ϕ(i,k+3)∈Ei(m).
We now prove the second half of the lemma.
That is, we assume that Ei(m)=∅ and then show that
and 2) necessarily hold.
Suppose that Ei(m) is nonempty.
Then m(i)=m(i−1), since
otherwise there would not exist any
ϕ(i,j)∈Tn\D(m).
Let j∈[i+1,n] and k∈[i,j−2] such that
ϕ(j,k)∈Tn\D(m) and m⋅ϕ(i,j)=m(j,k).
Existence of such j, k, and ϕ(j,k) is
guaranteed by definition of Ei(m) and
the fact that Ei(m)
was assumed to be nonempty.
Then for all p∈[i,k−1], we have
m(p)=m(p+1) and
for all p∈[k,j−2], we have
m(p)=m(p+2).
Hence either m[k,j] is
alternating, or else
for all p,q∈[k,j], we have
m(p)=m(q).
However, the latter case is impossible,
since it would imply that for all
p,q∈[i,j] that m(p)=m(q),
which would mean ϕ(j,k)∈/Tn\D(m), a contradiction.
Therefore m[k,j] is alternating.
It remains only to show that ∣m[k,j]∣≥4.
Since k∈[i,j−2], it must be the case that
∣m[k,j]∣≥3. However, if
∣m[k,j]∣=3 (which occurs when k=j−2),
then (m⋅ϕ(i,j))(j)=m(i)=m(k)=m(k+1)=(m⋅ϕ(j,k)(j),
which implies that
m⋅ϕ(i,j)=m⋅ϕ(j,k),
a contradiction.
Hence ∣m[k,j]∣≥4.
∎
One implication of Lemma 9 is that
there are only two possible forms for
m[i,j] where ϕ(i,j)∈Ei(m).
The first possibility is that
m[i,j] is an alternating substring
of the form
(a,b,a,b,…,a,b)
(here a,b∈Z),
so that m[i,j]⋅ϕ(i,j)
is of the form (b,a,b,a…,b,a).
In this case, as long as m[i,j]∣≥4,
then setting k=i implies that
k∈[i,j−2], that
ϕ(j,k)∈Tn\D(m), and that
m[i,j]⋅ϕ(i,j)=m[i,j]⋅ϕ(j,k).
The other possibility is that m[i,j]
is of the form
(ka,a,a,…,a,n−kb,a,b,…,a,b)
(again a,b∈Z), so that
m[i,j]⋅ϕ(i,j) is of the form
(k−1a,…,a,n−k+1b,a,b,…,b,a).
Again in this case, as long as ∣m[i,j]∣≥4, then
k∈[i,j−2],
ϕ(j,k)∈Tn\D(m), and
m[i,j]⋅ϕ(i,j)=m[i,j]⋅ϕ(j,k).
Remark**.**
*If
m is alternating and
n is even*
then m⋅ϕ(1,n)=m⋅ϕ(n,1).
Remark**.**
*If
m is alternating
n≥3
n is odd,*
then m⋅ϕ(1,n)=m⋅ϕ(n,1).
To calculate ∣E(mσr∣, we define a
set of equal size that is easier to calculate.
Definition**.**
Define
[TABLE]
For each i∈[n], also define
Ei∗(m):={(i,j)∈E∗(m):j∈[n]}.
Notice that E∗(m)=i∈[n]⋃Ei∗(m).
Lemma 10**.**
∣E(m)∣=∣E∗(m)∣**
Proof.
The idea of the proof is simple.
Each element ϕ(i,j)∈E(m)
involves exactly one alternating sequence
of length greater or equal to 4, so the set
sizes must be equal. We formalize the
argument by showing that
∣E(m)∣≤∣E∗(m)∣
and then that
∣E∗(m)∣≤∣E(m)∣.
To see why
∣E(m)∣≤∣E∗(m)∣,
we define a mapping
f:[n]→[n], which maps
index values either to the
beginning of the nearest alternating
subsequence to the right, or else
to n. For all i∈[n], let
[TABLE]
Notice by definition of f, if
i,i′∈[n] such that i=i′, and
if m(i)=m(i−1) or i=1
and at the same time
m(i′)=m(i′−1) or i′=1,
then f(i)=f(i′).
Now for each i∈[n],
if m(i)=m(i−1) or
i=1, then
∣Ei(m)∣ = ∣Ef(i)∗(m)∣
by Lemma 9 and the two previous remarks.
Otherwise, if
m(i)=m(i−1),
then
∣Ei(m)∣=∣Ef(i)∗(m)∣=0.
Therefore
∣Ei(m)∣≤∣Ei∗(m)∣.
This is true for all i∈[n], so
∣E(m)∣≤∣E∗(m)∣.
The argument to show that
∣E∗(m)∣≤∣E(m)∣
is similar, except it uses the following
function g:[n]→[n] instead of f.
For all i∈[n], let
[TABLE]
∎
By definition, calculating ∣E∗(m)∣ equates
to calculating the number of alternating substrings
m[i,j] of m such that the length
of the substring is both even and longer than 4. The
following lemma helps to simplify this calculation further.
Lemma 11**.**
Let m be an alternating string. Then
[TABLE]
Proof.
Assume m is an alternating string.
By Lemma 10,
∣E(m)∣=∣E∗(m)∣=∣i∈[n]⋃Ei∗(m)∣.
Since m was assumed to be alternating,
[TABLE]
where K is the set of even integers between
4 and n, i.e.
K:={k∈[4,n]:k is even}.
For each k∈K, we have
[TABLE]
Therefore
∣E(m)∣=k∈K∑(n−k+1).
In the case that n is even, then
[TABLE]
In the case that n is odd, then
[TABLE]
∎
Notice that by Lemma 11,
it suffices to calculate
∣E(m)∣ for locally maximal length
alternating substrings of m.
An alternating substring m[i,j]
is of locally maximal length if and only if
m[i−1] is not alternating or i=1; and
m[i,j+1] is not alternating or j=n.
Finally, we define the general set of duplications. The
lemma that follows the definition also shows that
removing the set D∗(mσr) from Tn
removes all duplicate translocations
associated with mσr.
Definition** (D∗(m), duplication set).**
Define
[TABLE]
We call D∗(m) the duplication
set for m. For each i∈[n], we also define
Di∗(m):={ϕ(i,j)∈D∗(m):j∈[n]}.
Lemma 12**.**
Let ϕ1,ϕ2∈Tn\D∗(mσr).
Then ϕ1=ϕ2 if and only if
mσr⋅ϕ1=mσr⋅ϕ2.
Proof.
Let
ϕ1,ϕ2∈Tn\D∗(mσr).
If ϕ1=ϕ2 then
mσr⋅ϕ1=mσr⋅ϕ2
trivially. It remains to prove that
mσr⋅ϕ1=mσr⋅ϕ2⟹ϕ1=ϕ2. We proceed by contrapositive.
Suppose that ϕ1=ϕ2.
We want to show that
mσr⋅ϕ1=mσrϕ2.
Let ϕ1:=ϕ(i1,j1) and ϕ2:=ϕ(i2,j2).
The remainder of the proof can be split into two main cases:
Case I is if i1=i2 and Case II is if i1=i2.
Case I (when i1=i2), can be further divided into two subcases:
[TABLE]
Case IA is easy to prove. We have
Di1∗(mσr)=Di2∗(mσr)={ϕ(i1,j)∈Tn\{e}:j∈[n]},
so ϕ1=e=ϕ2, a contradiction.
For Case IB, we can first assume without loss of
generality that j1<j2 and then split
into the following smaller subcases:
[TABLE]
However, subcase iv) is unnecessary since
it was assumed that j1<j2,
so j1>i1⟹j2>j1>i1.
Subcase ii) can also be reduced to
(j1<i1) and (j2<i1) since
j2=i2=i1.
Each of the remaining subcases is proven
by noting that there is some element in the
multipermutation mσr⋅ϕ1 that is
necessarily different from mσr⋅ϕ2.
For example, in subcase i), we have
mσr⋅ϕ1(j1)=mσr(i1)=mσr(j1)=mσr⋅ϕ2(j1).
Subcases ii) and iii) are solved similarly.
Case II (when i1=i2) can be divided into three subcases:
[TABLE]
Case IIA is easily solved by mimicking the proof of Case IA.
Case IIB is also easily solved as follows.
First, without loss of generality,
we assume that
mσr(i1)=mσr(i1−1) and mσr(i2)=mσr(i2−1).
Then Di1∗(mσr)={ϕ(i1,j)∈Tn\{e}:j∈[n]}, so ϕ1=e.
Therefore we have
mσr⋅ϕ1(j2)=mσr(j2)=mσr(i2)=mσrϕ2(i2−1).
Finally, for Case IIC, without loss of generality we may
assume that i1<i2 and then split into the following four subcases:
[TABLE]
.
However, since
ϕ(i2,j2)∈Tn\D∗(mσr)
implies i2=j2,
subcases i) and iii) can be reduced to
(j1<i2) and (j2>i2) and
(j1≥i2) and (j2>i2) respectively.
For subcase i), we have
mσr⋅ϕ1(j1)=mσr(i1)=mσr(j1)=mσr⋅ϕ2(j1).
Subcases ii) and iii) are solved in a similar manner.
For subcase iv),
if j1>i2, then mσr⋅ϕ1(j1)=mσr(i1)=mσr(j1)=mσr⋅ϕ2(j1).
Otherwise, if j1=i2, then
ϕ1=ϕ(i1,i2) and
ϕ1=ϕ(i2,j2).
Thus if mσr⋅ϕ1=mσr⋅ϕ2 then
ϕ1∈Di1∗(mσr),
which implies that
ϕ1∈/Tn\D∗(mσr),
a contradiction.
∎
Lemma 12 implies that we can
calculate r-regular Ulam sphere sizes
of radius 1 whenever we can calculate the
appropriate duplication set. This calculation
can be simplified by noting that for a sequence
m∈Zn that
D(m)∩E(m)=∅
(by the definition of E(m))
and then decomposing the duplication set into these
components.
This idea is stated in the next theorem
This implies ∣Tn\D∗(mσr)∣≥∣S(mσr,1)∣.
By lemma 12, for ϕ1,ϕ2∈Tn\D∗(mσr),
if ϕ1=ϕ2, then mσr⋅ϕ1=mσr⋅ϕ2.
Hence we have ∣Tn\D∗(mσr)∣≤∣S(mσr,1)∣, which implies that
∣Tn\D∗(mσr)∣=∣S(mσr,1)∣.
It remains to show that
∣Tn\D∗(mσr)∣=(n−1)2+1−∣D(mσr)∣−∣E(mσr)∣.
This is an immediate consequence of the fact that
∣Tn∣=(n−1)2+1 and
D(mσr)∩E(mσr)=∅.
∎
Theorem 13 reduces the calculation of
∣S(mσr,1)∣ to calculating
∣D(mσr)∣ and ∣E(mσr∣.
It is an easy matter to calculate ∣D(mσr)∣,
since it is exactly equal to (n−2) times the number of
i∈[n] such that
mσr(i)=mσr(i−1) plus
(r−1) times the number of i∈[n] such that
mσr(i)=mσr(i−1).
We also showed how to calculate ∣E(m)∣ earlier.
The next example is an application of Theorem 13
Example**.**
Suppose σ:=[1,2,3,4,9,6,7,11,5,10,12,8]. Then
mσ3:=(1,1,1,2,3,2,3,2,4,4,3,4).
There are 3 values of i∈[12]
such that mσ3(i)=mσ3(i−1),
which implies that ∣D(mσ3∣=(3)(12−2)+(12−3)(3−1)=48. Meanwhile, by
Lemmas 10 and 11,
∣E(mσ3)∣=((5−3)/2)((5−1)/2))=2.
By Theorem 13,
∣S(mσ3),1∣=(12−1)2−48−2=71.
VI Min/Max Spheres and Code Size Bounds
In this section we show choices of center achieving
minimum and maximum r-regular Ulam sphere sizes
for the radius t=1 case. The minimum and maximum values
are explicitly given. We then discuss
resulting bounds on code size. First let us consider
the r-regular Ulam sphere of minimal size.
Lemma 14**.**
∣S(mer,1)∣≤∣S(mσr,1)∣**
Proof.
In the case that n/r=1, then
mer=e and mσr=σ,
so that
∣S(mer,1)∣=∣S(mσr,1)∣.
Therefore we may assume that n/r≥2.
By Theorem 13,
σ∈Snmin(∣S(mσr,1)∣)=1+(n−1)2−σ∈Snmax(∣D(mσr)∣+∣E(mσr)∣).
Since n/r≥2, we know that
n−2>r−1, which implies that for all
σ∈Sn, that
∣D(mσr)∣ is maximized by
maximizing the number of integers i∈[n] such that
mσr(i)=mσr(i−1).
This is accomplished by choosing σ=e, and
hence for all
σ∈Sn, we have
∣D(mer)∣≥∣D(mσr)∣.
We next will show that for any increase in the size of
∣E(mσr)∣ compared to
∣E(mer)∣, that ∣D(mσr)∣
is decreased by a larger value compared to
∣D(mer)∣, so that
(∣D(mσr)∣+∣E(mσr)∣) is
maximized when σ=e.
Suppose σ∈Sn.
By Lemmas 10 and 11,
∣E(mσr∣ is characterized by
the lengths of its locally maximal alternating substrings.
For every locally maximal alternating substring
mσr[a,a+k−1] of
mσr of length k,
there are at least k−2 fewer instances where
mσr=mσr(i−1)
when compared to instances where
mer(i)=mer(i−1).
This is because for all i∈[a+1,a+k−1],
mσr(i)=mσr(i−1).
Hence for each locally maximal alternating substring
mσr(a,a+k−1), then
∣D(mσr)∣ is decreased by at least
(k−2)(n−2−(r−1))≥(k−2)(r−1) when compared to
∣D(mer)∣.
Meanwhile, ∣E(mσr)∣
is increased by the same
locally maximal alternating substring by at most
(k−2)((k−2)/4) by Lemma 11.
However, since k≤2r, we have
(k−2)((k−2)/4)≤(k−2)(r−1)/2, which is
of course less than (k−2)(r−1).
∎
Lemma 14, along with
Proposition 6 implies that
the r-regular Ulam sphere size of radius t=1
is bounded (tightly) below by (1+(n−1)(n/r−1)).
This in turn implies the
following sphere-packing type upper bound on
any single error-correcting code.
Lemma 15**.**
Let C be a single-error correcting
MPC∘(n,r) code.
Then
[TABLE]
Proof.
Let C be a single-error correcting
MPC∘(n,r) code.
A standard sphere-packing bound argument
implies that
∣C∣r≤(r!)n/r(σ∈Snmin(∣S(mσ,1)∣)n!.
The remainder of the proof follows from
Proposition 6 and Lemma 14.
∎
We have seen that ∣S(mσr)∣ is minimized
when σ=e. We now discuss the choice of center
yielding the maximal sphere size.
Let
ω∈Sn be defined as follows:
ω(i):=((i−1)mod(n/r))r+⌈ir/n⌉
and ω:=[ω(1),ω∗(2),…ω∗(n)].
With this definition, for all i∈[n], we have
mωr(i)=imod(n/r)
For example, if r=3 and n=12, then
ω=[1,4,7,10,2,5,8,11,3,6,9,12] and
mωr=(1,2,3,4,1,2,3,4,1,2,3,4).
We can use Theorem 13 to calculate
∣S(mωr,1)∣, and then
show that this is the
maximal r-regular Ulam sphere size
(except for the case when n/r=2).
Lemma 16**.**
Let
n/r=2.
Then
∣S(mσr,1)∣≤∣S(mωr,1)∣
and if n/r>2, then
∣S(mωr,1)∣=(1+(n−1)2)−(r−1)n.
Proof.
Assume n/r=2.
First notice that if n/r=1 then for any π∈Sn
(including π=ω), the sphere
S(mπr,1)
contains exactly one
element (the tuple of the form (1,1,…,1)).
Hence the lemma holds trivially in this instance.
Next, assume that n/r>2.
We will first prove that ∣S(mωr,1)∣=(1+(n−1)2)−(r−1)n.
Since n/r>2, it is clear that
mωr contains no
alternating subsequences of length greater than 2.
Thus by Lemma 9,
E(mωr)=∅
and therefore by Theorem 13,
∣S(mωr,1)∣=1+(n−1)2−∣D(mωr)∣.
Since there does not exist i∈[n] such that
mωr(i)=mωr(i−1),
we have ∣D(mωr)∣=(r−1)n,
completing the proof of the first statement in the lemma.
We now prove that
∣S(mσr,1)∣≤∣S(mωr,1)∣.
Recall that
∣D(mσr)∣ is equal to
(n−2) times the number of
i∈[n] such that
mσr(i)=mσr(i−1) plus
(r−1) times the number of i∈[n] such that
mσr(i)=mσr(i−1).
But n/r>2 implies that r−1<n−2, which implies
π∈Snmin∣D(mπr,1)∣=(r−1)n. Therefore
[TABLE]
∎
Extending the concept of perfect permutation codes
discussed in [10], we define a
perfect multipermutation code.
Let C be an MPC(n,r) code. Then C is a perfect t-error
correcting code if and only if for all
σ∈Sn, there exists a unique
mcr∈Mr(C)
such that mσr∈S(mcr,t).
We call such C a **perfect t-error correcting **
MPC(n,r).
With this definition the upper bound of lemma 16
implies a lower bound on a perfect single-error correcting
MPC(n,r).
Lemma 17**.**
Let n/r=2 and let C be a perfect single-error
correcting MPC(n,r).Then
[TABLE]
Proof.
Suppose n/r=2 and C is a perfect single-error
correcting MPC(n,r).
Then c∈C∑∣S(mcr,1)∣=(r!)n/rn!.
This means
A more general lower bound is easily
obtained by applying Lemma 16 with a
standard Gilbert-Varshamov bound argument.
In the lemma statement, C is an MPC∘(n,r,d)
if and only if C is an MPC(n,r) such that
σ,π∈C,σ=πmind∘r(σ,π)=d.
Lemma 18**.**
Let n/r=2 and C be an
MPC∘(n,r,d) code of maximal cardinality. Then
[TABLE]
Proof.
Suppose that n/r=2 and that
C is an MPC∘(n,r,d) code
of maximal cardinality.
For all σ∈Sn,
there exists
c∈C such that
d∘r(σ,c)≤d−1.
Otherwise, we could add
σ∈/C (and its entire equivalence class
Rr(σ)) to C while maintaining
a minimum distance of d, contradicting
the assumption that ∣C∣r is maximal.
Therefore
c∈C⋃S(mcr,d−1)=Mr(Sn).
This in turn implies that
[TABLE]
Of course, the left hand side of the
above inequality is less than or equal to
(∣C∣r)⋅(c∈Cmax(∣S(mcr,d−1)∣)).
Hence Lemma 16 implies that
[TABLE]
so the
conclusion holds.
∎
VII Conclusion
This paper compared the Ulam metric for the permutation
and multipermutation cases, providing a simplification of
the r-regular Ulam metric. The surprising fact that
r-regular Ulam sphere sizes differ depending upon the
center was also shown. New methods for calculating the
size of r-regular Ulam sphere sizes were provided, first
using Young Tableaux for spheres of any radius centered at
mer. Another method used duplicate
translocation sets to calculate sphere sizes for a radius
of t=1 for any center. Resulting bounds on Code size
were also provided. Many open questions remain, including
the existence of perfect codes, sphere size calculation
methods for more general parameters, and tighter
bounds on code size.
Acknowledgment
This paper is partially supported by
KAKENHI 16K12391 and 26289116.
Bibliography14
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] A. Barg and A. Mazumdar, “Codes in Permutations and Error Correction for Rank Modulation,” in Proc. IEEE ISIT, pp 854-858, 2010
2[2] I.F. Blake, G. Cohen, and M. Deza, ”Coding with permutations,” in Inform. Contr., vol. 43, pp. 1-19,1979.
3[3] F. Farnoud, V. Skachek, and O. Milenkovic, “Error-correction in flash memories via codes in the Ulam metric,” in IEEE Trans. Inf. Theory, Vol. 59 No. 5, pp. 3003-3030, 2013.
4[4] F. Farnoud and O. Milenkovic, “Multipermutation Codes in the Ulam Metric,” in Proc. IEEE ISIT, pp. 2754-2758, 2014.
5[5] J. Frame, G. Robinson, and R. Thrall, “The hook graphs of the symmetric group,” in Canad. J. Math., 6, pp. 316-324, 1954.
6[6] W. Fulton, “Young Tableaux,” London Mathematical Society Student Texts. Cambridge University Press, 1997.
7[7] F. Gologlu, J. Lember, A. Riet, V. Skachek, “New Bounds for Permutation Codes in Ulam Metric,” in Proc. IEEE ISIT, pp. 1726-1730, 2015.
8[8] A. Jiang, R. Mateescu, M. Schwartz, J. Bruck, “Rank Modulation for Flash Memories,” in Proc. IEEE ISIT, pp. 1731-1735, 2008.