On the Sparseness of Certain MRD Codes
Heide Gluesing-Luerssen

TL;DR
This paper investigates the rarity of certain MRD codes over finite fields, showing that as the field size grows, these codes become increasingly sparse within the space of all rank-metric codes.
Contribution
It provides the first known example where MRD codes are proven to be sparse, using a novel reduction to bases and classical semifield results.
Findings
MRD codes are sparse for parameters [3×3;3] as q increases
Proportion of MRD codes tends to 0 when q→∞
Reduction to basis spaces and semifield theory underpins the analysis
Abstract
We determine the proportion of -MRD codes over within the space of all -dimensional -rank-metric codes over the same field. This shows that for these parameters MRD codes are sparse in the sense that the proportion tends to as . This is so far the only parameter case for which MRD codes are known to be sparse. The computation is accomplished by reducing the space of all such rank-metric codes to a space of specific bases and subsequently making use of a result by Menichetti (1973) on 3-dimensional semifields.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCoding theory and cryptography · Cooperative Communication and Network Coding · graph theory and CDMA systems
On the Sparseness of Certain MRD Codes
Heide Gluesing-Luerssen111HGL was partially supported by the grant #422479 from the Simons Foundation. Department of Mathematics, University of Kentucky, Lexington KY 40506-0027, USA; [email protected].
Abstract: We determine the proportion of -MRD codes over within the space of all -dimensional -rank-metric codes over the same field. This shows that for these parameters MRD codes are sparse in the sense that the proportion tends to [math] as . This is so far the only parameter case for which MRD codes are known to be sparse. The computation is accomplished by reducing the space of all such rank-metric codes to a space of specific bases and subsequently making use of a result by Menichetti (1973) on 3-dimensional semifields.
Keywords: Rank-metric codes, MRD codes, semifields.
MSC: 11T71, 94B60, 17A35, 12K10.
1 Introduction
Rank-metric codes play a crucial role for error correction in single-source networks with adversarial noise; see [22, 21]. For this reason they have been studied in great detail during the last decade (and even longer) in the context of coding theory. The main focus has been on MRD codes, that is, rank-metric codes with the largest possible dimension for the given rank distance. Such codes do exist for all possible parameters, and constructions have been provided independently by Delsarte [5] and later Gabidulin [9]. The latter codes are even -linear when identifying the ambient matrix space with . Further constructions of MRD codes (not -linear in general) have been found and studied by Sheekey [19], Cossidente et al. [3], Csajbók et al. [4], Lunardon et al. [15], Trombetti/Zhou [23] and others.
In this paper we focus on the proportion of -MRD codes, that is, the number of such MRD codes compared to the number of all -rank-metric codes over of the same dimension. Bounds on this proportion have been given already by Byrne/Ravagnani [2] and Antrobus/Gluesing-Luerssen [1]. More precisely, in [2, Cor. 6.2] it is shown that, for all parameter sets and all field sizes, this proportion is always at most 1/2, while in [1, Thm. VII.6] a much smaller upper bound is derived via entirely different methods. Both results leave the question open whether for some parameters the asymptotic proportion is zero as , in which case the family of -MRD codes is called sparse. Besides the trivial case , the only known case for the exact asymptotic proportion is . In that case the limit is nonzero and has been determined in [1, Cor. VII.5]. Further details are provided in the next section after introducing the necessary notation and terminology.
We will explicitly compute the proportion of -MRD codes. We will see that this proportion approaches [math] as , and thus -MRD codes are sparse. This is the only parameter case known so far for which MRD codes are sparse. The main part of our proof consists of using a characterization of -dimensional semifields over finite fields by Menichetti [16]. This is also the reason why our results only apply to the parameter set .
The proportion will be obtained in two steps, formulated in Theorem 2.3 and Theorem 2.4 in the next section. In Theorem 2.3 (proven in Section 3) we reduce the given problem to the problem of counting a certain set of matrix triples. This set consists of bases of MRD codes, and in fact all MRD codes up to isometry are covered by this set. However, the set may also contain isometric codes, but since we aim at counting all codes there is no need or use in studying isometry (and in fact, this has been done by Menichetti in [17]). In Theorem 2.4 (established in Section 4) we observe that the set of matrix triples has been parametrized by Menichetti in [16]. A closer look at those results will provide us with the exact count. While we formulate the results and proofs in Sections 2 – 4 in a purely matrix-theoretical language, the main result in Section 4, Theorem 4.2, by Menichetti is best understood in the language of semifields. For this reason, we will give in Section 5 an account of the main ideas in [16] leading to Theorem 4.2. We believe it is worth to present that beautiful material in a slightly reorganized and more extended form that suits the current interest in MRD codes. Among other things, this will allow us to detect the commutative non-associative semifields within our set of matrix triples.
2 Preliminaries and Statement of Main Results
Throughout, denotes a finite field of order . We endow the matrix space with the rank metric, defined as d(A,B)=\mbox{{\rm rk}\,}(A-B). A rank-metric code is an -linear subspace of the metric space . The rank distance of a rank-metric code is defined as d_{R}(C):=\min\{\mbox{{\rm rk}\,}(M)\mid M\in C\backslash\{0\}\}. An -code is a rank-metric code in of dimension and rank distance . Assuming (without loss of generality) that , the Singleton bound tells us that the dimension of an -code is at most , and codes attaining this bound are called MRD codes (maximum rank-distance codes), denoted as -MRD codes. It is well known that MRD codes exist for all parameters and all fields . For further details on rank-metric codes we refer to the vast literature.
In this paper we focus on the proportion of MRD codes in the following sense.
Definition 2.1**.**
Let . Consider the spaces
[TABLE]
Thus is the set of -MRD codes. The fraction is called the proportion of MRD-codes (within the space of all -dimensional subspaces of ).
- (a)
The family of -MRD codes is called sparse if .
- (b)
The family of -MRD codes is called generic if .
The above defined proportion is related to the probability that randomly and independently chosen matrices in (endowed with the uniform distribution) generate an MRD code. Indeed, in [1, Prop. VI.2] it is shown that the two quantities differ by a factor which tends to as . See also Remark 3.2 in the next section.
We summarize the results known so far about the proportion of MRD codes.
Remark 2.2**.**
- (a)
It follows from [1, Thm. VII.6] that the asymptotic proportion satisfies
[TABLE]
(if the limit exists). Hence -MRD codes are not generic if . For , the full matrix space is the only -MRD code and thus trivially generic.
- (b)
In [1, Cor. VII.5] it has been shown that for the proportion attains the upper bound given in (a). Hence, in this case MRD codes are neither sparse nor generic.
- (c)
Certain maximal Ferrers diagram codes are generic; see [1, Thm. VI.8]. These are rank-metric codes where all matrices are zero in prespecified positions and satisfy a generalized Singleton bound. For instance, genericity holds for upper triangular matrices. We refer to [1] for further details.
- (d)
Identifying with the field extension we obtain an -vector space isomorphism between the matrix space and . This allows us to consider rank-metric codes in the latter space, in which case one may also require the codes to be -linear. In [18] it is shown that -linear MRD codes are generic within the class of all -linear rank-metric codes of the same dimension.
In this paper we add another case to the above list. It is in stark contrast to those earlier results. Indeed, we will prove that -MRD codes are sparse in the sense of Definition 2.1.
From now on we restrict ourselves to and set
[TABLE]
We will prove the two theorems below. The first result reduces the count of to determining the cardinality of the set defined below, which consists of very specific matrix triples generating MRD codes. The second result provides the actual cardinality of .
The following notation is needed. Set
[TABLE]
and for define its companion matrix as
[TABLE]
For any matrix we let be its entry at position . Finally, denotes the subspace of generated by the listed matrices.
Theorem 2.3**.**
Define the set
[TABLE]
Then the proportion of all -MRD codes is given by
[TABLE]
From the fraction it is clear that if is at most of order . Our second result shows that is actually of order .
Theorem 2.4**.**
The set has cardinality . As a consequence, the proportion of -MRD codes is
[TABLE]
and . That is, -MRD codes are sparse.
Note that is simply the number of -dimensional subspaces in , thus (the Gaussian coefficient). As a consequence, we can determine the absolute number of -MRD codes. One obtains for instance , and , which coincides with the numerical results by Sheekey [20, Sec. 5].
Theorem 2.3 will be proven in the next section, while Theorem 2.4 will be established in Section 4.
3 Reduction Step: Proof of Theorem 2.3
In order to prove Theorem 2.3 we replace the subspaces in and , see (2.1), by ordered bases. This will be done in several steps.
Proposition 3.1**.**
Define the spaces
[TABLE]
Then
[TABLE]
As a consequence, if the limit exists.
Proof.
Consider the matrix spaces
[TABLE]
Every code in (and thus in ) has ordered bases. Since the matrix triples in and form all ordered bases of the codes in and , respectively, we obtain . Next, we clearly have
[TABLE]
In order to relate the cardinalities of and , consider the map
[TABLE]
This map is well-defined because every matrix of a triple is invertible thanks to d_{R}\mbox{\langle{A_{1},A_{2},A_{3}}\rangle}=3, and clearly d_{R}\mbox{\langle{A_{1},A_{2},A_{3}}\rangle}=d_{R}\mbox{\langle{I,A_{1}^{-1}A_{2},A_{1}^{-1}A_{3}}\rangle}. Furthermore, is surjective and each fiber is of the form . All of this shows that . Using (3.1) we obtain
[TABLE]
Remark 3.2**.**
The last result can be interpreted in terms of the probability that we obtain an MRD code when drawing random matrices. Indeed, fix the probability distribution on where all entries of a matrix are independent and uniformly distributed. That is, for all and all . Let
[TABLE]
where are randomly chosen matrices, i.e., chosen independently and randomly according to the above distribution. Then the identity of the previous proof tells us that is exactly the proportion of MRD codes. Furthermore, Proposition 3.1 states that up to the factor , this proportion equals the probability P_{q}^{\prime}:=\mbox{\rm Prob}\,\big{(}\mbox{\langle{I,A_{2},A_{3}}\rangle}\text{ is a [3\times 3;3]-MRD code}\big{)} when choosing at random. The factor is less than , i.e., , which simply reflects the fact that in the latter drawing one matrix is already invertible. This probabilistic interpretation can be applied accordingly to all further results of this section.
So far we have reduced the problem of determining the proportion to determining the cardinality . In the next step we will make use of a similarity action in order to bring the matrix of the triples into companion form. Using the description
[TABLE]
and the rational canonical form for matrices in we observe the following fact.
Remark 3.3**.**
Denote by the characteristic polynomial of the matrix . Then for every triple the characteristic polynomial is irreducible and thus is similar to the companion matrix ; see (2.3).
We need the following result about the stabilizer under the conjugation action. The first part about can be found in [10], which in turn goes back to [8, § 217], while the second part is an obvious consequence. One should also note that for every nonzero the matrix appearing below is indeed invertible. This follows from the fact that is a field whenever is irreducible. In other words, it is a -MRD code.
Proposition 3.4** ([10, Cor. 2 and Cor. 3]).**
Let be as in (2.2) and set . Denote by the stabilizer of w.r.t. the conjugation action
[TABLE]
Then for any companion matrix we have
[TABLE]
As a consequence, for all .
In our next step toward the proof of Theorem 2.3 we will transform the triples in into the form via an according group action. For a precise count of the resulting orbits the following lemma is needed.
Lemma 3.5**.**
Let and X\in\mathbb{F}_{q}^{3\times 3}\setminus\mbox{\langle{I,C_{f},C_{f}^{2}}\rangle}. Furthermore, let . Then
[TABLE]
Proof.
Write and . Then the matrices and are given by
[TABLE]
and thus is of the form
[TABLE]
The assumptions imply and .
Let now where s=(s_{1},s_{2},s_{3})\mbox{!{}^{\sf T}}\in{\mathbb{F}}_{q}^{3}\setminus 0; see Proposition 3.4. Hence
[TABLE]
We have to show that is a multiple of the identity matrix, which means . In other words we have to prove that the solution space of the linear system is given by . This is now a matter of basic linear algebra. Indeed, the identity yields a system of equations taking the form
[TABLE]
where
[TABLE]
Thus we have to show that the last two columns of are linearly independent. This is clearly the case if either \mbox{{\rm rk}\,}(Z)=2 or . Hence it remains to consider the case
[TABLE]
Now the last two columns of take the form
[TABLE]
One computes
[TABLE]
where
[TABLE]
Furthermore,
[TABLE]
where . Since and is irreducible, we conclude and thus \mbox{{\rm rk}\,}(\hat{M}(t))=3 for all . Using that , we arrive at \mbox{{\rm rk}\,}(\tilde{M}(t))=2 for all . All of this shows that (3.3) has solution space , and this concludes the proof. ∎
Now we are ready to prove the following result, which lets us reduce the count of MRD codes to those where one basis matrix is in companion form. Recall the sets and from Proposition 3.1.
Theorem 3.6**.**
Consider the matrix sets
[TABLE]
Then
[TABLE]
Proof.
Define the group action
[TABLE]
For every triple , the orbit \mbox{{\rm Orb}{\tau}}(I,A_{1},A_{2}) contains a triple from because is similar to , where . In other words, \mbox{{\rm Orb}{\tau}}(I,A_{1},A_{2})=\mbox{{\rm Orb}_{\tau}}(I,C_{f},Z) for some . This implies
[TABLE]
It remains to determine the sizes of the orbits \mbox{{\rm Orb}{\tau}}(I,C_{f},Z) for as well as which of these orbits are distinct. Not surprisingly, the orbits behave differently depending on whether or not Z\in\mbox{\langle{I,C{f},C_{f}^{2}}\rangle}. We thus define
[TABLE]
Using the orbit-stabilizer theorem along with Proposition 3.4 and Lemma 3.5 we obtain
[TABLE]
As for the number of distinct orbits, we clearly have \mbox{{\rm Orb}{\tau}}(I,C_{f},Z)\neq\mbox{{\rm Orb}{\tau}}(I,C_{g},Z^{\prime}) whenever . Thus it remains to characterize when \mbox{{\rm Orb}{\tau}}(I,C_{f},Z)=\mbox{{\rm Orb}{\tau}}(I,C_{f},Z^{\prime}). To this end fix some and consider the conjugation action
[TABLE]
Then Lemma 3.5 tells us that the stabilizers of this action are of the form
[TABLE]
Hence we conclude that
[TABLE]
whereas
[TABLE]
Now we can determine the cardinality of . Recalling that there are monic irreducible polynomials of degree over (see [14, Thm. 3.25]) we have
[TABLE]
The first identity follows from the fact that for the set the matrix is any element in \mathbb{F}_{q}^{3\times 3}\setminus\mbox{\langle{I,C_{f}}\rangle}, while for the set it is any element in \mbox{\langle{I,C_{f},C_{f}^{2}}\rangle}\setminus\mbox{\langle{I,C_{f}}\rangle}. Now (3.6), (3.7), and (3.5) along with lead to
[TABLE]
Notice further that . Thus thanks to (3.4) we have
[TABLE]
Using the cardinalities from (3.1) and from (3.8) we finally obtain the desired identity
[TABLE]
Now we can prove Theorem 2.3. The difference between the triples in the sets and is the shape of the first column of , which we will now take into account.
Proof of Theorem 2.3. Consider the sets
[TABLE]
Note that in the set , the matrix may have a zero entry at position as well, whereas this is not possible for the set thanks to the MRD property. This also means that is obtained from by normalizing the entry at position . Indeed, using the obvious fact that for any the triple generates an MRD code iff generates an MRD code, we obtain
[TABLE]
Furthermore, by the form of the first columns, every nonzero matrix with and every gives rise to a linearly independent triple . Using (again) that there are monic, irreducible, cubic polynomials over and that has free entries (not all of which are zero), we conclude
[TABLE]
It remains to relate the sets and . This is easily achieved by the surjective map
[TABLE]
Clearly, every fiber has cardinality . Since we obtain and . With the aid of (3.10) and (3.9) we conclude
[TABLE]
Now the result follows from Proposition 3.1 and Theorem 3.6.
4 Parametrization of the set : Proof of Theorem 2.4
In this section we determine the precise cardinality of the set defined in Theorem 2.3. It will follow from a detailed study of the papers [16, 17] by Menichetti on three-dimensional division algebras over finite fields. While those papers [16, 17] had the goal to count the isomorphism classes of these division algebras, we need to make sure that we count all division algebras, i.e. determine the cardinality of . This makes it necessary to go into the details of the arguments in [16].
We present the results in the matrix-theoretical form needed for determining and without the notion of semifields. In the next section we will elaborate on the connection to semifields and outline the ideas of [16, 17]. This will also provide us with a sketch of the proof of Theorem 4.2 below. Recall the set from Theorem 2.3, which in short we may write as
[TABLE]
This indicates that may be of order (in the worst case). However, in order to establish our ultimate goal, , Theorem 2.3 requires us to show that is of order at most . In this section we will prove Theorem 2.4, which shows that is actually of order .
From now on we use the standard notation k\mbox{{}^{[i]}}:=k^{q^{i}} for . Thus k=k\mbox{{}^{[0]}}. Recall that k\mapsto k\mbox{{}^{[i]}} is an -linear automorphism of . We need the following maps.
Definition 4.1**.**
Define the maps
[TABLE]
Obviously, is the trace map of the field extension and the norm. More generally, are the symmetric functions evaluated at k\mbox{{}^{[0]}},k\mbox{{}^{[1]}},k\mbox{{}^{[2]}} and thus
[TABLE]
This also implies for all
[TABLE]
Furthermore, we need the map
[TABLE]
For further use we record
[TABLE]
Now we are ready to present the crucial step in our count. The following result has been established by Menichetti [16] and amounts to a parametrization of the set . In the next section we will outline [16] and sketch a proof of this result. This will also provide some insight into the special cases and \hat{k}\in\{k\mbox{{}^{[1]}},k\mbox{{}^{[2]}}\}. For instance, Proposition 5.9 will show that leads to the only non-associative commutative semifield for the given (provided that are linearly independent).
Theorem 4.2** ([16, Prop. 10]).**
For define
[TABLE]
Then the set from (4.1) is given by
[TABLE]
Let us briefly comment on this result. Given as in (4.1). Then it is clear from (4.2) that for the identity (C_{f},Z)=\big{(}\Sigma_{1}(k,\hat{k}),\Sigma_{2}(k,\hat{k})\big{)} to be true, the parameters and must be roots of the irreducible polynomials and in , respectively. The main part of the proof consists of showing that the last columns of and coincide, and that the MRD property translates into the linear independence of . Details will be given in the next section.
Note that the result above shows that is of order at most . In order to determine the exact cardinality, we need to investigate when two pairs give rise to the same matrix pair . One easily verifies that the linear independence of implies that , and therefore one may restrict oneself to that situation. This is dealt with in the next proposition, part of which has been proven in [16, Lem. 12.1]. For sake of completeness we include a proof of all statements below.
Proposition 4.3**.**
Let . Then
[TABLE]
Furthermore,
- (1)
Let \hat{k}\not\in\{k\mbox{{}^{[1]}},k\mbox{{}^{[2]}}\}. Then
[TABLE]
- (2)
Let \hat{k}=k\mbox{{}^{[n]}} for some . Then
[TABLE]
Proof.
The first implication follows immediately from (4.3) and the definition of the matrices . The additional backwards implication in (2), where \hat{k}=k\mbox{{}^{[n]}} and (\ell,\,\hat{\ell})=(k\mbox{{}^{[n+r]}},\,k\mbox{{}^{[r]}}) follows again from (4.3) together with the fact that in this case \ell+\hat{\ell}=k\mbox{{}^{[n+r]}}+k\mbox{{}^{[r]}}=(\hat{k}+k)\mbox{{}^{[r]}} and \ell\hat{\ell}=k\mbox{{}^{[n+r]}}k\mbox{{}^{[r]}}=(\hat{k}k)\mbox{{}^{[r]}}.
Let us now assume \big{(}\Sigma_{1}(k,\hat{k}),\,\Sigma_{2}(k,\hat{k})\big{)}=\big{(}\Sigma_{1}(\ell,\hat{\ell}),\Sigma_{2}(\ell,\hat{\ell})\big{)}. Then
[TABLE]
Indeed: The first two identities are clear from the matrices; for the identity for the product also follows from comparing the matrices, while for it is a consequence of the multiplicativity of ; finally, for the identity for the sum follows again from the matrices, for from the additivity of , and for from the identity , see [16, Lem. 10.3].
With the aid of (4.3) all of this tells us that
[TABLE]
We have to show that . Let us assume . Then by the third identity in (4.6) and by the fourth identity. Since attain only possible values, we conclude . Thus
[TABLE]
This implies (x-k\mbox{{}^{[i]}})(x-\hat{k}\mbox{{}^{[j]}})=(x-k\mbox{{}^{[r]}})(x-\hat{k}\mbox{{}^{[r]}}) and therefore k\mbox{{}^{[i]}},\hat{k}\mbox{{}^{[j]}}\in\{k\mbox{{}^{[r]}},\hat{k}\mbox{{}^{[r]}}\}. Since we thus have
[TABLE]
This in turn implies \hat{k}=k\mbox{{}^{[i+3-r]}}. Now we may argue as follows.
- (i)
If \hat{k}\not\in\{k,k\mbox{{}^{[1]}},k\mbox{{}^{[2]}}\}, then the last identity is a contradiction. Hence in this case we arrive at .
- (ii)
If then (4.7) leads to the contradiction . Thus again by virtue of (4.6). This and (i) establishes Part (1) of the proposition.
- (iii)
Consider now Part (2), thus \hat{k}=k\mbox{{}^{[n]}} for some . The case leads to the first option in the implication. If , then (4.7) yields and . This means (\ell,\hat{\ell})=(k\mbox{{}^{[n+r]}},k\mbox{{}^{[r]}}), which is the second option. ∎
Corollary 4.4**.**
The set has cardinality , where
[TABLE]
Proof.
The summand follows from Proposition 4.3(1) along with the simple fact that, for any , the linear independence of implies the linear independence of 1,k\mbox{{}^{[r]}},\phi(k\mbox{{}^{[r]}},\hat{k}\mbox{{}^{[r]}}). As for the second summand, note first that (4.5) guarantees that 1,k,\phi(k,k\mbox{{}^{[n]}}) are linearly independent for any and . Thus Proposition 4.3(2) leads to the summand . ∎
The following result appears implicitly in [17].
Proposition 4.5**.**
Consider the set from Corollary 4.4. Then . In particular iff .
Proof.
We follow [17, pp. 404]. If are linearly independent, then . Fix any such . Then forms a basis of the -vector space . Write . We want to determine the number of elements such that are linearly independent. By [16, Cor. 10.2] the linear independence of is equivalent to the linear independence of . We determine the coordinate vectors of w.r.t. the basis . Let be the minimal polynomial of . Thus , and . Then
[TABLE]
Hence the coordinate vectors of are the columns of the matrix
[TABLE]
and we need to determine the number of points such that . It is easy to verify that the curve given by
[TABLE]
has exactly solutions ( solutions for and solutions for ). All of this tells us that for every there exist elements such that are linearly independent. Finally, if \hat{k}=k\mbox{{}^{[n]}} for some , then are linearly independent by (4.5). Thus we have to subtract these two elements from our count. This yields the desired result. ∎
Now the proof of Theorem 2.4 follows quickly.
Proof of Theorem 2.4. Since , the stated cardinality of the set follows from Proposition 4.5 and Corollary 4.4. The proportion of -MRD codes is now immediate with Theorem 2.3.
5 Relation to -Dimensional Semifields
In this section we recount the relation between MRD codes and semifields and go on to survey the paper [16] by Menichetti as needed for the proof of Theorem 4.2. This will also provide further insight into the various types of semifields parametrized by as in Theorem 4.2.
Menichetti used this theorem to determine the number of isomorphism classes of the related non-associative semifields, which are in this case -dimensional over . It turns out [17, p. 400] that for the number of isomorphism classes of non-associative semifields is
[TABLE]
Since these cardinalities agree with the number of isomorphism classes of -dimensional twisted fields, determined by Kaplansky [11, p. 77], Menichetti thereby proved that all -dimensional non-associative semifields are twisted fields.
We start by recalling the notion of a semifield.
Definition 5.1**.**
A finite semifield is a triple with a finite set and binary operations and on satisfying the following conditions.
- (i)
is an abelian group,
- (ii)
has no left or right zero divisors, that is: .
- (iii)
Both distributivity laws hold: and .
- (iv)
There exists an identity element: for all .
A semifield without identity element is called a pre-semifield.
It is easy to see that every finite semifield contains a field, say , and is an algebra over . Thus for all and . If is -dimensional over , we call an -dimensional division algebra.
Note that a (pre-) semifield is not necessarily associative or commutative. Clearly, an associative finite semifield is a finite skew field and thus a finite field by Wedderburn’s Theorem, hence commutative. Later in Proposition 5.9 we will encounter instances of commutative non-associative semifields.
It is well-known from finite geometry that -MRD codes are in one-one-correspondence to -dimensional division algebras over . For sake of completeness and further use we provide the correspondence below. Most properties are easy to verify. For details see also [6, p. 220] or [13, p. 134] or other sources on finite geometry.
Proposition 5.2**.**
- (a)
Let generate an -MRD code. Define the linear map {\mathbb{F}}_{q}^{n}\longrightarrow\mbox{\langle{A_{1},\ldots,A_{n}}\rangle},\ x\longmapsto M_{x}, where is the unique matrix in whose first column is . On we define
[TABLE]
where the right hand side is ordinary matrix-vector multiplication. Then is a pre-semifield. It is a semifield, and hence an -dimensional division algebra, iff the code contains the identity matrix.
- (b)
Let be an -dimensional division algebra over , and be a basis of over . For every define the ( -linear) map . Let be the matrix representation of w.r.t. to the basis . Then the subspace is an -MRD code containing the identity matrix. The matrices are the structure matrices of .
Note that the matrices in (a) are well-defined. Indeed, since every nonzero linear combination has rank , the first columns of the matrices in are distinct and thus assume each vector of exactly once. More generally, the matrix could be replaced by a matrix , where is any isomorphism between and . In that case multiplication would read as .
The above relationship becomes particularly nice for the MRD codes generated by the matrix triples in the set from Theorem 2.3. Denote by the standard basis vectors.
Definition 5.3**.**
Let and
[TABLE]
We denote the associated -dimensional division algebra by . For the matrix from Proposition 5.2(a) is given by . Furthermore, is the identity of and
[TABLE]
Note in particular (e_{2}\circ e_{2})\circ e_{2}=(z_{1},z_{2},z_{3})\mbox{!{}^{\sf T}}, while e_{2}\circ(e_{2}\circ e_{2})=(a,b,c)\mbox{!{}^{\sf T}}, hence we have no well-defined notion of third (or higher) powers unless the semifield is associative. The latter as well as commutativity are easily characterized.
Proposition 5.4**.**
Let and be as in (5.1).
- (a)
* is commutative iff .*
- (b)
* is associative iff , in which case is the field .*
Proof.
(a) is immediate because the products between the standard basis vectors fully determine the multiplication in .
(b) We compute
[TABLE]
Suppose that is associative. Comparing the last two expressions we obtain and
[TABLE]
Note that . Hence and further
[TABLE]
Conversely, if , then is the field , and thus closed under multiplication. As a consequence, the products are in for any . Since the first column of is given by , we conclude that the map ({\mathbb{F}}_{q}^{3},+,\circ)\longrightarrow\mbox{\langle{I,C_{f},C_{f}^{2}}\rangle},\ x\longmapsto M_{x} is multiplicative, i.e., an isomorphism of semifields, and thus is associative. ∎
In the relation between semifields and MRD codes we made use of the multiplication maps , based on the fixed left factor . It will be crucial to also consider the multiplication maps with given right factors. This leads to a dual matrix triple (not to be confused with the trace-dual of MRD codes).
Proposition 5.5** (see also [16, Prop. 5]).**
Consider the division algebra from Definition 5.3 and the -linear maps
[TABLE]
Then the matrix representations of and w.r.t. the standard basis are and , respectively, and the matrix representations of and are
[TABLE]
respectively (using column vector notation throughout). As a consequence, is an MRD code as well. Thus and is irreducible. We call the dual triple to .
Proof.
The matrix representations of and are clear from the definition of . For the matrix representation of notice that its columns are the coordinate vectors of , and . This results in the matrix . Similarly one verifies . The MRD property of follows from the void of zero divisors in . The latter then also implies that has no roots in , thus is irreducible. ∎
Note that, by definition, and and all . Strictly speaking, the dual triple are the structure matrices of the opposite semifield defined by . This leads to and and avoids changing the order of the factors.
For the rest of this section we will also have to consider values for in the extension field . We denote by and the projective planes over and , respectively. Then , and we call the points in the -rational points.
One easily verifies [16, Eq. (3)] that a triple and its dual triple satisfy
[TABLE]
We are now ready to give an outline of the proof of Theorem 4.2. For the next results we fix the following setting.
Notation 5.6**.**
We fix a triple with dual triple and where are as in (5.1) and (5.2).
- (i)
Denote by \alpha\mbox{{}^{[i]}},\,\beta\mbox{{}^{[i]}},\hat{\alpha}\mbox{{}^{[i]}}\in{\mathbb{F}}_{q^{3}},\,i=0,1,2, the eigenvalues of , respectively. The eigenvalues do indeed appear in conjugate triples because the characteristic polynomial of every nonzero matrix in a -MRD code is irreducible.
- (ii)
Let be an eigenvector of w.r.t. the eigenvalue \hat{\alpha}\mbox{{}^{[0]}}. Then v\mbox{{}^{[i]}}:=(v_{1}^{[i]},v_{2}^{[i]},v_{3}^{[i]}) is an eigenvector of w.r.t. \hat{\alpha}\mbox{{}^{[i]}} for .
- (iii)
Let be the curve in given by the equation , where
[TABLE]
We have the following simple properties.
Proposition 5.7** ([16, Props. 3, 6, 7]).**
Consider the setting from Notation 5.6.
- (a)
The curve has no -rational points.
- (b)
The points w\mbox{{}^{[i]}}:=(-\alpha\mbox{{}^{[i]}},1,0) and \tilde{w}\mbox{{}^{[i]}}=(-\beta\mbox{{}^{[i]}},0,1),\,i=0,1,2, are the intersection points of with the curve given by and , respectively.
- (c)
The points v\mbox{{}^{[i]}},\,i=0,1,2, are on . Furthermore, .
Proof.
(a) is clear because every nontrivial linear combination with coefficients in is invertible thanks to the MRD property.
(b) is obvious and the first part of (c) follows from (5.3). For the second statement in (c) consider C_{g}(v_{1},v_{2},v_{3})\mbox{!{}^{\sf T}}=(z_{1}v_{3},\,v_{1}+z_{2}v_{3},\,v_{2}+z_{3}v_{3})\mbox{!{}^{\sf T}}=\hat{\alpha}(v_{1},v_{2},v_{3})\mbox{!{}^{\sf T}}. If , then also , a contradiction. If , then , which is a contradiction because is in whereas is not. Thus . ∎
The next result is a crucial step for the proof of Theorem 4.2 and involves a geometric argument.
Proposition 5.8** ([16, Lem. 9.1, Prop. 9]).**
Consider the setting from Notation 5.6. The curve is the union of three conjugate lines (with coefficients in ). More precisely,
[TABLE]
Sketch of Proof. (See [16]) The polynomial has coefficients in . If was irreducible in , then the Hasse-Weil inequality , where is the number of -rational points and the genus of , would imply , contradicting Proposition 5.7(a). Hence is reducible in . Suppose has a quadratic irreducible factor, and thus also a linear factor. Since is invariant under the Frobenius homomorphism, extended coefficientwise to , the same is true for these irreducible factors (up to a constant factor). Hence the factors are in , in contradiction to Proposition 5.7(a). Thus factors into (mutually conjugate) lines. Now we consider the points on given in Proposition 5.7(b) and argue as follows.
- (i)
w\mbox{{}^{[0]}},w\mbox{{}^{[1]}},w\mbox{{}^{[2]}} are on the line , and this line is clearly not a factor of thanks to Proposition 5.7(a).
- (ii)
\tilde{w}\mbox{{}^{[0]}},\tilde{w}\mbox{{}^{[1]}},\tilde{w}\mbox{{}^{[2]}} are on the line , which is also not a factor.
- (iii)
Since is the union of three lines, each w\mbox{{}^{[i]}} must be on the same line as a point \tilde{w}\mbox{{}^{[j]}}. After appropriate re-indexing this results in the three lines given in the statement.
Proof of Theorem 4.2. (See [16, pp. 291–293]) We have to establish the stated identity for the set .
“” Let with dual triple and where are as in (5.1) and (5.2). Recall the map from (4.4). We have to show that there exist such that are linearly independent and . Recall that and are the companion matrices of the irreducible polynomials and , respectively. Let k\mbox{{}^{[i]}} and \hat{k}\mbox{{}^{[i]}},\,i=0,1,2, be the roots of and in , respectively. Furthermore, let \beta\mbox{{}^{[i]}}\in{\mathbb{F}}_{q^{3}},\,i=0,1,2, be the eigenvalues of . Now we are in the setting of Notation 5.6 with \alpha\mbox{{}^{[i]}}=k\mbox{{}^{[i]}} and \hat{\alpha}\mbox{{}^{[i]}}=\hat{k}\mbox{{}^{[i]}}. Thanks to (4.2) we have
[TABLE]
In particular . Since \hat{k}\mbox{{}^{[i]}} are the eigenvalues of , the matrices
[TABLE]
are singular. Using g(\hat{k}\mbox{{}^{[i]}})=0 one checks that
[TABLE]
is an eigenvector of to the eigenvalue \hat{k}\mbox{{}^{[i]}}. By Proposition 5.7(c), the points v\mbox{{}^{[i]}} are on the curve given by the equation (see Notation 5.6). Furthermore, thanks to Proposition 5.8 this polynomial factors over into
[TABLE]
Without loss of generality let v=v\mbox{{}^{[0]}} be on the line . The point is also on the line
[TABLE]
because thanks to (5.4). This implies . Since \phi(k\mbox{{}^{[i]}},\hat{k}\mbox{{}^{[i]}})=\phi(k,\hat{k})\mbox{{}^{[i]}} we thus have
[TABLE]
Thanks to Proposition 5.7(a) the curve has no points in . But this means that are linearly independent over .
Consider now
[TABLE]
Since is similar to \text{diag}(\beta\mbox{{}^{[0]}},\beta\mbox{{}^{[1]}},\beta\mbox{{}^{[2]}}), we obtain with the aid of (4.2) and (5.4)
[TABLE]
These are exactly the identities at the top of [16, p. 292]222In [16] the quantities and play the role of our and , respectively, for , and is our . In the formula for in the middle of p. 292 some terms are missing: In the notation of [16] it has to read ., and extended use of identities for the symmetric functions, derived in [16, Lem. 10.3], leads to
[TABLE]
see [16, pp. 292]. This means that , as desired.
“” We have to show that, whenever are linearly independent, generates an MRD code. This can be checked in two ways. On the one hand, a very tedious computation (with the help of a CAS) shows that
[TABLE]
Since are by assumption linearly independent over , the linear forms on the right hand side have no nonzero roots in and thus the triple generates an MRD code. An alternative proof is given in [16] where a geometric argument is used in combination with more general versions of Propositions 5.5 – 5.8. Those versions apply to more general triples , where the resulting -dimensional algebras may have zero divisors, and lead to a characterization of the void of zero divisors.
We close this section with rewriting the characterizations of commutativity and associativity in Proposition 5.4 in terms of the parameters from Theorem 4.2.
Proposition 5.9** ([16, Prop. 11]).**
Consider as in Theorem 4.2. Thus are linearly independent. Let .
- (a)
* is commutative iff \hat{k}=k\mbox{{}^{[i]}} for some . The case only occurs if . For \hat{k}\in\{k\mbox{{}^{[1]}},\,k\mbox{{}^{[2]}}\} we have*
[TABLE]
whereas for we have
[TABLE]
- (b)
* is associative (and thus commutative) iff \hat{k}=k\mbox{{}^{[i]}} for some .*
The case , describing the unique commutative, non-associative semifield for a given (in odd characteristic), appears already in [7, p. 374] by Dickson. The case \hat{k}\in\{k\mbox{{}^{[1]}},k\mbox{{}^{[2]}}\}, given by the set in Corollary 4.4, parametrizes the semifields that are isomorphic to . On the other hand, the set of the same corollary parameterizes the proper semifields (i.e., those that are not a field), including the commutative ones. Proposition 4.5 thus reflects the well-known fact [12, p. 208] that there are no proper semifields of order .
Finally note that Part (a) above states that \Sigma_{j}(k,k\mbox{{}^{[1]}})=\Sigma_{j}(k,k\mbox{{}^{[2]}}) for . This agrees with the second option in Proposition 4.3(2).
Proof.
(a) Using Proposition 5.4(a) and (4.2) we obtain that is commutative iff the minimal polynomials of and are identical. This is the case iff \hat{k}=k\mbox{{}^{[i]}} for some . It remains to see for which of these cases are linearly independent. We certainly need . From (4.5) we know that the elements are linearly independent if \hat{k}\in\{k\mbox{{}^{[1]}},k\mbox{{}^{[2]}}\}. If , then . Thus, if , then are linearly independent because and . If, however, , then and thus are linearly dependent. The rest of (a) follows from the very definition of the matrices in Theorem 4.2 and some basic identities for the symmetric functions, given in [16, Lem. 10.3].
(b) By Proposition 5.4(b), is associative iff . Since commutativity follows from associativity for finite semifields, the statement is a consequence of (a). ∎
Outlook
It is natural to ask whether the methods can be extended to more general -MRD codes. To address this question let us recall where the particular case played a role. Firstly, implies that (after some normalization) all nonzero matrices in the MRD code have no eigenvalues in . Secondly, since , the latter is equivalent to the irreducibility of their characteristic polynomials. As a consequence, each such matrix is similar to a companion matrix. Since -MRD codes are -dimensional, all of this has led to the set consisting of matrix triples , and it remained to investigate which matrices lead to MRD codes. This by itself was a major feat accomplished in Theorem 4.2 by Menichetti. Let us briefly discuss the general square case of -MRD codes. The first property given above (no eigenvalues in ) remains obviously true. The second property (similarity to a companion matrix) needs to be replaced by general rational canonical forms (or other canonical forms). While these two steps may just scale up in technicality, the main difficulty for this larger case, however, arises from the larger dimension of the MRD code. A “normalized” basis may now be of the form , and one is left with the task of estimating the number of matrix tuples leading to an MRD code. It seems that entirely new methods are needed for this problem. We thus close the paper with
Open Problem: For which are -MRD codes sparse?
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Antrobus and H. Gluesing-Luerssen. Maximal Ferrers diagram codes: Constructions and genericity considerations. Preprint 2018; ar Xiv: 1804.00624 v 2. Accepted for publication in IEEE Trans. Inform. Theory . DOI 10.1109/TIT.2019.2926256.
- 2[2] E. Byrne and A. Ravagnani. Partition-balanced families of codes and asymptotic enumeration in coding theory. Preprint 2018. ar Xiv:1805.02049, 2018.
- 3[3] A. Cossidente, G. Marino, and F. Pavese. Non-linear maximum rank distance codes. Des. Codes Cryptogr. , 79:597–609, 2016.
- 4[4] B. Csajbók, G. Marino, O. Polverino, and F. Zullo. Maximum scattered linear sets and MRD-codes. J. Algebr. Comb. , 46:517–531, 2017.
- 5[5] P. Delsarte. Bilinear forms over a finite field, with applications to coding theory. J. Combin. Theory Ser. A , 25:226–241, 1978.
- 6[6] P. Dembowski. Finite Geometries . Ergebnisse der Mathematik und ihrer Randgebiete, 44. Springer, Berlin, 1968.
- 7[7] L. E. Dickson. Linear algebras in which division is always uniquely possible. §Trans. Amer. Math. Soc. , 7:370–390, 1906.
- 8[8] L. E. Dickson. Linear groups with an exposition of the Galois field theory. Dover Publ., New York, 1958.
