Codes for Updating Linear Functions over Small Fields

Suman Ghosh; Lakshmi Natarajan

arXiv:1901.02816·cs.IT·January 10, 2019

Codes for Updating Linear Functions over Small Fields

Suman Ghosh, Lakshmi Natarajan

PDF

TL;DR

This paper studies efficient linear coding schemes for updating linear functions of sparse message changes over small finite fields, with applications to distributed data storage, providing tighter bounds and constructions that reduce field size requirements.

Contribution

It offers a field-size aware analysis of the function update problem, tighter codelength bounds, and new codes that balance codelength reduction with smaller field sizes.

Findings

01

Derived a tighter lower bound on codelength independent of field size

02

Constructed codes for striped message vectors with reduced field size requirements

03

Established equivalence between function update and generalized index coding problems

Abstract

We consider a point-to-point communication scenario where the receiver maintains a specific linear function of a message vector over a finite field. When the value of the message vector undergoes a sparse update, the transmitter broadcasts a coded version of the modified message while the receiver uses this codeword and the current value of the linear function to update its contents. It is assumed that the transmitter has access to the modified message but is unaware of the exact difference vector between the original and modified messages. Under the assumption that the difference vector is sparse and that its Hamming weight is at the most a known constant, the objective is to design a linear code with as small a codelength as possible that allows successful update of the linear function at the receiver. This problem is motivated by applications to distributed data storage systems.…

Equations79

l \geq min (m, 2 ϵ) .

l \geq min (m, 2 ϵ) .

A = I_{a} \otimes C = C 0 ⋮ 0 0 C ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ C

A = I_{a} \otimes C = C 0 ⋮ 0 0 C ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ C

E : \mathds F_{q}^{n} \leavevmode ⟶ \leavevmode \mathds F_{q}^{l}

E : \mathds F_{q}^{n} \leavevmode ⟶ \leavevmode \mathds F_{q}^{l}

l_{q, opt} \leq m .

l_{q, opt} \leq m .

H (x - x^{'}) \neq = H (e^{'} - e)

H (x - x^{'}) \neq = H (e^{'} - e)

H z \neq = H y

H z \neq = H y

I (A, ϵ) = {y \in \mathds F_{q}^{n} \leavevmode ∣ \leavevmode A y \neq = 0, \leavevmode 0 < wt (y) \leq 2 ϵ} .

I (A, ϵ) = {y \in \mathds F_{q}^{n} \leavevmode ∣ \leavevmode A y \neq = 0, \leavevmode 0 < wt (y) \leq 2 ϵ} .

H y \neq = 0, \leavevmode \leavevmode \leavevmode \forall y \in I (A, ϵ) .

H y \neq = 0, \leavevmode \leavevmode \leavevmode \forall y \in I (A, ϵ) .

I_{FU} (A, ϵ) = {A y \leavevmode ∣ \leavevmode 0 < wt (y) \leq 2 ϵ} \ {0} = {A y \leavevmode ∣ \leavevmode y \in I (A, ϵ)} .

I_{FU} (A, ϵ) = {A y \leavevmode ∣ \leavevmode 0 < wt (y) \leq 2 ϵ} \ {0} = {A y \leavevmode ∣ \leavevmode y \in I (A, ϵ)} .

S z \neq = 0, \leavevmode \leavevmode \leavevmode \forall z \in I_{FU} (A, ϵ) .

S z \neq = 0, \leavevmode \leavevmode \leavevmode \forall z \in I_{FU} (A, ϵ) .

A = 1010110000010111010100010010101011110111 .

A = 1010110000010111010100010010101011110111 .

S = 10010101010110111010 .

S = 10010101010110111010 .

H = S A = 01101001010001101011111011011101 .

H = S A = 01101001010001101011111011011101 .

q^{m - 2 ϵ}

q^{m - 2 ϵ}

or, \leavevmode \leavevmode \frac{q ^{m}}{q ^{2 ϵ}}

\frac{q ^{m} - 1}{q ^{2 ϵ} - 1}

\frac{q ^{m} - 1}{q ^{2 ϵ} - 1}

or, \leavevmode \leavevmode \frac{q ^{m} - 1}{q ^{2 ϵ} - 1}

or, \leavevmode \leavevmode q^{m} - 1

A^{S} = C 0 ⋮ 0 0 C ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ C

A^{S} = C 0 ⋮ 0 0 C ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ C

l_{q, opt} = m - k_{q} (m, 2 ϵ + 1) .

l_{q, opt} = m - k_{q} (m, 2 ϵ + 1) .

A^{S} = \setcounter M a x M a t r i x C o l s 12100010001000010001000100001000100010000100010001 .

A^{S} = \setcounter M a x M a t r i x C o l s 12100010001000010001000100001000100010000100010001 .

z = A y = C 0 ⋮ 0 0 C ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ C y_{1} y_{2} ⋮ y_{a} = C y_{1} C y_{2} ⋮ C y_{a}

z = A y = C 0 ⋮ 0 0 C ⋮ 0 \dots \dots ⋱ \dots 00 ⋮ C y_{1} y_{2} ⋮ y_{a} = C y_{1} C y_{2} ⋮ C y_{a}

S z \neq = 0

S z \neq = 0

\Rightarrow

\displaystyle\Rightarrow\leavevmode\nobreak\

A^{S} = C 00 0 C 0 00 C

A^{S} = C 00 0 C 0 00 C

C = 100010001111 .

C = 100010001111 .

S = 100000010000001000000100000010000001100100010010001001 .

S = 100000010000001000000100000010000001100100010010001001 .

M = 010 ⋮ 0 001 ⋮ 0 \dots \dots \dots ⋱ \dots 000 ⋮ 1 - p_{0} - p_{1} - p_{2} ⋮ - p_{t - 1} .

M = 010 ⋮ 0 001 ⋮ 0 \dots \dots \dots ⋱ \dots 000 ⋮ 1 - p_{0} - p_{1} - p_{2} ⋮ - p_{t - 1} .

{\textbf{S}}_{i,j}=\left\{\begin{array}[]{ccl}\boldsymbol{0}_{t\times t}&\mbox{if}&\hat{s}_{i,j}=0\\ {\bf{I}}_{t\times t}&\mbox{if}&\hat{s}_{i,j}=1\\ {\textbf{M}}^{k}&\mbox{if}&\hat{s}_{i,j}=\alpha^{k},\leavevmode\nobreak\ k\in\{1,2\dots,q^{t}-2\}\end{array}\right.

{\textbf{S}}_{i,j}=\left\{\begin{array}[]{ccl}\boldsymbol{0}_{t\times t}&\mbox{if}&\hat{s}_{i,j}=0\\ {\bf{I}}_{t\times t}&\mbox{if}&\hat{s}_{i,j}=1\\ {\textbf{M}}^{k}&\mbox{if}&\hat{s}_{i,j}=\alpha^{k},\leavevmode\nobreak\ k\in\{1,2\dots,q^{t}-2\}\end{array}\right.

A^{S} = C 0000 0 C 000 00 C 00 000 C 0 0000 C

A^{S} = C 0000 0 C 000 00 C 00 000 C 0 0000 C

C = 100010001111 .

C = 100010001111 .

M = 010001110 .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Codes for Updating Linear Functions over

Small Fields

Suman Ghosh and Lakshmi Natarajan The authors are with the Department of Electrical Engineering, Indian Institute of Technology Hyderabad, Sangareddy 502 285, India (email: {ee16resch11006, lakshminatarajan}@iith.ac.in).

Abstract

We consider a point-to-point communication scenario where the receiver intends to maintain a specific linear function of a message vector over a finite field. When the value of the message vector changes, which is modelled as a sparse update, the transmitter broadcasts a coded version of the modified message while the receiver uses this codeword and the current value of the linear function to update its contents. It is assumed that the transmitter has access to only the modified message and is unaware of the exact difference vector between the original and modified messages. Under the assumption that the difference vector is sparse and that its Hamming weight is at the most a known constant, the objective is to design a linear code with as small a codelength as possible that allows successful update of the linear function at the receiver. This problem is motivated by applications to distributed data storage systems. Recently, Prakash and Médard derived a lower bound on the codelength, which is independent of the size of the underlying finite field, and provided constructions that achieve this bound if the size of the finite field is sufficiently large. However, this requirement on the field size can be prohibitive for even moderate values of the system parameters. In this paper, we provide a field-size aware analysis of the function update problem, including a tighter lower bound on the codelength, and design codes that trade-off the codelength for a smaller field size requirement. We also show that the problem of designing codes for updating linear functions is related to functional index coding or generalized index coding. We first characterize the family of function update problems where linear coding can provide reduction in codelength compared to a naive transmission scheme. We then provide field-size dependent bounds on the optimal codelength, and construct coding schemes based on error correcting codes and subspace codes when the receiver maintains linear functions of striped message vector. These codes provide a trade-off between the codelength and the size of the operating finite field, and whenever the achieved codelengths equal those reported by Prakash and Médard the requirements on the size of the finite field are matched as well. Finally, for any given function update problem, we construct an equivalent functional index coding or generalized index coding problem such that any linear coding scheme is valid for the function update problem if and only if it is valid for the constructed functional index coding problem.

I Introduction

We consider a point-to-point communication scenario as shown in Fig. 1 where the receiver maintains a linear function ${\textbf{A}}{\bf{x}}$ of a message vector ${\bf{x}}$ . The message ${\bf{x}}$ is an $n$ -length column vector over a finite field $\mathds{F}_{q}$ , where $q$ is any prime power, and A is an $m\times n$ matrix over $\mathds{F}_{q}$ with $m\leq n$ and rank $({\textbf{A}})=m$ . Suppose the value of the message vector is updated to ${\bf{x}}+{\bf{e}}$ , where ${\bf{e}}$ represents a sparse update to the message, i.e., we assume that $\mathrm{wt}({\bf{e}})\leq\epsilon$ where $\mathrm{wt}$ denotes the Hamming weight of a vector and $\epsilon$ is a known constant. In other words at the most $\epsilon$ entries of the original message ${\bf{x}}$ are updated to new values. We assume that the transmitter has access to the updated message ${\bf{x}}+{\bf{e}}$ , but is unaware of the original message ${\bf{x}}$ or the sparse update ${\bf{e}}$ . Note that the message update is modelled here as substitutions only and not as insertions or deletions. The objective is to design a linear encoder that uses an $l\times n$ matrix ${\bf{H}}$ to generate the codeword ${\bf{c}}={\bf{H}}({\bf{x}}+{\bf{e}})$ , with as small a codelength $l$ as possible, such that the receiver can decode ${\textbf{A}}({\bf{x}}+{\bf{e}})$ using the transmitted codeword ${\bf{c}}$ and the older version of its content ${\textbf{A}}{\bf{x}}$ .

The problem is motivated by distributed storage systems (DSS) where information is stored in linearly coded form across a number of nodes to provide resilience against storage node failures [1]. In the scenario where multiple users can simultaneously edit a single file stored in a DSS, it is possible that a user who wishes to apply his update ${\bf{x}}+{\bf{e}}$ is unaware of the current version of the message ${\bf{x}}$ stored in the DSS, for instance when another user has recently edited this file. Letting the user first learn the version ${\bf{x}}$ stored in the DSS, and then apply his update will incur additional communication cost. As an alternative, if it is known that the update vector ${\bf{e}}$ is sparse, it is possible to design schemes that do not require the knowledge of the value of ${\bf{e}}$ at the transmitter [1, 2, 3].

The function update problem was considered in [2, 3] for DSS’s for updating one of the storage nodes with the help of the other nodes in the system. Note that each node in a DSS stores a linear function of the message. A node can become stale in such systems, for instance if the node goes offline while the message and the corresponding linear functions stored in the other nodes undergo an update. Once it is back online, the stale node connects to the other nodes in the distributed storage system to update its own linear function, and the stale data already stored in this node acts as side information. The authors of [2, 3] design both the code for distributed storage and the code for function update to minimize the amount of data downloaded by the stale node to update its contents. This is unlike the problem statement considered in [1] as well as this paper, where it is assumed that an arbitrary matrix A is given and a code for updating the function ${\textbf{A}}{\bf{x}}$ is to be designed.

The authors in [1] also consider a broadcast scenario where a codeword is broadcast to multiple nodes in order to update the (different) linear functions stored in each of the nodes. Problems related to updating linear functions have been considered in [4, 5, 6]. In [4], codes for updating linear functions are used in cache-aided networks to reduce the cost of multicasting a sequence of correlated data frames. The problem of efficiently storing multiple versions of a file in a DSS while ensuring a property called consistency is considered in [5, 6].

In the study of the point-to-point function update problem given in [2, 3, 1] the authors derive the following field-size independent lower bound on the codelength

[TABLE]

Note that, if $m\leq 2\epsilon$ , the lower bound on the codelength $l\geq m$ can be trivially achieved by transmitting ${\textbf{A}}({\bf{x}}+{\bf{e}})$ . Hence, we will always assume that $m>2\epsilon$ . The results in [1] show that codelength $l=2\epsilon$ is achievable using maximally recoverable subcodes of $\mathscr{C}_{A}$ , the subspace spanned by the rows of A, which are guaranteed to exist if the field size $q\geq 2\epsilon n^{2\epsilon}$ . Note that this requirement imposed on the field size can be large even for moderate values of $\epsilon$ and $n$ . The authors of [1] also consider the special case where the matrix A is striped, i.e.,

[TABLE]

where ${\bf{I}}_{a}$ is the $a\times a$ identity matrix, $\mathbf{C}\in\mathds{F}_{q}^{t\times K}$ and $\otimes$ denotes the Kronecker product. Note that $m=at$ and $n=aK$ . This structure frequently arises in distributed storage systems where the $n$ -length data ${\bf{x}}$ is partitioned into $a$ subvectors ${\bf{x}}_{1},\dots,{\bf{x}}_{a}$ , each of length $K$ , each subvector is encoded independently by multiplying with $\mathbf{C}$ , and all the encoded vectors are stored in a single storage node, see Examples 1–3 of [1]. In [1, Section IV], a code is constructed for the case $t=1$ that achieves the codelength $l=2\epsilon$ using an $[m,m-2\epsilon]$ MDS code, which is guaranteed to exist if the field size $q\geq m$ . In Remark 4 of [1] the authors consider a modified system model for the function update problem which we show in Section V-A3 of this paper to be equivalent to the case where A is striped with the number of stripes $a=t$ . Construction 1 and Remark 4 of [1] provide a code construction for this modified system model, and hence for the case $a=t$ , that achieves codelength of $2t\epsilon$ over any field.

In this paper we provide a field-size aware characterization of the point-to-point function update problem. In particular, we provide bounds on the achievable codelength that take into account the effect of the field size and we provide constructions that trade-off the codelength for a smaller field size requirement. This is unlike the point-to-point results in [1] which provide constructions only for the case $l=2\epsilon$ but assume that the field size $q$ is sufficiently large. To the best of our knowledge, no prior analysis of this problem as a function of the field size $q$ is available in the literature except [1] which assumes that the field size $q$ is large enough for a maximally recoverable code to exist.

We characterize the family of point-to-point function update problems where linear coding scheme is useful to save at least one transmission, i.e., $l\leq m-1$ is achievable (Theorem 3, Section III). This characterization is analyzed in terms of the covering radius of $\mathscr{C}^{\perp}_{A}$ , the dual of the code $\mathscr{C}_{A}$ , in Section III-B. We provide a lower bound (Theorem 4, Section IV) and an upper bound (Theorem 5, Section V) on optimal codelength based on linear error correcting codes. Similar to [1] we also provide code constructions when A is striped (Section V-A1,V-A2) but our focus is on the general case where $t\geq 1$ and $a\geq 1$ . For the case when $t=1$ we provide a construction (Section V-A1) which achieves the optimal codelength for the respective operating field size $q$ , for any prime power $q\geq 2$ . For the special case $q\geq m$ this code construction achieves codelength $2\epsilon$ and this matches the achieved codelength in Construction 2 of [1] for $t=1$ which also requires $q\geq m$ . Section V-A2 provides code constructions for $t\geq 1$ using subspace codes and error correcting codes over field extensions. All these code constructions yield a trade-off between the chosen field size and achieved codelength where operating over a smaller field size results in a larger codelength than operating over a larger field size (for instance, see Example 3). When restricted to the special case $a=t$ our construction provides a valid coding scheme for the modified function update problem mentioned in [1, Remark 4] that matches codelength $2t\epsilon$ over any field $\mathds{F}_{q}$ reported in [1] (Section V-A3). The performance comparison of the constructed codes are discussed in Section V-A4. Finally, we show that the point-to-point function update problem is equivalent a functional index coding or a generalized index coding problem [7, 8, 9]. Given a point-to-point function update problem we construct a functional index coding problem (Algorithm 1, Section VI-B) such that a coding scheme is valid for the function update problem if and only if it is valid for the constructed functional index coding problem (Theorem 9, Section VI-B). This paper starts with describing the system model and providing relevant preliminary results in Section II.

Notation: Matrices and column vectors are denoted by bold uppercase and lowercase letters, respectively. For any positive integer $n$ , the symbol $[n]$ denotes the set $\{1,\dots,n\}$ . The Hamming weight of a vector ${\bf{x}}$ is denoted as $\mathrm{wt}({\bf{x}})$ . The symbol $\mathds{F}_{q}$ denotes the finite field of size $q$ and $\mathds{F}_{q}^{n}$ denotes a column vector of $n$ elements over $\mathds{F}_{q}$ where $q$ is a prime power. The $n\times n$ identity matrix is denoted as ${\bf I}_{n}$ .

II System Model and Preliminaries

We consider a noiseless communication scenario with single transmitter and single receiver. The transmitter knows a column vector ${\bf{x}}$ of $n$ information symbols where each information symbol is an element over finite field $\mathds{F}_{q}$ . The receiver stores the coded message ${\textbf{A}}{\bf{x}}\in\mathds{F}_{q}^{m}$ where ${\textbf{A}}\in\mathds{F}_{q}^{m\times n}$ ( $m\leq n$ ) and rank(A) = $m$ . Now suppose the information symbol vector ${\bf{x}}$ is updated to ${\bf{x}}+{\bf{e}}$ where ${\bf{e}}$ is the update vector which is also a column vector of length $n$ over $\mathds{F}_{q}$ with $\mathrm{wt}({\bf{e}})\leq\epsilon$ , where $\mathrm{wt}$ denotes the Hamming weight of a vector. The objective is to generate a codeword ${\bf{c}}=(c_{1},c_{2},\dots,c_{l})^{T}$ with codelength $l$ as small as possible such that the receiver can update its content to ${\textbf{A}}({\bf{x}}+{\bf{e}})$ using the transmitted codeword ${\bf{c}}$ and the older version of its content ${\textbf{A}}{\bf{x}}$ . We assume the transmitter doesn’t know about original information symbol vector ${\bf{x}}$ or update vector ${\bf{e}}$ but only knows the updated information symbol vector $({\bf{x}}+{\bf{e}})$ . The problem of designing coding scheme to update the coded data ${\textbf{A}}{\bf{x}}$ available at the receiver to ${\textbf{A}}({\bf{x}}+{\bf{e}})$ with $\mathrm{wt}({\bf{e}})\leq\epsilon$ will be called as $({\textbf{A}},\epsilon)$ function update problem.

Definition 1.

A valid encoding function of codelength $l$ for the $({\textbf{A}},\epsilon)$ function update problem over the field $\mathds{F}_{q}$ is a function

[TABLE]

such that there exists a decoding function $\mathfrak{D}:\mathds{F}_{q}^{l}\times\mathds{F}_{q}^{m}\leavevmode\nobreak\ \longrightarrow\leavevmode\nobreak\ \mathds{F}_{q}^{m}$ satisfying the following property: $\mathfrak{D}(\mathfrak{E}({\bf{x}}+{\bf{e}}),{\textbf{A}}{\bf{x}})={\textbf{A}}({\bf{x}}+{\bf{e}})$ for every ${\bf{x}}\in\mathds{F}_{q}^{n}$ and ${\bf{e}}\in\mathds{F}_{q}^{n}$ with $\mathrm{wt}({\bf{e}})\leq\epsilon$ .

The objective of the code construction is to design a pair $(\mathfrak{E},\mathfrak{D})$ of encoding and decoding functions that minimizes the codelength $l$ and to calculate the optimal codelength over $\mathds{F}_{q}$ which is the minimum codelength among all valid coding schemes.

A coding scheme $(\mathfrak{E},\mathfrak{D})$ is said to be linear if the encoding function is an $\mathds{F}_{q}$ -linear transformation. For a linear coding scheme, the codeword ${\bf{c}}={\bf{H}}({\bf{x}}+{\bf{e}})$ , where ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ . The matrix ${\bf{H}}$ is the encoder matrix of the linear coding scheme. The minimum codelength among all valid linear coding schemes for the $({\textbf{A}},\epsilon)$ function update problem over the field $\mathds{F}_{q}$ will be denoted as $l_{q,\mathrm{opt}}$ .

The trivial coding scheme that transmits the updated coded information symbols ${\textbf{A}}({\bf{x}}+{\bf{e}})$ i.e., ${\bf{c}}={\textbf{A}}({\bf{x}}+{\bf{e}})$ is a valid coding scheme with codelength $m$ since the receiver can directly update its content using ${\bf{c}}$ . We refer to this trivial coding scheme as naive scheme where ${\bf{H}}={\textbf{A}}$ . Thus, we have the following trivial upper bound on the optimum linear codelength

[TABLE]

In [1] the authors provided a necessary and sufficient condition for a matrix ${\bf{H}}$ to be a valid encoder matrix for $({\textbf{A}},\epsilon)$ function update problem. In Theorem 2 of [1] the proof is given only for necessary condition for a matrix ${\bf{H}}$ to be a valid encoder matrix for $({\textbf{A}},\epsilon)$ function update problem. For the sake of completeness here we first prove that the criterion 1 in [1, Theorem 2] is a necessary and sufficient condition for a matrix ${\bf{H}}$ to be a valid encoder matrix for $({\textbf{A}},\epsilon)$ function update problem and then state the relevant results which will be helpful to derive other results of this paper. Let $\mathscr{C}_{A}$ and $\mathscr{C}_{H}$ denote the linear codes generated by the rows of A and ${\bf{H}}$ respectively. Also let $\mathscr{C}=\mathscr{C}_{A}\cap\mathscr{C}_{H}$ and let ${\bf{P}}$ be a generator matrix of $\mathscr{C}$ .

Theorem 1 (Theorem 2, [1]).

A matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem if and only if ${\bf{P}}{\bf{y}}\neq\boldsymbol{0}$ for any ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $\mathrm{wt}({\bf{y}})\leq 2\epsilon$ and ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ .

Proof.

A matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem if and only if the receiver can uniquely determine ${\textbf{A}}({\bf{x}}+{\bf{e}})$ from the received codeword ${\bf{H}}({\bf{x}}+{\bf{e}})$ and the side information ${\textbf{A}}{\bf{x}}$ . Hence for two pairs of information symbol vectors and update vectors $({\bf{x}},{\bf{e}})$ and $({\bf{x}}^{\prime},{\bf{e}}^{\prime})$ such that the coded information symbol vectors available at the receiver are identical i.e., ${\textbf{A}}{\bf{x}}={\textbf{A}}{\bf{x}}^{\prime}$ but updated coded information symbol vectors are distinct i.e., ${\textbf{A}}({\bf{x}}+{\bf{e}})\neq{\textbf{A}}({\bf{x}}^{\prime}+{\bf{e}}^{\prime})$ then the transmitted codeword ${\bf{H}}({\bf{x}}+{\bf{e}})$ must be distinct from ${\bf{H}}({\bf{x}}^{\prime}+{\bf{e}}^{\prime})$ to distinguish the two different updated coded information symbol vectors. Equivalently, the condition ${\bf{H}}({\bf{x}}+{\bf{e}})\neq{\bf{H}}({\bf{x}}^{\prime}+{\bf{e}}^{\prime})$ should hold for every choice of ${\bf{x}},{\bf{x}}^{\prime},{\bf{e}},{\bf{e}}^{\prime}\in\mathds{F}_{q}^{n}$ with $\mathrm{wt}({\bf{e}}),\mathrm{wt}({\bf{e}}^{\prime})\leq\epsilon$ satisfying ${\textbf{A}}{\bf{x}}={\textbf{A}}{\bf{x}}^{\prime}$ and ${\textbf{A}}({\bf{x}}+{\bf{e}})\neq{\textbf{A}}({\bf{x}}^{\prime}+{\bf{e}}^{\prime})$ . Therefore ${\bf{H}}$ is a valid encoder matrix if and only if

[TABLE]

for all ${\bf{x}},{\bf{x}}^{\prime}\in\mathds{F}_{q}^{n}$ such that ${\textbf{A}}{\bf{x}}={\textbf{A}}{\bf{x}}^{\prime}$ and ${\textbf{A}}({\bf{x}}-{\bf{x}}^{\prime})\neq{\textbf{A}}({\bf{e}}^{\prime}-{\bf{e}})$ . Now denoting ${\bf{z}}={\bf{x}}-{\bf{x}}^{\prime}$ and ${\bf{y}}={\bf{e}}^{\prime}-{\bf{e}}$ we have

[TABLE]

for all ${\bf{z}},{\bf{y}}\in\mathds{F}_{q}^{n}$ and $\mathrm{wt}({\bf{y}})=\mathrm{wt}({\bf{e}}^{\prime}-{\bf{e}})\leq 2\epsilon$ such that ${\textbf{A}}{\bf{z}}=\boldsymbol{0}$ and ${\textbf{A}}{\bf{z}}\neq{\textbf{A}}{\bf{y}}$ . Now reformulating the condition given in (2) we obtain ${\bf{H}}({\bf{z}}-{\bf{y}})\neq\boldsymbol{0}$ for all ${\bf{z}},{\bf{y}}\in\mathds{F}_{q}^{n}$ that satisfy $\mathrm{wt}({\bf{y}})\leq 2\epsilon$ , ${\textbf{A}}{\bf{z}}=\boldsymbol{0}$ and ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ . Therefore ${\bf{H}}$ is a valid encoder matrix if and only if for all ${\bf{z}},{\bf{y}}\in\mathds{F}_{q}^{n}$ and $\mathrm{wt}({\bf{y}})\leq 2\epsilon$ if ${\bf{z}}\in\mathscr{C}_{A}^{\perp}$ and ${\bf{y}}\notin\mathscr{C}_{A}^{\perp}$ then $({\bf{y}}-{\bf{z}})\notin\mathscr{C}_{H}^{\perp}$ . Hence ${\bf{H}}$ is a valid encoder matrix if and only if for all ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $\mathrm{wt}({\bf{y}})\leq 2\epsilon$ if ${\bf{y}}\notin\mathscr{C}_{A}^{\perp}$ then ${\bf{y}}\notin\mathscr{C}_{A}^{\perp}+\mathscr{C}_{H}^{\perp}$ . Now using the fact that $\mathscr{C}_{A}^{\perp}+\mathscr{C}_{H}^{\perp}=(\mathscr{C}_{A}\cap\mathscr{C}_{H})^{\perp}=\mathscr{C}^{\perp}$ we deduce that ${\bf{H}}$ is a valid encoder matrix if and only if for all ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $\mathrm{wt}({\bf{y}})\leq 2\epsilon$ such that ${\bf{y}}\notin\mathscr{C}_{A}^{\perp}$ also satisfies ${\bf{y}}\notin\mathscr{C}^{\perp}$ . Hence the statement of the theorem follows. ∎

Lemma 1 (Remark 2, [1]).

Let ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ be a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem. Let ${\bf{P}}$ be a generator matrix of the code $\mathscr{C}=\mathscr{C}_{A}\cap\mathscr{C}_{H}$ . Then ${\bf{P}}$ is also a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem.

If we consider a valid encoder matrix ${\bf{H}}^{\prime}\in\mathds{F}_{q}^{l^{\prime}\times n}$ such that $\mathscr{C}_{H^{\prime}}\nsubseteq\mathscr{C}_{A}$ , then we can find another valid encoder matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ as the generator matrix of the code $\mathscr{C}_{A}\cap\mathscr{C}_{H^{\prime}}$ . Since $\mathscr{C}_{H}$ is a subcode of $\mathscr{C}_{A}\cap\mathscr{C}_{H^{\prime}}$ , we have $l>l^{\prime}$ . Therefore the encoder matrix ${\bf{H}}^{\prime}$ has sub-optimal codelength. So from now we only consider encoder matrices ${\bf{H}}$ such that $\mathscr{C}_{H}\subseteq\mathscr{C}_{A}$ . Since we assume $\mathscr{C}_{H}\subseteq\mathscr{C}_{A}$ we can write ${\bf{H}}={\textbf{S}}{\textbf{A}}$ for some matrix ${\textbf{S}}\in\mathds{F}_{q}^{l\times m}$ .

Now using ${\bf{P}}={\bf{H}}$ we restate Theorem 1 as follows. A matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ such that $\mathscr{C}_{H}\subseteq\mathscr{C}_{A}$ is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem if and only if for any ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $\mathrm{wt}({\bf{y}})\leq 2\epsilon$ and ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ satisfies ${\bf{H}}{\bf{y}}\neq\boldsymbol{0}$ . We define the collection $\mathcal{I}({\textbf{A}},\epsilon)$ as the set of all vectors ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $\mathrm{wt}({\bf{y}})\leq 2\epsilon$ such that ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ i.e.,

[TABLE]

Theorem 2.

A matrix ${\bf{H}}={\textbf{S}}{\textbf{A}}$ for some matrix ${\textbf{S}}\in\mathds{F}_{q}^{l\times m}$ is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem if and only if

[TABLE]

Now we define the collection $\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ as the set of all non-zero linear combinations of $2\epsilon$ or fewer columns of A over $\mathds{F}_{q}$ i.e.,

[TABLE]

Note that $|\mathcal{I}_{\text{FU}}|\leq q^{m}-1$ since $\boldsymbol{0}\notin\mathcal{I}_{\text{FU}}$ .

Corollary 1.

${\bf{H}}={\textbf{S}}{\textbf{A}}$ * is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem if and only if*

[TABLE]

III Necessary and Sufficient Condition for $l_{q,\mathrm{opt}}<m$

In this section we will characterize the family of point-to-point function update problems where linear coding is useful to save at least one transmission compared to the naive scheme i.e., $l_{q,\mathrm{opt}}<m$ . First we will derive some preliminary results which will be helpful to derive the main result of this section.

Lemma 2.

The collection $\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ is closed under non-zero scalar multiplication.

Proof.

Suppose ${\bf{z}}\in\mathcal{I}_{\text{FU}}$ . There exists a ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ such that ${\bf{z}}={\textbf{A}}{\bf{y}}$ . For any $\alpha\in\mathds{F}_{q}^{\ast}$ , $\alpha{\bf{z}}=\alpha{\textbf{A}}{\bf{y}}={\textbf{A}}(\alpha{\bf{y}})={\textbf{A}}{\bf{y}}^{\prime}$ , where ${\bf{y}}^{\prime}=\alpha{\bf{y}}$ . Now as $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ , it follows that $0<\mathrm{wt}({\bf{y}}^{\prime})\leq 2\epsilon$ . Again ${\textbf{A}}{\bf{y}}^{\prime}={\textbf{A}}(\alpha{\bf{y}})=\alpha{\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ as ${\textbf{A}}{\bf{y}}\in\mathcal{I}_{\text{FU}}$ and $\alpha\neq 0$ . Therefore for any $\alpha\in\mathds{F}_{q}^{\ast}$ , $\alpha{\bf{z}}\in\mathcal{I}_{\text{FU}}$ . Hence the lemma holds. ∎

III-A A coding scheme for a family of $({\textbf{A}},\epsilon)$ function update problems

Consider any $({\textbf{A}},\epsilon)$ function update problem where there exists a non-zero ${\bf{u}}\in\mathds{F}_{q}^{m}$ such that ${\bf{u}}\notin\mathcal{I}_{\text{FU}}$ . Let $\mathscr{C}_{u}$ be the subspace of $\mathds{F}_{q}^{m}$ generated by ${\bf{u}}$ . Therefore dim $(\mathscr{C}_{u})=1$ . Note that dim $(\mathscr{C}_{u}^{\perp})=m-1$ . Let ${\textbf{S}}\in\mathds{F}_{q}^{(m-1)\times m}$ be a generator matrix of the code $\mathscr{C}_{u}^{\perp}$ . The matrix S is a parity check matrix of the code $\mathscr{C}_{u}$ .

Lemma 3.

The matrix S satisfies ${\textbf{S}}{\bf{z}}\neq\boldsymbol{0}$ for all ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ .

Proof.

Proof by contradiction. Let there exist a ${\bf{z}}\in\mathcal{I}_{\text{FU}}$ such that ${\textbf{S}}{\bf{z}}=\boldsymbol{0}$ . This implies that ${\bf{z}}\in\mathscr{C}_{u}$ . Therefore there exists an $\alpha\in\mathds{F}_{q}^{\ast}$ such that ${\bf{z}}=\alpha{\bf{u}}$ . Now as $\alpha\in\mathds{F}_{q}^{\ast}$ , $\alpha^{-1}$ exists and hence ${\bf{u}}=\alpha^{-1}{\bf{z}}$ . Now as ${\bf{z}}\in\mathcal{I}_{\text{FU}}$ and $\mathcal{I}_{\text{FU}}$ is closed under non-zero scalar multiplication (using Lemma 3), ${\bf{u}}\in\mathcal{I}_{\text{FU}}$ which is a contradiction. Hence the lemma holds. ∎

Now using Corollary 1, we obtain a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem over $\mathds{F}_{q}$ as ${\bf{H}}={\textbf{S}}{\textbf{A}}$ with codelength $l=m-1$ , whenever there exists a non-zero vector in $\mathds{F}_{q}^{m}\backslash\mathcal{I}_{\text{FU}}$ . We do not claim that this coding scheme yields the optimal codelength $l_{q,\mathrm{opt}}$ .

Example 1.

Consider the $({\textbf{A}},1)$ function update problem over binary field $\mathds{F}_{2}$ where $m=5$ , $n=8$ , $\epsilon=1$ and the matrix ${\textbf{A}}\in\mathds{F}_{2}^{5\times 8}$ is given by

[TABLE]

Note that rank $({\textbf{A}})=5$ over $\mathds{F}_{2}$ . The non-zero vector ${\bf{u}}=[0\leavevmode\nobreak\ 1\leavevmode\nobreak\ 1\leavevmode\nobreak\ 0\leavevmode\nobreak\ 0]\in\mathds{F}_{2}^{5}$ satisfies ${\bf{u}}\notin\mathcal{I}_{\text{FU}}({\textbf{A}},1)$ . The parity check matrix of the code $\mathscr{C}_{u}$ , generated by ${\bf{u}}$ is given by

[TABLE]

Therefore we obtain a valid encoder matrix ${\bf{H}}\in\mathds{F}_{2}^{4\times 8}$ with codelength $l=4$ for the function update problem as

[TABLE]

∎

Now we derive a necessary and sufficient condition for any $({\textbf{A}},\epsilon)$ function update problem to save at least one transmission using linear coding scheme compared to the naive scheme.

Theorem 3.

For an $({\textbf{A}},\epsilon)$ function update problem, $l_{q,\mathrm{opt}}=m$ if and only if $|\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)|=q^{m}-1$ .

Proof.

To prove the theorem we first show that for any $({\textbf{A}},\epsilon)$ function update problem if $|\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)|=(q^{m}-1)$ then $l_{q,\mathrm{opt}}=m$ . Next we show that if $|\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)|<(q^{m}-1)$ then $l_{q,\mathrm{opt}}\leq(m-1)$ .

Proof of first part i.e., if $|\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)|=(q^{m}-1)$ then $l_{q,\mathrm{opt}}=m$ :

Let ${\bf{H}}$ be an optimal encoder matrix with $l=l_{q,\mathrm{opt}}$ . Then there exists a matrix ${\textbf{S}}\in\mathds{F}_{q}^{l_{q,\mathrm{opt}}\times m}$ such that ${\bf{H}}={\textbf{S}}{\textbf{A}}$ . From Corollary 1 we obtain ${\textbf{S}}{\bf{z}}\neq\boldsymbol{0}$ for all ${\bf{z}}\in\mathcal{I}_{\text{FU}}$ . Since $\mathcal{I}_{\text{FU}}$ contains all non-zero vectors from $\mathds{F}_{q}^{m}$ , the columns of S are linearly independent. Hence $l_{q,\mathrm{opt}}\geq m$ . Again from (1), we have $l_{q,\mathrm{opt}}\leq m$ . Hence $l_{q,\mathrm{opt}}=m$ .

Proof of second part i.e., if $|\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)|<(q^{m}-1)$ then $l_{q,\mathrm{opt}}\leq(m-1)$ :

If $|\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)|<(q^{m}-1)$ then there exists a non-zero vector ${\bf{u}}\in\mathds{F}_{q}^{m}$ such that ${\bf{u}}\notin\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ . Therefore using the technique described in Section III-A we can construct a valid encoder matrix such that we save one transmission compared to the naive scheme, i.e., $l_{q,\mathrm{opt}}\leq(m-1)$ . Hence the lemma holds. ∎

Now we provide a sufficient condition on the field size $q$ to save at least one transmission compared to the naive scheme for any $({\textbf{A}},\epsilon)$ function update problem.

Corollary 2.

For any ${\textbf{A}}\in\mathds{F}_{q}^{m\times n}$ with rank $({\textbf{A}})=m$ where $m>2\epsilon$ ,

$l_{q,\mathrm{opt}}\leq(m-1)$ * if $q\geq\binom{n}{2\epsilon}^{1/(m-2\epsilon)}$ *

.

Proof.

If $q\geq\binom{n}{2\epsilon}^{1/(m-2\epsilon)}$ then

[TABLE]

Using the fact that if $a>b$ then $\frac{a-1}{b-1}>\frac{a}{b}$ , we have

[TABLE]

Now for any ${\textbf{A}}\in\mathds{F}_{q}^{m\times n}$ with rank $({\textbf{A}})=m$ , the number of distinct non-zero linear combinations of $2\epsilon$ or fewer columns of A is at the most $\binom{n}{2\epsilon}(q^{2\epsilon}-1)$ . Therefore $|\mathcal{I}_{\text{FU}}|\leq\binom{n}{2\epsilon}(q^{2\epsilon}-1)$ . Hence $(q^{m}-1)>\binom{n}{2\epsilon}(q^{2\epsilon}-1)\geq|\mathcal{I}_{\text{FU}}|$ . Now using Theorem 3 we have $l_{q,\mathrm{opt}}\leq(m-1)$ . ∎

III-B Relation with covering radius

The covering radius of an $[n,k]$ linear code $\mathscr{C}$ over $\mathds{F}_{q}$ , denoted by $r_{\mathrm{cov}}(\mathscr{C})$ , is defined as the smallest integer $r$ such that the spheres of radius $r$ centered at each codeword of $\mathscr{C}$ cover the whole space $\mathds{F}_{q}^{n}$ . We can determine covering radius of a linear code in terms of the cosets of the code. For any vector ${\bf{a}}\in\mathds{F}_{q}^{n}$ , the set ${\bf{a}}+\mathscr{C}=\{{\bf{a}}+{\bf{c}}\leavevmode\nobreak\ |\leavevmode\nobreak\ {\bf{c}}\in\mathscr{C}\}$ is called a coset of the code $\mathscr{C}$ and in any coset, a vector with minimum Hamming weight is called a coset leader. The covering radius $r_{\mathrm{cov}}(\mathscr{C})$ of the code $\mathscr{C}$ is the largest among the Hamming weight of all the coset leaders. Upon denoting ${\bf{H}}^{\prime}$ as a parity check matrix of $\mathscr{C}$ , ${\bf{H}}^{\prime}{\bf{u}}$ is the syndrome of the vector ${\bf{u}}\in\mathds{F}_{q}^{n}$ . Two vectors ${\bf{u}},\textbf{v}\in\mathds{F}_{q}^{n}$ have the same syndrome if and only if they belong to the same coset of $\mathscr{C}$ . Hence there is an one-to-one correspondence between syndromes and cosets [10].

For the $({\textbf{A}},\epsilon)$ function update problem let $\mathscr{C}_{A}$ be the linear code generated by A. Hence A is a parity check matrix of the code $\mathscr{C}_{A}^{\perp}$ which is the dual code of $\mathscr{C}_{A}$ . Now considering a vector ${\bf{z}}\in\mathcal{I}_{\text{FU}}$ , ${\bf{z}}$ can be expressed as ${\textbf{A}}{\bf{y}}$ where ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ . Therefore the vector ${\bf{z}}$ denotes the syndrome of a vector ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ that belongs to some coset of $\mathscr{C}_{A}^{\perp}$ . Note that any vector that belongs to $\mathcal{I}_{\text{FU}}$ is non-zero, hence can not be the syndrome of the codewords of $\mathscr{C}^{\perp}_{A}$ . Note that ${\bf{z}}$ is the syndrome of the coset leader of the coset ${\bf{y}}+\mathscr{C}_{A}^{\perp}$ . Since ${\bf{y}}$ is a vector that belongs to the coset and $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ , the Hamming weight of the coset leader of the coset is at the most $2\epsilon$ .

Corollary 3.

For an $({\textbf{A}},\epsilon)$ function update problem, $l_{q,\mathrm{opt}}=m$ if and only if $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\leq 2\epsilon$ .

Proof.

From Theorem 3 we have $l_{q,\mathrm{opt}}=m$ if and only if $|\mathcal{I}_{\text{FU}}|=q^{m}-1$ . Hence to prove the corollary we prove that for any $({\textbf{A}},\epsilon)$ function update problem $|\mathcal{I}_{\text{FU}}|=q^{m}-1$ if and only if $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\leq 2\epsilon$ .

Proof of $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\leq 2\epsilon$ if $|\mathcal{I}_{\text{FU}}|=q^{m}-1$ : Since the collection $\mathcal{I}_{\text{FU}}$ contains all non-zero vectors over $\mathds{F}_{q}^{m}$ , each non-zero vector ${\bf{z}}\in\mathds{F}_{q}^{m}$ is the syndrome of some vector ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ that belongs to some coset of $\mathscr{C}_{A}^{\perp}$ . Since there exists a one-to-one correspondence between the syndromes and cosets, for each vector ${\bf{z}}\in\mathcal{I}_{\text{FU}}$ there exists a coset of $\mathscr{C}_{A}^{\perp}$ that contains a vector ${\bf{y}}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ . Hence the coset leader of each coset has Hamming weight at the most $2\epsilon$ . Therefore the largest Hamming weight of the coset leaders among all cosets of $\mathscr{C}_{A}^{\perp}$ is at the most $2\epsilon$ . Hence $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\leq 2\epsilon$ .

Proof of $|\mathcal{I}_{\text{FU}}|=q^{m}-1$ if $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\leq 2\epsilon$ : Since $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\leq 2\epsilon$ , the largest Hamming weight of the coset leaders among all cosets of $\mathscr{C}_{A}^{\perp}$ is at the most $2\epsilon$ . Since there exists a one-to-one correspondence between the syndromes and cosets, each syndrome ${\bf{z}}\in\mathds{F}_{q}^{m}$ can be expressed as ${\textbf{A}}{\bf{y}}$ for some coset leader ${\bf{y}}\in\mathds{F}_{q}^{n}$ satisfies $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ . We know that the syndromes of a particular linear code covers the whole space. Hence any vector ${\bf{z}}\in\mathds{F}_{q}^{m}\backslash\{\boldsymbol{0}\}$ can be expressed as ${\textbf{A}}{\bf{y}}$ for some ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ . Since $\mathcal{I}_{\text{FU}}$ consists only non-zero vectors that satisfies the above property, $|\mathcal{I}_{\text{FU}}|=q^{m}-1$ .

Hence $l_{q,\mathrm{opt}}=m$ if and only if $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\leq 2\epsilon$ . ∎

Example 2.

In this example we calculate the minimum number of rows of ${\textbf{A}}_{m\times n}$ such that $l_{q,\mathrm{opt}}\leq(m-1)$ is guaranteed for $q=2$ , $\epsilon=1$ and $n=8$ . Now $l_{q,\mathrm{opt}}\leq(m-1)$ if and only if $r_{\mathrm{cov}}(\mathscr{C}_{A}^{\perp})\geq 2\epsilon+1=3$ . From Table I of [11] we observe that for any binary code of length $8$ and dimension up to $3$ , covering radius is at least $3$ . Thus dim $(\mathscr{C}_{A}^{\perp})\geq 3$ implies $l_{q,\mathrm{opt}}\leq(m-1)$ . Hence $(n-m)\leq 3$ and $m\geq(n-3)=5$ . Therefore for any matrix ${\textbf{A}}\in\mathds{F}_{2}^{5\times 8}$ with rank $({\textbf{A}})=5$ we can save one transmission compared to the naive scheme. One such example of A is given in Example 1. ∎

IV Lower Bound on Optimal Codelength

In this section we derive a lower bound on the optimal codelength $l_{q,\mathrm{opt}}$ over $\mathds{F}_{q}$ . First we derive two preliminary lemmas which will help to derive the lower bound.

Lemma 4.

For any $({\textbf{A}},\epsilon)$ function update problem and for any invertible matrix ${\textbf{K}}\in\mathds{F}_{q}^{m\times m}$ , $\mathcal{I}({\textbf{A}},\epsilon)=\mathcal{I}({\textbf{K}}{\textbf{A}},\epsilon)$ .

Proof.

To prove the lemma we first show that $\mathcal{I}({\textbf{A}},\epsilon)\subseteq\mathcal{I}({\textbf{K}}{\textbf{A}},\epsilon)$ and then $\mathcal{I}({\textbf{K}}{\textbf{A}},\epsilon)\subseteq\mathcal{I}({\textbf{A}},\epsilon)$ .

Proof for $\mathcal{I}({\textbf{A}},\epsilon)\subseteq\mathcal{I}({\textbf{K}}{\textbf{A}},\epsilon)$ : Suppose ${\bf{y}}\in\mathcal{I}({\textbf{A}},\epsilon)$ . Then from (3) we have ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ . Now left multiplying both side by K we obtain ${\textbf{K}}{\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ since K is invertible. Hence ${\bf{y}}\in\mathcal{I}({\textbf{K}}{\textbf{A}},\epsilon)$ .

Proof for $\mathcal{I}({\textbf{K}}{\textbf{A}},\epsilon)\subseteq\mathcal{I}({\textbf{A}},\epsilon)$ : Suppose ${\bf{y}}\in\mathcal{I}({\textbf{K}}{\textbf{A}},\epsilon)$ . Then from (3) we have ${\textbf{K}}{\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ . Since K is invertible, ${\textbf{K}}^{-1}$ exists. Now left multiplying both side by ${\textbf{K}}^{-1}$ we obtain ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ . Hence ${\bf{y}}\in\mathcal{I}({\textbf{A}},\epsilon)$ .

Hence the lemma holds. ∎

For any $({\textbf{A}},\epsilon)$ function update problem ${\textbf{A}}\in\mathds{F}_{q}^{m\times n}$ with rank(A)= $m$ . Hence A contains $m$ linearly independent columns. Now consider a matrix ${\textbf{K}}^{\prime}$ which contains $m$ linearly independent columns of A. Note that ${\textbf{K}}^{\prime}$ is an $m\times m$ full rank matrix and hence invertible. Denote ${\textbf{K}}={\textbf{K}}^{\prime-1}$ and ${\textbf{A}}^{\prime}={\textbf{K}}{\textbf{A}}$ . From Lemma 4 we observe that $({\textbf{A}},\epsilon)$ and $({\textbf{A}}^{\prime},\epsilon)$ are equivalent function update problems and any matrix ${\bf{H}}$ is a valid encoder matrix of $({\textbf{A}},\epsilon)$ function update problem if and only if ${\bf{H}}$ is a valid encoder matrix of $({\textbf{A}}^{\prime},\epsilon)$ function update problem. Hence we conclude that the linear code generated by the rows of ${\bf{H}}$ is a subcode of the linear code generated by the rows of ${\textbf{A}}^{\prime}$ i.e., $\mathscr{C}_{H}\subseteq\mathscr{C}_{A^{\prime}}$ . Hence there exists a matrix ${\textbf{S}}^{\prime}\in\mathds{F}_{q}^{l\times m}$ such that ${\bf{H}}={\textbf{S}}^{\prime}{\textbf{A}}^{\prime}$ . Now using the equivalence between $({\textbf{A}},\epsilon)$ and $({\textbf{A}}^{\prime},\epsilon)$ function update problems and using Corollary 1 we say that ${\bf{H}}={\textbf{S}}^{\prime}{\textbf{A}}^{\prime}$ is a valid encoder matrix of the $({\textbf{A}},\epsilon)$ function update problem if and only if ${\textbf{S}}^{\prime}{\bf{z}}\neq\boldsymbol{0}$ for all ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}}^{\prime},\epsilon)$ .

Let $\mathcal{B}^{\ast}(m,2\epsilon)=\{{\bf{z}}\in\mathds{F}_{q}^{m}\leavevmode\nobreak\ |\leavevmode\nobreak\ 0<\mathrm{wt}({\bf{z}})\leq 2\epsilon\}$ be the set of all non-zero vectors in $\mathds{F}_{q}^{m}$ of Hamming weight at the most $2\epsilon$ .

Lemma 5.

For any $({\textbf{A}},\epsilon)$ function update problem

$\mathcal{B}^{\ast}(m,2\epsilon)\subseteq\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ .

Proof.

For an $({\textbf{A}},\epsilon)$ function update problem, ${\textbf{A}}^{\prime}={\textbf{K}}{\textbf{A}}$ where ${\textbf{K}}={\textbf{K}}^{\prime-1}$ and ${\textbf{K}}^{\prime}$ consists of $m$ linearly independent columns of A. Note that the sub-matrix of ${\textbf{A}}^{\prime}$ that contains the corresponding columns forms an $m\times m$ identity matrix. Now if we consider any non-zero linear combination of $2\epsilon$ of fewer columns of this sub-matrix we obtain all non-zero vectors over $\mathds{F}_{q}^{m}$ with Hamming weight at the most $2\epsilon$ . Hence $\mathcal{B}^{\ast}(m,2\epsilon)\subseteq\mathcal{I}_{\text{FU}}({\textbf{A}}^{\prime},\epsilon)=\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ . The last equality holds due to Lemma 4. ∎

Let $k_{q}(m,2\epsilon+1)$ be the maximum dimension among all linear codes over $\mathds{F}_{q}$ with blocklength $m$ and minimum distance $d_{\mathrm{min}}\geq 2\epsilon+1$ .

Theorem 4.

The optimal codelength of the $({\textbf{A}},\epsilon)$ function update problem over $\mathds{F}_{q}$ satisfies

$l_{q,\mathrm{opt}}\geq m-k_{q}(m,2\epsilon+1)$ .

Proof.

Let ${\bf{H}}$ be an optimal encoder matrix of $({\textbf{A}},\epsilon)$ function update problem with codelength $l=l_{q,\mathrm{opt}}$ . Then there exists a matrix ${\textbf{S}}\in\mathds{F}_{q}^{l_{q,\mathrm{opt}}\times m}$ such that ${\bf{H}}={\textbf{S}}{\textbf{A}}$ . Now using Corollary 1 we have ${\textbf{S}}{\bf{z}}\neq\boldsymbol{0}$ for all ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ . Since $\mathcal{B}^{\ast}(m,2\epsilon)\subseteq\mathcal{I}_{\text{FU}}({\textbf{A}},\epsilon)$ , it follows that ${\textbf{S}}{\bf{z}}\neq\boldsymbol{0}$ for all ${\bf{z}}\in\mathcal{B}^{\ast}(m,2\epsilon)$ . Therefore any set of $2\epsilon$ columns of S are linearly independent. Hence S is a parity check matrix of a linear code of block length $m$ and minimum distance at least $2\epsilon+1$ . Thus the dimension of this code satisfies $m-l_{q,\mathrm{opt}}\leq k_{q}(m,2\epsilon+1)$ . Then $l_{q,\mathrm{opt}}\geq m-k_{q}(m,2\epsilon+1)$ . ∎

Theorem 4 provides a lower bound that is aware of the field size $q$ . This is tighter than the bound $l\geq 2\epsilon$ given in [2, 3, 1] since from Singleton bound we know that $k_{q}(m,2\epsilon+1)\leq m-2\epsilon$ , and this combined with Theorem 4 yields $l\geq 2\epsilon$ . Hence, irrespective of the matrix A, a necessary condition for $l_{q,\mathrm{opt}}=2\epsilon$ is that an $[m,m-2\epsilon]$ MDS code over $\mathds{F}_{q}$ must exist.

V Code constructions

In this section we first derive an upper bound on the optimal codelength $l_{q,\mathrm{opt}}$ over $\mathds{F}_{q}$ and then provide code constructions for $({\textbf{A}},\epsilon)$ function update problem when A is in form given by (4). Define $\eta=\max\limits_{{\bf{z}}\in\mathcal{I}_{\text{FU}}}{\mathrm{wt}({\bf{z}})}$ .

Theorem 5.

The optimal codelength of the $({\textbf{A}},\epsilon)$ function update problem over $\mathds{F}_{q}$ satisfies

$l_{q,\mathrm{opt}}\leq m-k_{q}(m,\eta+1)$ .

Proof.

From Corollary 1 we have that a matrix ${\bf{H}}={\textbf{S}}{\textbf{A}}\in\mathds{F}_{q}^{l\times n}$ for some matrix ${\textbf{S}}\in\mathds{F}_{q}^{l\times m}$ , is a valid encoder matrix if and only if ${\textbf{S}}{\bf{z}}\neq\boldsymbol{0},\leavevmode\nobreak\ \forall{\bf{z}}\in\mathcal{I}_{\text{FU}}$ . To satisfy this condition it is sufficient that any set of $\eta$ columns of S are linearly independent. Now consider S as a parity check matrix of the largest linear code with blocklength $m$ and minimum distance $d_{\mathrm{min}}\geq\eta+1$ . The resulting codelength $l=m-k_{q}(m,\eta+1)$ . Hence the upper bound on the optimal codelength holds. ∎

V-A Code constructions for striped data

In this section we provide linear code construction of an $({\textbf{A}}^{S},\epsilon)$ function update problem where ${\textbf{A}}^{S}\in\mathds{F}_{q}^{m\times n}$ follows the structure given by

[TABLE]

where ${\bf{C}}\in\mathds{F}_{q}^{t\times K}$ and $\boldsymbol{0}$ is a $t\times K$ matrix over $\mathds{F}_{q}$ whose all elements are [math]. Let $a$ be the number of repetitions of ${\bf{C}}$ in the matrix ${\textbf{A}}^{S}$ . Hence we write $m=at$ and $n=aK$ . First we consider the family of $({\textbf{A}}^{S},\epsilon)$ function update problems where $t=1$ and show that for this case the lower bound on optimal codelength given in Theorem 4 and the upper bound on optimal codelength given in Theorem 5 exactly matches with each other. Hence we characterize the optimal codelength for this family of function update problems. Our code construction is based on an appropriately chosen linear error correcting code. Note that in Section IV of [1] the authors provided a linear code construction based on maximally recoverable subcodes (MRSC) which requires field size $q\geq m$ and uses an $[m,m-2\epsilon]$ MDS code. In comparison our code construction is suitable for any field size.

V-A1 Code Constructions for the family of $({\textbf{A}}^{S},\epsilon)$ function update problems with $t=1$

In this sub-section we first calculate the optimal codelength for such family of function update problems and then provide a code construction based on an appropriately chosen linear error correcting code.

Theorem 6.

For the family of $({\textbf{A}}^{S},\epsilon)$ function update problems with $t=1$ the optimal codelength over $\mathds{F}_{q}$ is given by

[TABLE]

Proof.

Consider any ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}}^{S},\epsilon)$ . Then ${\bf{z}}$ can be written as ${\bf{z}}={\textbf{A}}^{S}{\bf{y}}$ for some ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ . Hence we write ${\bf{z}}=[{\textbf{A}}^{S,1}\leavevmode\nobreak\ {\textbf{A}}^{S,2}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\textbf{A}}^{S,n}]{\bf{y}}$ where ${\textbf{A}}^{S,i}$ denotes the $i^{\mathrm{th}}$ column of ${\textbf{A}}^{S}$ and $\mathrm{wt}({\textbf{A}}^{S,i})=1$ for all $i\in[n]$ . Now $\mathrm{wt}({\bf{z}})=\mathrm{wt}({\textbf{A}}^{S,1}y_{1}+{\textbf{A}}^{S,2}y_{2}+\dots+{\textbf{A}}^{S,n}y_{n})\leq\mathrm{wt}({\textbf{A}}^{S,1}y_{1})+\mathrm{wt}({\textbf{A}}^{S,2}y_{2})+\dots+\mathrm{wt}({\textbf{A}}^{S,n}y_{n})$ . Since $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ , at the most $2\epsilon$ terms among ${\textbf{A}}^{S,1}y_{1},{\textbf{A}}^{S,2}y_{2},\dots,{\textbf{A}}^{S,n}y_{n}$ are non-zero and each ${\textbf{A}}^{S,i}y_{i},\leavevmode\nobreak\ i\in[n]$ has Hamming weight at the most $1$ . Hence for any ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}}^{S},\epsilon)$ , we have $\mathrm{wt}({\bf{z}})\leq 2\epsilon$ . It is easy to observe that $\eta=\max\limits_{{\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}}^{S},\epsilon)}{\mathrm{wt}({\bf{z}})}=2\epsilon$ . So using Theorem 5 we have $l_{q,\mathrm{opt}}\leq m-k_{q}(m,2\epsilon+1)$ . Again from Theorem 4 we have $l_{q,\mathrm{opt}}\geq m-k_{q}(m,2\epsilon+1)$ . Since the lower bound and the upper bound matches with each other we have $l_{q,\mathrm{opt}}=m-k_{q}(m,2\epsilon+1)$ . ∎

Now we provide a code construction for the family of $({\textbf{A}}^{S},\epsilon)$ function update problems with $t=1$ . Since for any $({\textbf{A}}^{S},\epsilon)$ function update problem with $t=1$ the value of $\eta$ is $2\epsilon$ , it is sufficient that ${\textbf{S}}{\bf{z}}\neq\boldsymbol{0}$ for any ${\bf{z}}$ with $0<\mathrm{wt}({\bf{z}})\leq 2\epsilon$ . Hence it is sufficient that any $2\epsilon$ columns of S are linearly independent. Now consider S as a parity check matrix of a linear code of maximum dimension with blocklength $m$ and minimum distance $d_{\text{min}}\geq 2\epsilon+1$ and set the encoder matrix ${\bf{H}}={\textbf{S}}{\textbf{A}}^{S}$ . This code achieves the optimal codelength $l_{q,\mathrm{opt}}=m-k_{q}(m,2\epsilon+1)$ . Now if $q\geq m$ then there exists an MDS code over $\mathds{F}_{q}$ with blocklength $m$ and minimum distance $d_{\text{min}}=2\epsilon+1$ which has maximum dimension $k_{q}(m,2\epsilon+1)=m-2\epsilon$ among all linear codes over $\mathds{F}_{q}$ . Hence choosing S as a parity check matrix of an $[m,m-2\epsilon]$ MDS code $\mathds{F}_{q},\leavevmode\nobreak\ q\geq m$ and encoder matrix ${\bf{H}}={\textbf{S}}{\textbf{A}}^{S}$ we achieve codelength $l_{q,\mathrm{opt}}=2\epsilon$ which matches the codelength achieved by the construction given in Section IV of [1] which also requires $q\geq m$ .

Example 3.

Consider an $({\textbf{A}}^{S},\epsilon)$ function update problem over $\mathds{F}_{2}$ where $\epsilon=1$ and ${\textbf{A}}^{S}$ is given by

[TABLE]

Now from [12] we have $k_{2}(4,3)=1$ . Hence choosing S as parity check matrix of a $[4,1]$ repetition code over $\mathds{F}_{2}$ we achieve codelength $l_{2,\mathrm{opt}}=3$ .

If we view the above matrix ${\textbf{A}}^{S}$ as over $\mathds{F}_{4}$ and the $({\textbf{A}}^{S},\epsilon)$ function update problem over $\mathds{F}_{4}$ where $\epsilon=1$ , from [12] we have $k_{4}(4,3)=2$ . Hence choosing S as parity check matrix of a $[4,2,3]$ MDS code over $\mathds{F}_{4}$ we achieve codelength $l_{4,\mathrm{opt}}=2$ . ∎

V-A2 Code constructions for the family of $({\textbf{A}}^{S},\epsilon)$ function update problems where $t\geq 1$

In this sub-section we provide a linear code construction for the family of $({\textbf{A}}^{S},\epsilon)$ function update problems where ${\textbf{A}}^{S}$ is given in (4) with $t\geq 1$ . A matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ is a valid encoder matrix if and only if there exists a matrix ${\textbf{S}}\in\mathds{F}_{q}^{l\times m}$ such that ${\bf{H}}={\textbf{S}}{\textbf{A}}^{S}$ satisfies ${\textbf{S}}{\bf{z}}\neq\boldsymbol{0}$ for all ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}}^{S},\epsilon)$ . For any vector ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}}^{S},\epsilon)$ we write ${\bf{z}}={\textbf{A}}{\bf{y}}$ for some ${\bf{y}}\in\mathds{F}_{q}^{n}$ with $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ . Hence

[TABLE]

where ${\bf{y}}=[{\bf{y}}_{1}^{T}\leavevmode\nobreak\ {\bf{y}}_{2}^{T}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\bf{y}}_{a}^{T}]^{T}$ with $a=\frac{n}{K}=\frac{m}{t}$ and each ${\bf{y}}_{i}\in\mathds{F}_{q}^{K},\leavevmode\nobreak\ \forall i\in[a]$ . Since $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ , at the most $2\epsilon$ vectors among ${\bf{y}}_{1},{\bf{y}}_{2},\dots,{\bf{y}}_{a}$ are non-zero. Hence at the most $2\epsilon$ vectors among ${\bf{C}}{\bf{y}}_{1},{\bf{C}}{\bf{y}}_{2},\dots,{\bf{C}}{\bf{y}}_{a}$ are non-zero. Denote ${\bf{z}}=[{\bf{z}}_{1}^{T}\leavevmode\nobreak\ {\bf{z}}_{2}^{T}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\bf{z}}_{a}^{T}]^{T}$ where each ${\bf{z}}_{i}={\bf{C}}{\bf{y}}_{i}\in\mathds{F}_{q}^{t},\leavevmode\nobreak\ \forall i\in[a]$ . Therefore we have that at the most $2\epsilon$ vectors among ${\bf{z}}_{1},{\bf{z}}_{2},\dots,{\bf{z}}_{a}$ are non-zero. Now for any ${\bf{z}}\in\mathcal{I}_{\text{FU}}({\textbf{A}}^{S},\epsilon)$ we write

[TABLE]

where ${\textbf{S}}_{i}\in\mathds{F}_{q}^{l\times t},\leavevmode\nobreak\ i\in[a]$ is the sub-matrix of S containing $(i-1)t+1^{\mathrm{th}}$ to $it^{\mathrm{th}}$ columns of S.

I. Case-1, $t\geq 1$ , $\epsilon=1$ : To satisfy the condition given in (7) for $\epsilon=1$ it is sufficient that the columns of any two or fewer sub-matrices among ${\textbf{S}}_{1},{\textbf{S}}_{2},\dots,{\textbf{S}}_{a}$ form linearly independent set. Hence the columns of each sub-matrix ${\textbf{S}}_{i},\leavevmode\nobreak\ i\in[a]$ are linearly independent. Let $\mathcal{S}_{i}$ be the $t$ -dimensional subspace of $\mathds{F}_{q}^{l}$ generated by the columns of ${\textbf{S}}_{i}$ over $\mathds{F}_{q}$ . Now to satisfy the linear independence property of the columns of two or fewer sub-matrices among ${\textbf{S}}_{1},{\textbf{S}}_{2},\dots,{\textbf{S}}_{a}$ , it is sufficient to have $\mathcal{S}_{i}\cap\mathcal{S}_{j}=\{\boldsymbol{0}\}$ for any $i,j\in[a]$ and $i\neq j$ . Our code construction for an $({\textbf{A}}^{S},1)$ function update problem where $t\geq 1$ is based on subspace codes.

Code Construction 1.

Our aim is to construct a matrix ${\textbf{S}}=[{\textbf{S}}_{1}\leavevmode\nobreak\ {\textbf{S}}_{2}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\textbf{S}}_{a}]\in\mathds{F}_{q}^{l\times m}$ where ${\textbf{S}}_{i}\in\mathds{F}_{q}^{l\times t},\leavevmode\nobreak\ i\in[a]$ is the sub-matrix of S containing $(i-1)t+1^{\mathrm{th}}$ to $it^{\mathrm{th}}$ columns of S such that the subspaces generated by the columns any two sub-matrices ${\textbf{S}}_{i}$ and ${\textbf{S}}_{j}$ for $i\neq j,\leavevmode\nobreak\ i,j\in[a]$ are trivially intersecting. Note that for any $i\neq j,\leavevmode\nobreak\ i,j\in[a]$ the subspaces $\mathcal{S}_{i}$ and $\mathcal{S}_{j}$ generated by the columns of ${\textbf{S}}_{i}$ and ${\textbf{S}}_{j}$ respectively are $t$ dimensional subspace of $\mathds{F}_{q}^{l}$ and satisfies $\mathcal{S}_{i}\cap\mathcal{S}_{j}=\{\boldsymbol{0}\}$ . Hence to construct such S matrix we utilize pairwise trivially intersecting $t$ -dimensional subspaces $\mathcal{S}_{1},\mathcal{S}_{2},\dots,\mathcal{S}_{a}$ of $\mathds{F}_{q}^{l}$ . From the literature on subspace codes [13, 14, 15], we know that if $l\geq 2t$ then there exist at least $q^{l-t}$ pairwise trivially intersecting $t$ -dimensional subspaces in $\mathds{F}_{q}^{l}$ . Hence if $q\geq a^{\frac{1}{l-t}}$ and provided $l\geq 2t$ it is possible to find pairwise trivially intersecting $t$ -dimensional subspaces $\mathcal{S}_{1},\mathcal{S}_{2},\dots,\mathcal{S}_{a}$ of $\mathds{F}_{q}^{l}$ . Now to construct ${\textbf{S}}=[{\textbf{S}}_{1}\leavevmode\nobreak\ {\textbf{S}}_{2}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\textbf{S}}_{a}]$ we choose a basis of $i^{\mathrm{th}}$ subspace $\mathcal{S}_{i},\leavevmode\nobreak\ i\in[a]$ which contains $t$ vectors over $\mathds{F}_{q}^{l}$ and these $t$ linearly independent vectors form the columns of the sub-matrix ${\textbf{S}}_{i}$ . After constructing such S matrix, we set ${\bf{H}}={\textbf{S}}{\textbf{A}}^{S}$ which is a valid encoder matrix for the $({\textbf{A}}^{S},1)$ function update problem with $t\geq 1$ . Using this code construction we achieve codelength $l\geq 2t$ for $({\textbf{A}}^{S},1)$ function update problem if $q\geq a^{\frac{1}{l-t}}$ .

Example 4.

Consider an $({\textbf{A}}^{S},\epsilon)$ function update problem over $\mathds{F}_{2}$ with $\epsilon=1$ where ${\textbf{A}}^{S}\in\mathds{F}_{2}^{9\times 12}$ is given by

[TABLE]

where ${\bf{C}}\in\mathds{F}_{2}^{3\times 4}$ is given by

[TABLE]

Now our aim is to construct a matrix ${\textbf{S}}=[{\textbf{S}}_{1}\leavevmode\nobreak\ {\textbf{S}}_{2}\leavevmode\nobreak\ {\textbf{S}}_{3}]\in\mathds{F}_{q}^{l\times 9}$ such that the subspaces $\mathcal{S}_{1},\leavevmode\nobreak\ \mathcal{S}_{2},\leavevmode\nobreak\ \mathcal{S}_{3}$ generated by the columns of ${\textbf{S}}_{1},\leavevmode\nobreak\ {\textbf{S}}_{2}$ and ${\textbf{S}}_{3}$ respectively are pairwise trivially intersecting. From our construction we have that it is possible to find $3$ pairwise trivially intersecting $3$ -dimensional subspaces of $\mathds{F}_{q}^{l}$ if $q\geq 3^{\frac{1}{l-3}}$ and provided $l\geq 6$ . If we let $l=6$ then $q\geq 3^{\frac{1}{3}}$ i.e., $q\geq 2$ . Hence over $\mathds{F}_{2}$ it is possible to construct a $6\times 9$ matrix S such that ${\bf{H}}={\textbf{S}}{\textbf{A}}^{S}$ is a valid encoder matrix for the $({\textbf{A}}^{S},1)$ function update problem. One possible choice of $3$ pairwise trivially intersecting $3$ -dimensional subspaces of $\mathds{F}_{q}^{6}$ is $\mathcal{S}_{1}=\text{span}\{(1,0,0,0,0,0),(0,1,0,0,0,0),(0,0,1,0,0,0)\}$ , $\mathcal{S}_{2}=\text{span}\{(0,0,0,1,0,0),(0,0,0,0,1,0),\\ (0,0,0,0,0,1)\}$ and $\mathcal{S}_{3}=\text{span}\{(1,0,0,1,0,0),(0,1,0,0,1,0),(0,0,1,0,0,1)\}$ . Hence the matrix ${\textbf{S}}\in\mathds{F}_{2}^{6\times 9}$ is given by

[TABLE]

∎

II. Case-2, $t\geq 1$ , $\epsilon\geq 1$ : Here we provide a linear code construction for the family of $({\textbf{A}}^{S},\epsilon)$ function update problem where $\epsilon\geq 1$ and ${\textbf{A}}^{S}$ is given in (4) with $t\geq 1$ . To satisfy the condition given in (7) for $\epsilon\geq 1$ it is sufficient that the columns of any $2\epsilon$ or fewer sub-matrices among ${\textbf{S}}_{1},{\textbf{S}}_{2},\dots,{\textbf{S}}_{a}$ form a linearly independent set.

Code Construction 2.

Our code construction uses a linear code over $\mathds{F}_{q^{t}}$ of maximum possible dimension with block length $a$ and minimum distance $d_{\mathrm{min}}\geq 2\epsilon+1$ . Let $\hat{{\textbf{S}}}\in\mathds{F}_{q^{t}}^{\hat{l}\times a}$ be a parity check matrix of such linear code with $\hat{l}=a-k_{q^{t}}(a,2\epsilon+1)$ where $k_{q^{t}}(a,2\epsilon+1)$ denotes the maximum dimension of a linear code over $\mathds{F}_{q^{t}}$ with block length $a$ and minimum distance $d_{\mathrm{min}}\geq 2\epsilon+1$ . Note that any $2\epsilon$ columns of $\hat{{\textbf{S}}}$ are linearly independent over $\mathds{F}_{q^{t}}$ . Let $\alpha$ be a primitive element of $\mathds{F}_{q^{t}}$ and $p(x)=p_{0}+p_{1}x+p_{2}x^{2}+\dots+p_{t-1}x^{t-1}+x^{t}$ be the primitive polynomial corresponding to $\alpha$ where each $p_{j}\in\mathds{F}_{q}$ for all $j\in\{0,1,\dots,t-1\}$ . The corresponding companion matrix is given by

[TABLE]

Now we define a matrix ${\textbf{S}}=[{\textbf{S}}_{1}\leavevmode\nobreak\ {\textbf{S}}_{2}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\textbf{S}}_{a}]\in\mathds{F}_{q}^{\hat{l}t\times at}$ where for each $j\in[a],\leavevmode\nobreak\ {\textbf{S}}_{j}\in\mathds{F}_{q}^{\hat{l}t\times t}$ is given by ${\textbf{S}}_{j}=[{\textbf{S}}_{1,j}^{T}\leavevmode\nobreak\ {\textbf{S}}_{2,j}^{T}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\textbf{S}}_{\hat{l},j}^{T}\leavevmode\nobreak\ ]^{T}$ . Now for each $i\in[\hat{l}]$ and $j\in[a]$ , ${\textbf{S}}_{i,j}\in\mathds{F}_{q}^{t\times t}$ is given by

[TABLE]

where $\hat{s}_{i,j}$ is the $(i,j)^{\mathrm{th}}$ entry of $\hat{{\textbf{S}}}$ . Since any $2\epsilon$ or fewer columns of $\hat{{\textbf{S}}}$ are linearly independent then using Theorem 3 in [13] we have that the columns of any $2\epsilon$ or fewer block matrices among ${\textbf{S}}_{1},{\textbf{S}}_{2},\dots,{\textbf{S}}_{a}$ are linearly independent. Hence the matrix ${\bf{H}}={\textbf{S}}{\textbf{A}}^{S}$ is a valid encoder matrix over $\mathds{F}_{q}$ with codelength $l=\hat{l}t=t(a-k_{q^{t}}(a,2\epsilon+1))$ . Since any $2\epsilon$ or fewer columns of $\hat{{\textbf{S}}}$ are linearly independent we have $\hat{l}\geq 2\epsilon$ and hence $l\geq 2\epsilon t$ with equality if and only if $\hat{{\textbf{S}}}$ is a parity check matrix of an $[a,a-2\epsilon,2\epsilon+1]$ MDS code over $\mathds{F}_{q^{t}}$ . Such an MDS code is guaranteed to exist if $q^{t}\geq a$ . Hence using this code construction we achieve codelength $l=2\epsilon t$ if $q\geq a^{\frac{1}{t}}$ .

Example 5.

Consider an $({\textbf{A}}^{S},\epsilon)$ function update problem over $\mathds{F}_{2}$ with $\epsilon=2$ where ${\textbf{A}}^{S}\in\mathds{F}_{2}^{15\times 20}$ is given by

[TABLE]

where ${\bf{C}}\in\mathds{F}_{2}^{3\times 4}$ is given by

[TABLE]

Note that $x^{3}+x+1$ is a primitive polynomial corresponding to $\mathds{F}_{8}$ and companion matrix corresponding to the primitive polynomial $x^{3}+x+1=0$ is given by

[TABLE]

Now we set $\hat{{\textbf{S}}}$ as a parity check matrix of a $[5,1,5]$ MDS code over $\mathds{F}_{8}$ which is repetition code over $\mathds{F}_{8}$ . Hence $\hat{{\textbf{S}}}\in\mathds{F}_{8}^{4\times 5}$ is given by

[TABLE]

Now we obtain the matrix ${\textbf{S}}\in\mathds{F}_{2}^{12\times 15}$ from $\hat{{\textbf{S}}}$ using (8) as

[TABLE]

Now we obtain a valid encoder matrix ${\bf{H}}={\textbf{S}}{\textbf{A}}^{S}$ with codelength $12$ over $\mathds{F}_{2}$ . ∎

V-A3 Comparison with the code in Remark 4 of [1]

Let us first briefly describe about the system model given in Remark 4 in [1] using our notations. In Remark 4 of [1] the authors considered transmission of $t$ updated information symbol vectors ${\bf{x}}_{1}+{\bf{e}}_{1},{\bf{x}}_{2}+{\bf{e}}_{2},\dots,{\bf{x}}_{t}+{\bf{e}}_{t}$ , ${\bf{x}}_{i}+{\bf{e}}_{i}\in\mathds{F}_{q}^{K},\leavevmode\nobreak\ \forall i\in[t]$ . The receiver knows coded version of each information symbol vector denoted by ${\bf{C}}{\bf{x}}_{1},{\bf{C}}{\bf{x}}_{2},\dots,{\bf{C}}{\bf{x}}_{t}$ where ${\bf{C}}\in\mathds{F}_{q}^{t\times K}$ and ${\bf{C}}{\bf{x}}_{i}\in\mathds{F}_{q}^{t},\leavevmode\nobreak\ \forall i\in[t]$ and demands updated version of the coded demands i.e., ${\bf{C}}({\bf{x}}_{1}+{\bf{e}}_{1}),{\bf{C}}({\bf{x}}_{2}+{\bf{e}}_{2}),\dots,{\bf{C}}({\bf{x}}_{t}+{\bf{e}}_{t})$ . We can view this problem as an $({\textbf{A}}^{S},\epsilon)$ function update problem where ${\textbf{A}}^{S}\in\mathds{F}_{q}^{t^{2}\times tK}$ takes the form given in (4) with the number of repetitions of the matrix ${\bf{C}}$ along the block diagonal entries of ${\textbf{A}}^{S}$ being equal to $t$ . We denote the information symbol vector as ${\bf{x}}=[{\bf{x}}_{1}^{T}\leavevmode\nobreak\ {\bf{x}}_{2}^{T}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\bf{x}}_{t}^{T}]^{T}\in\mathds{F}_{q}^{tK}$ and the update vector as ${\bf{e}}=[{\bf{e}}_{1}^{T}\leavevmode\nobreak\ {\bf{e}}_{2}^{T}\leavevmode\nobreak\ \dots\leavevmode\nobreak\ {\bf{e}}_{t}^{T}]^{T}\in\mathds{F}_{q}^{tK}$ with $\mathrm{wt}({\bf{e}})\leq\epsilon$ . The authors of [1] provide a valid code construction with codelength $2t\epsilon$ based on an MRSC using the Construction 1 in [1]. This construction from [1] is valid over any field $\mathds{F}_{q}$ .

To construct a valid code for the above function update problem we choose $\hat{{\textbf{S}}}$ as a parity check matrix of a $[t,t-2\epsilon,2\epsilon+1]$ MDS code over $\mathds{F}_{q^{t}}$ and such a code exists if $q^{t}\geq t$ . Then we construct the matrix ${\textbf{S}}\in\mathds{F}_{q}^{2t\epsilon\times t^{2}}$ from $\hat{{\textbf{S}}}$ using (8). Hence if $q\geq t^{1/t}$ we construct a valid code with codelength $2t\epsilon$ for the $({\textbf{A}}^{S},\epsilon)$ function update problem. Note that for any positive integer $t$ , $t^{1/t}<2$ . Hence over any finite field $\mathds{F}_{q}$ our construction yields a valid encoder matrix with codelength $2t\epsilon$ for the $({\textbf{A}}^{S},\epsilon)$ function update problem described above.

V-A4 Comparison of Code Construction 1 and Code Construction 2 for $({\textbf{A}}^{S},1)$ function update problem with $t\geq 1$

In this sub-section we consider the Code Construction 2 for the special case of $\epsilon=1$ and then compare the performance with the performance of the Code Construction 1. Consider an $({\textbf{A}}^{S},1)$ function update problem where ${\textbf{A}}^{S}$ is of the form given in (4). To obtain a valid code for the $({\textbf{A}}^{S},1)$ function update problem using the Code Construction 2, we use a linear code over $\mathds{F}_{q^{t}}$ of maximum possible dimension with blocklength $a$ and minimum distance $d_{\text{min}}\geq 3$ . Let $\hat{{\textbf{S}}}\in\mathds{F}_{q^{t}}^{\hat{l}\times a}$ be a parity of such linear code with $\hat{l}=a-k_{q^{t}}(a,3)$ where $k_{q^{t}}(a,3)$ denotes the maximum possible dimension of a linear code over $\mathds{F}_{q^{t}}$ with blocklength $a$ and minimum distance is at least $3$ . We construct a matrix ${\textbf{S}}\in\mathds{F}_{q}^{\hat{l}t\times at}$ from $\hat{{\textbf{S}}}$ using (8) and obtain a valid encoder matrix ${\bf{H}}$ with code length $\hat{l}t$ by multiplying S with ${\textbf{A}}^{S}$ . Note that any two or fewer columns of $\hat{{\textbf{S}}}$ are linearly independent. Hence the subspace generated by each column of $\hat{{\textbf{S}}}$ are pairwise trivially intersecting. Therefore to construct such a matrix $\hat{{\textbf{S}}}$ it is necessary and sufficient that the number of trivially intersecting $1$ -dimensional subspaces of space $\mathds{F}_{q^{t}}^{\hat{l}}$ is at least $a$ . From [16] we know that the space $\mathds{F}_{q^{t}}^{\hat{l}}$ contains exactly $(q^{\hat{l}t}-1)/(q^{t}-1)$ trivially intersecting $1$ -dimensional subspaces. Hence to construct a matrix S it is necessary and sufficient that

[TABLE]

Now using the fact $(q^{\hat{l}t}-1)/(q^{t}-1)\geq q^{\hat{l}t}/q^{t}$ (since $\hat{l}t\geq t$ ) we observe that $q^{\hat{l}t}/q^{t}\geq a$ i.e., $q\geq a^{\frac{1}{t(\hat{l}-1)}}$ is a sufficient condition for such an encoder matrix to exist. Hence applying the Code Construction 2 for an $({\textbf{A}}^{S},1)$ function update problem over $\mathds{F}_{q}$ we achieve codelength $l=t(a-k_{q^{t}}(a,3))$ if the field size $q\geq a^{\frac{1}{t(\hat{l}-1)}}$ . Hence if $q\geq a^{1/t}$ we achieve codelength $l=2t$ using the Code Construction 2 by choosing a parity check matrix of an $[a,a-2,3]$ MDS code over $\mathds{F}_{q^{t}}$ and such a MDS code exists over $\mathds{F}_{q^{t}}$ since $q^{t}\geq a$ . Note that we also achieve codelength $l=2t$ for $({\textbf{A}}^{S},1)$ function update problem using the Code Construction 1 if $q\geq a^{1/t}$ . Note that in Code Construction 2, the achieved codelength $l=\hat{l}t$ is always an integer multiple of $t$ . But applying the Code Construction 1 for $({\textbf{A}}^{S},1)$ function update problem we can achieve any codelength $l\geq 2t$ provided the field size $q\geq a^{1/l-t}$ . Hence for $({\textbf{A}}^{S},1)$ function update problem the Code Construction 2 becomes a special case of the Code Construction 1. This also inspires us to study the Code Construction 1 separately for $({\textbf{A}}^{S},1)$ function update problem.

VI Equivalence with a Functional Index Coding problem

In this section we discuss a variation of the classical index coding problem where each user demands a coded version of the information symbols present at the transmitter and already knows a subset of the (uncoded) information symbols as side information. This is a special case of the Generalized Index Coding problem [7, 8] and the Functional Index Coding problem [9]. The authors of [7, 8] generalized the classical index coding problem where each receiver knows some linearly coded information symbols as side-information and demands some linearly coded information symbols. Additionally the authors of [7] assume that the information symbols present in the transmitter are also linearly coded information symbols. In [9], authors generalized the index coding problem, where the side-information as well as demanded messages can be arbitrary functions of information symbols, called functional index coding problem. Here we consider a special case of generalized index coding problem and functional index coding problem and then we introduce the relation between function update problem and this family of functional index coding problems.

VI-A Functional Index Coding with Coded Demand and Uncoded Side Information

Consider a broadcast network scenario with single transmitter and $\hat{K}$ receivers $u_{1},u_{2},\dots,u_{\hat{K}}$ . The transmitter has a vector of $n$ information symbols ${\bf{x}}=(x_{1},x_{2},\dots,x_{n})\in\mathds{F}_{q}^{n}$ . Each receiver knows a subset of the information symbols as side-information. Let ${\bf{x}}_{\mathcal{X}_{i}}$ be the side-information vector of $i^{\mathrm{th}}$ receiver $u_{i}$ where $\mathcal{X}_{i}\subseteq[n],\leavevmode\nobreak\ i\in[\hat{K}]$ . Each receiver demands a coded version of the information symbols vector ${\bf{x}}$ . Let ${\textbf{A}}_{i}{\bf{x}}$ be the coded demand of $i^{\mathrm{th}}$ receiver $u_{i}$ where ${\textbf{A}}_{i}\in\mathds{F}_{q}^{m\times n}$ with rank $({\textbf{A}}_{i})=m$ . Upon denoting $\mathcal{A}=({\textbf{A}}_{1},{\textbf{A}}_{2},\dots,{\textbf{A}}_{\hat{K}})$ and $\mathcal{X}=(\mathcal{X}_{1},\mathcal{X}_{1},\dots,\mathcal{X}_{\hat{K}})$ we describe the problem instance as $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem. A valid encoding function $\mathfrak{E}_{\mathrm{FIC}}$ over $\mathds{F}_{q}$ for an $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem is

[TABLE]

such that for each receiver $u_{i},$ $i\in[\hat{K}]$ there exists a decoding function $\mathfrak{D}_{i,\mathrm{FIC}}:\mathds{F}_{q}^{l}\times\mathds{F}_{q}^{|\mathcal{X}_{i}|}\rightarrow\mathds{F}_{q}^{m}$ satisfying the following property: $\mathfrak{D}_{i,\mathrm{FIC}}(\mathfrak{E}_{\mathrm{FIC}}({\bf{x}}),{\bf{x}}_{\mathcal{X}_{i}})={\textbf{A}}_{i}{\bf{x}}$ for every ${\bf{x}}\in\mathds{F}_{q}^{n}$ .

The design objective is to design a tuple $(\mathfrak{E}_{\mathrm{FIC}},\mathfrak{D}_{1,\mathrm{FIC}},\mathfrak{D}_{2,\mathrm{FIC}},\dots,\mathfrak{D}_{\hat{K},\mathrm{FIC}})$ of encoding and decoding functions that minimizes the codelength $l$ and determine the optimal codelength for the given functional index coding problem which is the minimum codelength among all coding schemes.

A linear code for an $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem is defined as a coding scheme where the encoding function $\mathfrak{E}_{\mathrm{FIC}}:\mathds{F}_{q}^{n}\rightarrow\mathds{F}_{q}^{l}$ is a linear transformation over $\mathds{F}_{q}$ described as $\mathfrak{E}_{\mathrm{FIC}}({\bf{x}})={\bf{H}}{\bf{x}}$ , where ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ is the encoder matrix for linear functional index code. The minimum codelength among all valid linear coding schemes for the $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem over the field $\mathds{F}_{q}$ will be denoted as $l_{q,\mathrm{opt},\mathrm{FIC}}$ .

Now we derive a design criterion for a matrix ${\bf{H}}$ to be a valid encoder matrix for $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem. We define the set $\mathcal{I}_{\mathrm{FIC}}(\hat{K},n,\mathcal{X},\mathcal{A})$ , or equivalently $\mathcal{I}_{\mathrm{FIC}}$ , of vectors ${\bf{y}}$ of length $n$ such that ${\bf{y}}_{\mathcal{X}_{i}}={\bf{0}}\in\mathds{F}_{q}^{|\mathcal{X}_{i}|}$ and ${\textbf{A}}_{i}{\bf{y}}\neq\boldsymbol{0}$ for some choice of $i\in[\hat{K}]$ i.e.,

[TABLE]

Theorem 7.

The matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ is a valid encoder matrix for the $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem if and only if

[TABLE]

Proof.

A matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ is a valid encoder matrix for the $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem if and only if at each receiver $u_{i},\leavevmode\nobreak\ i\in[\hat{K}]$ , ${\textbf{A}}_{i}{\bf{x}}$ can be uniquely determined from the received codeword ${\bf{H}}{\bf{x}}$ and the side information ${\bf{x}}_{\mathcal{X}_{i}}$ . Hence for two distinct pair of the information symbol vectors $({\bf{x}},{\bf{x}}^{\prime})$ such that the side-information symbol vectors available at the $i^{\text{th}}$ receiver are identical i.e., ${\bf{x}}_{\mathcal{X}_{i}}={\bf{x}}^{\prime}_{\mathcal{X}_{i}}$ but demanded coded information symbol vectors are distinct i.e., ${\textbf{A}}_{i}{\bf{x}}\neq{\textbf{A}}_{i}{\bf{x}}^{\prime}$ then the transmitted codeword ${\bf{H}}{\bf{x}}$ must be distinct from ${\bf{H}}{\bf{x}}^{\prime}$ to distinguish two different demanded coded information symbol vectors. Equivalently, the condition ${\bf{H}}{\bf{x}}\neq{\bf{H}}{\bf{x}}^{\prime}$ should hold for every pair ${\bf{x}},{\bf{x}}^{\prime}\in\mathds{F}_{q}^{n}$ such that ${\textbf{A}}_{i}{\bf{x}}\neq{\textbf{A}}_{i}{\bf{x}}^{\prime}$ and ${\bf{x}}_{\mathcal{X}_{i}}={\bf{x}}^{\prime}_{\mathcal{X}_{i}}$ for some $i\in[K]$ . Therefore ${\bf{H}}$ is a valid encoder matrix if and only if

[TABLE]

for all ${\bf{x}},{\bf{x}}^{\prime}\in\mathds{F}_{q}^{n}$ such that ${\textbf{A}}_{i}{\bf{x}}\neq{\textbf{A}}_{i}{\bf{x}}^{\prime}$ and ${\bf{x}}_{\mathcal{X}_{i}}={\bf{x}}^{\prime}_{\mathcal{X}_{i}}$ for some $i\in[\hat{K}]$ . Now denoting ${\bf{y}}={\bf{x}}-{\bf{x}}^{\prime}$ we have

[TABLE]

for all ${\bf{y}}\in\mathds{F}_{q}^{n}$ such that ${\textbf{A}}_{i}{\bf{y}}\neq\boldsymbol{0}$ and ${\bf{y}}_{\mathcal{X}_{i}}=\boldsymbol{0}$ for some $i\in[\hat{K}]$ . Hence the statement of the theorem follows. ∎

VI-B Construction of a Equivalent Functional Index Coding Problem from a given Function Update problem

Now we construct an $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem starting from an $({\textbf{A}},\epsilon)$ function update problem. The number of receivers $\hat{K}$ , the tuple of the side information indices $\mathcal{X}$ and the tuple of coded demands $\mathcal{A}$ are obtained from Algorithm 1.

Algorithm 1 considers every possible choice of $Q\subseteq[n]$ such that $|Q|=\min(2\epsilon,n)$ and defines a new user $u_{j}$ in the functional index coding problem with demand matrix ${\textbf{A}}_{j}={\textbf{A}}$ and side information $\mathcal{X}_{j}=[n]\setminus Q$ .

Now we relate the set $\mathcal{I}({\textbf{A}},\epsilon)$ defined for the $({\textbf{A}},\epsilon)$ Function Update problem and the set $\mathcal{I}_{\mathrm{FIC}}$ defined in (9) for the constructed $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem.

Theorem 8.

For any given $({\textbf{A}},\epsilon)$ function update problem and its corresponding $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem, $\mathcal{I}({\textbf{A}},\epsilon)=\mathcal{I}_{\mathrm{FIC}}(\hat{K},n,\mathcal{X},\mathcal{A})$ .

Proof.

To show that $\mathcal{I}({\textbf{A}},\epsilon)=\mathcal{I}_{\mathrm{FIC}}(\hat{K},n,\mathcal{X},\mathcal{A})$ , we will show that $\mathcal{I}({\textbf{A}},\epsilon)\subseteq\mathcal{I}_{\mathrm{FIC}}$ and $\mathcal{I}_{\mathrm{FIC}}\subseteq\mathcal{I}({\textbf{A}},\epsilon)$ .

Proof for $\mathcal{I}({\textbf{A}},\epsilon)\subseteq\mathcal{I}_{\mathrm{FIC}}$ : Suppose a vector ${\bf{y}}\in\mathcal{I}({\textbf{A}},\epsilon)$ . Then from (3), we have ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ and $0<\mathrm{wt}({\bf{y}})\leq 2\epsilon$ . Hence there exists a $Q\subseteq[n]$ such that $|Q|=\min(2\epsilon,n)$ and ${\bf{y}}_{[n]\setminus Q}=\mathbf{0}$ . Now using the construction procedure described in Algorithm 1 we see that there exists a user $u_{j}$ in the constructed functional index coding problem such that $\mathcal{X}_{j}=[n]\setminus Q$ and ${\textbf{A}}_{j}={\textbf{A}}$ . The vector ${\bf{y}}$ satisfies ${\bf{y}}_{\mathcal{X}_{j}}=\boldsymbol{0}$ and ${\textbf{A}}_{j}{\bf{y}}\neq\boldsymbol{0}$ . Hence ${\bf{y}}\in\mathcal{I}_{\mathrm{FIC}}$ .

Proof for $\mathcal{I}_{\mathrm{FIC}}\subseteq\mathcal{I}({\textbf{A}},\epsilon)$ : Suppose a vector ${\bf{y}}\in\mathcal{I}_{\mathrm{FIC}}$ . Then there exists at least one user $j\in[\hat{K}]$ such that ${\textbf{A}}_{j}{\bf{y}}\neq\boldsymbol{0}$ and ${\bf{y}}_{\mathcal{X}_{j}}=\mathbf{0}$ . Since ${\textbf{A}}_{j}{\bf{y}}\neq\boldsymbol{0}$ we have ${\bf{y}}\neq\boldsymbol{0}$ . From Algorithm 1 we see that for any $j\in[\hat{K}]$ , $|\mathcal{X}_{j}|=n-\min(2\epsilon,n)$ . Note that $\mathrm{wt}({\bf{y}})=\mathrm{wt}({\bf{y}}_{\mathcal{X}_{j}})+\mathrm{wt}({\bf{y}}_{[n]\setminus\mathcal{X}_{j}})\leq 2\epsilon$ . Again from the construction we have ${\textbf{A}}_{i}={\textbf{A}},\leavevmode\nobreak\ \forall j\in[K]$ . Therefore ${\textbf{A}}{\bf{y}}\neq\boldsymbol{0}$ . Hence ${\bf{y}}\in\mathcal{I}({\textbf{A}},\epsilon)$ .

Hence the theorem holds. ∎

Now we relate the problem of constructing linear codes for function update problem to the problem of designing linear coding scheme for the corresponding functional index coding problem.

Theorem 9.

A matrix ${\bf{H}}\in\mathds{F}_{q}^{l\times n}$ such that ${\bf{H}}={\textbf{S}}{\textbf{A}}$ for some matrix ${\textbf{S}}\in\mathds{F}_{q}^{l\times m}$ is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem if and only if ${\bf{H}}$ is a valid encoder matrix for the $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem.

Proof.

From Theorem 2 we know that ${\bf{H}}$ is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem if and only if it satisfies

[TABLE]

Now from Theorem 8 we have $\mathcal{I}({\textbf{A}},\epsilon)=\mathcal{I}_{\mathrm{FIC}}(\hat{K},n,\mathcal{X},\mathcal{A})$ . Therefore using Theorem 7 we conclude that ${\bf{H}}$ is a valid encoder matrix for the $(\hat{K},n,\mathcal{X},\mathcal{A})$ functional index coding problem if and only if ${\bf{H}}$ is a valid encoder matrix for the $({\textbf{A}},\epsilon)$ function update problem. ∎

Acknowledgment

The authors thank Dr V. Lalitha for discussions regarding the topic of this paper.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. Prakash and M. Médard, “Communication Cost for Updating Linear Functions When Message Updates are Sparse: Connections to Maximally Recoverable Codes,” IEEE Transactions on Information Theory , vol. 64, no. 12, pp. 7557–7576, Dec 2018.
2[2] P. Nakkiran, N. B. Shah, and K. V. Rashmi, “Fundamental limits on communication for oblivious updates in storage networks,” in 2014 IEEE Global Communications Conference , Dec 2014, pp. 2363–2368.
3[3] P. Nakkiran, N. B. Shah, K. V. Rashmi, A. Sahai, and K. Ramchandran, “Optimal Oblivious Updates in Distributed Storage Networks.” [Online]. Available: www.cs.cmu.edu/%7Ervinayak/papers/Ob Up.pdf
4[4] M. Mahdian, N. Prakash, M. Médard, and E. Yeh, “Updating Content in Cache-Aided Coded Multicast,” Co RR , vol. abs/1805.00396, 2018. [Online]. Available: https://arxiv.org/abs/1805.00396
5[5] R. E. Ali and V. R. Cadambe, “Multi-version Coding for Consistent Distributed Storage of Correlated Data Updates,” Co RR , vol. abs/1708.06042, 2017. [Online]. Available: https://arxiv.org/abs/1708.06042
6[6] Z. Wang and V. R. Cadambe, “Multi-Version Coding–An Information-Theoretic Perspective of Consistent Distributed Storage,” IEEE Transactions on Information Theory , vol. 64, no. 6, pp. 4540–4561, June 2018.
7[7] M. Dai, K. W. Shum, and C. W. Sung, “Data Dissemination With Side Information and Feedback,” IEEE Transactions on Wireless Communications , vol. 13, no. 9, pp. 4708–4720, Sep. 2014.
8[8] N. Lee, A. G. Dimakis, and R. W. Heath, “Index Coding With Coded Side-Information,” IEEE Communications Letters , vol. 19, no. 3, pp. 319–322, March 2015.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Codes for Updating Linear Functions over

Abstract

I Introduction

II System Model and Preliminaries

Definition 1**.**

Theorem 1** (Theorem 2, [1]).**

Proof.

Lemma 1** (Remark 2, [1]).**

Theorem 2**.**

Corollary 1**.**

III Necessary and Sufficient Condition for lq,opt<ml_{q,\mathrm{opt}}<mlq,opt​<m

Lemma 2**.**

Proof.

III-A A coding scheme for a family of (A,ϵ)({\textbf{A}},\epsilon)(A,ϵ) function update problems

Lemma 3**.**

Proof.

Example 1**.**

Theorem 3**.**

Proof.

Corollary 2**.**

Proof.

III-B Relation with covering radius

Corollary 3**.**

Proof.

Example 2**.**

IV Lower Bound on Optimal Codelength

Lemma 4**.**

Proof.

Lemma 5**.**

Proof.

Theorem 4**.**

Proof.

V Code constructions

Theorem 5**.**

Proof.

V-A Code constructions for striped data

V-A1 Code Constructions for the family of (AS,ϵ)({\textbf{A}}^{S},\epsilon)(AS,ϵ) function update problems with t=1t=1t=1

Theorem 6**.**

Proof.

Example 3**.**

V-A2 Code constructions for the family of (AS,ϵ)({\textbf{A}}^{S},\epsilon)(AS,ϵ) function update problems where t≥1t\geq 1t≥1

Code Construction 1**.**

Example 4**.**

Code Construction 2**.**

Example 5**.**

V-A3 Comparison with the code in Remark 4 of [1]

V-A4 Comparison of Code Construction 1 and Code Construction 2 for (AS,1)({\textbf{A}}^{S},1)(AS,1) function update problem with t≥1t\geq 1t≥1

VI Equivalence with a Functional Index Coding problem

VI-A Functional Index Coding with Coded Demand and Uncoded Side Information

Theorem 7**.**

Proof.

VI-B Construction of a Equivalent Functional Index Coding Problem from a given Function Update problem

Theorem 8**.**

Proof.

Theorem 9**.**

Proof.

Acknowledgment

Definition 1.

Theorem 1 (Theorem 2, [1]).

Lemma 1 (Remark 2, [1]).

Theorem 2.

Corollary 1.

III Necessary and Sufficient Condition for $l_{q,\mathrm{opt}}<m$

Lemma 2.

III-A A coding scheme for a family of $({\textbf{A}},\epsilon)$ function update problems

Lemma 3.

Example 1.

Theorem 3.

Corollary 2.

Corollary 3.

Example 2.

Lemma 4.

Lemma 5.

Theorem 4.

Theorem 5.

V-A1 Code Constructions for the family of $({\textbf{A}}^{S},\epsilon)$ function update problems with $t=1$

Theorem 6.

Example 3.

V-A2 Code constructions for the family of $({\textbf{A}}^{S},\epsilon)$ function update problems where $t\geq 1$

Code Construction 1.

Example 4.

Code Construction 2.

Example 5.

V-A4 Comparison of Code Construction 1 and Code Construction 2 for $({\textbf{A}}^{S},1)$ function update problem with $t\geq 1$

Theorem 7.

Theorem 8.

Theorem 9.