Subspace-Aware Index Codes

Bhavya Kailkhura; Lakshmi Narasimhan Theagarajan; Pramod K. Varshney

arXiv:1702.03589·cs.IT·April 11, 2017

Subspace-Aware Index Codes

Bhavya Kailkhura, Lakshmi Narasimhan Theagarajan, Pramod K. Varshney

PDF

Open Access

TL;DR

This paper introduces subspace-aware index coding, leveraging low-dimensional data structures to significantly improve system throughput, with an efficient algorithm for near-optimal coding, outperforming traditional methods.

Contribution

It generalizes index coding to exploit data subspace structures and proposes an efficient alternating minimization algorithm for near-optimal codes.

Findings

01

Achieves up to 90% throughput gain with subspace-aware codes.

02

Develops an algebraic framework for subspace-aware index coding.

03

Provides an efficient algorithm for near-optimal code design.

Abstract

In this paper, we generalize the well-known index coding problem to exploit the structure in the source-data to improve system throughput. In many applications, the data to be transmitted may lie (or can be well approximated) in a low-dimensional subspace. We exploit this low-dimensional structure of the data using an algebraic framework to solve the index coding problem (referred to as subspace-aware index coding) as opposed to the traditional index coding problem which is subspace-unaware. Also, we propose an efficient algorithm based on the alternating minimization approach to obtain near optimal index codes for both subspace-aware and -unaware cases. Our simulations indicate that under certain conditions, a significant throughput gain (about 90%) can be achieved by subspace-aware index codes over conventional subspace-unaware index codes.

Equations48

D_{j} [S_{j} Tw y] = x_{R_{j}}, \forall j, \mbox s . t . y = C x = C T w .

D_{j} [S_{j} Tw y] = x_{R_{j}}, \forall j, \mbox s . t . y = C x = C T w .

S_{1 [1, 8, 9]} = [10 0 h_{8} 0 h_{9}], S_{2 [2, 5, 7]} = [10 0 h_{5} 0 h_{7}],

S_{1 [1, 8, 9]} = [10 0 h_{8} 0 h_{9}], S_{2 [2, 5, 7]} = [10 0 h_{5} 0 h_{7}],

S_{3 [3, 6, 10]} = [10 0 h_{6} 0 h_{10}], and S_{4 [4, 11, 12]} = [10 0 h_{11} 0 h_{12}] .

S_{3 [3, 6, 10]} = [10 0 h_{6} 0 h_{10}], and S_{4 [4, 11, 12]} = [10 0 h_{11} 0 h_{12}] .

D_{j} [S_{j} T w y]

D_{j} [S_{j} T w y]

x_{R_{j}}

B C T = (R - A S) T = R T,

B C T = (R - A S) T = R T,

B

B

S

A

R

R

D_{j}

rk (B C T) \leq min (rk (B), rk (C T)) = rk (C T) .

rk (B C T) \leq min (rk (B), rk (C T)) = rk (C T) .

rk (B C T) \geq rk (B) + rk (C T) - L = rk (C T) .

rk (B C T) \geq rk (B) + rk (C T) - L = rk (C T) .

rk (C) \leq rk (C T) + N - D = rk (R T) + N - D .

rk (C) \leq rk (C T) + N - D = rk (R T) + N - D .

L^{*} = A_{1}, A_{2}, \dots, A_{U} min rk (R T) .

L^{*} = A_{1}, A_{2}, \dots, A_{U} min rk (R T) .

L^{*} = A min rk ((R - A S) T)

L^{*} = A min rk ((R - A S) T)

min (\tilde{L} - (N - D), 1) \leq L^{*} \leq \tilde{L}

min (\tilde{L} - (N - D), 1) \leq L^{*} \leq \tilde{L}

rk (R T) \geq rk (R) + D - N \geq \tilde{L} - (N - D) .

rk (R T) \geq rk (R) + D - N \geq \tilde{L} - (N - D) .

\Big{\|}{\bf D}_{j}\begin{bmatrix}{\bf S}_{j}{\bf x}\\ {\bf y}\end{bmatrix}-{\bf x}_{\mathcal{R}_{j}}\Big{\|}\leq\epsilon,\,\forall j.

\Big{\|}{\bf D}_{j}\begin{bmatrix}{\bf S}_{j}{\bf x}\\ {\bf y}\end{bmatrix}-{\bf x}_{\mathcal{R}_{j}}\Big{\|}\leq\epsilon,\,\forall j.

{Z_{j}, A_{j}}_{j = 1}^{U} min j = 1 \sum U ∥ Z_{j} - (R_{j} - A_{j} S_{j}) T ∥_{F}^{2} = Z, {A_{j}}_{j = 1}^{U} min ∥ Z - R T ∥_{F}^{2} .

{Z_{j}, A_{j}}_{j = 1}^{U} min j = 1 \sum U ∥ Z_{j} - (R_{j} - A_{j} S_{j}) T ∥_{F}^{2} = Z, {A_{j}}_{j = 1}^{U} min ∥ Z - R T ∥_{F}^{2} .

∥ R x - R \hat{x} ∥ = ∥ R T w - (A S T w + B C T w) ∥,

∥ R x - R \hat{x} ∥ = ∥ R T w - (A S T w + B C T w) ∥,

∥ R x - R \hat{x} ∥ \leq ∥ R T - B Z ∥_{F} ∥ w ∥_{2} \leq ϵ . \qed

∥ R x - R \hat{x} ∥ \leq ∥ R T - B Z ∥_{F} ∥ w ∥_{2} \leq ϵ . \qed

S_{1} = [00100100], S_{2} = [10000100],

S_{1} = [00100100], S_{2} = [10000100],

S_{3} = [00100001], S_{4} = [1000] .

S_{3} = [00100001], S_{4} = [1000] .

1 - 1.0962388 0 - 0.8506375 - 0.9122099 1 0.8499089 0 - 1.0733032 1.1765966 10 00 - 1.0952999 1 .

1 - 1.0962388 0 - 0.8506375 - 0.9122099 1 0.8499089 0 - 1.0733032 1.1765966 10 00 - 1.0952999 1 .

[1 - 0.8506375 - 0.9122099 0 - 1.0733032 0 01] .

[1 - 0.8506375 - 0.9122099 0 - 1.0733032 0 01] .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCooperative Communication and Network Coding · Advanced MIMO Systems Optimization · Wireless Networks and Protocols

Full text

Subspace-Aware Index Codes

Bhavya Kailkhura*∗†, , Lakshmi Narasimhan Theagarajan∗‡, , Pramod K. Varshney‡ This work was supported in part by NSF Grant no. ECCS 1609916.This work was performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344. LLNL-JRNL-718227These authors contributed equally to this work. $\dagger$ This author is presently affiliated to Lawrence Livermore National Laboratory, [email protected] $\ddagger$ These authors are at the Department of EECS, Syracuse University, New York, {ltheagar, varshney}@syr.edu.

Abstract

In this paper, we generalize the well-known index coding problem to exploit the structure in the source-data to improve system throughput. In many applications (e.g., multimedia), the data to be transmitted may lie (or can be well approximated) in a low-dimensional subspace. We exploit this low-dimensional structure of the data using an algebraic framework to solve the index coding problem (referred to as subspace-aware index coding) as opposed to the traditional index coding problem which is subspace-unaware. Also, we propose an efficient algorithm based on the alternating minimization approach to obtain near optimal index codes for both subspace-aware and -unaware cases. Our simulations indicate that under certain conditions, a significant throughput gain (about $90\%$ ) can be achieved by subspace-aware index codes over conventional subspace-unaware index codes.

Keywords – *Index coding, coded side-information, low-dimensional data, alternating minimization *

I Introduction

Index coding with side-information (ICSI) [1, 2, 3], is a problem, where a server has $N$ stored messages that it can broadcast over a noiseless channel to a set of receivers or clients. Each client has a subset of the $N$ messages as side information, and requests a subset of messages that it needs from the server. The objective of the ICSI problem is to devise an optimal coding strategy that minimizes the number of broadcast transmissions made by the server to satisfy the requirements of all the clients. The optimality criterion of an index code is its code length. The optimal index code length, i.e., the minimum number of transmissions required from the server for successful recovery of the desired information at the clients, was first characterized in [1, 2] as the minimum rank of a matrix that represents the side-information graph [2]. A method to construct index codes by solving a matrix completion problem was presented in [4]. In many practical scenarios, users possess coded side information (CSI) [5]; and index codes for linear CSI were studied in [5, 6]. Index codes over real field and their construction methods are investigated in [7, 8]. Considering index codes over the real field enables the use of efficient optimization techniques to construct near optimal index codes. Further, it was shown in [9] that network codes are equivalent to index codes. Thus, one can construct an optimal network code by constructing an optimal index code for the equivalent problem. Network codes over real field are discussed in [10, 11].

The source-data encountered in many practical systems such as data caching, images, video streaming, big-data storage and processing, can be well approximated using a lower dimensional linear subspace [12, 13, 14] (and references therein). Motivated by such applications, we propose a technique to construct index codes when the source-data belongs to a lower dimensional linear subspace. For example, it is well known that a facial image is a point from a high-dimensional image space which can be well approximated in a lower-dimensional linear subspace. The lower-dimensional subspace is found using Principal Component Analysis, which identifies the axes with maximum variance. In Figure 1(a), a set of eigenvectors (known as eigenfaces) are shown for AT&T Facedatabase. There are ten different images of each of the 40 distinct subjects. The size of each image is 92x112 pixels, with 256 grey levels per pixel. In Figure 1(b), we can see that a good reconstruction quality can be obtained using a very small number of eigenfaces.111These results are obtained from https://github.com/bytefish/facerec.

More specifically, the main contributions of this paper can be summarized as follows.

•

We generalize the index coding problem with coded (and/or uncoded) side information to exploit the low-dimensional structure that may be present in the source-data.

•

We establish bounds on the gain achieved by subspace-aware index codes over subspace-unaware case.

•

We consider the design of subspace-aware/unaware index codes with coded/uncoded side information in a unified optimization framework and develop an efficient algorithm to construct near optimal index codes.

•

Finally, we provide theoretical guarantees and simulation results on the performance of the proposed techniques.

The notations followed in the rest of this paper are: $\operatorname{rk}(.)$ denotes the rank of a matrix, $\operatorname{span}(.)$ denotes a vector space spanned by a set of vectors, $(.)^{\dagger}$ denotes the pseudo-inverse of a matrix, and $\|.\|_{F}$ denotes the Frobenius norm of a matrix.

II Problem setup

Consider a network with $U$ users and a data source (DS). Let $N$ denote the total number of data packets involved in a transmission instance, $P$ denote the size of each data packet, ${\bf x}_{i}$ denote the data in the $i$ th packet, ${\bf x}_{i}\in\mathbb{R}^{P}$ for $i=1,2,\cdots,N$ , and ${\bf x}\triangleq[{\bf x}_{1},{\bf x}_{2},\cdots,{\bf x}_{N}]^{T}\in\mathbb{R}^{PN}$ . The $j$ th user requests $V_{j}$ number of packets from the DS, $\mathcal{R}_{j}$ is the set of all indices of the requested data packets by the $j$ th user, $|\mathcal{R}_{j}|=V_{j}$ for $j=1,2,\cdots,U$ , and ${\bf x}_{\mathcal{R}_{j}}$ denotes the $PV_{j}\times 1$ information vector requested by the $j$ th user. Each user possesses a linearly coded side information. Let $M_{j}$ denote the length of the CSI and ${\bf S}_{j}\in\mathbb{R}^{PM_{j}\times PN}$ denote the side information coding matrix for the user $j$ , $0\leq M_{j}<N$ . The CSI of the $j$ th user is given by the vector ${\bf S}_{j}{\bf x}$ . When the $j$ th user has uncoded side information (USI), the side information consists of $M_{j}$ data packets, and the non-zero columns of ${\bf S}_{j}$ form an identity matrix of dimension $PM_{j}\times PM_{j}$ .

If the vector ${\bf x}$ belongs to a low-dimensional subspace, then ${\bf x}={\bf T}{\bf w}$ , where ${\bf T}\in\mathbb{R}^{PN\times PD}$ ( $1\leq D<N$ ) is the matrix of basis vectors of the low-dimensional subspace, ${\bf w}\in\mathbb{R}^{PD}$ , and $\operatorname{rk}({\bf T})=PD$ .

Goal: Knowing $\mathcal{R}_{j}$ , matrices ${\bf S}_{j}$ and subspace structure $\mathbf{T}$ for $j=1,2,\cdots,U$ , the goal is to have the DS broadcast the least number of coded data packets to $U$ users such that each user is able to successfully decode the requested packets. ∎

Let ${\bf y}\triangleq[{\bf y}_{1},{\bf y}_{2},\cdots,{\bf y}_{L}]^{T}\in\mathbb{R}^{PL}$ be the data vector transmitted by the DS. Now, each user needs to decode ${\bf x}_{\mathcal{R}_{j}}\in\mathbb{R}^{PV_{j}}$ from $[({\bf S}_{j}{\bf x})^{T}\,{\bf y}^{T}]^{T}$ . Assuming linear decoding, the $j$ th user performs the decoding as $\widehat{{\bf x}}_{\mathcal{R}_{j}}={\bf D}_{j}[({\bf S}_{j}{\bf x})^{T}\,{\bf y}^{T}]^{T}$ , where ${\bf D}_{j}$ is the decoding matrix. For linear encoding, this problem can be stated as follows.

Problem: Find a matrix ${\bf C}\in\mathbb{R}^{PL\times PN}$ such that

[TABLE]

We refer to the matrix ${\bf C}$ as the $L$ -length index code. For a given $\{\mathcal{R}_{j}\}$ , $\{{\bf S}_{j}\}$ , and ${\bf T}$ , the matrix ${\bf C}$ with the least number of rows $L^{*}$ satisfying the condition in (1) is the optimal index code and $L^{*}$ is the optimal index code length222Note that, in our proposed methodology, compression and index coding are performed in a unified framework. This helps to further simplify the receiver by relieving it of the separate decompression algorithm, reduces computational complexity, and improves overall system throughput..

Examples: The problem described above is often encountered in practical scenarios such as cloud networks, multicast video-streaming and content-sharing. Since the users are connected to multiple datacenters, each user may have different subsets of the same data and require different subsets. The data could be low-dimensional due to its inherent nature (e.g., videos, images, and sensory data [12, 13]) or the usage of redundancy-inducing error correcting codes [15]. Here, the datacenters employ index codes to serve the users’ requests to increase network efficiency and throughput. A similar problem is also encountered in distributed computing setups [16], distributed cognitive radio networks and satellite networks. Next, we describe these applications in more detail.

( $1$ ) Consider a wireless relay network represented in Fig. 2(a). Source nodes $s_{1}$ and $s_{2}$ wish to broadcast their data to user nodes $u_{1}$ , $u_{2}$ , and $u_{3}$ with the help of a relay node $r_{1}$ . The intensity signals transmitted by $s_{1}$ and $s_{2}$ decay with distance and the SNR deteriorates to the extent that their signals are not decodable beyond a certain radius of transmission. Consequently, in a transmission time slot $t$ , nodes $u_{1}$ and $r_{1}$ successfully decode the data from $s_{1}$ , while nodes $u_{3}$ and $r_{1}$ successfully decode the data from $s_{2}$ , and the node $u_{3}$ receives a linear combination of the data from $s_{1}$ and $s_{2}$ . In time slot $t+1$ , the relay node broadcasts a coded combination of data from $s_{1}$ and $s_{2}$ with which the nodes $u_{1}$ , $u_{2}$ , and $u_{3}$ are able to decode the data from both $s_{1}$ and $s_{2}$ . Here, the relay node comes up with an index code such that $\mathcal{R}_{1}=\{2\},\mathcal{R}_{2}=\{1,2\},\mathcal{R}_{3}=\{1\}$ , ${\bf S}_{1}=[1,0]$ (USI), ${\bf S}_{2}=[h_{1},h_{2}]$ (CSI), ${\bf S}_{3}=[0,1]$ (USI), where $h_{1}$ and $h_{2}$ are the linear coefficients at $u_{2}$ . When the transmitted data is encoded with the same linear channel code at $s_{1}$ and $s_{2}$ , the transmitted data belongs to a low dimensional subspace. The columns of ${\bf T}$ are the bases of this subspace created by the linear channel code.

( $2$ ) Consider a collaborative cognitive radio (CR) network of mutiple low-cost CR $c_{i}$ where $i=1,2,\cdots,12$ , as illustrated in Fig. 2(b). Each CR senses a disjoint band of a wide spectrum and broadcasts the information to all its nearest neighbors. The data collected by all the CRs are finally fused at the fusion centers (FC) $f_{0}$ . The FC forms the complete map of the wideband spectrum. This complete map has to be conveyed back to the CRs. Since the CR network is power constrained, the FC conveys this information to its neighbors in least number of broadcasts using an index code. Further, as the low-cost CRs have limited memory, each CR stores only a linear combination of all the data it receives. Further, the CRs $c_{1},c_{2},c_{3}$ , and $c_{4}$ develop index codes to broadcast to their neighbors. At the FC, $\mathcal{R}_{1}=\{1,8,9\}^{c}$ , $\mathcal{R}_{2}=\{2,5,7\}^{c}$ , $\mathcal{R}_{3}=\{3,6,10\}^{c}$ , $\mathcal{R}_{4}=\{4,11,12\}^{c}$ ,

[TABLE]

Due to the inherent sparsity in the wideband spectrum activity, the spectrum data is low dimensional in nature [17].

( $3$ ) Consider a network of devices served by a central server of facial image databases, with each device requesting a few facial images while possessing images of other faces. Such a scenario commonly occurs in biometric verification systems and security monitoring applications. As described in the previous section, it is known that the facial image data of hundreds of pixels in dimension belong to a lower dimensional linear subspace [18]. Therefore, a subspace-aware index coding in this scenario will improve throughput, speed and scalability of the network.

( $4$ ) Consider a network of users connected to a cloud of datacenters hosting a common dataset. One such cloud network in illustrated in Fig. 2(c). The online users could be simultaneously performing operations such as document-editing or video-streaming or file-sharing. Since the users are connected to multiple datacenters, each user may contain different subsets of the same data and require different subsets. The data could belong to a low-dimensional linear subspace due to either its inherent nature (e.g., videos [19]) or the usage of redundancy-inducing error correcting codes. For multicast transmissions, the data centers employ index codes to serve the users’ requests. This increases the overall network efficiency and throughput.

III Optimal Index Code length

An important step towards solving the problem stated in Sec. II is to identify the minimum length of the index code. Without loss of generality, we assume $P=1$ . Let ${\bf R}_{j}$ be a $V_{j}\times N$ matrix such that ${\bf R}_{j}{\bf x}={\bf x}_{\mathcal{R}_{j}}$ . Splitting ${\bf D}_{j}$ into sub-matrices ${\bf A}_{j}\in\mathbb{R}^{V_{j}\times M_{j}}$ and ${\bf B}_{j}\in\mathbb{R}^{V_{j}\times L}$ , we can write (1) as

[TABLE]

Since ${\bf x}={\bf T}{\bf w}$ and ${\bf w}$ can be any arbitrary vector in $\mathbb{R}^{D}$ , from (1) and (2), we can write ${\bf B}_{j}{\bf C}{\bf T}=({\bf R}_{j}-{\bf A}_{j}{\bf S}_{j}){\bf T}$ , $\forall j$ . This can be expressed succinctly as

[TABLE]

where

[TABLE]

Now, the optimal index code is the matrix ${\bf C}$ that satisfies (3) and has the least value of $L(>0)$ . Since ${\bf C}$ has only linearly independent rows, the rank of ${\bf C}$ is $L$ . Therefore, the goal is to minimize $\operatorname{rk}({\bf C})$ such that (3) is satisfied.

Note: When index coding is performed without the knowledge of the underlying subspace (we refer to this scenario as the subspace-unaware case) or when the data is not low-dimensional, we have ${\bf T}={\bf I}$ .

Lemma 1.

$\operatorname{rk}({\bf C}{\bf T})=\operatorname{rk}(\widetilde{\bf R}{\bf T})$ .

Proof.

If $\sum_{j}V_{j}<L$ , then the index code length is larger than the number of data packets required. Therefore, $\sum_{j}V_{j}\geq L$ ; hence, $\operatorname{rk}({\bf B})\leq L$ . As governed by (2) and (3), when the decoding is successful at the receivers, ${\bf C}{\bf x}\in\operatorname{span}({\bf B})$ ; hence, $\operatorname{rk}({\bf B})\geq\operatorname{dim}({\bf C}{\bf x})=L$ . This proves that $\operatorname{rk}({\bf B})=L$ .

Note that by choosing the index code as ${\bf C}=({\bf T}^{T}{\bf T})^{-1}{\bf T}^{T}$ (i.e., $L=D$ ), and the decoder matrices as ${\bf B}={\bf R}{\bf T}$ and ${\bf A}={\bf 0}$ , all the required packets can be trivially decoded at the receivers. Therefore, the index code is optimal only when $L\leq D$ . Now, $\operatorname{rk}({\bf C}{\bf T})\leq\min(L,D)=L$ , and we have

[TABLE]

Further, by Sylvester’s rank inequality,

[TABLE]

From (3), (5) and (6), $\operatorname{rk}({\bf B}{\bf C}{\bf T})=\operatorname{rk}(\widetilde{\bf R}{\bf T})=\operatorname{rk}({\bf C}{\bf T})$ . ∎

Now, from Sylvester’s rank inequality, we get

[TABLE]

Since, $N-D$ is a fixed positive value, minimizing $\operatorname{rk}(\widetilde{\bf R}{\bf T})$ minimizes the upperbound on $\operatorname{rk}({\bf C})$ , thereby reducing $\operatorname{rk}({\bf C})$ . We use this approach of minimizing $\operatorname{rk}(\widetilde{\bf R}{\bf T})$ to construct index codes for low-dimensional data. Further, when ${\bf C}{\bf T}$ has full row-rank (i.e., $\operatorname{rk}({\bf C}{\bf T})=L$ ), we have $\operatorname{rk}({\bf C}{\bf T})=\operatorname{rk}({\bf C})$ . For subspace-unaware case, we have $\operatorname{rk}({\bf C})=\operatorname{rk}(\widetilde{\bf R})$ [6].

III-A Throughput Gain

The length of the optimal subspace-aware index codes is defined as the following

[TABLE]

Next, we characterize the throughput gain obtained using subspace-aware index codes.

Theorem 1.

The length of the optimal linear index code obtained for the subspace-aware case is less than or equal to the length of the optimal linear index code obtained for the subspace-unaware case.

Proof. Let $\tilde{L}\triangleq\min_{{\bf A}}\operatorname{rk}(\widetilde{\bf R})$ be the optimal subspace-unaware linear index code length, and $\widetilde{{\bf A}}\triangleq\operatorname*{arg\,min}_{{\bf A}}\operatorname{rk}(\widetilde{\bf R})$ . Now,

[TABLE]

Corollary 1.

The length of the optimal linear index code obtained in the subspace-aware case can be bounded as

[TABLE]

Proof.

By Sylvester’s rank inequality, for any matrix ${\bf A}$ ,

[TABLE]

The proof follows from (9) and Theorem 1. ∎

IV Construction of Subspace-Aware Index Codes

It is well-known that the optimization problem in (8) is NP-hard. In order to solve (8), we make a practical assumption that the users can tolerate a decoding error of at most $\epsilon$ . That is,

[TABLE]

Note that, subspace-unaware case with USI can be seen as special cases of (8). Index codes over real field for this case has been studied previously in the literature [7]. It is known that a subspace-unaware linear index code matrix can be obtained by solving a matrix completion problem [6, 7]. However, the optimization problem in (8) is more challenging compared to the conventional matrix completion problems. This is due to the fact that an indeterminate element in ${\bf A}$ affects multiple entries in the resultant $\widetilde{\bf R}{\bf T}$ matrix in (8), which is not the case in conventional matrix completion problems. In the next subsection, we consider the design of subspace-aware/unaware index codes with CSI/USI in a unified optimization framework.

IV-A Construction Algorithm for Index Codes

Let $\mathbf{Z}\triangleq[\mathbf{Z}_{1}^{T},\cdots,\mathbf{Z}_{U}^{T}]^{T}$ be a rank $r$ matrix and ${\bf Z}_{j}\in\mathbb{R}^{V_{j}\times D}$ . Now, the optimization problem can be formulated as

[TABLE]

We solve the optimization problem in (11) for a range of values of $r$ and choose the minimum value of $r$ for which the optimization was feasible (i.e., all the constraints were satisfied) as the length of the index code ( $L^{*}$ ).

We factorize $\bf Z$ as $\mathbf{Z}=\mathbf{X}\mathbf{Y}$ , where $\mathbf{X}\in\mathbb{R}^{(\sum_{j}V_{j})\times r}$ , and $\mathbf{Y}\in\mathbb{R}^{r\times D}$ . The optimization problem in (11) is not convex in $\mathbf{X}$ , $\mathbf{Y}$ and $\mathbf{A}$ simultaneously; however, it is convex in $\mathbf{X}$ (or $\mathbf{Y}$ or $\mathbf{A}$ ) when the rest of the optimization variables are fixed. In fact, here, each of the sub-problems can be solved in a closed form. Note that, $\bf Z$ is a rank $r$ approximation of $\widetilde{\bf R}{\bf T}$ (with an error of $\epsilon$ , i.e., $\|{\bf Z}-\widetilde{\bf R}{\bf T}\|_{F}\leq\epsilon$ ; index codes over $\mathbb{R}$ enable us to obtain such a rank $r$ approximation). The steps in solving this optimization problem are listed in Algorithm 1. The alternating minimization method is guaranteed to converge to a locally optimum solution for a sufficiently large number of iterations [20].

Let $\widetilde{\mathbf{Z}}$ be the matrix formed by choosing the $L^{*}$ linearly independent rows of ${\bf Z}$ . Now, we set ${\bf C}{\bf T}=\widetilde{\mathbf{Z}}$ . At every transmission instant, if the low-dimensional vector ${\bf w}$ is available at the DS, then the matrix ${\bf C}{\bf T}$ can be used for index coding to generate ${\bf y}={\bf C}{\bf T}{\bf w}$ , else the matrix ${\bf C}{\bf T}{\bf T}^{\dagger}$ is used (since ${\bf C}{\bf T}{\bf T}^{\dagger}{\bf x}={\bf C}{\bf T}{\bf T}^{\dagger}{\bf T}{\bf w}={\bf C}{\bf T}{\bf w}={\bf C}{\bf x}={\bf y}$ ).

IV-B Decoding Error Analysis

Theorem 2.

For an index code constructed using the proposed algorithm such that $\|{\bf Z}-\widetilde{\bf R}{\bf T}\|_{F}\leq\epsilon$ , the decoding error is bounded above by $\epsilon$ .

Proof. Let ${\bf R}\hat{{\bf x}}$ be the vector decoded at the receivers. Then, the decoding error is

[TABLE]

where the values of the matrices ${\bf A}$ and ${\bf C}{\bf T}($ $=\widetilde{\bf Z})$ are obtained from Algorithm 1. We choose ${\bf B}$ such that ${\bf Z=B\widetilde{Z}}$ . This is possible due to the following reason. Without loss of generality, we can express ${\bf Z}$ as ${\bf Z}=[\widetilde{{\bf Z}}^{T},\,\bar{{\bf Z}}^{T}]^{T}$ , where $\bar{{\bf Z}}$ is the matrix of $\sum_{j}V_{j}-L$ linearly dependent rows of ${\bf Z}$ . Therefore, the rows of $\bar{{\bf Z}}$ are in the $\operatorname{span}($ rows of $\widetilde{{\bf Z}})$ , i.e., $\bar{{\bf Z}}={\bf G}\widetilde{{\bf Z}}$ for some matrix ${\bf G}\in\mathbb{R}^{\sum_{j}V_{j}-L\times L}$ . Hence, by choosing ${\bf B}$ as ${\bf B}=[{\bf I}_{L},\,{\bf G}^{T}]^{T}$ , we have ${\bf B}\widetilde{{\bf Z}}={\bf Z}$ . Further, without loss of generality, we assume $\|{\bf w}\|_{2}\leq 1$ .

Now, from (12), the decoding error can be bounded as

[TABLE]

Remark 1: The matrices $\widetilde{{\bf Z}}$ and $\bar{{\bf Z}}$ can be easily obtained from ${\bf Z}$ using one of the many commonly known techniques such as using QR decomposition, and ${\bf G}=\bar{{\bf Z}}\widetilde{{\bf Z}}^{\dagger}$ .

Remark 2: In the proposed decoding strategy, the users need not be aware of the subspace matrix ${\bf T}$ for decoding. Using the matrix ${\bf D}_{j}$ , each user can directly decode ${\bf x}_{\mathcal{R}_{j}}$ .

V Numerical Results

Here, we present numerical results for the proposed index code construction algorithm and analyze its performance.

V-A Comparison

First, we consider the index coding problem with USI from [7]333The algorithm in [7] can solve only the subspace-unaware index coding problem with USI (conventional matrix completion problem) which is a special case of the problem we consider in this paper., where $U=4$ , $\mathcal{R}_{i}=\{i\}$ for $i=1,2,3,4$ , and

[TABLE]

Using our proposed algorithm, the $\widetilde{\bf R}$ matrix obtained for $\epsilon=10^{-10}$ is

[TABLE]

The rank of this matrix is $2$ , which is the optimal index code length for this problem as given in [7]. We obtain the index code for this problem by choosing the linearly independent rows in the above matrix. The index code thus obtained is

[TABLE]

For example, when ${\bf x}=[1,1,-1,2]^{T}$ is the source-data, the reconstructed values at the users, from the designed index code were $[1,0.9999999,-1,2]$ ; this gives a decoding error of $\|{\bf x}-\hat{{\bf x}}\|=4.02\times 10^{-14}$ . For the same problem, consider the source-data to be low-dimensional with ${\bf T}=\begin{bmatrix}1&-2&1&1\\ 1&1&-1&2\end{bmatrix}^{T}$ . Now, ${\bf x}=[1,1,-1,2]^{T}={\bf T}[0,1]^{T}$ . For this linear subspace, we get a subspace-aware index code of length 1 given by the matrix $[0,-0.3936923,0.2644723,-0.1412844]$ , and a corresponding decoding error of $8.34\times 10^{-14}$ at the users.

V-B Simulation results

We simulated a simple multicast video-streaming scenario with $N=20$ source-data packets and $U=20$ users requesting $|\mathcal{R}_{j}|=5$ data packets, each. We evaluated the index code length averaged over several instances for four different cases – namely, $(1)$ DS is subspace-unaware and the SI is uncoded, $(2)$ DS is subspace-unaware and the SI is coded, $(3)$ DS is subspace-aware and SI is uncoded, and $(4)$ DS is subspace-aware and the SI is coded. For fair comparison, we consider the same requirement matrix ${\bf R}$ for all the cases.

In Figure 3, we plot the average index code length obtained using our proposed algorithm for different subspace dimensions fixing the SI length at each user to be $M_{j}=15$ . We see that when the source-data is low-dimensional, the average index code lengths obtained for the subspace-aware cases are significantly less than that of the subspace-unaware cases. The average index code lengths for the subspace-unaware cases are 7 (CSI) and 12.4 (USI). Whereas, in subspace-aware case, for $D<9$ , the average index code length is 1.1. Therefore, subspace-aware index codes reduce the transmissions required by about 91% for the USI case and by about 85% for the CSI case444 As the number of packets and users increase, the difference between the average index code length for the CSI and that of the USI case decreases.. Also, for $9\leq D<20$ , we observe that the subspace-aware index codes consistently outperform the subspace-unaware index codes by considerable margin.

Furthermore, from Fig. 4, we can see that even when the number of packets are $50$ or $100$ , the subspace-aware index code outperforms the subspace-unaware index codes. For example, when the subspace dimension is half that of the number of packets (i.e., $D=25$ when $N=50$ , and $D=50$ when $N=100$ ), subspace-aware index codes have code lengths that are $70$ % smaller compared to that of subspace-unaware index codes for uncoded side information, and subspace-aware index codes have $81$ % advantage over subspace-unaware index codes for coded side information. Thus, we can observe that irrespective of the number of packets, the subspace-aware index codes provide significant throughput gains over the subspace-unaware index codes.

In Figure 5, we evaluate the performance of the proposed algorithm for varying SI lengths ( $M_{j}$ ) and fixing the subspace dimension at $D=15$ . As before, we can see that the subspace-aware index codes have significant throughput gains over the subspace-unaware index codes in both the USI and CSI cases. For instance, when $M_{j}=10$ , the subspace-aware cases (both USI and CSI) have an average index code length that is at least 30% smaller than that of the subspace-unaware cases.

VI Conclusion

In this paper, we studied a generalization of the index coding problem that exploits source-data’s structure to improve the system-throughput. We analytically characterized the length of the subspace-aware index codes and proposed an algorithm to obtain near optimal index codes. We showed that this approach significantly outperforms the conventional approaches when the source-data belongs to a low-dimensional subspace. Index coding for the case when the source-data belongs to a non-linear subspace or manifold is an interesting direction for future research. Further, network codes can also be constructed using the proposed algorithm, once the network coding problem is converted to an equivalent index coding problem [9].

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Y. Birk and T. Kol, “Coding on demand by an informed source (ISCOD) for efficient broadcast of different supplemental data to caching clients,” IEEE Trans. Inform. Theory , vol. 52, no. 6, pp. 2825-2830, Jun. 2006.
2[2] Z. Bar-Yossef, Y. Birk, T. Jayram, and T. Kol, “Index coding with side information,” IEEE Symposium on Foundations of Computer Science (FOCS) , pp. 197-206, 2006.
3[3] M. Ji, A.M. Tulino, J. Llorca, and G. Caire, “Caching and coded multicasting: Multiple groupcast index coding,” IEEE Global SIP , pp. 881-885, Dec. 2014.
4[4] V. Y. F. Tan, L. Balzano, and S. C. Draper, “Rank minimization over finite fields: Fundamental limits and coding-theoretic interpretations,” IEEE Trans. Inform. Theory , vol. 58, no. 4, pp. 2018-2039, Apr 2011.
5[5] K. W. Shum, M. Dai, and C. W. Sung, “Broadcasting with coded side information,” in Proc. IEEE PIMRC , 2012, pp. 89-94, Sep. 2012.
6[6] N. Lee, A. G. Dimakis, and R. W. Heath, “Index coding with coded side-information,” IEEE Commun. Letters , vol. 19, no. 3, pp. 319-322, 2015.
7[7] X. Huang and S. El Rouayheb, “Index coding and network coding via rank minimization,” Proc. of IEEE Information Theory Workshop , pp. 14-18, 2015.
8[8] Y. Shi and B. Mishra, “A Sparse and Low-Rank Optimization Framework for Index Coding via Riemannian Optimization,” ar Xiv:1604.04325 , 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Subspace-Aware Index Codes

Abstract

I Introduction

II Problem setup

III Optimal Index Code length

Lemma 1**.**

Proof.

III-A Throughput Gain

Theorem 1**.**

Corollary 1**.**

Proof.

IV Construction of Subspace-Aware Index Codes

IV-A Construction Algorithm for Index Codes

IV-B Decoding Error Analysis

Theorem 2**.**

V Numerical Results

V-A Comparison

V-B Simulation results

VI Conclusion

Lemma 1.

Theorem 1.

Corollary 1.

Theorem 2.