Coded Caching via Projective Geometry: A new low subpacketization scheme
Hari Hara Suthan Chittoor, Bhavana M, Prasad Krishnan

TL;DR
This paper introduces a coded caching scheme leveraging projective geometry that significantly reduces subpacketization levels at the expense of a higher rate, offering a practical trade-off for large networks.
Contribution
It presents a novel coded caching scheme using line graphs and projective geometries that achieves subexponential subpacketization, addressing the large subpacketization issue in prior schemes.
Findings
Achieves rate Θ(K / log_q K) with subexponential subpacketization.
Reduces subpacketization compared to earlier schemes.
Provides a new lower bound on rate based on subpacketization.
Abstract
Coded Caching is a promising solution to reduce the peak traffic in broadcast networks by prefetching the popular content close to end users and using coded transmissions. One of the chief issues of most coded caching schemes in literature is the issue of large , i.e., they require each file to be divided into a large number of subfiles. In this work, we present a coded caching scheme using line graphs of bipartite graphs in conjunction with projective geometries over finite fields. The presented scheme achieves a rate ( being the number of users, is some prime power) with subpacketization when cached fraction is upper bounded by a constant () for some positive integer ). Compared to earlier schemes, the presented scheme has a lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Coded Caching via Projective Geometry:
A new low subpacketization scheme
Hari Hara Suthan C, Bhavana M, Prasad Krishnan
Signal Processing and Communications Research Center,
International Institute of Information Technology, Hyderabad.
Email: {hari.hara@research., bhavana.mvn@research., prasad.krishnan@}iiit.ac.in
Abstract
Coded Caching is a promising solution to reduce the peak traffic in broadcast networks by prefetching the popular content close to end users and using coded transmissions. One of the chief issues of most coded caching schemes in literature is the issue of large subpacketization, i.e., they require each file to be divided into a large number of subfiles. In this work, we present a coded caching scheme using line graphs of bipartite graphs in conjunction with projective geometries over finite fields. The presented scheme achieves a rate ( being the number of users, is some prime power) with subexponential subpacketization when cached fraction is upper bounded by a constant () for some positive integer ). Compared to earlier schemes, the presented scheme has a lower subpacketization (albeit possessing a higher rate). We also present a new subpacketization dependent lower bound on the rate for caching schemes in which each subfile is cached in the same number of users. Compared to the previously known bounds, this bound seems to perform better for a range of parameters of the caching system.
I Introduction
The key performance challenges that next generation wireless networks (5G) face are low latency, high throughput and energy efficiency[1]. Content delivery networks have been estimated to carry of the global internet traffic by [2]. Coded caching was proposed recently in a landmark paper by Maddah-Ali and Niesen [3] and has emerged as an important tool to address major challenges of future communication networks. Since its inception coded caching has proved as an efficient tool to trade-off expensive bandwidth with abundantly available and cost-effective memory at the user/network nodes.
In [3], the setup consists of a single server with equi-popular files of same size (divided into subfiles each of the same size, where is known as the subpacketization parameter), and users(clients) each having a local memory called cache that can store subfiles. The centralized coded caching scheme of [3] works in two phases. In the caching phase (which occurs during off peak times) the cache of each client is populated with some fraction of each file in the server. In the delivery phase (which happens during peak traffic times), the clients demand one file each from the server, to satisfy which the server sends coded transmissions. The rate of such a coded caching scheme is defined as the ratio of the number of bits transmitted to the size of each file, which can be calculated as
[TABLE]
when each transmission is of the same size as the subfiles. The Ali-Niesen scheme in [3] achieves , where is the global caching gain, i.e., the number of users served by each transmission in the delivery scheme. This rate was shown to be optimal for uncoded cache placement [4]. Further, the subpacketization level used by the Ali-Niesen scheme to achieve this rate is . Note that as grows large, , (for constant , being the binary entropy). This means that the files have to be extremely large for even - clients, making the Ali-Niesen scheme impractical for applications.
Since then several new coded caching schemes with lower subpacketization have been constructed at the cost of increase in rate, or cache requirement, or the number of users [5, 6, 7, 8]. Among these, an important construction was reported in [6], via a combinatorial object that the authors defined, known as Placement Delivery Arrays (PDAs). The PDA constructed in [6] achieved a global caching gain of (one less than that of [3]), while improving the subpacketization by an exponential factor compared to [3]. However, in this construction, as well as in most others in literature, the subpacketization required for the caching schemes continues to be exponential in (for some positive integer ) to the best of our knowledge.
Recently, a line graph based approach to coded caching was introduced in [9]. Using this framework, a construction for a caching scheme was given via a projective geometry over a finite field. The scheme presented in [9] achieves a constant rate with subpacketization subexponential in for some prime power ). However the drawback of this scheme is that the uncached fraction of each file has to be large . We remedy this drawback (to some extent) in this work.
The contributions and organization of this paper are as follows. In Section II, we review the line graph based coded caching scheme proposed in [9], while refining it slightly for our purposes. In Section III, we propose a new lower bound for the optimal rate given parameters and , for the caching schemes in which each subfile is cached in the same number of users, which is a property satisfied by all known centralized caching designs in literature (to the best of our knowledge). Using some numerical examples, we see that this lower bound performs better (for a range of parameters) compared to the previously known bounds in [4, 7]. In Section IV, we present a new coded caching scheme using projective geometries over finite fields in the line graph framework of [9]. In Section V we give the asymptotic analysis of the scheme proposed. We show that the scheme achieves a rate ( being a prime power) for a constant cache requirement, which can be extremely small (, for some constant positive integer ). The subpacketization achieved is subexponential in , . We provide a table in Section V which compares the parameters of our scheme to that of [6].
Notations and Terminology: denotes the set of positive integers. We denote the set by where . We give the basic definitions in graph theory. The sets denote vertex set and edge set of a graph respectively, where . All graphs considered in this paper are undirected graphs with no self loops. The neighbourhood of a vertex is given as . The square of a graph is a graph having and an edge if and only if either or there exists some such that . The complement of a graph is denoted as . A set is called a clique of if every two distinct vertices in are adjacent to each other. A single vertex is also considered as a clique by definition. A clique cover of is a collection of disjoint cliques such that each vertex appears in precisely one clique. A bipartite graph is a graph whose vertices can be partitioned into two independent sets (called left and right vertices of ) such that edges exist only between left and right vertices. A bipartite graph is (left or right) regular if the degree of each vertex on (left or right) is same throughout (left or right) partition. A bipartite graph is bi-regular if it is both left regular and right regular. For more information on graph theory the reader is referred to [10].
II Review of Line Graph scheme in [9]
We now review the basic framework and some results of [9]. Consider a coded caching system consisting of a server with files . Let be any set such that . We shall use to indicate the set of users. Let be any set such that . The subfiles of a file are denoted by where and takes values in some Abelian group. Here we consider symmetric caching, i.e., for any , either a user caches or the user does not cache for any . Any symmetric caching scheme can be represented as an equivalent -left regular bipartite caching graph with left vertices being and right vertices being , and the uncached fraction . The uncached subfiles are identified by the edges of i.e, for an edge if and only if the subfiles are not present in the cache of user . This bipartite coded caching setup was given in [8]. In [9], a line graph based framework was proposed to study the coded caching problem. The line graph (or simply, ) of an undirected graph is a graph in which the vertex set is the edge set of , and two vertices of are adjacent if and only if they share a common vertex in . The caching scheme was captured via a line graph of a bipartite caching graph and the delivery scheme was obtained as a clique cover of complement of the square of line graph denoted by . The following lemma proved in [9] presents the conditions under which an arbitrary graph is a line graph of a left regular bipartite graph. This enables us to construct a line graph which corresponds to a coded caching scheme.
Lemma 1**.**
[9*]**
A graph containing vertices is the line graph of a -left-regular bipartite graph if and only if the following conditions are satisfied.*
- •
The vertices of can be partitioned into disjoint cliques containing vertices each. We denote these cliques by and call them as the user-cliques.
- •
Consider distinct For any vertex , there exists at most one vertex such that
- •
For any and any vertex , the set (containing and all adjacent vertices of except those in ), forms a clique. We refer to these cliques as the subfile-cliques.
- •
Let be the number of subfile-cliques in and the subfile-cliques be denoted as . Then the number of right vertices of is .
Any graph that satisfies the above conditions for some and is called as a caching line graph. Since there are subfile cliques, we can denote the subfile-cliques as . It also holds by the construction of that there is at most one vertex in the intersection of any given user-clique and a subfile-clique . Further, note that the subfile-cliques also partition the vertices of . Thus each vertex of lies precisely in one user-clique and one subfile-clique. Therefore the vertices of can be indexed using a subset of , i.e., . With this notation, we have and . Furthermore, it follows that .
Following [9], the delivery scheme follows according to a clique cover of . In order to find the cliques in (called transmission cliques), first we need to identify the structure of . It is easy to see that . In Lemma 2 we present the conditions under which an edge exist in . We will use this lemma in Section IV to identify such transmission cliques in the construction we give.
Lemma 2**.**
Let . The edge if and only if and .
Proof:
The If part of the lemma follows from the definition of . We prove the only if part here. Let . Suppose . Then by the construction of we have . Therefore which is a contradiction. Hence . Similarly . Suppose . Since . By the construction of we have and . By definition of , we have which is a contradiction. Hence . Similarly . ∎
In [9] a particular class of caching line graphs called -caching line graphs was considered. The advantage of these line graphs is that the corresponding caching scheme parameters are obtained naturally in a simple fashion. This is captured in the following definition and theorem. Our construction in Section IV is also based on such -caching line graphs.
Definition 1**.**
[9]** A caching line graph such that has a clique cover consisting of -sized disjoint subfile cliques and has a clique cover consisting of -sized disjoint cliques, for some , is called a -caching line graph.
Theorem 1**.**
[9*]**
Consider a -caching line graph . Then there is a coded caching scheme consisting of the caching scheme given by with (and thus ), and there is an associated transmission scheme based on the clique cover of having rate .*
III A new lower bound on the rate
In this section, we propose a lower bound on rate of the delivery scheme for symmetric caching schemes where each subfile is stored in equal number of users. Most known schemes in literature satisfy this property to best of our knowledge. From Section II, we know that any symmetric caching scheme can be represented by a left-regular bipartite caching graph where . As each subfile is not cached at equal number of users, the equivalent bipartite graph with edges will be right regular as well, with right degree being for every subfile in .
We first recall a generic lower bound given in [9] based on structure of which is used in Theorem 2 to arrive at our new bound. Let be the subgraph of induced by the vertices where and . Let Let be a subset of vertices of taken in some order. For , let be the set of right vertices (subfiles) in which are adjacent to Let be the infimum of all achievable rates for coded caching problem defined by . Then, from Theorem 2 of [9],
[TABLE]
We now obtain a lower bound for the case of symmetric caching schemes defined by biregular bipartite caching graphs. The proof for this bound is based on a similar ‘nested’ bound shown in [7].
Theorem 2**.**
Let be the infimum of all achievable rates for the coded caching problem defined by a bi-regular bipartite graph . Then
[TABLE]
Proof:
Every user vertex has degree in . Consider a user vertex in and call it as . So, by notations of (1), . Consider the graph induced by vertices of . Call it . Since is a bi-regular graph, degree of each subfile vertex is exactly . So, by pigeon-holing argument, it is not difficult to see that there exists a user with degree at least in . Consider such user vertex and call it as . Then . In general, for each , for the graph induced by vertices of , by pigeon-holing argument there exists a user with degree at least . Call such user vertex as . Then, Running over all , and using (1), we therefore get the bound in the theorem. ∎
For a number of parameters we now compare (in Table I) the above new bound on the number of transmissions (column 4 of Table I) in an optimal scheme, with the lower bound given in [7] (given in column 5, which holds for PDA based schemes), as well as the lower bound (column 6) based on the Ali-Niesen rate () as shown in [4]. The bound given in [7] used in Table I is as follows.
Theorem 3**.**
[7]**
It can be seen that for many of the parameters, our bound is better than those in [7],[4]. Further, the last column of Table I denotes the rate achieved by the scheme in Section IV in this work, for whichever parameters are applicable.
IV A new projective geometry based scheme
In this section we present a new coded caching scheme using projective geometries over finite fields. We first review some basic concepts.
IV-A Review of projective geometries over finite fields [11]
Let such that is a prime power. Consider a -dim (we use “dim” for dimensional) vector space over a finite field . Consider an equivalence relation on (where represents the zero vector) whose equivalence classes are -dim subspaces(without ) of . The set of these equivalence classes is called the -dim projective space over and is denoted by . For , let denote the set of all -dim subspaces of . It is known that (Chapter in [11]) is equal to the q-binomial coefficient , where The following result is known from [11].
Lemma 3** (Chapter 3 in [11]).**
Consider a -dim vector space . Let . Then the number of -dim subspaces intersecting a fixed -dim subspace in a fixed -dim subspace is .
We now proceed to construct a caching line graph using projective geometry.
IV-B A new caching line graph using projective geometry
Consider such that . Let be a fixed -dim subspace of the vector space .
Let
[TABLE]
We first initialize by its user-cliques. The user-cliques are indexed by -dim subspaces in . For each create the vertices corresponding to the user-clique indexed by as . Now, for each we construct the subfile clique of associated with as . By definition, these subfile cliques partition the set of vertices in (the union of all the user cliques). By invoking the notations from Section II (Lemma 1), we have (number of user-cliques), and subpacketization (the number of subfile cliques). We now find the values of , and the size of the subfile cliques and the user cliques.
Lemma 4**.**
[TABLE]
Proof:
is the number of -dim subspaces intersecting the fixed -dim subspace in the fixed -dim subspace . By Lemma 3, we have .
for any is the number of -dim subspaces intersecting the fixed -dim subspace in the fixed -dim subspace . By Lemma 3, we have .
We now obtain for any . To do this, we will first find the number (say ) of -dim subspaces intersecting the fixed -dim subspace in the fixed -dim space . By using Lemma 3, we thus have . Now, we find the number(say ) of such that for some . Then it follows that .
Now, to find , we first prove a few smaller claims.
Claim 1: The number of one dimensional spaces such that ( being a fixed -dimensional subspace, representing direct sum) is .
Proof of Claim 1: To see this, observe that to satisfy , we must have for some Thus there are choices for . However there are precisely vectors whose span is the same one-dimensional subspace . Hence we have that the number of one dimensional spaces such that is
*Claim 2: * Let be some fixed element in . The number of -sized sets (where are -dim subspaces) such that is
Proof of Claim 2: By Claim 1, the number of such that is As s can be independently chosen to get the corresponding s, we thus have that is
Claim 3: The number of -sized sets (where are -dim subspaces) such that (for some fixed ) is
Proof of Claim 3: Firstly, we note that there exists a set of -dim subspaces such that , if and only if there exists a (not necessarily unique) set of vectors such that and , for all . We call such a set as a generating set of the set .
Note that given a set of -dim subspaces , we can get a generating set by choosing any for each . Now, suppose the set is a generating set for , then so is , for any Thus, the number of such distinct generating sets for any given set of -dim subspaces is .
Now the number of ways to choose an ordered sets of vectors such that , for all , is . Thus, the number of (unordered) such generating sets is then . By arguments in the previous paragraph, it can be seen that these unordered generating sets can be partitioned into groups of , such that each such group includes precisely the set of all generating sets for a particular set of -dim spaces such that .
Thus the number we are looking for is precisely This proves Claim 3.
We now prove that the number of such that for some is To see this, note that for each such that , there exists precisely sets of -dim subspaces such that By Claim 3, the total number of sets of -dim subspaces such that is , and these can be partitioned into groups each of size , such that for each set in any particular group, the set is the same. Thus we have that
Finally, we see that the expression for matches the lemma statement, which proves the lemma.
∎
Note that by Lemma 4, we have the size of the subfile cliques of as (for any ). We now show that has a clique cover with -sized disjoint cliques for some . Therefore is in fact a -caching line graph, giving raise to the main result in this section which is Theorem 4.
IV-C Delivery Scheme from a clique cover of
We first describe a clique of and show that such equal-sized cliques partition . This will suffice to show the delivery scheme as per Theorem 1.
Let \mathbb{Y}=\Big{\{}\{V_{1},V_{2},\cdots,V_{m+2}\}:V_{i}\in\mathbb{V},\forall i\in[m+2] such that \sum\limits_{i=1}^{m+2}V_{i}\in PG_{q}(k-1,m+t)\Big{\}}. We now present a clique of size in .
Lemma 5**.**
Consider . Then is a clique in .
Proof:
First note that is well defined as (otherwise will not be a -dim space) and hence . Consider two distinct vertices . It is clear that and . By the construction of we have and . Therefore we have . By invoking Lemma 2, . Hence proved. ∎
Now we show that the cliques partition .
Lemma 6**.**
, where the union is a disjoint union.
Proof:
Consider such that . By definition of , we have . Now consider an arbitrary vertex . By the construction of , . Therefore lies in the clique, (defined as in Lemma 5). Hence proved. ∎
Finally we present our coded caching scheme using the caching line graph constructed above.
Theorem 4**.**
The caching line graph constructed above is a -caching line graph and defines a coded caching scheme with
,
,
, and .
Proof:
From Lemma 4 and the notations in Lemma 1, we get the expression of and Further we see that the subfile cliques partition the vertices of by definition and also for any (the size of each subfile clique). By Lemma 5 and Lemma 6, the size of the cliques of is and they partition the vertices. Hence is a -caching line graph.
Thus, we have by Theorem 1,
[TABLE]
[TABLE]
[TABLE]
[TABLE]
This completes the proof. ∎
V Asymptotic analysis of the proposed scheme
In this section, we analyse the behaviour of for the coded caching scheme proposed in Section IV as is lower bounded by a constant and . We show that , while . Towards this end, we first give some bounds on -binomial coefficients. These can be easily derived, however a proof is available in [9].
Lemma 7**.**
[9*]**
For non-negative integers , for being some prime power,*
[TABLE]
We now proceed to analyse the asymptotics of the scheme. Throughout our analysis we assume is constant. Consider,
[TABLE]
To lower bound by a constant, let , where is a constant. Note that as . We have . We analyse our scheme as grows large. By Lemma 7, we have . Hence we have
[TABLE]
The rate expression in Theorem 4 can be written as .
Therefore by using (2) we have
Consider, . Now by Taylor’s series expansion we have as . Therefore we have . Hence .
Consider, . Now if , by Taylor’s series expansion we have as . Therefore we have . If , then . Hence . Therefore .
We now obtain the asymptotics for subpacketization . By Lemma 7 and from the expression of in Theorem 4, we have
[TABLE]
Once again, by applying Lemma 7 to the above expression, we have, As , thus (by Lemma 7). Hence . Also by (2) we have, , which can be written as . Using Stirling’s approximation for as for large , and after some simplifications we see that .
Finally in Table II, we compare the scheme in Theorem 4 with the scheme in [6] for some choices of and (the global caching gain, i.e., , where is the rate achieved by the scheme). We label the parameters of our scheme in Theorem 4 as where . The parameters of the scheme presented in [6] are where . The first column lists according to the Theorem 4. The second column lists parameters of the scheme in [6]. Since we can not match the parameters from this work and [6] exactly, we choose approximately equal values. We see from the table that our scheme performs much better than [6] in terms of the subpacketization, but pays a price in terms of the rate.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Y. Fadlallah, A. M. Tulino, D. Barone, G. Vettigli, J. Llorca, and J. Gorce, “Coding for caching in 5g networks,” IEEE Communications Magazine , vol. 55, no. 2, pp. 106–113, February 2017.
- 2[2] “Cisco visual networking index: Forecast and trends, 2017-2022,” Available at : www.cisco.com .
- 3[3] M. A. Maddah-Ali and U. Niesen, “Fundamental limits of caching,” IEEE Transactions on Information Theory , vol. 60, no. 5, pp. 2856–2867, May 2014.
- 4[4] K. Wan, D. Tuninetti, and P. Piantanida, “On the optimality of uncoded cache placement,” in 2016 IEEE Information Theory Workshop (ITW) , Sep. 2016, pp. 161–165.
- 5[5] L. Tang and A. Ramamoorthy, “Coded caching schemes with reduced subpacketization from linear block codes,” IEEE Transactions on Information Theory , vol. 64, no. 4, pp. 3099–3120, April 2018.
- 6[6] Q. Yan, M. Cheng, X. Tang, and Q. Chen, “On the placement delivery array design for centralized coded caching scheme,” IEEE Transactions on Information Theory , vol. 63, no. 9, pp. 5821–5833, Sep. 2017.
- 7[7] M. Cheng, Q. Yan, X. Tang, and J. Jiang, “Coded caching schemes with low rate and subpacketizations,” ar Xiv preprint ar Xiv:1703.01548 , 2017.
- 8[8] Q. Yan, X. Tang, Q. Chen, and M. Cheng, “Placement delivery array design through strong edge coloring of bipartite graphs,” IEEE Communications Letters , vol. 22, no. 2, pp. 236–239, Feb 2018.
