A Low-Complexity Cache-Aided Multi-antenna Content Delivery Scheme
Junlin Zhao, Mohammad Mohammadi Amiri, and Deniz G\"und\"uz

TL;DR
This paper proposes a low-complexity cache-aided multi-antenna content delivery scheme that efficiently manages beamforming in multi-user systems, balancing performance and computational complexity.
Contribution
It introduces a time-sharing approach to reduce complexity in multi-antenna beamforming for cache-enabled networks, maintaining near-optimal performance.
Findings
Low-complexity scheme outperforms existing methods across SNR and rate regimes.
The proposed approach achieves minimal performance loss compared to full joint subfile transmission.
Numerical results confirm effectiveness in high SNR and limited spatial degrees of freedom scenarios.
Abstract
We study downlink beamforming in a single-cell network with a multi-antenna base station (BS) serving cache-enabled users. For a given common rate of the files in the system, we first formulate the minimum transmit power with beamforming at the BS as a non-convex optimization problem. This corresponds to a multiple multicast problem, to which a stationary solution can be efficiently obtained through successive convex approximation (SCA). It is observed that the complexity of the problem grows exponentially with the number of subfiles delivered to each user in each time slot, which itself grows exponentially with the number of users in the system. Therefore, we introduce a low-complexity alternative through time-sharing that limits the number of subfiles that can be received by a user in each time slot. It is shown through numerical simulations that, the reduced-complexity beamforming…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCaching and Content Delivery · Cooperative Communication and Network Coding · Advanced MIMO Systems Optimization
A Low-Complexity Cache-Aided Multi-Antenna Content Delivery Scheme
Junlin Zhao
Department of Electrical and Electronic Engineering, Imperial College London, London, SW7 2BT, UK
Emails: {j.zhao15, m.mohammadi-amiri15, d.gunduz}@imperial.ac.uk
Mohammad Mohammadi Amiri
Department of Electrical and Electronic Engineering, Imperial College London, London, SW7 2BT, UK
Emails: {j.zhao15, m.mohammadi-amiri15, d.gunduz}@imperial.ac.uk
Deniz Gündüz
Department of Electrical and Electronic Engineering, Imperial College London, London, SW7 2BT, UK
Emails: {j.zhao15, m.mohammadi-amiri15, d.gunduz}@imperial.ac.uk
Abstract
We study downlink beamforming in a single-cell network with a multi-antenna base station (BS) serving cache-enabled users. For a given common rate of the files in the system, we first formulate the minimum transmit power with beamforming at the BS as a non-convex optimization problem. This corresponds to a multiple multicast problem, to which a stationary solution can be efficiently obtained through successive convex approximation (SCA). It is observed that the complexity of the problem grows exponentially with the number of subfiles delivered to each user in each time slot, which itself grows exponentially with the number of users in the system. Therefore, we introduce a low-complexity alternative through time-sharing that limits the number of subfiles that can be received by a user in each time slot. It is shown through numerical simulations that, the reduced-complexity beamforming scheme has minimal performance gap compared to transmitting all the subfiles jointly, and outperforms the state-of-the-art low-complexity scheme at all SNR and rate values with sufficient spatial degrees of freedom, and in the high SNR/high rate regime when the number of spatial degrees of freedom is limited.00footnotetext: This work was partially supported by the European Research Council (ERC) through project BEACON (No. 725731), and by the European Union’s Horizon 2020 Research and Innovation Programme through project SCAVENGE (No. 675891).
I Introduction
Research efforts in efficient video delivery have intensified recently. A promising approach to enhance the performance of content delivery is to exploit cache memories across the network. Proactive content caching at user devices during off-peak periods can improve the spectral efficiency, and reduce the bandwidth requirements and the transmission delay, particularly for video-on-demand services [1].
The coded cache placement and delivery scheme proposed in [2] creates and exploits multicasting opportunities to users with arbitrary demands; thereby achieving a global caching gain. While [2] considered a noiseless broadcast channel, application of coded delivery in real cellular systems requires designing a multicasting strategy over a noisy broadcast channel jointly with the caching and delivery scheme [3, 4]. As an efficient physical layer transmission technique, multicast beamforming, where the multi-antenna base station (BS) multicasts distinct data streams to multiple user groups, has been extensively studied in cellular networks [5, 6]. Inspired by the multicasting feature of coded content delivery schemes [2], and the high efficiency of multi-antenna beamforming, there have been recent research efforts to incorporate multi-antenna beamforming techniques into cache-aided coded content delivery. In [3], coded delivery is employed along with zero-forcing to simultaneously exploit spatial multiplexing and caching gains. By treating the transmission of coded subfiles as a coordinated beamforming problem, improved spectral efficiency is achieved in [7] by optimizing the beamforming vectors. A reduced-complexity scheme is also presented in [7] by limiting the number of users served at any time.
In this paper, we consider the problem studied in [7]. Firstly, a general framework for cache-aided downlink beamforming is formulated, focusing on the minimum required transmit power under user quality-of-service (QoS) requirements. The resultant nonconvex optimization problem is tackled by successive convex approximation (SCA), which is guaranteed to converge to a stationary solution of the original nonconvex problem. As noted in [7], the transceiver design and computational complexities increase drastically with the number of coded messages each user decodes at any time slot. We propose a novel content delivery scheme which flexibly adjusts the number of coded messages each user decodes in each time slot. Unlike the scheme in [7], our proposed scheme does not limit the number of users served in each time slot, but directly limits the number of messages each user decodes. Our numerical results show that the proposed scheme can provide significant gains in terms of transmit power, particularly in the high rate/high signal-to-noise ratio (SNR) regime.
II System Model
We consider downlink transmission within a single cell, where a BS equipped with antennas serves single-antenna users, as illustrated in Fig. 1. A library of files , each distributed uniformly over set is available at the BS, where and represent the rate of each file and the blocklength, respectively. Each user is equipped with a cache that can store up to files, and the caching factor is defined as .
Contents are placed at users’ caches during off-peak periods, known as the placement phase, without any prior information on user requests. The caching function for user is denoted by , which maps the library to the cache contents at user , i.e., , . After the users reveal their demands , signal is transmitted by the BS, where we denote , . An average power constraint is imposed on each channel input . User receives
[TABLE]
where is the channel vector from the BS to the -th user, and is the additive white Gaussian noise at user with each entry independent and identically distributed according to , . We assume the channel state information (CSI) is perfectly known to the BS and the receivers. Hence, the encoding function is . After receiving , user reconstructs using its cache content , channel vector , and demand vector through function , i.e., , . The probability of error is defined as . An tuple is achievable if there exist a sequence of caching functions , encoding function , and decoding functions , such that as . For file rate and cache size , our main goal is to characterize
[TABLE]
III An Achievable Delivery Scheme
In this section, we present a multi-antenna transmission scheme with coded caching, following the placement and coded content generation proposed in [2].
III-A Placement and Delivery Schemes
We follow the cache placement scheme of [2]: For a caching factor , we represent -element subsets of by . File , , is divided equally into disjoint subfiles , each of bits. User , , caches subfile , if , . The cache content of user is then given by .
The delivery phase is performed once the demands are revealed. For any demand combination , we aim to deliver the coded message
[TABLE]
to all the users in set , for . Observe that, after receiving , each user can recover subfile having access to , .
We define as the set of all the messages, with each message represented by the set of users it is targeting, and is the subset of messages targeting user . We have and .
The following example will be used throughout the paper to explain the proposed scheme:
Example: Let . We have . Each file is split into disjoint subfiles of the same size, where we represent file , , as
[TABLE]
The cache content of user is , , which satisfies the cache capacity constraint. For a demand combination , all user demands can be fulfilled by delivering the following subfiles:
[TABLE]
III-B Multi-antenna Transmission Scheme
The delivery of the coded messages in set to their respective sets of receivers is a multi-antenna multi-message multicasting problem. Before introducing our low-complexity scheme in the next section, we present here a general transmission strategy based on message-splitting and time-division transmission. The messages in are transmitted over orthogonal time slots, the -th of which is of blocklength , , where . The transmitted signal at time slot is given as
[TABLE]
where is the unit power complex Gaussian signal of block length , modulated from the corresponding message in (3), intended for the users in set , transmitted in time slot , encoded by the beamforming vector .
The received signal at user in time slot is
[TABLE]
where is the complement of set in . Let denote the collection of all non-empty subsets of , with each element of denoted by . We denote as the subset of messages transmitted in time slot , i.e., if .
Note that each user may decode more than one message in each transmission slot. From the capacity region of the associated Gaussian multiple access channel, following conditions must be satisfied for successful decoding of all the intended messages at user , , at time slot :
[TABLE]
where is the rate of the message , and is the received signal-to-interference-plus-noise ratio (SINR) of message at user at time slot , given by
[TABLE]
for any , or equivalently, any . The rate of message is the sum of the rate of submessages , and must satisfy
[TABLE]
III-C Total Transmit Power Minimization
To characterize the minimum power consumption in (2) under the multi-antenna framework, a power minimization problem is formulated under the requirement of successfully delivering all the desired files to each user at the prescribed common rate. For any given transmission scheme represented by the rates of the subfiles transmitted in each time slot and the blocklength of each time slot , the minimum average required power can be obtained as
[TABLE]
where
[TABLE]
The optimization problem in (11) is nonconvex due to the constraints in (11b); hence, it is computationally intractable. In order to obtain an effective feasible solution of the problem (11), we leverage the SCA algorithm, which is known to converge to a stationary point of the original nonconvex problem [8]. The details of the implementation of SCA are omitted, and will be elaborated in the longer version.
IV Low-Complexity Delivery scheme
We propose a content delivery scheme with the flexibility to adjust the number of coded messages intended for each user at each time slot. Assuming a set of messages are transmitted in time slot , messages are transmitted to user , then we have constraints for user in time slot in problem (11). Computational complexity increases drastically with the number of constraints, rendering the numerical optimization problem practically infeasible. In addition, a multi-user detection scheme, e.g., successive interference cancellation (SIC), needs to be employed at the users, whose complexity also increases with .
A low complexity scheme is proposed in [7] by limiting the number of users to be served in each time slot, thereby indirectly reducing the number of coded messages to be decoded by each user. Instead of limiting the subsets of users to be served in each time slot, we propose to directly adjust the number of coded messages targeted to each user. We will show that this results in a more efficient delivery scheme than the one in [7].
In the setting of our Example, if we transmit all the messages in one time slot, i.e., , a total of coded subfiles are transmitted simultaneously, with each user decoding messages. Accordingly, in the optimization problem in (11) we will have constraints. To alleviate the computational complexity, the low complexity scheme in [7] splits each subfile into minifiles, and the coded messages are grouped to serve a subset of users in each of the time slots. Within each time slot, each user needs to decode 2 messages to recover the corresponding minifiles. Note that the power minimization problem for each time slot can be solved independently; therefore, we would need to solve 10 smaller optimization problems, each with constraints.
In contrast, we propose to serve as many users as needed at each time slot while keeping under a given threshold for each user . In our Example, we can satisfy all the user requests in only time slots, by setting nonzero rate targets for the messages in
[TABLE]
in time slots 1 and 2, respectively. Note that each user decodes only messages in time slot , the same as the delivery scheme in [7], requiring the same implementation complexity at each user; however, users are served in each time slot, which results in a significantly smaller number of time slots. Thus, we need to solve only two optimization problems at the BS, each with constraints.
In general, the number of constraints in the optimization problem in (11) increases exponentially with , which results in exponentially increasing number of constraints in the problem in each SCA iteration. Thus the computational complexity of the delivery scheme can be largely alleviated by choosing a small value, which also simplifies the multi-user detection algorithm.
In our proposed low-complexity scheme, we seek to divide set into disjoint subsets , with for , while keeping as small as possible. To this end, we propose a greedy algorithm to construct the disjoint sets for any value, as shown in Alg. 1. Specifically, ’s are generated in a sequential manner: to construct , given the initialized , , and the set of remaining messages for assignment, user(s) that decode the least number of messages, i.e., user(s) in set , are firstly identified, followed by checking whether there exists a message such that the condition for holds. If such can be found, then is added to and removed from , and is updated by replacing , . We initialize as the set of remaining messages in for checking with the condition, and set each time after a message is checked. If a qualified is not found until , i.e., adding any of the messages in to will violate the above condition, the process of constructing is completed, and we start constructing in the same manner. The whole procedure is completed when , i.e., all the messages have been assigned to a subset. For efficient wireless resource management, we allocate channel uses to transmit the messages in , for . Note that our proposed scheme covers the case of , where all the messages are sent simultaneously, if .
V Simulation Results
In this section, numerical results are presented to demonstrate the effectiveness of the proposed coded delivery scheme with multi-antenna beamforming. We consider a single-cell with radius 500m, and users uniformly randomly distributed in the cell. Channel vectors are written as , , where denotes an i.i.d. vector accounting for Rayleigh fading of unit power, and the path loss exponent is modeled as , with denoting the distance between the BS and the user (in kilometers). The noise variance is set to dBW for all the users. All simulation results are averaged over 1000 independent trials computed with MOSEK in CVX [9]. We note that in simulations of our proposed low-complexity scheme, the blocklength of each time slot is set proportional to the number of messages transmitted in that time slot, and better performance in terms of average transmit power is observed compared to setting equal blocklengths for all the time slots.
The scheme with time slot will be referred to as the full superposition (FS) scheme. FS has the best performance in terms of transmit power given enough number of spatial degrees of freedom, and serves as a baseline, but it also has the highest complexity. To compare our results with those in [7], same number of coded messages are transmitted to each user in each time slot for both schemes. We note here that the scheme in [7] can be improved by serving disjoint subsets of users simultaneously without increasing the complexity, but the improvement is only applicable when the size of user subset divides . Therefore, the scheme in [7] cannot handle certain settings such as the one in the running Example.
We first present the average transmit power as a function of the target rate in Fig. 2 for the setting in our Example, assuming that the BS is equipped with antennas. The scheme in [7] that satisfies is adopted for fair comparison, where users are served in each time slot. We observe that the proposed scheme provides significant savings in the transmit power compared to the one in [7] at all rates. The power savings increase with the target rate as a result of the increased superposition coding gain. Furthermore, the gap between the proposed scheme and FS is quite small, and remains almost constant with rate. At , the power loss of the scheme in [7] and ours compared to FS is about 12 dB and 1 dB, respectively. Hence, the proposed scheme provides significant reduction in the computational complexity without sacrificing the performance much.
Fig. 3 considers the average transmit power versus the rate target for files, users, , and antennas. It is interesting to see that when the target rate is low, the scheme in [7] slightly outperforms both the FS and the proposed schemes. Due to insufficient spatial degrees of freedom, both the FS and the proposed schemes fail to manage the interference between data streams. We conclude that this effect occurs only for low rates, as the benefit of superposition coding becomes more dominant at high rates. We note that the performance of our scheme coincides with that of the scheme in [7] for , but this does not always happen. For instance, when , the only option in [7] to keep the same level of complexity is to serve 3 users in each time slot.
We finally plot in Fig. 4 the power loss of the proposed scheme compared to FS as a function of . Assuming , , , we let take values from , where corresponds to the FS scheme in which all the coded messages are transmitted in one time slot. At the other extreme, when , the model boils down to the single-cell multigroup multicasting problem, which has the lowest computational and implementation complexity. In general, Fig. 4 can be considered as the trade-off curve between the performance and complexity for each target rate value, both of them increasing with .
VI Conclusions
We have proposed a low-complexity coded content delivery scheme for a multi-antenna BS serving cache-enabled users. By limiting the number of coded messages targeted at each user in each time slot, the proposed coded content delivery scheme provides the flexibility to adjust the computational complexity of the optimization problem and the receiver complexity. Compared with the FS scheme, in which all the coded messages are delivered simultaneously, our proposed delivery scheme achieves comparable performance with significantly lower complexity by delivering the coded messages in a time division fashion. It has also been shown that the proposed delivery scheme outperforms the one proposed in [7] for all values of SNR and rate with sufficient spatial degrees of freedom, while the improvement is limited to high data rate values when the BS does not have sufficiently many transmit antennas. When considering practical implementations, one must choose a suitable value of that yields an acceptable performance while keeping the complexity feasible.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] K. C. Almeroth and M. H. Ammar, “The use of multicast delivery to provide a scalable and interactive video-on-demand service,” IEEE J. Sel. Areas Commun. , vol. 14, no. 6, pp. 1110–1122, Aug. 1996.
- 2[2] M. A. Maddah-Ali and U. Niesen, “Fundamental limits of caching,” IEEE Trans. Inf. Theory , vol. 60, no. 5, pp. 2856–2867, May 2014.
- 3[3] S. P. Shariatpanahi, G. Caire, and B. H. Khalaj, “Physical-layer schemes for wireless coded caching,” Co RR , vol. abs/1711.05969, 2017. [Online]. Available: http://arxiv.org/abs/1711.05969
- 4[4] M. Mohammadi Amiri and D. Gündüz, “Caching and coded delivery over Gaussian broadcast channels for energy efficiency,” IEEE J. Sel. Areas Commun. , vol. 36, no. 8, pp. 1706–1720, Aug. 2018.
- 5[5] N. D. Sidiropoulos, T. N. Davidson, and Z.-Q. Luo, “Transmit beamforming for physical-layer multicasting,” IEEE Trans. Signal Process. , vol. 54, no. 6, pp. 2239–2251, Jun. 2006.
- 6[6] Z. Xiang, M. Tao, and X. Wang, “Coordinated multicast beamforming in multicell networks,” IEEE Trans. Wireless Commun. , vol. 12, no. 1, pp. 12–21, Jan. 2013.
- 7[7] A. Tölli, S. P. Shariatpanahi, J. Kaleva, and B. H. Khalaj, “Multi-antenna interference management for coded caching,” Co RR , vol. abs/1711.03364 v 3, 2018. [Online]. Available: http://arxiv.org/abs/1711.03364 v 3
- 8[8] G. Scutari, F. Facchinei, and L. Lampariello, “Parallel and distributed methods for constrained nonconvex optimization—Part I: Theory,” IEEE Trans. Signal Process. , vol. 65, no. 8, pp. 1929–1944, Apr. 2017.
