On the Performance of Mobility-Aware D2D Caching Networks

Sameh Hosny; Atilla Eryilmaz; Alhussein A. Abouzeid; Hesham El; Gamal

arXiv:1903.10071·cs.NI·March 26, 2019

On the Performance of Mobility-Aware D2D Caching Networks

Sameh Hosny, Atilla Eryilmaz, Alhussein A. Abouzeid, Hesham El, Gamal

PDF

TL;DR

This paper explores how mobility and demand statistics can be leveraged in D2D caching networks to optimize service costs, proposing centralized and decentralized caching schemes with game-theoretic analysis.

Contribution

It introduces a mobility-aware D2D caching framework with novel centralized and decentralized schemes, including a greedy algorithm and Stackelberg game analysis for user decision-making.

Findings

01

Greedy caching algorithm achieves near-optimal performance with polynomial complexity.

02

Mobility statistics significantly influence caching decisions and network performance.

03

Identified regimes where the Stackelberg equilibrium is non-unique, affecting caching strategies.

Abstract

The increase in demand for spectrum-based services forms a bottleneck in wireless networks. Device-to-Device (D2D) caching networks tackle this problem by exploiting user's behavior predictability and the possibility of sharing data between them to alleviate the network congestion. However, capturing mobility statistics allows Service Providers (SPs) to enhance their caching strategies. In this work, we introduce a mobility-aware D2D caching network where SP harnesses user demand and mobility statistics to minimize the incurred service cost through an optimal caching policy. We investigate two caching schemes: centralized and decentralized caching schemes. In the centralized caching scheme, SP makes the caching decision towards its cost minimization to increase its profit. However, the complexity of optimal caching policy grows exponentially with the number of users. Therefore, we…

Equations147

I_{n, t}^{m} = {1, 0, with probability p_{n, t}^{m}, with probability 1 - p_{n, t}^{m} .

I_{n, t}^{m} = {1, 0, with probability p_{n, t}^{m}, with probability 1 - p_{n, t}^{m} .

C^{R} = T \to \infty lim sup \frac{1}{T} t = 1 \sum T E [C (n = 1 \sum N m = 1 \sum M S_{m} p_{n, t}^{m})] .

C^{R} = T \to \infty lim sup \frac{1}{T} t = 1 \sum T E [C (n = 1 \sum N m = 1 \sum M S_{m} p_{n, t}^{m})] .

0 \leq x_{n}^{m} \leq S_{m}, \forall n, m

0 \leq x_{n}^{m} \leq S_{m}, \forall n, m

L_{t}^{P}

L_{t}^{P}

\displaystyle+\sum_{m=1}^{M}\biggl{(}S_{m}-\sum_{n=1}^{N}x_{n}^{m}\biggr{)}^{+}\sum_{n=1}^{N}p_{n,t}^{m}\underbrace{\sum_{l=1}^{L}\prod_{n=1}^{N}\theta_{n,t}^{l}}_{\text{all users are together}}

\displaystyle+\sum_{m=1}^{M}\sum_{n=1}^{N}\biggl{(}S_{m}-x_{n}^{m}\biggr{)}p_{n,t}^{m}\underbrace{\biggl{(}1-\sum_{l=1}^{L}\sum_{k=2}^{N}\sum_{a_{k}\in\mathcal{A}_{k}}\prod_{j\in a_{k}}\theta_{j,t}^{l}\prod_{i\notin a_{k}}\Bigl{(}1-\theta_{i,t}^{l}\Bigr{)}\biggr{)}}_{\text{every user is alone}}

\mathcal{A}_{n}=\Bigl{\{}a_{n}:=\bigl{(}k_{1},\cdots,k_{n}\bigr{)},k_{j}\in\bigl{\{}1,2,\cdots,N\bigr{\}}\hskip 2.84526pt\forall j\Bigr{\}}

\mathcal{A}_{n}=\Bigl{\{}a_{n}:=\bigl{(}k_{1},\cdots,k_{n}\bigr{)},k_{j}\in\bigl{\{}1,2,\cdots,N\bigr{\}}\hskip 2.84526pt\forall j\Bigr{\}}

C^{\mathcal{P}}=\limsup_{T\to\infty}\frac{1}{T}\sum_{t=1}^{T}\mathbb{E}\biggl{[}C\Bigl{(}L_{t}^{\mathcal{P}}\Bigr{)}\biggr{]}+r\sum_{n=1}^{N}\sum_{m=1}^{M}x_{n}^{m}

C^{\mathcal{P}}=\limsup_{T\to\infty}\frac{1}{T}\sum_{t=1}^{T}\mathbb{E}\biggl{[}C\Bigl{(}L_{t}^{\mathcal{P}}\Bigr{)}\biggr{]}+r\sum_{n=1}^{N}\sum_{m=1}^{M}x_{n}^{m}

min C^{P}

min C^{P}

s.t. (\ref E q : C o n s t) .

L^{P}

L^{P}

\displaystyle+\sum_{m=1}^{M}\biggl{(}1-\sum_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}\sum\limits_{n=1}^{2}\Bigl{(}S_{m}-x_{n}^{m}\Bigr{)}p_{n}^{m}

\displaystyle\min\hskip 14.22636ptL^{\mathcal{P}}+r\sum_{m=1}^{M}\Bigl{(}x_{1}^{m}+x_{2}^{m}\Bigr{)}

\displaystyle\min\hskip 14.22636ptL^{\mathcal{P}}+r\sum_{m=1}^{M}\Bigl{(}x_{1}^{m}+x_{2}^{m}\Bigr{)}

s.t. 0 \leq x_{n}^{m} \leq S_{m}, \forall n, m

\displaystyle L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\Bigl{(}x_{1}^{m}p_{2}^{m}+x_{2}^{m}p_{1}^{m}\Bigr{)}\sum_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}}_{\text{sharing gain}}

\displaystyle L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\Bigl{(}x_{1}^{m}p_{2}^{m}+x_{2}^{m}p_{1}^{m}\Bigr{)}\sum_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}}_{\text{sharing gain}}

\displaystyle L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}\Bigl{(}S_{m}-x_{n}^{m}\Bigr{)}p_{n}^{m}\sum_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}}_{\text{sharing gain}}

\displaystyle L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{2}\Bigl{(}S_{m}-x_{n}^{m}\Bigr{)}p_{n}^{m}\sum_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}}_{\text{sharing gain}}

r

r

0

r

r

ρ_{i}

min L^{P} + r m = 1 \sum M n = 1 \sum 3 x_{n}^{m} s.t. 0 \leq x_{n}^{m} \leq S_{m}, \forall n, m

min L^{P} + r m = 1 \sum M n = 1 \sum 3 x_{n}^{m} s.t. 0 \leq x_{n}^{m} \leq S_{m}, \forall n, m

\begin{split}L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\sum_{i\neq j}\left(\Bigl{(}x_{i}^{m}p_{j}^{m}+x_{j}^{m}p_{i}^{m}\Bigr{)}\sum_{l=1}^{L}\theta_{i}^{l}\theta_{j}^{l}\right)}_{\text{sharing gain}}\end{split}

\begin{split}L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\sum_{i\neq j}\left(\Bigl{(}x_{i}^{m}p_{j}^{m}+x_{j}^{m}p_{i}^{m}\Bigr{)}\sum_{l=1}^{L}\theta_{i}^{l}\theta_{j}^{l}\right)}_{\text{sharing gain}}\end{split}

\begin{split}L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}\Bigl{(}S_{m}-x_{n}^{m}\Bigr{)}p_{n}^{m}v_{n}}_{\text{sharing gain}}\end{split}

\begin{split}L^{\mathcal{P}}=\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}S_{m}p_{n}^{m}}_{\text{reactive load}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}x_{n}^{m}p_{n}^{m}}_{\text{caching gain}}-\underbrace{\sum_{m=1}^{M}\sum_{n=1}^{3}\Bigl{(}S_{m}-x_{n}^{m}\Bigr{)}p_{n}^{m}v_{n}}_{\text{sharing gain}}\end{split}

v_{i}=\sum_{l=1}^{L}\biggl{(}\theta_{i}^{l}\theta_{j}^{l}+\theta_{i}^{l}\theta_{k}^{l}\Bigl{(}1-\theta_{j}^{l}\Bigl{)}\biggr{)},i\neq j\neq k

v_{i}=\sum_{l=1}^{L}\biggl{(}\theta_{i}^{l}\theta_{j}^{l}+\theta_{i}^{l}\theta_{k}^{l}\Bigl{(}1-\theta_{j}^{l}\Bigl{)}\biggr{)},i\neq j\neq k

\displaystyle r<r_{1}=\min\limits_{i=1,2,3}\frac{1}{T}\sum\limits_{t=1}^{T}p_{i,t}^{m}\Bigl{(}1-v_{i,t}\Bigr{)},0\leq r_{1}\leq 1

\displaystyle r<r_{1}=\min\limits_{i=1,2,3}\frac{1}{T}\sum\limits_{t=1}^{T}p_{i,t}^{m}\Bigl{(}1-v_{i,t}\Bigr{)},0\leq r_{1}\leq 1

\displaystyle r>r_{2}=\max\limits_{i=1,2,3}\left\{\max\limits_{k\neq j\neq i}\left\{\frac{1}{T}\sum\limits_{t=1}^{T}\left(p_{j,t}^{m}\Bigl{(}1-\sum\limits_{l=1}^{L}\theta_{i,t}^{l}\theta_{j,t}^{l}\Bigr{)}+p_{k,t}^{m}\sum\limits_{l=1}^{L}\theta_{j,t}^{l}\theta_{k,t}^{l}\Bigl{(}1-\theta_{i,t}^{l}\Bigr{)}\right)\right\}\right\}

\displaystyle r>r_{2}=\max\limits_{i=1,2,3}\left\{\max\limits_{k\neq j\neq i}\left\{\frac{1}{T}\sum\limits_{t=1}^{T}\left(p_{j,t}^{m}\Bigl{(}1-\sum\limits_{l=1}^{L}\theta_{i,t}^{l}\theta_{j,t}^{l}\Bigr{)}+p_{k,t}^{m}\sum\limits_{l=1}^{L}\theta_{j,t}^{l}\theta_{k,t}^{l}\Bigl{(}1-\theta_{i,t}^{l}\Bigr{)}\right)\right\}\right\}

r >

r >

k_{1} = i = 1, 2, 3 arg max \frac{1}{T} t = 1 \sum T p_{i, t}^{m} + j \neq = i \sum (p_{j, t}^{m} l = 1 \sum L θ_{i, t}^{l} θ_{j, t}^{l})

k_{1} = i = 1, 2, 3 arg max \frac{1}{T} t = 1 \sum T p_{i, t}^{m} + j \neq = i \sum (p_{j, t}^{m} l = 1 \sum L θ_{i, t}^{l} θ_{j, t}^{l})

(k_{1}, k_{2}) = i, j = 1, 2, 3 i \neq = j arg max \frac{1}{T} t = 1 \sum T p_{i, t}^{m} + p_{j, t}^{m} + k \neq = i, j \sum p_{k, t}^{m} v_{k, t}

(k_{1}, k_{2}) = i, j = 1, 2, 3 i \neq = j arg max \frac{1}{T} t = 1 \sum T p_{i, t}^{m} + p_{j, t}^{m} + k \neq = i, j \sum p_{k, t}^{m} v_{k, t}

(1 N) + (2 N) + \dots + (N - 2 N) + (N - 1 N) = k = 0 \sum N (k N) - (N N) - (0 N) = 2^{N} - 2

(1 N) + (2 N) + \dots + (N - 2 N) + (N - 1 N) = k = 0 \sum N (k N) - (N N) - (0 N) = 2^{N} - 2

s_{i}=\frac{1}{T}\sum\limits_{t=1}^{T}\left(p_{i,t}^{m}+\sum\limits_{j\neq i}\biggl{(}p_{j,t}^{m}\sum\limits_{l=1}^{L}\theta_{i,t}^{l}\theta_{j,t}^{l}\biggr{)}\right),\forall i

s_{i}=\frac{1}{T}\sum\limits_{t=1}^{T}\left(p_{i,t}^{m}+\sum\limits_{j\neq i}\biggl{(}p_{j,t}^{m}\sum\limits_{l=1}^{L}\theta_{i,t}^{l}\theta_{j,t}^{l}\biggr{)}\right),\forall i

\displaystyle s_{ij}=\frac{1}{T}\sum\limits_{t=1}^{T}\Biggl{(}p_{i,t}^{m}+p_{j,t}^{m}+\sum\limits_{k\neq i,j}\biggl{(}p_{k,t}^{m}\sum\limits_{l=1}^{L}\Bigl{(}\theta_{i,t}^{l}\theta_{k,t}^{l}+\theta_{j,t}^{l}\theta_{k,t}^{l}-\theta_{i,t}^{l}\theta_{j,t}^{l}\theta_{k,t}^{l}\Bigr{)}\biggr{)}\Biggr{)},\forall i,j

\displaystyle s_{ij}=\frac{1}{T}\sum\limits_{t=1}^{T}\Biggl{(}p_{i,t}^{m}+p_{j,t}^{m}+\sum\limits_{k\neq i,j}\biggl{(}p_{k,t}^{m}\sum\limits_{l=1}^{L}\Bigl{(}\theta_{i,t}^{l}\theta_{k,t}^{l}+\theta_{j,t}^{l}\theta_{k,t}^{l}-\theta_{i,t}^{l}\theta_{j,t}^{l}\theta_{k,t}^{l}\Bigr{)}\biggr{)}\Biggr{)},\forall i,j

\displaystyle r_{N-1}=\frac{1}{T}\sum\limits_{t=1}^{T}\Biggl{(}p_{k_{2},t}^{m}\Bigl{(}1-\sum\limits_{l=1}^{L}\theta_{k_{1},t}^{l}\theta_{k_{2},t}^{l}\Bigr{)}+\sum\limits_{n\neq k_{1},k_{2}}p_{n,t}^{m}\sum\limits_{l=1}^{L}\theta_{k_{2},t}^{l}\theta_{n,t}^{l}\Bigl{(}1-\theta_{k_{1},t}^{l}\Bigr{)}\Biggr{)}

\displaystyle r_{N-1}=\frac{1}{T}\sum\limits_{t=1}^{T}\Biggl{(}p_{k_{2},t}^{m}\Bigl{(}1-\sum\limits_{l=1}^{L}\theta_{k_{1},t}^{l}\theta_{k_{2},t}^{l}\Bigr{)}+\sum\limits_{n\neq k_{1},k_{2}}p_{n,t}^{m}\sum\limits_{l=1}^{L}\theta_{k_{2},t}^{l}\theta_{n,t}^{l}\Bigl{(}1-\theta_{k_{1},t}^{l}\Bigr{)}\Biggr{)}

s_{i}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On the Performance of Mobility-Aware

D2D Caching Networks

Sameh Hosny, Atilla Eryilmaz, Alhussein A. Abouzeid and Hesham El Gamal

Abstract

The increase in demand for spectrum-based services forms a bottleneck in wireless networks. Device-to-Device (D2D) caching networks tackle this problem by exploiting users behavior predictability and the possibility of sharing data between them to alleviate the network congestion. Usually, network congestion occurs at certain times of the day and in some popular locations. Consequently, the information about user demand alone is not enough. Capturing mobility statistics allows Service Providers (SPs) to enhance their caching strategies. In this work, we introduce a mobility-aware D2D caching network where an SP harnesses users demand and mobility statistics to minimize the incurred service cost through an optimal caching policy. We investigate two caching schemes: a centralized caching scheme and a decentralized caching scheme. In the centralized caching scheme, the SP makes the caching decision towards its cost minimization to increase its profit. However, the complexity of the optimal caching policy grows exponentially with the number of users. Therefore, we discuss a greedy caching algorithm which has a polynomial order complexity. We also use this greedy algorithm to establish upper and lower bounds on the proactive service gain achieved by the optimal caching policy. In the decentralized caching scheme, users take over and make their caching decisions, in a distributed fashion affected by the SP pricing policy, towards their payment minimization. We formulated the tension between the SP and users as a Stackelberg game. Best response analysis was used to identify a subgame perfect Nash equilibrium (SPNE) between users. The optimal solution of the proposed model was found to depend on the SP reward preference, which affects the assigned memory in users devices. We found some regimes for the reward value where the SPNE was non-unique. A fair allocation caching policy was adopted to choose one of these SPNEs. To understand the impact of user behavior, we investigated some special cases to explore how users mobility statistics affect their caching decision. The obtained results in this work allow us to enhance our previously studied content trading model [1] to form a complete vision of mobile content trading. Based on the results obtained in this work, we plan to formulate a mobility-aware content trading marketplace. We expect to achieve more gains by exploiting the users mobility statistics when they are allowed to trade their proactive downloads.

I Introduction

The growth in data traffic represents a crucial problem in mobile networks. More than half a billion mobile devices were added in 2015 causing a 74 $\%$ growth in global mobile data traffic. Nevertheless, an eightfold increase in this traffic is expected between 2015 and 2020. Moreover, three-fourths of the world’s mobile data traffic will be video by 2020 [2]. This increase in demand for spectrum-based services and devices has led network SPs to experience a major demand and supply mismatch during the whole day [3]. This demand disparity is ultimately tied to user behavioral pattern. However, most people follow certain daily routines and hence their behavior is highly predictable [4],[5]. Interestingly, the time-varying user activities, that are ultimately contributing to this mismatch, can be exploited to solve this demand disparity.

The concept of proactive resource allocation for wireless networks was established to control the supplied services to best match the demand patterns [6]. The predictability of user behavior is exploited to balance the wireless traffic over time, and significantly reduce the bandwidth required to achieve a given blocking/outage probability. Device-to-device (D2D) communication has been proposed in [7] as a promising technology that can relief the wireless networks congestion. A pair of end-users, moving within a close proximity to each other, establish a D2D link that can be operated in the unlicensed spectrum band, such as the Industrial, Scientific, and Medical (ISM) radio bands. These D2D links when used as a traffic offloading approach introduces very little or no monetary cost for the end-users.

A tutorial overview of some recent results on base station assisted D2D wireless networks with caching for video delivery was presented in [8]. Some competing conventional schemes and a recently developed scheme based on caching at the user devices was also introduced. Throughput-outage scaling laws of such schemes were discussed. It was shown that, in realistic conditions, the D2D caching scheme largely outperforms all other competing schemes both in terms of per-user throughput and in terms of outage probability. A D2D caching network under arbitrary demand was considered in [9]. It was shown that if each node in the network can reach in a single hop all other nodes, then the proposed scheme achieves almost the same throughput of [10]. Moreover, if concurrent short range transmissions can co-exist in a spatial reuse scheme, then the throughput has the same scaling law of the reuse-only case [11, 12] or the coded-only case [10]. Although previous models utilized the D2D communication to alleviate network congestion, they considered a grid network formed by a set of nodes placed on a regular grid on the unit square and user’s mobility was not captured in this work.

The authors in [13] considered the model of [14] and showed that the per-user throughput can increase dramatically when nodes are mobile rather than fixed. This improvement was obtained under several idealistic assumptions. They assumed complete mixing of nodes trajectories in the network and random mobility pattern was not considered. They also assumed that data contents are delay tolerant and stated that their ideas were not very relevant to real-time applications. Caching data contents in users devices helps us to overcome the delay constraint. Furthermore, a practical mobility model is required to represent a more realistic behavior of the users. There are many mobility models in the literature which try to capture user behavior [15]. In this work, we focus on the individual user mobility based on a probabilistic random walk and defer group mobility for our future work.

We consider a D2D caching network where the SP is aware of the user demand and mobility. We consider the results presented here as a forward step towards a mobile content marketplace. The obtained results will allow us to enhance our content trading model presented in [1]. Our aim is to show that exploiting the information about user mobility helps the SP to optimize its caching strategy and address the network congestion problem in an intelligent manner. Moreover, users can achieve more gains when they consider their mobility statistics and the locations where they can meet other users in the network. We investigate two caching schemes: a centralized caching scheme and a decentralized caching scheme. In the centralized caching scheme, the SP makes the caching decision towards its cost minimization to increase its profit. In the decentralized caching scheme, users take over and make their caching decisions, in a distributed fashion affected by the SP pricing policy, towards their payment minimization. Our main contributions are:

We introduce an optimal centralized caching policy that allows SP to enhance its caching decisions based on the user demand and mobility statistics. 2. 2.

The complexity of the optimal centralized caching policy grows exponentially with the number of users. Therefore, We introduce a sub-optimal policy based on a greedy algorithm that has a polynomial order complexity. 3. 3.

Using the proposed greedy algorithm, we establish upper and lower bounds on the gain achieved by the optimal policy for the proactive service cost. 4. 4.

We investigated how the SP chooses an optimal reward to incentives users to participate in the proposed centralized caching policy. 5. 5.

We extend our work by considering a decentralized caching policy. The tension between the SP and users is modeled as a Stackelberg game. Best response analysis was used to identify a subgame perfect Nash equilibrium (SPNE) between users. 6. 6.

The optimal solution of the proposed model was found to depend on the SP reward preference, which affects the assigned memory in users devices. We found some regimes for the reward value where the SPNE was non-unique. A fair allocation caching policy was adopted to choose one of these SPNEs. 7. 7.

We studied the relation between the users assigned memory and the reward they receive from the SP. This part studies the tension between the SP and users to choose an appropriate memory size for the decentralized caching policy. 8. 8.

To understand the impact of user mobility, we considered some special cases when users have similar behavior. We studied the effect of these special cases on the centralized and decentralized caching policies. 9. 9.

The obtained results in this work allow us to enhance our content trading model presented in [16] to form a complete vision about mobile content marketplace.

The rest of this paper is organized as follows. In Section II, we lay out the system setup and define the characteristics of its main components. We study the performance of the centralized caching scheme in Section III. In Section IV, we study the decentralized caching scheme. The paper is concluded in Section V.

II System Model

We consider a wireless network consisting of a set of $N$ users $\mathcal{N}=\{1,2,\cdots,N\}$ and a single Service Provider (SP) who supplies $M$ data items $\mathcal{M}=\{1,2,\cdots,M\}$ upon demand. Each data item $m\in\mathcal{M}$ has a size $S_{m}>0$ which may be a movie (as in YouTube and Netflix), a sound track (as in Panadora), a social network update (as in Facebook and Twitter), a news update (as in CNN and Fox News), etc. Each user may request any of these data items in a random fashion. We consider a time-slotted system where SP divides the duration of interest (e.g. a day) into $T$ time slots. We assume that the duration of each slot is the time taken for a user to completely consume the requested data item and hence each time slot is in the order of minutes or possibly hours. At the beginning of each time slot, SP collects the demand of all users and supplies them with the requested data items.

II-A User Demand Model

We assume that SP can track, learn and predict user behavior over time and hence constructs a demand profile for every user $n$ denoted by $\mathbf{\Pi}_{n}=\left(\mathbf{p}_{n,t}\right)_{t}$ . For any time slot $t$ , $\mathbf{p}_{n,t}=\left(p_{n,t}^{m}\right)_{m}$ where $p_{n,t}^{m}$ is the probability that user $n$ requests item $m$ in time slot $t$ . The demand of user $n$ in time slot $t$ is captured by a random variable $\mathbb{I}_{n,t}^{m}$ where

[TABLE]

We assume that at any time slot $t$ , $\mathbb{I}_{n,t}^{m}$ is independent of $\mathbb{I}_{n,t+1}^{m},\forall n,m$ . We aslso assume that for any $n\neq k$ , $\mathbb{I}_{n,t}^{m}$ is independent of $\mathbb{I}_{k,t}^{m},\forall m,t$ . Furthermore, the demand profile of each user follows a cyclo-stationary pattern that repeats itself in a period of $T$ time slots. That is, we can write $\mathbf{p}_{n,t}=\mathbf{p}_{n,t+kT}$ for any non-negative integer $k$ . As an example, the $T$ -slot period can be interpreted as a single day through which the activity of each user varies each hour, but occurs with the same statistics every day. SP relates these time slots with the actual day time based on users demand statistics to recognize the time slots where it experiences low demand (off-peak times) and those where a high demand occurs (peak times).

II-B User Mobility Model

We assume that SP is interested in $L$ popular locations $\mathcal{L}=\{1,2,\cdots,L\}$ like airports, schools, shopping malls, stadiums or governmental buildings where high demand can be related with mobility of users. Moreover, SP can track, learn and predict the mobility of each user over time and hence constructs a mobility profile for every user $n$ denoted by $\mathbf{\Theta}_{n}=\left(\theta_{n,t}^{l}\right)_{t}^{l}$ where $\theta_{n,t}^{l}$ is the probability that user $n$ will be present at location $l$ in time slot $t$ where $\sum_{l=1}^{L}\theta_{n,t}^{l}=1\hskip 2.84526pt\forall n,t$ . We represent user’s mobility by a modified probabilistic version of the random walk mobility model which is based on a discrete-time Markov chain model [17]. We assume that users stay in the same location within a time slot and may move to another location at the beginning of each time slot. Let $\lambda_{n,t}^{l,k}$ be the transition probability that user $n$ moves from location $l$ to location $k$ in time slot $t$ where $\sum_{k=1}^{L}\lambda_{n,t}^{l,k}=1\hskip 2.84526pt\forall n,t,l$ . These transition probabilities may change from one time slot to another to capture the mobility of each user. However, the probability of being at a certain location in a time slot $t$ depends on the location in the previous time slot $t-1$ only, i.e. $\theta_{n,t}^{l}=\sum_{k=1}^{L}\theta_{n,t-1}^{k}\lambda_{n,t}^{k,l}$ where $\theta_{n,1}^{l}=\lambda_{n,1}^{l,l}$ .

Figure 1 (a) shows the state transition diagram of user $n$ in time slot $t$ for $L=3$ locations, like home (H), campus (C) and downtown (D). We assume that each user randomly takes a trajectory everyday starting from one location and moving to other locations. However, we assume that the mobility profile of each user follows a cyclo-stationary pattern that repeats itself in a period of $T$ time slots. For example, everyday user starts from his home, visits some frequent locations throughout the day and then returns back home at the end of the day as shown in Figure 1 (b). SP exploits this mobility to enhance its caching strategy and hence achieves more gain by reducing the incurred service cost.

II-C Proactive Service Scheme

SP tries to smooth out the network load by caching some of these data items at the network edge and exploits users mobility statistics to enhance its caching decision. We assume that one-hop device-to-device (D2D) communication is allowed and can be used to transfer data items between users. A fixed data rate link between all users is assumed. We also consider a non-fading channel between all users where an appropriate network protocol is applied to avoid multiple access interference. In the small timescale, data transmission follows an orthogonal multiple access scheme, hence inter-node interference effect is ignored in our large timescale model. For example, at any location $l$ , SP predicts that a certain item $m$ will experience a high request in time slot $t$ . It also predicts which users will be possibly present at that location in this time slot. This data item can be cached at these users and they can transfer it to other users in their vicinity. Therefore, some of the network load will be shifted to the D2D communication which alleviates the network congestion and yields a reduction in the incurred service cost.

Users occupy part of their device memory for caching these data items and consume some of their batteries to transfer it through the D2D communication. We capture the cost of caching each byte by a parameter $r>0$ . This parameter can be viewed as a rent cost for caching this data. We can also view it as a reward that incentives users to participate in this model and save some of their payments by getting it as a discount in their monthly bills. For simplicity, we assume that users always have enough battery level to transfer cached data items to other users in the network and that they always allow SP to cache data in their devices. This reward promotes users to raise their memory size to be able to cache more data. When $N$ is sufficiently large, we can assume that each user has enough memory space to cache assigned data items since SP distributes cached items over all available users.

III Centralized Caching Scheme

In the centralized caching mode, SP makes the caching decision to push some data items in users devices. SP leverages the information about users demand and mobility statistics to make these decisions towards its cost minimization. Users get reward by participating in this model and find their request either in their local cache or at other users in the same vicinity. Therefore, users can also save some of their payments. To evaluate the system performance, we compare the incurred service cost of the proposed model with the cost of the flat pricing scenario. The definition of the cost function is first defined and then the problem is stated. We introduce an optimal centralized caching policy and resolve its complexity issue through a suboptimal caching policy. We also shed light on the impact of users mobility on the proposed caching policy.

III-A Problem Statement

To supply requested data items, SP incurs a certain service cost due to the resources consumed at each time slot. We denote by $C(L_{t})$ the SP cost for serving a total demand $L_{t}\geq 0$ in time slot $t$ . We also assume that the cost function $C:\mathbb{R}^{+}\rightarrow\mathbb{R}^{+}$ is convex and non-decreasing. We consider a reactive network as a baseline scenario where users’ requests are served upon arrival (in contrast to proactively predicting the demand requests). In this case, the time-averaged expected cost of all users is given by:

[TABLE]

where, the superscript $\mathcal{R}$ indicates reactive operation. In the proposed model, we assume that SP is aware of the demand and mobility profiles of all users over $T$ time slots. SP caches an amount $x_{n}^{m}$ of data item $m$ at user $n$ for a future possible request. Each user $n$ transfers this data to other users through the D2D communication in any time slot $t$ when it is requested. For simplicity, we assume that sharing the cached data in users devices happen for free. In particular, users are not announcing any selling prices and they don’t pay for getting their request from other users. Therefore, when a user requests a certain content, his request will be first served from the cached data in the other users devices who are located around him. If this data content was not cached, the request will be served through the network resources. SP replaces the data cached in users devices when it is expired at the end of the day (i.e. at the end of time slot $T$ ). In particular, SP caches data at the beginning of the day and lets users share it throughout the rest of the day. The cached amount of data item $m$ at each user cannot exceed its size, i.e.

[TABLE]

Hence, under this proactive model, the total network load in time slot $t$ is given by:

[TABLE]

where the superscript $\mathcal{P}$ indicates proactive operation and $A_{n}$ is the set of all possible combinations of $n$ indices. i.e.

[TABLE]

where $|\mathcal{A}_{n}|=\binom{N}{n}$ . We assume that users share the cached data items when they meet each other and get the remaining portion from SP. The total network load in (3) captures all cases when some of the users meet each other, all users meet together or each user is moving alone. Consequently, the corresponding time-averaged expected cost under the proactive operation is given by:

[TABLE]

which captures SP’s cost for serving a proactive demand $L_{t}^{\mathcal{P}}$ and the corresponding cost for caching process. Note that instead of having a cost factor ( $\beta$ ), as in the previous chapters, we are modeling the SP cost in terms of serving the peak load for a normalized cost factor and caching some data items for a reward factor ( $r$ ).

The SP gain is the difference between the reactive cost and the proactive cost under the proposed model which can be denoted by $\bigtriangleup C=C^{\mathcal{R}}-C^{\mathcal{P}}$ . Users save some of their payments by finding the requested data items in their local cache or in the cache of their neighbors. The SP objective is to achieve a positive gain (i.e. $\bigtriangleup C>0$ ) by finding an optimal caching policy $\{x_{n}^{m*}\}_{n,m}$ , which minimizes the time-averaged expected cost, while serving the requested data items on time to all users. The problem is defined as:

[TABLE]

The optimization problem in (5) depends mainly on the cost function $C$ which may be linear, quadratic or a polynomial of higher order. The exact solution of (5) for non-linear cost functions can be obtained using convex optimization techniques. However, this case does not provide clear insights on the effect of user’s mobility. Nevertheless, finding an optimal caching policy will be non-tractable. Instead, we focus here on a linear cost function to reveal some insights and to find an optimal caching policy, which allows SP to achieve a minimum service cost. The complexity of this optimal policy grows exponentially with the number of users $N$ . We overcome this point by introducing a suboptimal policy based on a greedy algorithm which has a polynomial-order complexity. We use the sub-optimal policy to find upper and lower bounds for the optimal policy.

III-B Optimal Centralized Caching Policy Analysis

In this section we introduce an optimal caching policy which achieves a minimum service cost for the proposed model. For a linear cost function, considering all possible cases of the $\bigl{(}.\bigr{)}^{+}$ terms in (3), we wind up with a set of linear programs and the optimal solution is obtained from the one which leads to a minimum cost. We start by considering two simple cases for $N=2,3$ . We use these simple cases to generalize the optimal solution of this centralized caching policy.

III-B1 Case Study ( $N=2$ )

For simplicity, we start by the case when $T=1$ and then extend it for any value of $T$ . In this case, the suffix $t$ can be dropped and the expected load (3) will be:

[TABLE]

And the optimization problem will be:

[TABLE]

The problem decomposes to $M$ sub-problems and we have two sub-cases: either $x_{1}^{m}+x_{2}^{m}<S_{m}$ , which leads to a linear program (LP), where:

[TABLE]

or $x_{1}^{m}+x_{2}^{m}\geq S_{m}$ , which leads to another LP, where:

[TABLE]

Note that the first term in (8) and (9) represents the reactive load of the network, the second term represents the caching gain achieved by caching $x_{1}^{m}$ and $x_{2}^{m}$ at users 1 and 2 respectively, while the last term represents the sharing gain attained when each user transfers his proactive download to the other. The feasibility regions of these LPs are shown in Figure 2.

[TABLE]

For the case of caching twice, it is optimal to cache at users $k_{1}$ and $k_{2}$ if and only if:

We compare the gain achieved by the optimal and greedy policies. For example, in the case of caching once both policies achieve the same gain and we have:

[TABLE]

In the case of caching twice, the proactive service gain of the optimal caching policy is:

[TABLE]

This gain depends mainly on the selection of users $i$ and $j$ . Since level- $(1)$ ranking can not guarantee that these users are the same users as in the greedy policy, this gain is larger than or equal to the gain achieved by the greedy caching policy. The same approach applies to show a similar result for all other cases. ∎

Moreover, the greedy algorithm allows us to establish an upper bound for the optimal proactive service gain. Level- $(1)$ ranking defined in (15) generates $N$ items representing the gain achieved by caching data content $m$ once at one of the users. Adding these gains up provides us with an upper bound for the gain achieved by the optimal caching policy in all cases. For example, the first item is this ranked list is an upper bound for the case of caching once in the optimal policy. The sum of the first two items is an upper bound for the case of caching twice in the optimal policy and so on. This result is stated in the following theorem.

Theorem 2.

Under demand and mobility profiles of $N$ -users and for $T\geq 1$ , the optimal proactive service gain $\bigtriangleup C\left(\mathbf{\Pi}_{n},\mathbf{\Theta}_{n}\right)$ of (5) achieved by Algorithm (1) satisfies:

[TABLE]

where, $\bigtriangleup C_{U}\left(\mathbf{\Pi}_{n},\mathbf{\Theta}_{n}\right)$ is the gain achieved by adding up gains defined in (15).

Proof.

We show this result by comparing the gain achieved by the optimal caching policy with the again achieved by the greedy caching policy by adding up items of level- $(1)$ ranking list (15). For the case of caching once we have:

[TABLE]

which is the same value as in (22). For the case of caching twice, we have:

[TABLE]

Comparing (26) with (23) we see that:

[TABLE]

where $|\mathcal{A}_{k}^{n}|=\binom{N-1}{k-1}$ . The expected payment in (36) captures all the cases when user $n$ meets some users, when he meets all other users or when he is alone. He also pays $r^{{}^{\prime}}$ for caching an amount $x_{n}^{m}$ of each content $m$ . The time-averaged expected payment of user $n$ under the proposed model is given by:

[TABLE]

User’s gain is the difference between the reactive payment and the proactive payment, under the proposed model, which is denoted by $\bigtriangleup\mu_{n}=\mu_{n}^{\mathcal{R}}-\mu_{n}^{\mathcal{P}}$ . Users save some of their payment by finding the requested data items in their local cache or with others user in their neighborhood. User’s objective is to achieve a positive gain (i.e. $\bigtriangleup\mu_{n}>0$ ) by finding an optimal caching policy $\{x_{n}^{m*}\}_{m}$ which minimizes his time-averaged expected payment. The cached amount of data item $m$ at each user cannot exceed its size as mentioned in (2). Therefore, the problem is defined as

[TABLE]

IV-B Optimal Decentralized Caching Policy Analysis

In this section, we introduce an optimal decentralized caching policy which achieves a minimum payment for users. We can see from (36) that the objective function in (38) for user $n$ depends on the decision of the other users. Therefore, we start by the assumption that each user has a complete and perfect information about others and then discuss the sufficient statistics required to find his optimal decision. Moreover, without considering a memory constraint, we can decompose the problem in (38) to $M$ sub-problems and solve it for each content $m$ , separately. We will introduce an optimal decentralized caching policy without considering any memory constraint. We discuss the effect of the memory constraint and how to choose an optimal memory size in Section IV-D. To illustrate the idea of our analysis, we start by considering two simple cases for $N=2,3$ and then use them to generalize the solution.

IV-B1 Case Study ( $N=2$ )

For simplicity, we start by $T=1$ , and then extend it to any value of $T$ . In this case, the suffix $t$ can be dropped and the expected payment of user $1$ will be

[TABLE]

Note that the optimal decision of user $1$ depends on the decision of user $2$ . The problem decomposes to $M$ sub-problems and we have two sub-cases: either $x_{1}^{m}+x_{2}^{m}<S_{m}$ leading to a linear program (LP), where:

[TABLE]

and his optimization problem will be

[TABLE]

or $x_{1}^{m}+x_{2}^{m}\geq S_{m}$ which leads to another LP, where:

[TABLE]

and his optimization problem will be

[TABLE]

The first term in 40 and 42 represents the reactive payment, the second term represents the payment corresponding to caching these data contents. The last term represents the saving in payment achieved by sharing the proactive download of user $2$ . This saving gain depends on the meeting probability between user $1$ and $2$ . We use the Best Response (BR) analysis to find the Sub-game Perfect Nash Equilibrium (SPNE) between them, where

[TABLE]

User $1$ will consider all possible decisions of user $2$ and then make the decision that minimizes his payment for each case. Considering the two sub-cases when $x_{1}^{m}+x_{2}^{m}<S_{m}$ and $x_{1}^{m}+x_{2}^{m}\geq S_{m}$ , we can plot the payment of user $1$ versus $x_{2}^{m}$ as shown in Figure 10. We can draw a similar curve for the payment of user $2$ as function of $x_{1}^{m}$ . Based on these payment functions, we can come up with the best response shown in Figure 11. We can see that users best response depends on the comparison between the caching cost $r^{{}^{\prime}}$ and their interest and mobility statistics. In particular, if $r^{{}^{\prime}}<p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ , user $1$ caches this content regardless of what user $2$ does, since its price is very low. If $r^{{}^{\prime}}>p_{1}^{m}$ , user $1$ will not have any incentive to cache this content, since its price is high. When $r^{{}^{\prime}}$ lies between $p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ and $p_{1}^{m}$ , user $1$ prefers to share the payment with user $2$ , i.e. if user $2$ caches an amount $x$ from this content, user $1$ opts to cache an amount $S_{m}-x$ . In particular, when $r^{{}^{\prime}}$ lies in this region, partial caching is an optimal solution.

Now, without loss of generality, we can assume that $p_{1}^{m}>p_{2}^{m}$ and hence we have $p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}>p_{2}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ . We can consider two sub-cases, either $p_{2}^{m}<p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ or $p_{2}^{m}\geq p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ , which leads to the solutions shown in Figures 12 and 13, respectively. In Figure 12, we can see that when $r^{{}^{\prime}}<p_{2}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ , both users will have enough incentive to cache the content, since its price is very low. When $r^{{}^{\prime}}>p_{1}^{m}$ , both users will opt not to cache, since the price is high. When $p_{2}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}\leq r^{{}^{\prime}}<p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ , user $1$ caches this content and user $2$ takes it from him. When $p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}\leq r^{{}^{\prime}}<p_{1}^{m}$ , user $1$ prefers to share the payment with user $2$ . But since $r^{{}^{\prime}}>p_{2}^{m}$ , user $2$ will not have any incentive to participate in caching this content. Therefore, user $1$ will cache the whole content alone. This sub-case does not have any ambiguity and there exits a unique SPNE between both users.

In Figure 13, when $r^{{}^{\prime}}$ is very small, such that it is less than $p_{2}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ and $p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ , both users cache this content. When $p_{2}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}\leq r^{{}^{\prime}}<p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}$ , user $1$ still has an incentive to cache this content, and hence user $2$ will depend on him and opt not to cache. When $p_{1}^{m}\biggl{(}1-\sum\limits_{l=1}^{L}\theta_{1}^{l}\theta_{2}^{l}\biggr{)}\leq r^{{}^{\prime}}<p_{2}^{m}$ , partial caching will be an optimal solution. So if user $2$ caches an amount $x$ , user $1$ completes it by caching $S_{m}-x$ . Actually, any value $0\leq x\leq S_{m}$ leads to a Nash equilibrium. This means that we have a non-unique Nash equilibrium in this region. Therefore, it is important to find another dynamic to choose one of these equilibria, as discussed in Section IV-C.

IV-B2 Case Study ( $N=3$ )

We start by $T=1$ , and then we can extend the result for any value of $T$ . The suffix $t$ can be dropped and the expected payment of user $1$ can be written as:

[TABLE]

Following the same best response analysis, user $1$ determines his best response based on the decision of users $2$ and $3$ . Therefore, $\mathcal{B}_{1}(x_{2}^{m},x_{3}^{m})$ is the caching decision which achieves minimum payment for the corresponding values of $x_{2}^{m}$ and $x_{3}^{m}$ . The optimal solution of user $1$ is shown in Figure 14. Basically, each user $n$ decides whether he will be caching the content alone, sharing the payment with others, or discarding it at all based on the relation between $r^{{}^{\prime}}$ , $p_{n}^{m}$ and $p_{n}^{m}\bigl{(}1-v_{n}\bigr{)}$ .

The optimal solution of all users depends on the relation between their interest. For example, suppose $p_{1}^{m}>p_{2}^{m}>p_{3}^{m}$ and $p_{1}^{m}\Bigl{(}1-v_{1}\bigr{)}>p_{2}^{m}\Bigl{(}1-v_{2}\bigr{)}>p_{3}^{m}\Bigl{(}1-v_{3}\bigr{)}$ . The optimal solution will be as shown in Figure 15. When $r^{{}^{\prime}}$ is small enough, all users cache the content. There are some other regions of $r^{{}^{\prime}}$ where partial caching is an optimal solution. We notice that the non-unique equilibrium region expanded because partial caching may occur between user $1$ and $2$ , $2$ and $3$ or $1,2$ and $3$ . We emphasis here that the optimal solution depends on the relation between $p_{1}^{m},p_{2}^{m}$ and $p_{3}^{m}$ and the relation between $v_{1},v_{2}$ and $v_{3}$ . The solution shown in Figure 15 considers one example but there are some other cases. However, the same idea applies to find the optimal solution in each regime of $r^{{}^{\prime}}$ .

IV-B3 Optimal Policy for $N$ -users

Now, from the previous cases, we can infer the optimal decentralized caching policy for a general number of users as shown in Figure 16, where $\hat{p}_{n}^{m}=\frac{1}{T}\sum_{t=1}^{T}p_{n,t}^{m}$ , $\tilde{p}_{n}^{m}=\frac{1}{T}\sum_{t=1}^{T}p_{n,t}^{m}\Bigl{(}1-v_{n,t}\Bigr{)}$ and $v_{n}$ is as defined in (18), $\forall n\in\mathcal{N}$ . Each user compares the caching cost $r^{{}^{\prime}}$ with his interest and mobility statistics $\hat{p}_{n}^{m}$ and $\tilde{p}_{n}^{m}$ to determine whether he is caching the whole content, sharing the cost with others, or discarding it at all. Since there are non-unique equilibrium for the partial caching regime, we are not able to show uniqueness of the SPNE. The following theorem states the existence of the SPNE.

Theorem 3.

For a game of $N$ users, there exists a Subgame Perfect Nash Equilibrium (SPNE) between users.

Proof.

The existence follows from Debreu, Glicksberg and Fan (DGF) theorem since:

•

$x_{n}^{m}\in[0,S_{m}],\forall n\in\mathcal{N}$ are compact and convex.

•

$\mu_{n}^{\mathcal{P}}$ are continuous over $[0,S_{m}],\forall n\in\mathcal{N}$ .

•

$\mu_{n}^{\mathcal{P}}$ are concave by its linearity in $x_{1}^{m},x_{2}^{m},\cdots,x_{n}^{m}$ (for each sub-case separately).

Optimality of the solution was shown by the best response analysis discussed before. ∎

IV-C Fair Caching Allocation

When the caching cost $r^{{}^{\prime}}$ lies in the regime where partial caching is an optimal solution, there exits a non-unique Nash equilibrium. Another dynamic need to be added to the game that allows the users to choose one of these equilibria [18, 19]. A Nash equilibrium is considered payoff dominant if it is Pareto superior to all other Nash equilibria in the game. Unfortunately, it is not clear if any of these equilibira has this feature. For example in the case of $N=2$ , we can see that if user $2$ caches an amount $x$ of content $m$ and user $1$ completes it by caching an amount $S_{m}-x$ , the payment of user $1$ , corresponding to this content, will be

[TABLE]

which is a decreasing function in $x$ . In particular, any increase in $x$ is preferable to user $1$ . On the contrary, the payment of user $2$ , corresponding to this content, will be

[TABLE]

which is an increasing function in $x$ . Therefore, user $2$ will try to reduce $x$ as much as possible. This means that the tension between both users will not lead them to a payoff dominant NE.

A Nash equilibrium is considered risk dominant if it has the largest basin of attraction (i.e. is less risky). In particular, the more uncertainty players have about the actions of the other player(s), the more likely they will choose the strategy corresponding to it. Each user evaluates the risk corresponding to each NE, given that he doesn’t know the reaction of the other users, and chooses the one with the least risk value (e.g. smallest expected payment). Unfortunately, this approach does not necessarily lead us to one of the Nash equilibria. Figure 17 depicts the result obtained for an example of $N=2,T=1$ , where $p_{1}^{m}=0.8,p_{2}^{m}=0.6$ and their meeting probability is $0.5$ . We can see that there are some regimes of $r^{{}^{\prime}}$ where $x_{1}^{m}+x_{2}^{m}$ exceeds $S_{m}$ . In particular, for $0.4\leq r^{{}^{\prime}}\leq 0.5$ , the risk dominance solution does not lead to a Nash equilibrium. The corresponding payments are shown in Figure 17 (b). User $1$ pays more than user $2$ since he is caching more. This also means that user $1$ is more affected by the risk dominance policy.

Preplay communication is another way to coordinate between users. Users agree before playing the game on a certain strategy when $r^{{}^{\prime}}$ lies in the partial caching regime. For example, they may agree on caching amounts proportional to their interests. This coordination may also be imposed by the SP who sets this rule for all users before playing the game. We know that users will pick one of the Nash equilibria since it allows them to minimize their payment. We adopt a fair allocation strategy for this case which is defined as follows.

Definition 2.

For the game of $N$ -users, if $r^{{}^{\prime}}$ lies in the region where they need to share the caching cost, then the fair equilibrium is a NE which satisfies: $x_{n}^{m}=\frac{S_{m}\hat{p}_{n}^{m}}{\sum_{k=1}^{N}\hat{p}_{k}^{m}},\forall k\in\mathcal{N}$ .

Notice that this fair allocation is one of the equilibria. So, if users agree on this strategy before playing the game, none of them will have any incentive to deviate unilaterally. Figure 18 (a) depicts the fair allocation solution for the example mentioned above. Notice that, for $0.4\leq r^{{}^{\prime}}\leq 0.6$ , each user caches an amount proportional to his interest. The corresponding payments are shown in Figure 18 (b). Comparing the results obtained from the fair allocation policy with the risk dominance results, we see that users payments are reduced. The fair allocation policy is one of the pre-play communication policies; however, we can find some other coordination approaches between users. For example, users can make a caching decision such that their corresponding payments are proportional to their interest. We summarize the optimal decentralized caching policy in Algorithm 3.

IV-D Choosing Optimal Memory Size

In the previous section, we introduced the solution of the decentralized caching scheme based on the reward value $r$ (recall that $r^{{}^{\prime}}=1-r$ ). Since, SP pays this reward back to all users, it will always try to reduce this amount as much as possible. But at the same time, this reward creates an incentive for users to participate in this model. We assume that each user has an isolated memory of size $Z_{n}$ . For simplicity, we also assume that all users have the same memory size. Hence, the SP finds an aggregate memory of size $Z=NZ_{n}$ . The SP has a reward preference to assign a certain reward $r$ corresponding to the assigned memory $Z$ . We consider a linear relationship between $r$ and $Z$ . In particular, we assume that $r(Z)=1-\gamma Z$ , for some $\gamma>0$ . This means that the SP gives users more reward when they assign smaller memory and reduces the reward when they assign larger memory. This relation stops the users from increasing their memory and caching everything. At the same time, when the SP needs more memory, it can reduce the reward to push users towards increasing their memory size.

Now, considering this memory constraint, we can rewrite (38) as follows:

[TABLE]

where $L_{n}^{P}$ is the peak load generated by user $n$ . In particular, from (36), we see that $L_{n}^{\mathcal{P}}=\mu_{n}^{\mathcal{P}}-r^{{}^{\prime}}\sum_{m=1}^{M}x_{n}^{m}$ . Converting this problem to an unconstrained problem, we get

[TABLE]

where $r^{{}^{\prime}}$ is the Lagrangian multiplier associated with the constraint $\sum_{m=1}^{M}x_{n}^{m}\leq Z_{n}$ . Notice that the first term in (47) is the problem we solved in Section IV-B without having this memory constraint. Each user solves this optimization problem for all possible values of $Z_{n}$ . The optimal choice of $Z_{n}$ depends on the SP reward preference.

Plotting the Lagrangian multiplier $r^{{}^{\prime}}$ for the solution of (47) versus $Z_{n}$ we get the curve shown in Figure 19. The optimal solution is determined by the intersection point between the SP reward preference and the users reward preference. We can see that $r^{{}^{\prime}}$ takes the values of $\hat{p}_{1}^{m}$ or $\tilde{p}_{1}^{m}$ . At the intersection point, and under the fair allocation scheme discussed in Section IV-C, the optimal solution will be at $Z_{n}^{*}=\frac{S_{m}\hat{p}_{1}^{m}}{\sum_{n=1}^{N}\hat{p}_{n}^{m}}$ . Considering all users, we will have the result shown in Figure 20. The optimal solution $(Z^{*},r^{{}^{\prime}*})$ is at the intersection point between the SP reward preference and the users reward preference.

Note that each sub-region corresponds to one of the solutions shown in Figure 16. Therefore, the intersection point correspond to one of these solutions, where we may have some users are caching the content while others are sharing the cost of caching that content once between them. The best case scenario happens when the intersection leads to a solution where the content is cached once between all users. This means that each user caches a small portion of this data content, based on the relation between his interest and the aggregate interest of all users. This also yields a lower memory consumption as the number of users increases.

IV-E Impact of User Mobility

Users mobility statistics affect the optimal solution of the decentralized caching policy. The optimal decision of each user depends on his meeting probabilities with other users. The user who is meeting others with a higher probability will have more potential for partial caching. In particular, when his meeting probabilities increase, the value of $\tilde{p}_{n}^{m}$ decreases and the region of partial caching increases. To see this, let us consider a special case when users have similar mobility patterns. Further, we consider the case when users visit all locations with the same probability and hence have the same meeting probability. In particular, consider the case when $\theta_{1,t}^{l}=\theta_{2,t}^{l}=\cdots=\theta_{N,t}^{l}=\theta_{t}^{l}=\frac{1}{L},\forall t\in\{1,2,\cdots,T\}$ . Therefore, we have

[TABLE]

When $L\rightarrow\infty$ , we have $v_{n,t}\rightarrow 0,\forall n,t$ . This case is typically similar to the proactive caching model discussed in [16]. Since we are assuming here that $\alpha_{m}=1,\forall m$ , each user will cache the content when his interest exceeds the caching cost, regardless of the other users decision. Note that $r^{{}^{\prime}}$ is similar to $\frac{y_{o}}{y_{p}}$ in the proactive caching model, since we assume that $y_{p}=1$ . For example, the solution of $N=2$ will be as shown in Figure 21. When $r^{{}^{\prime}}$ is smaller than $\hat{p}_{1}^{m},\hat{p}_{2}^{m}$ both users cache this content. When it exceeds $\hat{p}_{1}^{m}$ or $\hat{p}_{2}^{m}$ , the corresponding user opts to avoid caching.

Now suppose that users are moving together such that their meeting probabilities are very close to 1. This is similar to the content trading model. The difference is that all users are setting their selling price to 0, i.e. they are sharing their proactive downloads for free. For example, the optimal solution for $N=2$ shown in Figure 13 will be modified as shown in Figure 22. Since, users are sharing their proactive downloads for free, partial caching will be an optimal solution, instead of having one user caching the content and selling it to all other users.

V Conclusion

We considered a mobility-aware D2D caching network where caching decision is taken based on the users demand and mobility statistics. Two caching schemes, centralized and decentralized, were considered. We started by considering a centralized D2D caching network, where the SP is pushing data items in users devices and pays them a reward for participation. The SP aim was to minimize its incurred service cost by harnessing user’s demand and mobility statistics. An optimal caching policy was introduced that allows the SP to enhance its caching decisions. The complexity of the optimal caching policy was found to grow exponentially with the number of users. Therefore, we introduced a greedy caching policy that has a polynomial order complexity. The proposed greedy algorithm was used to establish upper and lower bounds on the gain achieved by the optimal caching policy. The optimal solution of the proposed model was found to depend on users reward preference which affects the assigned memory in their devices. Our vision was completed by considering a decentralized D2D caching network, where users make the caching decision based on the SP reward. We introduced an optimal caching policy that allows users to minimize their expected payment. We formulated the tension between the SP and users as a Stackelberg game. Best response analysis was used to identify a subgame perfect Nash equilibrium between users. The optimal solution of the proposed model was found to depend on the SP reward preference, which affects the assigned memory in users devices. We found some regimes for the reward value where the SPNE was non-unique. A fair allocation caching policy was adopted to choose one of these SPNEs.

To understand the impact of user behavior, we considered some special cases when users have similar behavior. If users have identical behavior, i.e. they have similar interest and mobility statistics, they receive a similar amount of caching. Moreover, when the number of popular locations grows large, their meeting probability approaches zero and the amount of caching depends on their interest only. We showed that if users have similar interest and different mobility statistics, caching the same content at all of them happens only if their meeting probability is zero. In this case, the amount of data cached at each user depends on their interest only. Users who are meeting each other with a probability of $1$ , split the content caching between them and caching the data item once is optimal. Fair allocation plays an important role here, to pick one of the SPNEs. If users have similar and uniform mobility patterns, i.e. they visit all popular locations with the same probability, the complexity of the centralized optimal caching policy was significantly reduced. We used this special case to show how the mobility-aware model simplifies the proactive caching and the content trading models. Our objective from this part was to explore how users mobility statistics affect the caching decision.

The results of this work extend our understanding for users behavior in D2D caching networks and allow us to add mobility dynamics to our content trading model discussed in [16]. However, there are many aspects that need more investigations. Capturing group mobility in D2D caching networks helps SPs to leverage more statistics about users. This problem has its own importance in studying the correlation between users and how to exploit it to enhance the network performance. We considered mainly the economics point of view in the previous work and discussed cost minimization in the individual mobility model. There are some other metrics that can be used to evaluate the proposed models from different angles, like outage probability and achievable throughput. Further, scaling behavior of such networks is a major point in this direction. We need to investigate the performance of the network when it expands to a larger number of users or data items.

We also need to study the cooperative and distributed caching in social-aware D2D caching networks which is another dimension to capture the correlation between users. Inspired by the main results and insights from the group mobility direction, we can extend it to grasp another parameter which affects the D2D caching networks. The carrier should be able to harness the statistics about relations between users to optimize the cached data items. This should be another thrust towards cost minimization. The SP can also exploit this aspect to shape users demand and consequently maximizes its profit. Over and above, users gain from their relationships with others in many ways. By reducing the service cost, the SP will have more potential to offer lower prices to users, as a way to shape their demand. There will be a higher possibility to find the requested data items among users in the same vicinity.

Bibliography19

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] F. Alotaibi, S. Hosny, H. E. Gamal, and A. Eryilmaz, “A game theoretic approach to content trading in proactive wireless networks,” in 2015 IEEE International Symposium on Information Theory (ISIT) , June 2015, pp. 2216–2220.
2[2] C. V. N. Index, “Cisco visual networking index: Global mobile data traffic forecast update, 2015–2020 white paper,” Tech. rep. Cisco, 2016. url: http://www. cisco. com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/mobile-white-paper-c 11-520862. html (visited on 03/26/2016)(cit. on p. 6), Tech. Rep., 2016.
3[3] FCC, “Spectrum policy task force report, fcc 02-155,” 2002.
4[4] C. Song, Z. Qu, N. Blumm, and A.-L. Barabási, “Limits of predictability in human mobility,” Science , vol. 327, no. 5968, pp. 1018–1021, 2010. [Online]. Available: http://www.sciencemag.org/content/327/5968/1018.abstract
5[5] K. Farrahi and D. Gatica-Perez, “Discovering human routines from cell phone data with topic models,” in Wearable Computers, 2008. ISWC 2008. 12th IEEE International Symposium on . IEEE, 2008, pp. 29–32.
6[6] J. Tadrous, A. Eryilmaz, and H. El Gamal, “Proactive resource allocation: Harnessing the diversity and multicast gains,” Information Theory, IEEE Transactions on , vol. 59, no. 8, pp. 4833–4854, Aug 2013.
7[7] C.-H. Yu, K. Doppler, C. B. Ribeiro, and O. Tirkkonen, “Resource sharing optimization for device-to-device communication underlaying cellular networks,” Wireless Communications, IEEE Transactions on , vol. 10, no. 8, pp. 2752–2763, 2011.
8[8] G. C. M. Ji and A. F. Molisch, “Wireless device-to-device caching networks: Basic principles and system performance,” IEEE Journal on Selected Areas in Communications , vol. 34, no. 1, pp. 176–189, Jan 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On the Performance of Mobility-Aware

Abstract

I Introduction

II System Model

II-A User Demand Model

II-B User Mobility Model

II-C Proactive Service Scheme

III Centralized Caching Scheme

III-A Problem Statement

III-B Optimal Centralized Caching Policy Analysis

III-B1 Case Study (N=2N=2N=2)

Proposition 1**.**

Proof.

III-B2 Case Study (N=3N=3N=3)

Proposition 2**.**

Proof.

III-B3 Optimal Policy for NNN-users

Remark 1**.**

Remark 2**.**

Proposition 3**.**

Proof.

III-C Greedy Centralized Caching Policy

Proposition 4**.**

Proof.

III-D Upper and Lower Bounds Analysis

Theorem 1**.**

Proof.

Theorem 2**.**

Proof.

III-E Choosing Optimal Reward

III-F Impact of User Mobility

Definition 1**.**

IV Decentralized Caching Scheme

IV-A Problem Statement

IV-B Optimal Decentralized Caching Policy Analysis

IV-B1 Case Study (N=2N=2N=2)

IV-B2 Case Study (N=3N=3N=3)

IV-B3 Optimal Policy for NNN-users

Theorem 3**.**

Proof.

IV-C Fair Caching Allocation

Definition 2**.**

IV-D Choosing Optimal Memory Size

IV-E Impact of User Mobility

V Conclusion

III-B1 Case Study ( $N=2$ )

Proposition 1.

III-B2 Case Study ( $N=3$ )

Proposition 2.

III-B3 Optimal Policy for $N$ -users

Remark 1.

Remark 2.

Proposition 3.

Proposition 4.

Theorem 1.

Theorem 2.

Definition 1.

IV-B1 Case Study ( $N=2$ )

IV-B2 Case Study ( $N=3$ )

IV-B3 Optimal Policy for $N$ -users

Theorem 3.

Definition 2.