Vulnerability of D2D Communications from Interconnected Social Networks
Tianyi Pan, Md Abdul Alim, Xiang Li, My T. Thai

TL;DR
This paper investigates how rumors in online social networks can negatively impact device-to-device communications, reducing network throughput by up to 40%, and proposes methods to identify critical nodes that cause such vulnerabilities.
Contribution
It introduces the problem of identifying critical rumor-spreading nodes in OSNs that threaten D2D communication performance and provides an efficient solution for this detection.
Findings
Up to 40% reduction in network throughput due to rumors.
Proposed an efficient method for critical node detection.
Validated vulnerability through extensive simulations.
Abstract
In this paper, we study how rumors in Online Social Networks (OSNs) may impact the performance of device-to-device (D2D) communication. As D2D is a new technology, people may choose not to use it when believed in rumors of its negative impacts. Thus, the cellular network with underlaying D2D is vulnerable to OSNs as rumors in OSNs may decrement the throughput of the cellular network in popular content delivery scenarios. To analyze the vulnerability, we introduce the problem of finding the most critical nodes in the OSN such that the throughput of a content delivery scenario is minimized when a rumor starts from those nodes. We then propose an efficient solution to the critical nodes detection problem. The severity of such vulnerability is supported by extensive experiments in various simulation settings, from which we observe up to reduction in network throughput.
| Notation | Description |
|---|---|
| Set of mobile devices | |
| , the cellular network | |
| with all cellular/D2D links | |
| , the D2D network | |
| Interference set of edge | |
| Capacity of edge | |
| Data content requesters in cellular/D2D mode | |
| Set of outgoing and incoming links of node | |
| , the modified cellular network | |
| The set of all cellular links in | |
| The set of all D2D links in | |
| , | |
| the set of removable edges in | |
| Flow rate on edge | |
| , the throughput of the network, where | |
| denotes the BS and is a virtual sink node | |
| Binary variable indicates whether | |
| attack edge () or not () | |
| Bandwidth allocated to cellular transmission | |
| Bandwidth allocated to D2D transmission | |
| Budget of the adversary |
| Notation | Description |
|---|---|
| , the social network | |
| , | |
| criticality of all users | |
| Expected sum of criticality over all the | |
| nodes influenced by | |
| The optimal seed set | |
| The collection of all RR sets. | |
| # of hyperedges incident to nodes in | |
| among | |
| Estimator for . | |
| Precision parameters of Alg. 3 | |
| Stopping criterion for generating RR | |
| sets, | |
| Seed set outputted by Alg. 3 |
| Notation | Description |
|---|---|
| Cell dimension | x |
| BS position | Lower left corner |
| Network bandwidth | MHz |
| Channel Model | Multipath Rayleigh fading |
| Path Loss Exponent | |
| Noise spectral density | dBm/Hz |
| BS transmit power | W |
| D2D transmit power | W |
| D2D distance | m |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpportunistic and Delay-Tolerant Networks · Caching and Content Delivery · Cooperative Communication and Network Coding
Vulnerability of D2D Communications from Interconnected Social Networks
Tianyi Pan
University of Florida, Gainesville, FL, USA {tianyi,alim,xixiang,mythai}@cise.ufl.edu
Md Abdul Alim
University of Florida, Gainesville, FL, USA {tianyi,alim,xixiang,mythai}@cise.ufl.edu
Xiang Li
University of Florida, Gainesville, FL, USA {tianyi,alim,xixiang,mythai}@cise.ufl.edu
My T. Thai
University of Florida, Gainesville, FL, USA {tianyi,alim,xixiang,mythai}@cise.ufl.edu
In this paper, we study how rumors in Online Social Networks (OSNs) may impact the performance of device-to-device (D2D) communication. As D2D is a new technology, people may choose not to use it when believed in rumors of its negative impacts. Thus, the cellular network with underlaying D2D is vulnerable to OSNs as rumors in OSNs may decrement the throughput of the cellular network in popular content delivery scenarios. To analyze the vulnerability, we introduce the problem of finding the most critical nodes in the OSN such that the throughput of a content delivery scenario is minimized when a rumor starts from those nodes. We then propose an efficient solution to the critical nodes detection problem. The severity of such vulnerability is supported by extensive experiments in various simulation settings, from which we observe up to reduction in network throughput.
1 Introduction
D2D communication is a promising approach to cope with the rapidly increasing demand of mobile data [1], in which user equipments (UEs) directly communicate with each other while bypassing the cellular network’s base stations (BSs). For the cellular network, utilizing D2D communication can therefore offload its mobile data traffic and significantly boost the overall performance [2, 3, 4, 5].
However, as a new technology, D2D communication will be likely to face doubts from various perspectives, such as efficiency, safety, etc. When users lacking the knowledge of D2D, exaggerated disadvantages of D2D can also be generated and spread as rumors, either randomly by some normal users, or intentionally by malicious individuals. With OSN as the medium, rumors can propagate conveniently and affect a large portion of the users in the network [6, 7, 8] and cause the users to opt-out of D2D. When a requester chooses not to use D2D, its request has to be served by a cellular link; when a relay device disables D2D, some requesters in its proximity may not be served via D2D anymore. Both situations can potentially increase the burden of the BS and the overall throughput can degrade. Therefore, the cellular network is vulnerable to the rumors spreading in its interconnected OSN. An example is depicted in Fig. 1.
To assess the vulnerability of the cellular network from an interdependent OSN, we propose the TMIN (Throughput Minimization in INterdependent D2D/OSN) problem that asks for a set of nodes in the OSN, such that when a rumor starts from those nodes, the throughput in the cellular network for a popular content delivery scenario is minimized. In order to consider the vulnerability in the worst case, we assume the rumor starts early and all OSN users believed in the rumor will disable D2D on their devices before the content delivery starts. The throughput of the content delivery scenario in the cellular network is then calculated based on the (possibly) reduced number of D2D devices. The main challenge of TMIN is that the data transmission problem in the cellular network and the information propagation problem in the OSN must be jointly considered in order to accurately depict the impact or rumors to D2D communication. With high complexity from both problems, it is difficult to optimally solve TMIN.
To cope with the challenges, we propose Rumor the Critical Framework (RCF) that connects the two networks by first evaluating the criticality of UEs in content delivery and mapping the values to the OSN, and then efficiently solve the Influence Maximization (IM) problem in the OSN, considering node criticality.
In summary, our contributions are as follows:
- •
We analyze the vulnerability of a cellular network with underlaying D2D from an interdependent OSN, by solving the problem TMIN of finding the most critical nodes in the OSN to the throughput in the cellular network.
- •
We propose the algorithm RCF that can efficiently make near optimal decisions for TMIN.
- •
We experimentally evaluate the vulnerability in various simulation settings with realistic cellular network parameters and real-world OSN data.
Related Work. Recently, the use of D2D communication to improve the spectrum efficiency and system capacity has received much interest [2, 9, 11, 12, 13, 14, 3, 15, 16, 5, 4, 17, 18, 19, 20, 21, 22, 23, 24, 25]. Towards this direction, previous works studied resource allocation [3, 19, 20, 22, 21], interference management [11, 13, 12, 15], power control [18, 9, 14] and base station scheduling [25]. In order to construct stable D2D transmission links, the social aspect can be utilized. Social trust and social reciprocity have been leveraged in [23] to enhance cooperative D2D communication based on a coalitional game. In [24], encounter history of the devices as well as their social relations are exploited in order to deploy a reliable D2D communication mechanism. Social communities [26] were also utilized [27] to facilitate finding stable D2D links. However, none of the existing works can capture how the information diffusion phenomenon in OSNs affects different performance metrics of D2D communication underlaid cellular networks.
In OSNs, the NP-Complete IM problem [28] is widely studied. Since the seminal work by Kempe et al. [28], a plethora of works for IM have been proposed [29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, nguyen2016stop, kuhnle2017scalable, li2017approximate]. The state-of-the-art works [37, 38, 39, 40, nguyen2016stop, li2017approximate] follow the RIS approach proposed in [37]. They guarantee the approximation ratio with a time complexity near linear to the network size. The existing works usually focus on the OSNs, without considering how the information can influence other networks, like D2D networks, that are interconnected with the OSNs.
Organization. The rest of the paper is organized as follows. In Sect. 2, we describe the cellular network, the OSN, their dynamics, interactions and vulnerability. We also formally define the TMIN problem. We propose our solution RCF to TMIN in Sect. 3. The experiment results are illustrated in Sect. 4. We conclude the paper in Sect. 5.
2 System Model
In this section, we present the system model of interest and introduce the two components, the cellular network and the OSN respectively. Then we formally define the TMIN problem.
2.1 The Cellular Network
We study a cellular system as illustrated in Fig. 2. It has a single BS and a set of UEs. We denote the cellular network as , where and denotes all cellular/D2D links. A subset of UEs enables D2D. Denote as the network induced in by . A link exists if and only if UEs are within D2D communication range. A subset of UEs request the same content. where and . As a UE in disables D2D, its request can only be served by cellular links. Instead, a UE in can have its request fulfilled either by cellular or D2D links. The set of UEs does not request the data, but are able to be relay devices for UEs in . For a node , denote as the set of incoming/outgoing links of , respectively.
2.1.1 Cellular Resources
In this paper, we consider the resource sharing model discussed in [3, 5, 25]. In the model, the D2D and cellular links use disjoint portion of the licensed band. Denote the total bandwidth of the BS as . If a resource allocation scheme allocates of the band to cellular links, the bandwidth for D2D links is then . Therefore, interference only exists among D2D links. Additionally, there is no further division in the frequency domain, so that cellular/D2D links will use the full bandwidth /.
2.1.2 Data Rates
Based on the resource sharing model, a cellular link receives no interference from D2D links and uses the full bandwidth. We can express its data rate under distance-dependent path loss and multipath Rayleigh fading as in (1), where is the Signal to Noise Ratio (SNR) for .
[TABLE]
In , is the transmit power of the BS; is the distance between ; is the path loss exponent; is the fading component and is the power of the receiver noise, which is assumed to be the additive white Gaussian noise.
For a D2D link , we must include interference from other D2D links when calculating data rate . Denote as the set of D2D links that transmit at the same time with , we can calculate using (3).
[TABLE]
is the Signal to Interference and Noise Ratio (SINR) and are the transmit powers for UEs respectively.
The model can be extended to handle multiples BSs by adding intercell interference to the denominator part of (2) and (4). We model device mobility (when necessary) as multiple snapshots of static cellular networks for popular content delivery scenarios, since such scenarios usually happen at locations where devices have limited mobility (stadium, shopping mall) or the mobility follows certain routine (office, school).
2.2 The Online Social Network
We abstract the OSN to be a weighted directed graph with a node set and a directed link set , where a node represents a user. A link exists if and only if node follows node in the OSN. Also, each is associated with a weight for information propagation.
2.2.1 Information Propagation
To characterize how rumor propagates in the OSN, we will focus on the Independent Cascading (IC) model [28] in this paper. However, our results can be easily extended to the Linear Threshold (LT) model.
In the IC Model, initially no nodes believe the rumor, we term this as the unactivated status. Given a seed set , the rumor propagates in rounds. In round [math], only the nodes are activated (believed in the rumor). In round , all nodes activated at round will try to activate their neighbors. An activated node will remain activated. It has probability to activate each unactivated neighbor at the next round and it cannot activate any neighbors afterwards. The process stops when no more nodes can be activated. Denote as the expected number of nodes activated by the rumor with as the seed set, where the expectation is taken over all . We call as the influence spread of seed set . When a problem considers differentiated gain of influencing the nodes, we denote the expected gain over all influenced nodes by as .
2.2.2 Interconnection between OSN and the Cellular Network
The owners of the UEs in the cellular network may be users in an OSN, as illustrated in Figure 1. As our focus is on the impact of OSNs to D2D communication, we construct a link only if a UE has a corresponding user . The collection of all such edges is denoted as . We assume that each UE can be related to at most one OSN user and each OSN user owns at most one UE. Otherwise, dummy nodes can be used to recover the one-to-one correspondence. If and , when believed in the rumor, UE will no longer be in and cannot be included in any D2D links.
Definition 1** (TMIN).**
Consider a cellular system with BS , set of devices , set of content requesters and the cellular network . Also, consider the OSN and the correspondence among devices in and users in depicted by the edge set . The information propagation is described using the IC model with probability for each . TMIN asks for a seed set in with size at most to minimize the throughput in .
Solving TMIN reveals the top- nodes in the OSN that are critical to the cellular network. Also, comparing the throughput in a scenario that is not impacted by rumors and the one solved by TMIN helps characterizing the degree of damage that rumors can cast to the cellular network.
3 Solution to TMIN
In this section, we describe our solution RCF to TMIN by first presenting its overview and then introduce its two subroutines in detail.
3.1 The Overview
Intuitively, the goal for TMIN is to find the most critical nodes in the OSN as seed nodes for the rumor, in order to impact the cellular network throughput the most. However, the OSN alone contains no information about the cellular network. Therefore, we must utilize the cellular network to provide information to the OSN and guide its critical nodes selection. In our solution framework RCF, we first calculate the criticality of all UEs in the cellular network using Alg. 2, based on the solutions for a throughput minimization problem defined in the cellular network. Then, we cast the criticality values of the devices to their corresponding users in the OSN and introduce an efficient targeted IM algorithm (Alg. 3) to find the top- critical nodes in the OSN. The framework RCF is described in Alg. 1, while the two subroutines, Alg. 2 and Alg. 3 are discussed in Sect. 3.2 and 3.3, respectively.
3.2 Criticality Evaluation Scheme
As the goal of the cellular network is to maximize its throughput, the criticality of the devices must be related to their contribution in throughput reduction. To characterize criticality, we start from the problem of finding the top- critical nodes in the cellular network: those whose switching from D2D to cellular mode can minimize the maximum network throughput. With a fixed , we are able to obtain the top- critical nodes, yet with this piece of information alone we can only assign criticality value for the top- nodes and [math] for all the remaining. Such criticality values can be misleading as the values may change drastically with other values. Therefore, it is necessary to consider the top- critical nodes for various values of and integrate the pieces of information, in order to have a complete view of how critical the devices are.
In the following, we first propose an approach to find the top- critical nodes in the cellular network, via solving a bi-level mixed integer linear program (MILP). Then, based on the solution of the MILP, we discuss Alg. 2, Node Criticality Evaluation (NCE), to determine node criticality.
3.2.1 Find the Critical Nodes in the Cellular Network
Under the system model discussed in 2.1, the cellular network can be modeled as a flow network. The capacity and flow of a cellular/D2D link is its maximum data rate and actual data rate, respectively. However, special care is required in the construction, for linearizing the calculation of data rate, and modeling devices switching from cellular to D2D mode, which will be discussed respectively as follows.
Data Rate Estimation. In practice, calculating data rate often requires nonlinear formulas considering interference management and resource allocation. However, adding nonlinear constraints to a formulation can greatly increase its complexity. To avoid this, we describe an approach to estimate data rate. As the main purpose of studying TMIN is to demonstrate how cellular network throughput can be impacted by rumors in OSNs, the estimated data rates suffice.
To estimate data rates, we first discuss the collision management for D2D and cellular links. Links sharing the same band cannot transmit concurrently if they are in the same access domain, in order to avoid collision. We denote such links as an Interference Set. The interference set for a link , , can be defined as where is a tunable parameter and is the maximum D2D communication distance. Notice that the link itself is also contained in . With the interference sets for all links, we can obtain the set of links that are allowed to transmit at the same time, , given one active link . is constructed as follows. First we set . Then we iteratively select a link into if it causes the largest drop in data rate among links that are not in the interference sets of links in . Assume each device has a fixed transmission power, the SINR for each D2D link can therefore be obtained using (4). Based on (3), the data rate is proportional to the bandwidth, which is a variable. For notation convenience, we calculate using (3), assuming unit bandwidth. Therefore, the maximum data rate of link can be expressed by , recall that is the bandwidth assigned to D2D communications. Similarly, we can use (1), (2) to obtain the maximum data rate for a cellular link , which is . The actual data rate is modeled as a linear variable upper bounded by the maximum data rate.
Network Modification To create a flow network with a single source and a single sink, we introduce additional components, including a global sink node as well as the links and . A link among those has infinite capacity and an empty interference set. Next, we model nodes switching from D2D mode to cellular mode. Two types of devices may switch their modes. The first type is the relay devices . When such devices turn to cellular mode, we can remove them from the network as they neither request any data nor contribute to D2D transmission. The second type is the receiver devices . When such devices switch their modes, they still request data from the BS. So we must remove all D2D links associated with those devices, but keep their cellular links. To model a switch as a link removal, we split each node into two nodes . The incoming D2D links are connected to and all outgoing links are connected to . Denote as the set of all D2D links. For the cellular link , we replace it with if and if . Let as the set of all cellular links. Nodes are connected by link , which has an empty interference set and infinite capacity. Let . Denote the modified graph as where
[TABLE]
and
[TABLE]
We can easily verify that switching a node from D2D to cellular in is equivalent to removing link in .
With the data rate model and the modified network , we formulate the problem as a bi-level MILP .
[TABLE]
The objectives (5) (outer stage) and (8) (inner stage) guarantees the problem to be a minimization of the maximum throughput . Constraint (6) restrict the solution to be top- critical nodes. The binary variable reaches if link is removed. Therefore, at most links in , i.e. at most devices in can be removed. Constraint (9) is the flow balance constraint. The traffic received by a device must be equal to what it transmits. Collision management in the cellular network are considered in (10) and (11), by which we ensure that only one link among all links in an interference set can transmit at any point of time. Also, since the traffic must be non-negative, (10) and (11) upperbound the traffic on any link by its data rate. Constraint (12) model the case that no flow can be assigned to a removed edge. Notice that we omit all links with infinite capacity for the capacity constraints. Constraint (13) limits the total bandwidth being used by cellular and D2D communications.
As cannot be solved directly due to its bi-level structure, we reformulate it as a single-level MILP by dualization and linearization techniques. The solution of is attainable via existing MILP solvers.
Dualization of the Inner Stage. By dualization, we can transform the inner stage to a equivalent minimization problem, so that the original minimax bi-level formulation is equivalent to a minimization problem in only one stage. Denote as the dual variables corresponds to constraints (9)-(13), respectively. The reformulated program after dualization is denoted as .
[TABLE]
Linearization. The original dual objective contains a quadratic term , which largely increases complexity. To achieve a linear formulation, we substitute with and add constraints (18) to (20). The equivalence after the step is proved in Lemma 1.
Lemma 1**.**
Proof.
First, we claim that the range of any is . Based on the objective, both and should be minimized. As the only constraint that potentially requests an value be higher than [math] is inequality (25), the value of can be written as for . Based on constraints (21) to (24), the largest value for any is 1, otherwise the value will be unnecessarily large and the solution is not optimal. Therefore, for all . By constraints (18) to (20), when , ; when , (as ). As is binary, we have in all cases. ∎
3.2.2 The NCE Algorithm
By the discussion at the beginning of Sect. 3.2, we can use the solutions of to support criticality evaluation as in Alg. 2. The program is solved under a set of budgets. Denote the solutions as ,…,, we define the criticality of node as
[TABLE]
The definition of is aligned with the concept of being critical. A device that appears in more solutions has higher criticality than another device that contributes to less solutions. Intuitively, all users with no devices in the considered cellular network have [math] criticality.
We assume the availability of the full knowledge of the cellular network in this paper. In reality, a BS covering a business region may observe some frequently reappearing location patterns or mobility traces of the UEs in the form of snapshots, or static networks. Therefore, if an adversary wants to exploit the vulnerability of the interdependent cellular and social networks, s/he can calculate the criticality of the devices using Alg. 2 in all snapshots. However, how an adversary attacks the networks in reality is out of the scope of this paper, as our aim is to understand the vulnerability in the worst case.
3.3 Targeted IM Algorithm
With Alg. 2, we can quantitatively evaluate the criticality of all nodes in the OSN. Due to the way the criticality values are assigned, when more nodes with large criticality values in the OSN are influenced, we can expect a more severe throughput reduction in the cellular network. Therefore, the problem of finding the top- critical nodes in the OSN can be interpreted as the targeted-IM problem of finding seed nodes to maximize the total criticality of all influenced nodes. In the following, we first propose an efficient Targeted-IM algorithm based on the reverse influence sampling (RIS) technique and then prove its guaranteed approximation ratio and near-linear running time.
Brief Review of the RIS Technique. RIS was first proposed in [37], which casts light on efficient IM algorithms. It first samples Reverse Reachable (RR) sets and then apply a greedy maximum coverage (MC) algorithm to obtain the seed set. An RR set consists of the set of nodes that can reach an origin node in a sample graph . With enough RR sets, one can estimate the influence each seed set can have to the whole network, by relating the seed set and the RR sets to a coverage instance, in which an RR set corresponds to an element and a node in the graph corresponds to a set. An element is covered by a set if and only if the node exists in the RR set. Approximately, the influence spread of a seed set is positively correlated to the number of covered RR sets over the total number of RR sets, greedily solving the MC can output a seed set with near-optimal influence spread. However, to establish an accurate estimation of influence spread of each seed set without exert too much burden to computation, the number of RR sets generated must be carefully selected.
In our scenario, as the nodes are of different criticality values, the probability of starting a random RR set from node is , proportional to its criticality values. Denote as the number of RR sets in covered by seed set .
To limit the number of RR sets, we set exponential check points during RR set generation. At each check point, we greedily solve the weighted MC problem [41] and examine if exceeds a threshold, where is the solution by the greedy MC. If so, we stop RR set generation and output as the seed set. The Targeted-IM algorithm is presented in Alg. 3. With an improved threshold, the time complexity of Targeted-IM is a constant smaller than the state-of-art target IM algorithm BCT [40].
In the following, we present the approximation ratio and time complexity of Alg. 3. For conciseness, the proof for Theorem 1 is placed in appendix.
Theorem 1**.**
With , , the seed set calculated by Alg. 3 satisfies with probability at least .
Theorem 2**.**
The expected running time of Alg. 3 is .
The proof is a straightforward extension of the one in [40], and is omitted.
4 Experiments
In this section, we evaluate how cellular network performance can be impacted when the critical nodes in the OSN propagates the rumor. In the following, we first discuss the setup for the experiments in Section 4.1. In Section 4.2, we measure the vulnerability of the cellular network. Then we consider how the actions of the users and the BS can reduce the vulnerability in Section 4.3 and 4.4, respectively.
4.1 Setup
Cellular Network. As the real data sets are limited due to our requirements, we choose to use synthetic data for the cellular network. In a square m by m area, we generate data requesters at random locations. Further, we select of them to be in D2D mode and the other remain in cellular mode. To enable D2D communication, we also generate D2D relay devices at random locations within the area. All the wireless parameters are summarized in Table 3.
OSN. To model information propagation in the OSN, we use real-world Facebook network topology from [42]. The Facebook data consists of 4039 users and 88234 social network interactions between the users [43]. The influence between a pair of users was assigned randomly from a uniform distribution.
Interdependency. For the interconnection between the two networks, we consider two scenarios, namely a stadium scenario and a shopping mall scenario. In the stadium scenario, the owners of the devices are not very likely to be socially connected. In the shopping mall scenario, however, as the shoppers normally resides closer than the game goers, the owners of the devices are more likely to be connected in the OSN, compared with the stadium scenario.
We use the following procedure to create the interconnections. For each D2D node , we construct link between and a uniformly randomly chosen user from the OSN. Once is constructed, with probability , we uniformly randomly choose a neighbor of and a neighbor of and construct link . With probability , we construct link for a random neighbor of and a random neighbor of , when is constructed. In order to differentiate the different level of social connections among users in the cell area, we apply higher values for the shopping mall scenario. In our experiments we have set and for the stadium scenario, and and for the shopping mall scenario.
Algorithms. To illustrate the efficacy of our proposed method RCF, especially the criticalness evaluation scheme, we compare RCF with two approaches. The first approach uses degree centrality as criticalness values and runs the same targeted IM algorithm embedded in RCF to obtain the critical nodes in the OSN. The second approach, as a baseline, randomly assigns criticalness values to guide critical nodes selection in the OSN.
4.2 Vulnerability of the Cellular Network
In this section, we study how vulnerable the cellular network is to rumors in the OSN, in terms of throughput reduction. Also, we demonstrate that social connectivity of the nodes in the cellular network can impact the vulnerability. Additionally, we explore another dimension of vulnerability: how much bandwidth is necessary to bring the throughput back to the original value.
We first observe how the throughput can be impacted with varying seed set size of spreading rumors in the OSN. In Figure 3b, for both scenarios, the throughput of the cellular network experiences substantial decreases after the rumor propagation. The decrements are higher with increasing number of seeds (i.e. critical nodes). With RCF, the damage to throughput is severer than other approaches, which means that RCF can better reveal the critical nodes. The gap between RCF and the other methods increases significantly as more seed nodes can be selected. In Figure 3a, the throughput drops when critical nodes outputted by RCF are used as seeds of rumor. It is worth noticing that even the random approach can lead to noticeable decrease in throughput. Therefore, the interdependent cellular/social networks is vulnerable not only to rumors that are intentionally spread, but also to those that are randomly originated.
Next, we examine how social connectivity impacts the throughput. In theory, in a network with higher level of social connections, rumors can cause larger influence than a network with less social connections, with the same seed set size. Therefore, the cellular network can be more vulnerable when the interdependent OSN is more densely connected.
The result depicted in Figure 6 supports the analysis. At each seed set size, we calculate the relative throughput decrease for different scenarios. Denote the throughput after rumor propagation with seed nodes as , we define .
As can be seen from Figure 6, for the shopping mall scenario increases faster in most of the cases, which demonstrates that the social connections actually aid the rumor propagation, as well as increase the vulnerability. The slow increase in for the shopping mall scenario after seed set size can be interpreted as a saturation case, that all D2D users are already influenced and the throughput cannot be further decreased. It explains why of the stadium scenario surpasses that of the shopping mall scenario after seed set size .
Then, we show in Figure 6 the additional bandwidth cost requirement that arises from increasing the throughput to the level before rumor propagation. Consistent with our previous findings, the critical nodes found by RCF result in the largest bandwidth requirement and thus exerts the highest pressure to the BS. After the rumor propagation, the extra bandwidth required can be up to Hz. When a BS runs out of its own bandwidth and fails to supply the additional bandwidth, the throughput will be decreased drastically and the data requesters can experience a low data rate, which is not preferable. However, supply the extra requirement can be costly and thus a heavy burden to the BSs. This phenomenon emphasizes how vulnerable the cellular network can be, from another direction.
4.3 Reduce Vulnerability from the OSN
In this section, we study how users’ actions can help in reducing the vulnerability of the cellular network. In OSNs, not all users are equally sensitive to rumors of D2D communication. Users having larger exposure to related knowledge are likely to identify the rumors and remain uninfluenced. So, they are not likely to opt out using D2D nor spread the rumor. In contrast, the users with limited knowledge to D2D are susceptible to rumors and are much more likely to opt out using D2D.
To quantitatively describe the users’ knowledge to D2D, we use the term ”User Awareness” (UA) which ranges from to with means most aware of D2D related rumors. UA is in effect in the IC model in the following way. Denote the awareness for user as . In the original IC model, node can influence node with probability . With UA, the probability is modified to . Therefore, the most aware users () can never be influenced. Clearly, when users become more aware of the rumors, the rumor propagation will be limited and the vulnerability of the cellular network to the rumors will be reduced.
In this experiment, we fix the number of seed nodes at and vary UA values between [math] and in the stadium scenario. As the results for the shopping mall scenario have the similar trend, we choose not to include them for conciseness. Notice that we set the UA values equally for all nodes. Such a case can describe the situation in which the general public is educated about D2D and the individuals have similar awareness. Figure 6 clearly illustrates the trend that when the user are more aware, the cellular network is less vulnerable in terms of throughput. For the degree based and random methods, the throughput recovered linearly with respect to the increase of UA. However, the seed nodes selected by RCF are ”resistant” to UA. When the UA value is , the throughput after the RCF based rumor propagation only increased by compared with the case without UA. This experiment illustrates that the cellular network can still be vulnerable in the worst case, even if users start to be aware of the rumors.
To have a thorough understanding of UA, we now present the network throughput achieved by RCF for different values of user awareness values under various seed set size. Figure 8 is the heat map representing the impact of user awareness and the number of critical nodes on the network throughput. The throughput is depicted by the RGB colors. As the throughput value gets higher, the color becomes lighter in the heat map. From Figure 7a, we can see that the color is the lightest, i.e. the throughput is the highest, in the top left corner where the number of critical nodes is the least and the users are the most aware.
Sample horizontal and vertical stripes are taken in the heat map and plotted in Figure 7b. The throughputs are shown for four different user awareness levels (1.0, 0.4, 0.2, 0.0). As users become more prone to rumors, which is captured by smaller user awareness values, the network throughput decreases for fixed seed set size. On the other hand, when the number of seed users increases, the throughput decreases for a given user awareness.
4.4 Reduce Vulnerability from the Cellular Network
In Section 4.3, we demonstrate that the vulnerability can be reduced when the users can identify rumors about D2D and thus limit rumor propagation in the OSN. However, improve awareness for the general public may take time. In the meantime, the BS is responsible to ensure the usual network operation and continue to offer the best service. In this section, we discuss how the vulnerability can be reduced from the cellular network. To this end, the BS can identify the critical users to the D2D communication and incentivize them to stay using D2D. Using the same criticalness evaluation scheme, the BS can rank the devices based on their criticalness values. Once the top-ranked devices are identified, the BS offers those devices incentive to keep them in the cellular network. We now investigate how network throughput will change when the critical users are retained.
Figure 8 expresses the scenario where BS has a budget to incentivize the devices in the cellular network. In this experiment, we again use the stadium scenario only for conciseness and fix the number of critical nodes to . We measure the performance of the incentive based approach in terms of throughput gain when the BS has budget . Denote as the throughput calculated by RCF in the situation where the number of critical nodes is and the BS has budget of . Define , we can observe that the more budget the BS has, the larger throughput gain it achieves. Interestingly, this critical user retention scheme is much more effective against RCF compared with the other two approaches for critical nodes identification. The throughput gain is over with BS budget of . This result suggests that incentives can be effective for reducing the vulnerability of the cellular network.
5 Conclusion
In this paper, we investigated the vulnerability of D2D communication from interdependent OSN. To evaluate the vulnerability, we proposed the problem TMIN to find the critical nodes in the OSN. Then we developed a solution framework RCF to TMIN and discussed how it can be extended to consider general flow/diffusion networks. By experimental evaluation, we quantitatively demonstrated how vulnerable the cellular networks could be when rumors can propagate in the interdependent OSN. Also, we illustrated two possible approaches to reduce the vulnerability.
Appendix A Proofs
Proof for Theorem 1.
Proof.
We introduce the following two lemmas for the proof.
Lemma 2**.**
[39]** For any ,
[TABLE]
Lemma 3**.**
[39]** If the number of RR sets is at least
[TABLE]
where
[TABLE]
The seed set outputted by Alg. 3 satisfies
[TABLE]
with probability at least .
Notice that Lemma 3 is not in its original form, as we substitute with .
Next, we show that when Alg. 3 stops, the event that the total number of RR sets generated is less than can happen with probability at most .
Here we consider Bernoulli random variables where is a random RR set. Due to the BSA sampling algorithm [40], the mean of is .
[TABLE]
Combining Lemma 3 and the result above, by union bound, we obtain the desired result for Theorem 1. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Cisco Visual Networking Index Cisco. Global mobile data traffic forecast update, 2015–2020. white paper , 2016.
- 2[2] Klaus Doppler, Mika Rinne, Carl Wijting, Cássio B Ribeiro, and Klaus Hugl. Device-to-device communication as an underlay to lte-advanced networks. Communications Magazine, IEEE , 47(12):42–49, 2009.
- 3[3] Chia-Hao Yu, Klaus Doppler, Cassio B Ribeiro, and Olav Tirkkonen. Resource sharing optimization for device-to-device communication underlaying cellular networks. Wireless Communications, IEEE Transactions on , 10(8):2752–2763, 2011.
- 4[4] Gábor Fodor, Erik Dahlman, Gunnar Mildh, Stefan Parkvall, Norbert Reider, György Miklós, and Zoltán Turányi. Design aspects of network assisted device-to-device communications. Communications Magazine, IEEE , 50(3):170–177, 2012.
- 5[5] Lei Lei, Zhangdui Zhong, Chuang Lin, and Xuemin Shen. Operator controlled device-to-device communications in lte-advanced networks. IEEE Wireless Communications , 19(3):96, 2012.
- 6[6] Nam P Nguyen, Ying Xuan, and My T Thai. A novel method for worm containment on dynamic social networks. In Military Communications Conference, 2010-MILCOM 2010 , pages 2180–2185. IEEE, 2010.
- 7[7] Nam P Nguyen, Guanhua Yan, My T Thai, and Stephan Eidenbenz. Containment of misinformation spread in online social networks. In Proceedings of the 4th Annual ACM Web Science Conference , pages 213–222. ACM, 2012.
- 8[8] Lidan Fan, Zaixin Lu, Weili Wu, Bhavani Thuraisingham, Huan Ma, and Yuanjun Bi. Least cost rumor blocking in social networks. In Distributed Computing Systems (ICDCS), 2013 IEEE 33rd International Conference on , pages 540–549. IEEE, 2013.
