SoReC: A Social-Relation Based Centrality Measure in Mobile Social Networks
Bowen Li, Zhenxiang Gao, Xu Shan, Weihua Zhou, Emilio Ferrara

TL;DR
This paper introduces SoReC, a social-relation based centrality measure for mobile social networks, leveraging direct and indirect social relations to identify influential users in dynamic environments.
Contribution
It proposes a novel centrality measure using social network information to evaluate user influence in MSNs, considering both direct and indirect social relations.
Findings
SoReC effectively identifies influential users in real-world MSNs.
The measure outperforms existing centrality metrics in dynamic settings.
Simulations demonstrate accurate prediction of future influential users.
Abstract
Mobile Social Networks (MSNs) have been evolving and enabling various fields in recent years. Recent advances in mobile edge computing, caching, and device-to-device communications, can have significant impacts on 5G systems. In those settings, identifying central users is crucial. It can provide important insights into designing and deploying diverse services and applications. However, it is challenging to evaluate the centrality of nodes in MSNs with dynamic environments. In this paper, we propose a Social-Relation based Centrality (SoReC) measure, in which social network information is used to quantify the influence of each user in MSNs. We first introduce a new metric to estimate direct social relations among users via direct contacts, and then extend the metric to explore indirect social relations among users bridging by the third parties. Based on direct and indirect social…
| Cases | a | b | c | d | e | f |
| LocalCom | 0.483 | 0.485 | 0.493 | 0.493 | 0.493 | 0.493 |
| TCCB | 2.400 | 2.418 | 2.510 | 2.510 | 2.510 | 2.510 |
| SRS metric | 0.199 | 0.487 | 0.494 | 0.716 | 0.167 | 0.493 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOpportunistic and Delay-Tolerant Networks · Caching and Content Delivery · Complex Network Analysis Techniques
SoReC: A Social-Relation Based Centrality Measure in Mobile Social Networks
Bowen Li12, Zhenxiang Gao3, Xu Shan1, Weihua Zhou14, Emilio Ferrara 3
E-mail: [email protected], [email protected], [email protected], [email protected], [email protected]
1Institute of Information Engineering, Chinese Academy of Sciences, Beijing 100093, China
2School of Cyber Security, University of Chinese Academy of Sciences, Beijing 100049, China
3Information Sciences Institute, University of Southern California, CA 90292, USA
4Corresponding author
Abstract
Mobile Social Networks (MSNs) have been evolving and enabling various fields in recent years. Recent advances in mobile edge computing, caching, and device-to-device communications, can have significant impacts on 5G systems. In those settings, identifying central users is crucial. It can provide important insights into designing and deploying diverse services and applications. However, it is challenging to evaluate the centrality of nodes in MSNs with dynamic environments. In this paper, we propose a Social-Relation based Centrality (SoReC) measure, in which social network information is used to quantify the influence of each user in MSNs. We first introduce a new metric to estimate direct social relations among users via direct contacts, and then extend the metric to explore indirect social relations among users bridging to their neighbors. Based on direct and indirect social relations, we detect the influence spheres of users and quantify their influence in the networks. Simulations on real-world networks show that the proposed measure can perform well in identifying future influential users in MSNs.
Index Terms:
Centrality, social relations, mobile social networks, real-world techno-social systems
I Introduction
The advances in popularity of wireless networks and mobile devices bring unprecedented prosperity to Mobile Social Networks (MSNs) [1, 2]. Millions of mobile users can directly connect, interact, and share content with each other via their smart devices, which become one of the most important paradigms in the 5G system [3]. In this paradigm, centrality evaluation is a key research issue [4, 5, 6, 7, 8]. It is helpful to identify the influential users in the networks, as this provides important insights into the design and deployment of diverse applications and services in various settings, such as mobile edge computing, content-centric networks, device-to-device communication, etc. Most research [4, 5] on centrality evaluation in MSNs is based on the static-network assumption. The topologies of the networks are supposed to remain the same over time, e.g., a link between two users exists if they had interacted within the observation period, and doesn’t exist otherwise. In other words, any temporal information is essentially disregarded. However, MSNs consisting of mobile devices carried by humans are essentially dynamic environments, i.e., link vary over time, often significantly. Thus, even quite effective centrality measures for static networks, e.g., degree, closeness, node and edge betweenness [9, 10], and PageRank [11], are not ideal for dynamic MSNs. In order to evaluate the centrality of users in such challenging network environments, some researchers [6, 12] built a time-ordered model according to human mobility patterns and tried to quantify the influence of each user by capturing the spatial and temporal characteristics of the networks. While many researchers have studied that mechanism as a means of centrality evaluation in MSNs, the effects of the social nature of MSNs have generally been ignored when considering how to predict the centrality of users over time. Further studies [13, 14] show that spatial and temporal actions (mobility) of humans are not chaotic but are strongly impacted by social relations — these social relations, in turn, have stable long-term characteristics. Thus, social relations need to be taken into consideration when evaluating centrality. In addition, apart from the direct relations, indirect relations also need to be considered, because even two nodes with no direct relation can still have a strong influence on each other as long as the two have some mutual friend(s) — indirect connections can drive social influence. The importance of developing accurate centrality measures in MSNs is further enhanced by the fact that such centrality measures are also often exploited in downstream tasks such as community detection and recommendation systems [15, 16].
In this paper, we investigate the centrality evaluation from the perspective of social relations and propose a centrality measure to identify influential users in dynamic MSNs. First, direct and indirect social relations are studied. A new metric is proposed to estimate direct social relations among contact users by mining their contact patterns. We also give a brief (mathematical and experimental) proof of the metric validity. Apart from the direct social relations, indirect social relations are studied to estimate the relations among users bridging to their neighbors. Combing direct and indirect social relations, we propose a Social-Relation based Centrality (SoReC) measure to quantify the centrality of users in dynamic MSNs. Extensive simulations on real-world mobility networks show that the SoReC measure can well identify future influential users in MSNs. The rest of the paper is organized as follows: In Section II, we briefly depict the mobile social network model. In Section III, we detail our framework for centrality evaluation. In Section IV, we conduct the performance evaluation and discuss the results. Finally, we conclude the paper along with insights into future directions in Section V.
II Network Model
Consider a mobile social network, which consists of mobile devices. Each mobile device can directly communicate with others over short-range radio frequencies when they are within the direct transmission range of each other. For each time slot , the transient MSN is static and denoted as an undirected unweighted graph , where is a set of nodes representing all mobile devices in the network at the time slot , , , and is a set of edges representing the interaction states among the mobile devices at the time slot , , where denotes the physical distance between nodes and . When is not less than a special distance (the maximum wireless transmission distance), the direct interaction between them can occur, thereby an edge forms, otherwise not. If , we say and are adjacent. We assume the time during which the network is observed is finite, from until ; Without loss of generality, we set and . The dynamic MSN in the time interval is expressed as a time-ordered network .
III Framework of Centrality Evaluation
This section details our framework of centrality evaluation, which comprises of two parts: (i) the social relations analysis, and (ii) the centrality quantification. In the former, direct and indirect social relations are explored, respectively. In the latter, the concept of influence spheres is introduced first and then a centrality measure is proposed.
III-A Analysis of Direct Social Relations
The transient nature of the connectivity among nodes, which enables messages to travel over the MSNs, yields challenges in detecting the interaction probabilities among nodes. Since direct interactions among nodes only occur when nodes come into the wireless transmission range of each other, direct social relations arising from physical proximity (contact) need to be analyzed to evaluate interaction intensity (influence strength).
Previous studies have proposed diverse metrics to extract the intensity of direct social relations, such as encounter frequency (EF), total contact duration (TCD), and average separation period (ASP) [7, 8]. But all those metrics have some inadequacies in reflecting the interaction intensity arising from the contacts. For example, consider the six contact patterns in Fig. 1, where the shaded boxes represent the contacts’ duration. Comparing case (a) with case (b), we notice that but . Hence, the contact pattern (b) captures a stronger interaction. In cases (b) and (c), but . Since frequent encounters bring more interactions, pattern (c) is preferable.
Among the previous metrics, the metric cannot differentiate between case (a) and case (b), and the cannot differentiate between case (b) and case (c). Although can assign correct link weights in cases (a), (b) and (c), it fails in other cases. For example, consider case (c) and case (d). If , cannot differentiate between them but case (d) is preferable due to the longer uninterrupted contact duration. Similarly, if , cannot differentiate between case (d) and case (e), even though case (d) offers better content transmission opportunities. Meanwhile, the variance of the contact time is also a factor reflecting the irregularity in the relationship, but all the three metrics are unable to reflect it. Such as for case (c) and case (f), if , then , , . Since a more stable encounter duration captures a more stable content transmission, the relationship in case (c) is preferable to interaction opportunity.
To find a metric that reflects the direct social relation more accurately, we propose a new metric, Social-Relation Stability (SRS), by taking into consideration the aforementioned three factors: frequency, duration, and regularity. The idea is as follows: calculate the contribution of each encounter by the sine function and add them up. We denote as the contact patterns between nodes and during , (), where is the duration of the -th encounter. Hence the SRS metric is defined as
[TABLE]
where . Note that, since the sine function is monotonically increasing and concave in the partial interval, the value of SRS is positively correlated with the frequency, the longevity, and the regularity of interactions. Next, we will mathematically prove the validity of the SRS metric (in Lemma 1-3) and discuss the range of metric values (in Corollary 1).
Lemma 1
The SRS value is positively correlated with contact frequency.
Proof:
Without loss of generality, we assume the total contact duration is constant and contacts’ duration are regular. Substituting into (1), we can get
[TABLE]
Due to the regular contacts’ duration, i.e., , (2) can be rewritten as
[TABLE]
where denotes the total contact duration, . Since when (, when ), increases with the increasing of . Thus, the SRS value is positively correlated with contact frequency. ∎
Lemma 2
The SRS value is positively correlated with contact duration.
Proof:
Similarly, we assume the contact frequency is constant and contacts’ duration are regular. Due to the regular contacts’ duration, the SRS metric can be also derived as (3). Since within and the encounter frequency is a constant, increases with the increasing of . Thus, the SRS value is positively correlated with contact duration. ∎
Lemma 3
The SRS value is positively correlated with contact regularity.
Proof:
For each , , i.e., , so is concave. According to the Jensen Inequality, we can obtain
[TABLE]
where is the positive weights.
Let the weights are all equal and denote as the average contact duration, , then (4) become
[TABLE]
Thus, for two encounter patterns with the same frequency and duration, the more regular one gets a larger SRS value. ∎
Corollary 1
The range space of the SRS value is .
According to the lemma 1-3, the SRS metric gets the maximum value when is close to and each duration is almost the same and infinitesimal, and gets the minimum value when no encounter occurs. The following proof is about .
Proof:
The SRS metric gets the maximum when the following conditions hold:
- •
;
- •
;
- •
.
[TABLE]
Thus . ∎
It is immediate to prove . To illustrate the efficacy of the proposed SRS metric, we utilize the metric to evaluate the direct social relations of cases in Fig. 1 and compare with existing methods, LocalCom [7] and TCCB [8]. The experimental results in Table I show that LocalCom and TCCB, which are based on , fail to differentiate among cases (c)-(f), while our metric can accurately indicate which case supplies more interaction opportunities as argued earlier.
III-B Analysis of Indirect Social Relations
According to the SRS metric, the direct influence between each pair of adjacent nodes can be evaluated. However, apart from direct influence, indirect influence among nodes may come from indirect interactions via neighboring nodes. This type of indirect influence also plays a significant role in centrality evaluation, especially in the absence of strong direct relations among nodes. To further explore indirect influence, an indirect SRS (in-SRS) metric is proposed to reflect the indirect social relations among nodes. Considering a pair of nodes, and , we say indirect influence exists between them if there is a set of nodes , such that an indirect interaction between and can be bridged through those nodes. Here, the in-SRS value is defined as the probability of influencing through all possible indirect interactions. Thus, the in-SRS metric between and is expressed as
[TABLE]
where denotes the indirect influence of the -th indirect interaction, and is defined as the product of all intermediate direct social relations. Formally,
[TABLE]
III-C Evaluation of Node Centrality
By evaluating the direct and indirect social relations, we can derive the influence range of each node and the influence strengths of it on the nodes within its influence range. We construct the influence sphere of each node by the set of nodes having the influence on it (including the direct and indirect). Formally, denotes the influence sphere of node , where is the friends set of ,
[TABLE]
and , , is a set of influence strengths between and its friends,
[TABLE]
Since each influence sphere contains all possible influence members of a node, the task of quantifying the node centrality in the whole networks shifts to quantifying the influence of nodes in their influence spheres. Hence, this paper proposes a Social-Relation based Centrality (SoReC) measure to quantify the centrality of users on the basis of Entropy theory.
The entropy notion is introduced in thermodynamics and has been widely used in information science and statistical physics to describe the probability distribution of a given system. In this paper, we employ entropy to evaluate the distribution of influence strengths in the influence spheres. Consider a influence sphere of , . The influence probability of node on node is expressed as
[TABLE]
Thus, the influence entropy of node is defined as
[TABLE]
Based on Entropy theory, the node with wider influence range and uniform influence probability has higher influence entropy. However, the influence entropy slights the weights of social relations, which reflect the actual influence among nodes. Thus we add the weight information into centrality measure and update the measure as
[TABLE]
IV Simulation
Next, we employ the Susceptible-Infected-Recovered (SIR) model [17] to simulate the spreading process on MSNs and carry the simulations on the real-world mobility networks.
IV-A Experimental Settings
IV-A1 Dataset
The simulations in this work are based on two widely used real-world datasets: (i) MIT Reality Mining Data (Reality) [14], and (ii) UCSD Wireless Topology Discovery Trace (WTD) [18]. In Reality, Bluetooth data are recorded by 97 smartphones deployed on students and staff at MIT over 246 days. In WTD, WiFi data are recorded by 275 PDAs carried by freshmen students at UCSD over 11 weeks. The details about the datasets are illustrated in Table II. We extract the contact records from the partial data of the two datasets for our simulations. Each processed records includes the start and end time of each encounter and the IDs of the nodes in contact.
IV-A2 SIR Model
We use the SIR model to simulate spreading processes on networks and test the influence of every node. In SIR, every node is initialized to be the susceptible state, and they may convert to the infected state with probability when contacting an infected node. In addition, the infected nodes may recover over time, and recovered nodes will not be infected. During the simulation period, for a given initial infected node, the number of infected and recovered nodes (influence range), and the average time of infection (influence speed) are recorded and used as its actual influence ability.
IV-A3 Evaluation Metrics
Pearson correlation coefficient is used to test whether the influence range (or the influence speed) correlates with nodes centrality values under different situations, which can be expressed as
[TABLE]
where is the ranking list by different centrality measures and is the ranking list by the actual influence ability.
IV-B Experimental Results
Firstly, we evaluate the prediction ability of the proposed measure. In this part, the former portion of the dataset is used as the contact history data for the centrality quantification. The remaining portion is used as the test data for the actual influence ability test. In addition, we also evaluate the performance of traditional measures, including Betweenness, Closeness, [9], and PageRank [11], as baseline comparison.
Figs. 2a, 2b, 2d, 2e illustrate the correlation between the actual influence strength (range and speed) and the predicted value of node centrality under different networks. The performance of each measure varies in different networks. In Reality, the Closeness measure performs better than the Betweenness, but the opposite is true in WTD. But overall, the four measures can well predict the centrality of nodes in the future and the performance decreases with the increasing period between prediction and test. By contrast, the correlation coefficients of our measure are systematically the largest, which means that the SoReC measure can quantify the centrality of nodes more accurately in dynamic MSNs. In addition, we detail the ability of our metric in the centrality evaluation, shown in Fig. 2d, 2f. Since the effects turned out to be similar on influence range and speed, we only show the results on the influence range. In Fig. 2d, 2f, the X-axis is the rank of nodes and Y-axis is the average actual influence range of top-L nodes ranked by different centrality measures. Notice that the benchmark curve is based on the actual rank, i.e., the benchmark list is ranked by the actual influence range. We can observe that our curves are closest to benchmark curve under both datasets, and the advantage of our method is most marked in the head of the distributions. The results illustrate that our measure has an advantage in the centrality evaluation over other methods, especially in the identification of influential users.
V Conclusions
In this paper, we captured social relations to study links among users, and on this basis, proposed SoReC to identify influential users in MSNs. Through theoretical derivations and experimental verification, the SoReC measure we proposed is proved to able to accurately quantify the centrality of nodes in MSNs. In addition, the SoReC measure performs better than traditional measures in terms of centrality prediction. Despite the promising results, our model still requires a knowledge of the global network topology. In the future, we will attempt to identify the influential users in a distributed fashion, or by relying on a mix of global and local information [16]. We will then leverage our framework to redesign proofs-of-concept of some popular services and applications in MSNs.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Vastardis and K. Yang, “Mobile social networks: Architectures, social properties, and key research challenges,” IEEE Communications Surveys & Tutorials , vol. 15, no. 3, pp. 1355–1371, 2013.
- 2[2] Z. Gao, Y. Shi, S. Chen, and Q. Li, “Exploiting social relationship for opportunistic routing in mobile social networks,” IEICE Transactions on Communications , vol. 98, no. 10, pp. 2040–2048, 2015.
- 3[3] X. Wang, Y. Zhang, V. C. M. Leung, N. Guizani, and T. Jiang, “D 2d big data: Content deliveries over wireless device-to-device sharing in large-scale mobile networks,” IEEE Wireless Communications , vol. 25, no. 1, pp. 32–38, 2018.
- 4[4] W. K. Chai, D. He, I. Psaras, and G. Pavlou, “Cache “less for more” in information-centric networks (extended version) ☆ ☆☆,” Computer Communications , vol. 36, no. 7, pp. 758–770, 2013.
- 5[5] M. V. Barbera, A. C. Viana, M. D. D. Amorim, and J. Stefa, “Data offloading in social mobile networks through vip delegation,” Ad Hoc Networks , vol. 19, pp. 92–110, 2014.
- 6[6] H. Kim and R. Anderson, “Temporal node centrality in complex networks,” Phys.rev.e , vol. 85, no. 2 Pt 2, p. 026107, 2012.
- 7[7] F. Li and J. Wu, “Localcom: A community-based epidemic forwarding scheme in disruption-tolerant networks,” in IEEE Communications Society Conference on Sensor, Mesh and Ad Hoc Communications and Networks , 2009, pp. 574–582.
- 8[8] H. Zhou, V. Leung, C. Zhu, and S. Xu, “Predicting temporal social contact patterns for data forwarding in opportunistic mobile networks,” IEEE Transactions on Vehicular Technology , no. 99, p. 1, 2017.
