Towards more effective consumer steering via network analysis

Jacopo Arpetti; Antonio Iovanella

arXiv:1903.11469·cs.SI·December 3, 2019

Towards more effective consumer steering via network analysis

Jacopo Arpetti, Antonio Iovanella

PDF

TL;DR

This paper introduces a network analysis approach to improve consumer steering by measuring the information available within user networks, demonstrating how platform performance can be enhanced through connected profile data.

Contribution

It proposes the Network Information Patrimony measure to quantify the information in user networks and shows its effectiveness in enhancing platform steering capabilities.

Findings

01

Networks with richer connections improve information estimation.

02

Connected profiles provide a competitive advantage in consumer targeting.

03

Empirical evidence from Amazon data supports the measure's relevance.

Abstract

Increased data gathering capacity, together with the spread of data analytics techniques, has prompted an unprecedented concentration of information related to the individuals' preferences in the hands of a few gatekeepers. In the present paper, we show how platforms' performances still appear astonishing in relation to some unexplored data and networks properties, capable to enhance the platforms' capacity to implement steering practices by means of an increased ability to estimate individuals' preferences. To this end, we rely on network science whose analytical tools allow data representations capable of highlighting relationships between subjects and/or items, extracting a great amount of information. We therefore propose a measure called Network Information Patrimony, considering the amount of information available within the system and we look into how platforms could exploit data…

Tables1

Table 1. Table 1: Statistics for the Amazon instances

Network	$n$	$m$	$δ$	$⟨ d ⟩$	$⟨ d^{2} ⟩$	$σ^{2}$	$r$	$N I P_{N}$
amazon0302	262,111	1,234,877	$1.79 \cdot 10^{- 5}$	9.422	123.823	35.03	0.003	14.141
amazon0312	400,727	3,200,440	$1.99 \cdot 10^{- 5}$	15.973	505.859	250.71	-0.044	32.670
amazon0505	410,236	3,356,824	$1.99 \cdot 10^{- 5}$	16.365	533.438	265.61	-0.043	33.595
amazon0601	403,394	3,387,388	$2.08 \cdot 10^{- 5}$	16.794	542.731	260.68	-0.043	33.317

Equations20

k_{nn} = \frac{⟨ d ^{2} ⟩}{⟨ d ⟩}

k_{nn} = \frac{⟨ d ^{2} ⟩}{⟨ d ⟩}

k_{nn, i} = \frac{1}{d _{i}} j \sum d_{j} a_{ij}

k_{nn, i} = \frac{1}{d _{i}} j \sum d_{j} a_{ij}

k_{nn} (k) = \frac{\sum _{i : d_{i} = d} k _{nn, i}}{\sum _{i : d_{i} = d} 1}

k_{nn} (k) = \frac{\sum _{i : d_{i} = d} k _{nn, i}}{\sum _{i : d_{i} = d} 1}

I P_{i} = \frac{d _{i}}{2 m}

I P_{i} = \frac{d _{i}}{2 m}

N I P_{i} = I P_{i} + j \in N (i) \sum I P_{j} = \frac{d _{i}}{2 m} + j \in N (i) \sum I P_{j} = \frac{d _{i}}{2 m} + \frac{1}{2 m} j \in N (i) \sum d_{j}

N I P_{i} = I P_{i} + j \in N (i) \sum I P_{j} = \frac{d _{i}}{2 m} + j \in N (i) \sum I P_{j} = \frac{d _{i}}{2 m} + \frac{1}{2 m} j \in N (i) \sum d_{j}

N I P_{i} ≃ \frac{d _{i}}{2 m} + \frac{d _{i}}{2 m} k_{nn} = \frac{d _{i}}{2 m} (1 + k_{nn})

N I P_{i} ≃ \frac{d _{i}}{2 m} + \frac{d _{i}}{2 m} k_{nn} = \frac{d _{i}}{2 m} (1 + k_{nn})

NIP_{i}=\frac{d_{i}}{2m}\bigg{(}1+\frac{\langle d^{2}\rangle}{\langle d\rangle}\bigg{)}

NIP_{i}=\frac{d_{i}}{2m}\bigg{(}1+\frac{\langle d^{2}\rangle}{\langle d\rangle}\bigg{)}

NIP_{N}=\bigg{(}1+\frac{\langle d^{2}\rangle}{\langle d\rangle}\bigg{)}

NIP_{N}=\bigg{(}1+\frac{\langle d^{2}\rangle}{\langle d\rangle}\bigg{)}

N I P_{i} ≃ \frac{d _{i}}{2 m} + \frac{d _{i}}{2 m} k_{nn} = \frac{d _{i}}{2 m} (1 + k_{nn, i})

N I P_{i} ≃ \frac{d _{i}}{2 m} + \frac{d _{i}}{2 m} k_{nn} = \frac{d _{i}}{2 m} (1 + k_{nn, i})

NIP_{i}(d)\simeq\frac{d_{i}}{2m}+\frac{d_{i}}{2m}k_{nn}(d_{i})=\frac{d_{i}}{2m}\bigg{(}1+k_{nn}(d_{i})\bigg{)}

NIP_{i}(d)\simeq\frac{d_{i}}{2m}+\frac{d_{i}}{2m}k_{nn}(d_{i})=\frac{d_{i}}{2m}\bigg{(}1+k_{nn}(d_{i})\bigg{)}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

∎

11institutetext: Jacopo Arpetti, corresponding author (http://orcid.org/0000-0002-3448-1055),

Antonio Iovanella, (http://orcid.org/0000-0001-8147-3747) 22institutetext: Department of Enterprise Engineering, University of Rome Tor Vergata,

Via del Politecnico, 1 - 00133, Rome, Italy

Tel.: +39-06-72597788

22email: (jacopo.arpetti, antonio.iovanella)@uniroma2.it

Towards more effective consumer steering via network analysis

Jacopo Arpetti

Antonio Iovanella

(Received: date / Accepted: date)

Abstract

Increased data gathering capacity, together with the spread of data analytics techniques, has prompterd an unprecedented concentration of information related to the individuals’ preferences in the hands of a few gatekeepers. In the present paper, we show how platforms’ performances still appear astonishing in relation to some unexplored data and networks properties, capable to enhance the platforms’ capacity to implement steering practices by means of an increased ability to estimate individuals’ preferences. To this end, we rely on network science whose analytical tools allow data representations capable of highlighting relationships between subjects and/or items, extracting a great amount of information. We therefore propose a measure called Network Information Patrimony, considering the amount of information available within the system and we look into how platforms could exploit data stemming from connected profiles within a network, with a view to obtaining competitive advantages. Our measure takes into account the quality of the connections among nodes as the one of a hypothetical user in relation to its neighbourhood, detecting how users with a good neighbourhood – hence of a superior connections set – obtain better information. We tested our measures on Amazons’ instances, obtaining evidence which confirm the relevance of information extracted from nodes’ neighbourhood in order to steer targeted users.

Keywords:

Data value Network-driven economy Steering Networks theory Nearest neighbour degree

pacs:

D83 D85 L11

1 Introduction

Among the revolutions featuring the last century, the most important one, which marked every economic sector, impacting on every aspect of our daily life, is the Web revolution. Nowadays, starting with its 2.0 version, the Web has made individuals no longer mere users of information available on the internet, but rather – with the advent of internet platforms and mobile apps offering access to a wide array of services such as search engines, maps, and music or video on demand (Council of Economic Advisers - CEA, 2015) – true data forgers, often of a significantly personal nature (Jentzsch, 2017). In this context, data sharing increases to a global reach (Acquisti et al., 2016), blurring the distinction between the digital and the physical world, between online and offline.

Thanks to the increased data gathering capacity and the spreading deployment of data analytics techniques, a huge amount of information related to individuals’ preferences – of which no trace was previously left in the real world (Pagallo, 2014) – is today in the hands of a few gatekeepers. This allows them to potentially influence and steer individuals’ purchase choices, as well as to charge – in case they are e-commerce platforms – different prices to different customers (pp. 445, Acquisti et al., 2016; Council of Economic Advisers - CEA, 2015; Kshetri, 2014), who are clustered via different techniques.

As a matter of fact, the more data platforms manage to gather, the more effective their attempt will be to shift from a third price discrimination degree (different prices are charged to different socio-demographic groups) to a perfect discrimination, that is the condition in which it is possible to set different prices for different individuals or to nudge the same consumers towards the purchase of a certain good o service based on their price sensitivity (Shiller, 2014; Krämer and Kalka, 2017; Regner and Riener, 2017). It happen indeed to be addressed by product recommendations based on one’s past online purchases, to be targeted by real-time advertisements reflecting past browsing behaviour, or to be the recipient of tailored search results stemming from individual queries (Levin, 2011). As a matter of fact, platforms collect data, the more they can move towards perfect discrimination. The proof of the progressively enhanced platforms’ capacity to implement discriminatory practices lies with their increased ability to estimate individuals’ preferences (Lü et al., 2012; Lu et al., 2015).

The aim of the present paper is to look into whether, by using network representations and exploiting network properties, platforms could use data acquired from customers to design even more efficient price discrimination practices, or to better nudge and steer consumers towards their purchases. It appears indeed that, by letting information flow throughout the entire network, and thus taking into account the preferences of nodes which are closer to the targeted one (i.e. its alters), it would be possible for platforms to capture a wider amount of information compared to a situation where individuals are divided into clusters as it currently generally happens. As a matter of fact, considering the whole network and not just its subsets (thought clustering techniques, thereby items and/or subjects are divided into clusters, being considered as a portion of the entire universe) it is reasonable to think that the correspondingly greater amount of information, collected in the entire universe, could lead to better profiling results.

The theoretical proposal offered in the present paper is mainly based on the concept of network value111In the light of the existing literature, the value of a network can be inferred from to Metcalfe’s Law (Gilder, 1993) which defines the network value as proportional to the square of its size. Some other laws have been proposed: Sarnoff’s and the Reed’s laws (Reed, 1999) and the Odlyzko’s one (Briscoe et al., 2006). However, despite its simplicity and some of its limits (Swann, 2002; Briscoe et al., 2006), Metcalfe’s law remains a reliable tool (Madureira et al., 2013; Van Hove, 2016), which has been used, for example, in order to figure out Facebook’s network value (Metcalfe, 2013; Zhang et al., 2015)., which has been used in order to estimate the information available within the system, as well as the way how platforms exploit data stemming from connected profiles in a network.

The stemming of information availability from the network connection patterns, is taken into consideration starting from the observation that a single node is not only affected by its alters (i.e. its neighbours) but also by the alters of its alters (Cerqueti et al., 2018a). We therefore make use of some related network measures, and in so doing we provide several brand-new measures, shifting from the concept of value to the concept of information patrimony, both at the node and network levels.

The aim of the present paper is not to provide a methodology aimed to set different prices for different users, but rather to shed some light on how network systems could potentially provide a greater amount of information than expected.

The paper is organised as follows: Section 2 offers an overview of the evolution of profiling techniques. Section 3 provides a review of network properties and their impact on profiling techniques. Section 4 is devoted to the outline of certain relevant notations about network theory. Section 5 introduces and discusses our proposal and the related interpretation. Section 6 shows some explanatory simulations. The final section offers some conclusive remarks and proposes directions for future research.

2 The evolution of clustering-based profiling techniques and related limitations

Since the mid-90s e-commerce platforms begun exploiting input provided by customers in order to generate a list of items to be recommended to them (Kamishima and Akaho, 2011). A technical way through which platforms improve their own algorithms in order to identify individual preferences is represented by Collaborative Filtering techniques ( $CF$ )222A collaborative filtering algorithm implements a recommender system (Lü et al., 2012) in order to offer goods or services to a user, starting from the analysis of his/her behaviours (and those of similar users), ending up with advising what the same consumer might find useful, on the basis of the expressed preferences (Resnick and Varian, 1997; Sarwar et al., 2001; Breese et al., 2013), of the choices made by similar consumers, which allows to cluster them (Xue et al., 2005). Relying on similar form of users’ profiling techniques (such as Collaborative Filtering methods) may allow indeed an efficient matching of people and relevant purchase opportunities, (Levin, 2011), although it could also bring about some distortions and disequilibria, up to market failures (Gertz, 2002). For a different take on the functioning of digital markets, see Fuller (2019).. The aim of platforms is indeed to determine the preferences of individuals not only on the basis of data produced by some targeted ones, but also by using information derived by users related to him by the same purchase choices and preferences. In this regard, already back in 2010, in a special issue of The Economist, reference was made to the fact that Amazon and Netflix were using a statistical technique called collaborative filtering in order to «make recommendations to users based on what other users like» (The Economist, 2010). Algorithms aggregate items from similar customers, then eliminate items that a user has already purchased, recommending the remaining items to the same subject333Amazon’s profiling techniques – their so called item-to-item collaborative filtering – focus is though not on grouping similar customers on the basis of their individually analysed behaviours, but rather on clustering them via the correlation of similarities between the items that they have chosen: on the basis of a user’s purchased and/or rated items, the algorithm attempts to find out similarities with items not chosen yet, then it aggregate them in order to come up with purchase recommendations (Linden et al., 2003)..

By profiling consumers through their digital footprints444Collaborative filtering offers consumption prediction based on the customers’ purchase history (Linden et al., 2003)., firms can predict individuals’ demand as well as their price point sensitivity, ultimately altering the balance of power in their price and value negotiations (Gertz, 2002). Furthermore, sorting out the recommendation puzzle – deciding what to suggest to whom – can allow platforms to implement incisive forms of price discrimination555In this respect, it is worth recalling that the economic literature identifies three types of price discrimination: first-degree price discrimination, which occurs when a seller is a position to charge different prices for each buyer (personalised pricing); second-degree price discrimination, occurring when a consumer self-selects himself by choosing a specific package – whose per-unit price is dependent on the amount purchased – that best fits his needs (think about the Netflix package that each individual is free to choose according to his specific needs); third-degree price discrimination arising when sellers propose different prices to different socio-demographic groups (Cabral, 2000; Acquisti, 2008; Arpetti, 2018)..

One of the most effective ways to find out which item should be offered in relation to the user’s needs is grounded on algorithms trained to group or classify together similar objects or similar individuals via collaborative filtering. The goal of algorithms used by platforms, is essentially to lead the decisions made by users, suggesting them a certain good instead of another666Such goods could be perfect substitutes and an algorithm could decide to show only a part of them, proposing to the consumer the only goods that reflects the Willingness To Pay (WTP) identified for that profiled user. in relation to their price sensitivity.

Clustering and classification algorithms (such as those used by CF, like $k$ -Means Clustering and $k$ -Nearest Neighbours algorithm, $k$ -NN777The two most well-known classification and clustering algorithms – used by CF – are, respectively, $k$ -nearest neighbour algorithm ( $k$ -nn) (not to be confused with the “Nearest Neighbour Degree”, $k_{nn}$ , which origins from the network science field and which we will use later in this paper) and $k$ -mean clustering (Sarwar et al., 2001; Mobasher et al., 2001). While the $k$ -nn algorithm proposes a classification measuring the distance between similar items or users in order to suggest which of them are closer (in terms of the already purchased item or in terms of users brought together by similar purchase histories) from the targeted user, $k$ -mean algorithms are designed to cluster items (and so individuals on the basis of their choices and purchase histories when they are “user-item”), suggesting to the targeted subject those items surrounding those already purchased by him/her, thus defining the center of the temporary cluster and suggesting the closest nodes (Paterek, 2007; Katarya and Verma, 2016).) show though several limitations. As a matter of fact, even if their computational complexity is known to be polynomial in the size of the number of points, such algorithms suffer from major unsolved problems (Xu and Wunsch, 2005; Firdaus and Uddin, 2015). These latter are mainly related to sparsity of data, high redundancy, inherent noise, sensitivity to the outliers, clusters dimensional heterogeneity; furthermore, there is no metric to assess the quality of the results (Sarwar et al., 2001; Nguyen et al., 2007). In addition to the aforementioned scalabilty and spartisy, recommendation system suffer from the so-called “cold-start" problem (Nguyen et al., 2007; Castillejo et al., 2012; Konstas et al., 2009; Lam and Goeksel, 2010).

Although some authors have tried to sort out such matters via combining data form social network websites (ie information gathered via their explicit social network) and recommendation systems (Konstas et al., 2009; Liu and Lee, 2010), the aforementioned problems still remain unsolved (Castillejo et al., 2012).

3 Network properties applied to profiling tecniques

As shown by Google search engine, networks’ properties might be crucial in resolving recommendation systems limitations (Page et al., 1999; Castillejo et al., 2012). Indeed, platforms may rely on an alternative data representation – that is, the network representation of the relationships between nodes representing items offered by the platforms and/or users – based on the network science (NS) paradigm (Barabási, 2013). As matter of fact, a formal network representation (i.e. a mathematical representation) allows to organise users or objects as connected via links that make explicit predefined relationships. It is therefore possible to use network science tools and, in particular, those inherited from Social Network Analysis888Is it worth to specify that the term social network has a double meaning: it may refer both to a social structure where actors interact networked by means of a web platform and to the scientific discipline of measuring actor interaction on a network as a mathematical object. (e.g. Scott and Carrington, 2011) to build a scheme describing preferences’ patterns featuring connections between items and/or users. According to the literature, such representation has become the preferential tool used to study complex systems (Wang and Chen, 2003).

The use of Network Science may improve the quality of recommendations made to consumers via the combination of metadata stemming from the users’ activities – hence referred to the targeted node – with those gathered from the nodes’ neighbourhood and by exploit via networks’ properties (see paragraph 4).

In this regard, we consider that an underlying network expressing the item-user relationship is always potentially available to platforms, since these latter can rely on two types of networks: an explicit one (used by Facebook and any other social networks whereby network are built and inter-connected) and an implicit one, where the established connections among users are build on their interests or, in general, on their online behaviour (Zhou et al., 2014).

Thanks to network science it is indeed possible to understand the relations between items999In order, of course, to suggest them to the platform’s users., not by measuring the distance ( $k$ -NN) between individuals or goods and/or services within a cluster, but by considering the totality of connections between nodes (which may represent both items and users) in the entire network (e.g. see Birke (2013) and Lu et al. (2015) and references therein). The user-user CF method, for example, implies that it is possible profile individuals by inferring the data they ceded so to put them in a group of subjects with similar characteristics, in order to proceed to $k$ -NN measurements and find out what the nearest neighbours are. In the case of CF, clusters are created in order to reflect the platform’s needs on the background of the users’ (or items’) characteristics: applied to consumption forecasts, both behaviour predictions and item suggestions are “bent” by the use of Big Data, which allow the platform to modify the structure of the network through a continuous remodelling of the relevant clusters. Conversely, NS techniques allow the network structure to remain rigid, while data flow in a unique cluster (the entire network) and information stems from the measurement of connections in an immutable structure.

Network analysis and its properties – along with the data analysis made by any other recommendations system, although with a different computational effort – can be used by platforms in order to manage some of the factors influencing the purchase decisions by individuals plunged in the internet context101010i.e. behavioural biases to which consumers are generally subject., as well as a range of other elements that come into play as regards such choices. In such sense, due to the high level of information asymmetry (Akerlof, 1970; Tsai et al., 2011; Mavlanova et al., 2012) associated to the deep consumers’ unawareness of how digital footprints remain on the web111111i.e. purchase and internet history, GPS data stemmed from mobile devices, etc., networks properties can be used to perform profiling and grouping practices, to the point that a seller could decide to offer a certain item instead of another, hiding the existence of perfect substitute goods, nudging individuals to the purchase of a more expensive but equivalent item on the basis of a connections analysis (hence an analysis of the connections between the goods purchased by the individual and those connected to them) aimed at estimating the users’ price sensitivity (Mikians et al., 2012; Mattioli, 2012)121212In this regard, the UK Competition and Market Authority stated that: Firms could also seek to discriminate between customers using competitive variables other than price. […] The collection of consumer data may enable firms to make judgements about the lowest level of quality needed by consumers/groups of similar consumers. This may enable a firm to engage in quality discrimination where quality differences are not reflected in the prices of goods or services. Firms may do this by restricting the products that are displayed to consumers or by varying the order in which products are listed on their website to display relatively poorer or better quality products first depending on the information they collect about consumers (Competition and Markets Authority - CMA, 2015, pp. 93)..

When asked how e-commerce websites could implement price “personalisation” in relation to each consumer so to squeezing their surplus, Hannak et al. (2014, pp. 306) answered that two practices had been conceived by those players: «price discrimination – customising prices for some users – […] (and) price steering – changing the order of search results to highlight specific products».

Network science may ensure a more effective implementation of these practices because evaluations made by the platforms on what item to show or not, would be carried out in relation to the whole universe of items available to all users and not only to those addressed – at a given time – to a certain group of individuals. In this context it is therefore appropriate investigating how the use of NS could lead to an evolution of profiling practices grounded on the collection of habits and preferences, in the light the unprecedented capacity of online platforms to collect digital footprints left by individuals (Fuller, 2018; Acquisti, 2008), even considering constraints such as clustering limitation. In such sense, due to the presence of some constraints, it is improper to assert – contrary to what claimed by a wide part of the literature (Council of Economic Advisers - CEA, 2015) – that we are assisting to a shift from third-degree price discrimination to totally personalised prices just because of the increased availability of behavioural data. There are indeed at least three obstacles – in addition to the fact that platforms do not implement similar strategies due to reputation motivation as reported in Council of Economic Advisers - CEA (2015) – standing between perfect price discrimination and the forms of discrimination that platforms currently implement (Ezrachi and Stucke, 2016b, pp. 96-99): Lack of data131313A platform should be able to estimate each consumers’ reservation price, variable that is not directly observable but just inferred from. Moreover, not all platforms avail of the means to constantly observe every move made online by each consumer, while none of them can estimate their WTP for each item. As stated by Ezrachi and Stucke in “Virtual Competition”:One impediment to perfect discrimination is insufficient data. Although the algorithm has a lot more personal data that brick-and-mortar retailer of twenty years ago, the algorithm still has insufficient data for any particular customer: the customer may never have bought the item before; and the customer’s behavior may never have signaled how much or she is willing to spend to accurately predict an individual’s reservation price would require sufficient data to identify and measure each of many variables that affect the reservation price. (Ezrachi and Stucke, 2016b, pp. 96-97).; Irrationality141414In order to implement a perfect price discrimination practice, it would be necessary to predict how individuals are subjected to biases and heuristics, that can change their preferences and affect their reservation prices: an algorithm whose predictions are based on preexisting reservation prices cannot ignore how such elements could change (Arrow, 1958; Kahneman and Tversky, 1986; Simon, 1955, 1990).; Sufficient sample size151515An algorithm must have an adequate amount of data so that a perfect price discrimination practice can be performed. To this purpose, it is presumable that algorithms avail of the necessary data sources to infer information, hence to make predictions on individuals’ purchase preferences related to products routinely bought. However, as pointed out by Ezrachi and Stucke, the platforms do not have sufficient data about scattered purchases: when a purchase is not cyclically performed, it is more complex to elaborate forecasts on individual behaviour because of the absence of a sufficient amount of trial-and-errordata up to the point to detect every variable needed to figure up the reservation price of each individual for a corresponding good (think, for instance, about the rate at which individuals buy a PC screen (Ezrachi and Stucke, 2016b, pp. 99)).. In the light of such obstacles, platforms must put in place practical solutions so that preferences can be detected and reserve prices – which are not directly deducible – approximated (Shiller, 2015) (Ezrachi and Stucke, 2016b, pp. 96)161616In order for this to happen, segmentation strategies and individuals’ clustering practices are put in place to include each individual in a specific group of consumers who share similar preferences and price sensitivity, assuming that grouping persons with analogous tastes and purchase behaviours in small clusters allows a better approximation of their reservation prices (Ezrachi and Stucke, 2016a). Starting from the purchase behaviour patterns charactering each group to which individuals have been allocated to, platforms predict consumers’ desires and “their next moves”, prompting further consumption through the suggestion of specific products or services that have proven to be popular among the choices made by individuals. Such items, identified through the behaviours of consumers clustered in homogenous groups (pooled e.g. by preferences, behavioural patterns, purchase history and price reservations) are selected based on the corresponding utility levels express by individuals or, when items are bundled (or complementary), via the utility level related to one of them, influencing the utility of the other (Zhao et al., 2016).

What is then the role of NS in the light of the platforms’ profiling practices carried out in order to detect users’ preferences and to nudge them to consumption? Although the White House report on Big Data and differential pricing states that steering practices do not make use of information concerning potential buyers at the individual level, and this is the reason why those practices should not be considered any longer as prevailing171717Despite the fact that, the same report further explains how more complex it is to infer information about Internet users while having access to their data (such as the user’s IP address, or the kind of operating system) but without being able to define and their willingness to pay. (Council of Economic Advisers - CEA, 2015), that does not mean that steering practices181818To be understood as the possibility to decide what to show to individuals and guiding their choices in this sense. are not still used (see Mikians et al. (2012)).

Although the primary goal of the present contribution is not the exploration of how the increased observation of the individuals’ purchase choices and expressed preferences (hence the greater amount of data which platforms possess on them) could lead to more performing steering, nudging and price discrimination practices, we show how social network science has became crucial in predicting consumers consumption patterns in the light of networks features which can be better understood through some measures that we propose below.

4 Methodological prerequisites

The next sections are devoted to the conceptualisation of our proposal and we therefore provide further below some preliminary notations, for the convenience of the reader.

The classical mathematical abstraction of a network is a graph $G=(V,E)$ , where $V$ is the set of $n$ nodes (or vertices) and $E$ is the set of $m$ links (or edges) outlining the relationships among the nodes. For instance, taking into account the network of users in an online marketplace, nodes will be made up of individuals linked among them by the purchased items. Conversely, should, for the same online marketplace, be considered the network of items, nodes will represent goods or services linked together because bundled or, in an alternative setting, purchased by the same user. In this regard, Figure 1 provides a basic example of a users’ graph relating to a generic online marketplace with $6$ nodes corresponding to as many individuals, linked among them by $8$ identical purchase choices (namely the purchase of the same goods or services) represented by as many links. In such setting, information spans from node to node, within the network, via the links’ pattern (Bakshy et al., 2012; Galati et al., 2019).

It is worth mentioning that, according to the literature, both the word “network” and “graph” can be used interchangeably. While the first is generally associated to real systems, such as the WWW or networks of individuals, “graph” is mainly used to analyse the mathematical representation of a network (pag. 45, Barabási, 2016).

We refer to a node by an index $i$ , meaning that we allow a one-to-one correspondence between an index in $\{1,\ldots,N\}$ and a node in $V$ . The set $E$ can be conceptualised through the adjacency matrix $A=(a_{ij})_{i,j=1,\ldots,N}$ , whose generic element $a_{ij}$ is equivalent to $1$ – if the link between $i$ and $j$ exists – or, otherwise, to [math].

In this paper, we examine undirected and unweighted networks, thus $a_{ij}=a_{ji}$ , for each $i,j=1,\ldots,N$ . In other words, considering the previous graph in Figure 1, the index represents the identification number of a generic user, while the adjacency matrix is a table used to gather the relations between all purchase choices expressed by users in a given moment, identifying a formal relationships between them.

In graphs, a usual measure is given by the number of relationships owned by a user. In the case of Figure 1, user $1$ has three relationships, namely with users $2,3$ and $4$ . For a generic user $i$ , such relations are expressed as a measure called degree, with symbol $d_{i}$ , while the nodes linked to node $i$ are called the neighbour of $i$ (in symbols, $N(i)$ ). It is clear, that $d_{i}$ is an integer being zero when the user has no established relations.

All the degrees can be gathered in a vector, whose elements are ordered in a non-decreasing mode. This vector, called degree sequence with symbol $D_{G}$ , is – in our example – $D_{G}=\{4,3,3,2,2,2\}$ . On the basis of such sequence, it is possible to compute the traditional average value $\langle d\rangle$ , as well as the average of the squared values $\langle d^{2}\rangle$ , that are – in our example – respectively, $\langle d\rangle=2.67,$ and $\langle d^{2}\rangle=7.67$ .

In order to study networked systems properties, it is possible to exploit a method allowing the creation of a random network by using a given degree sequence. To this end, one of the most common and used tool is the configuration model (Newman, 2018). This latter, defined as a generalised random network model whit a given degree sequence, is build up by means of a simple algorithm (e.g. pag. 139, Barabási, 2016).

Configuration model can also be generated considering degree sequences stemming from a given distribution. Thanks to such feature, it is possible to generate specific networks taking into account, e.g, Poisson’s or Power-Law distributions, which are the widely used degree configurations in network science.

The purpose of the configuration model is that of leveraging some of its known properties in order to obtain general results suitable for building a new theoretical framework191919The aim of the present paper it is not to analyse the above mentioned algorithm (e.g. see Hakimi, 1962) neither to investigate the mathematical properties allowing the graph creation from a given sequence (Erdős and Gallai, 1960)..

Considering the configuration model properties, we therefore suppose that a platform may generate its own network (thus, the corresponding graph) starting from a database containing a given number of users (therefore, the length of the degree sequence) and their relative purchase choices (hence, the elements within the degree sequence).

In the light of such elements, we suppose that an enterprise, and in particular a platform, can always build up its own network considering consumers’ data as well as data related to items offered to them. In this regard, a network can be: (i) explicitly available, when the enterprise delivers multi-users products or services (e.g., Facebook, LinkedIn, Twitter and MySpace); (ii) not explicitly available, since the firm delivers single user products or services (e.g. Amazon, Last.FM, Outbrain and Color)202020In such case the network can be generated exploiting analytical methodologies (Zhou et al., 2014)..

Among the properties offered by the configuration model, we used – for our purposes – that of the average degree of a neighbour, represented with symbol $k_{nn}$ . For a generic node of the network, it holds that (Catanzaro et al., 2005):

[TABLE]

The average degree of a neighbour is widely used to study dependencies between neighbours nodes’ degrees in a network, affirming that, considering a node $i$ , $k_{nn}$ is independent from the node’s degree but exclusively dependent on the global network’s characteristics $\langle d\rangle$ and $\langle d^{2}\rangle$ . Besides being easily computable, such measures provide an overall view of the network, making $k_{nn}$ a simple and powerful global measure of it.

Generally, $k_{nn}$ turns out to be greater than the average degree of a node. As a matter fact, considering ${\langle d^{2}\rangle}/{\langle d\rangle}-{\langle d\rangle}=({\langle d^{2}\rangle}-{\langle d\rangle}^{2})/{\langle d\rangle}=\sigma^{2}/{\langle d\rangle}>0$ , since: (i) the variance is non-negative unless the network shows the same degree for all its nodes; (ii) $\langle d\rangle$ is greater than zero unless all the nodes have zero degree. This holds that ${\langle d^{2}\rangle}/{\langle d\rangle}>{\langle d\rangle}$ , showing a result known as the friendship paradox, according to which «your friends have more friends than you do» (Feld, 1991). Such bias is due to an over-representation of high-degree nodes in comparison with low-degree ones during the calculation.

Please note that Formula (1) refers to the general case of uncorrelated degree sequences (Barabási, 2016; Newman, 2018). Networks generally show a degree-degree correlation which can display a positive degree correlation in the case of social networks or a negative one, for instance, in the case of networks related to technological and innovation research projects. In such context, the average degree of a neighbour of node $i$ is computed as (Pastor-Satorras et al., 2001):

[TABLE]

A further measure can be expressed considering sets of nodes for identical degree. In this case, the average degree of a neighbour for nodes with degree equal to $k$ , is defined as:

[TABLE]

Considering the example shown in Figure 1, Equation 1 brings to $k_{nn}=2.9$ , Equation 2 brings to $k_{nn,1}=3.0$ , $k_{nn,2}=3.0$ , $k_{nn,3}=3.5$ , $k_{nn,4}=2.5$ , $k_{nn,5}=2.5$ and $k_{nn,6}=3.0$ , while Equation 3 to $k_{nn}(2)=3.0$ for the set of nodes of degree 2, $k_{nn}(3)=3.0$ for the set of nodes of degree 3 and $k_{nn}(4)=2.5$ for the set of nodes of degree 4.

$k_{nn}(k)$ nature highlights two distinctive structural network properties with reference to the presence of degree correlation. If $k_{nn}(k)$ is an increasing function of $k$ , such setting identifies assortative mixing, showing that high-degree nodes are preferentially connected to high-degree nodes, while the low ones to low ones. On the other hand, a decreasing trend of $k_{nn}(k)$ identifies disassortative mixing showing nodes with high connections having neighbours with lower ones.

5 Neighbours’ Information patrimony

In this section, we introduce our definition of Neighbours’ Information Patrimony ( $NIP$ ).

To this purpose, we start from the concept of “network value” which has been mostly defined on the basis of Metcalfe’s law. Comparing the weight of a single node with the entire network, our definition allows the estimation of such node’s “network share” by using information related to those nodes to which it is connected. Such approach intends to build on the fact that, generally, each node possesses data (also called metadata, e.g. see Peel et al., 2017) through which the enterprise can infer collective information about clients (e.g. geographical localisations, preferences, gender, etc.) to be used, for instance, to implement price discrimination practices.

In this regard, we propose to consider the network’s neighbourhood of a node as a proxy of the similarities among clients, as network clustering techniques do (Cerqueti et al., 2018a). In other words, links between nodes attest a certain level of preferences and/or similar tastes, thus making it possible to extend the chain of relationship from the node’s neighbours to the “neighbours of its neighbours” (the “alters of the alters”), shifting the network share of a node from its singularity to its linked environment and, then, to the whole collective patterns of interactions.

Our model builds on the assumption that, if an enterprise intends to gather information about users with similar preferences, it may avoid burdensome techniques (such as cluster analysis or density algorithm), thanks to network properties. In the light of such properties, also knowing a single node-related information, it is possible to acquire data on the neighbour nodes, hence about users with similar characteristics.

We start from the Metcalfe’s law which affirms that the value $\mathcal{V}$ of a network $G$ is proportional to the square of the nodes’ number, i.e. $\mathcal{V}\propto n^{2}$ (Gilder, 1993). Stemming from the ethernet network context, the Metcalfe’s law definition, considers all nodes as mutually linked212121Therefore, $n$ nodes are so connected to other $n-1$ nodes, thus the value is proportional to $n^{2}$ . In general terms, networks are not densely connected, hence estimating the value of a network as $n(n-1)$ seems to be unrealistic.

In our setting, considering how available information to the enterprise (or the platform) is given by the amounts of connections set present in a network, it is reasonable to think that all users are not mutually connected, nor are items, or users’ preferences in terms of goods or services. By means of the well known “handshaking lemma” (e.g. Bollobás, 2013, p. 4), we obtain that the sum of all network’s degrees is equal to twice the numbers of links, i.e. $\sum_{i\in V}d_{i}=2m$ . With this setting, we can assume that the object shared in the network is the information patrimony $IP$ , thus gathering all the comments so far we obtain the value of $IP$ for the generic node $i$ as:

[TABLE]

It is worth noting that we are reducing the market extension to its real dimension and in case of a complete network $m=n(n-1)/2$ , thus the denominator of Equation (4) becomes again proportional to $n^{2}$ , as postulated by the original definition of Metcalfe’s law.

Summarising, we are tailoring Metcalfe Law’s with a view to estimating the current value of a network, such as $\mathcal{V}\propto 2m$ and the value $\mathcal{V}\propto n^{2}$ remain still valid for a complete network.

5.1 Uncorrelated networks

It is now possibile to introduce the collective effects of alters and we therefore define the Neighbours’ Information Patrimony $NIP_{i}$ of node $i$ as the sum of its information patrimony with the information patrimonies of all its alters, i.e. the nodes $j$ in its neighbourhood $N(i)$ . In formula:

[TABLE]

In Equation (5) we approximate the information patrimony of $N(i)$ as $\sum_{j\in N(i)}d_{j}\simeq d_{i}\cdot k_{nn}$ , that is $d_{i}$ time the average degree of a neighbour:

[TABLE]

Finally, using Equation (1) for $k_{nn}$ we obtain:

[TABLE]

In Equation (7) we state that the neighbour information patrimony of a node $i$ is given by the value of its information patrimony, multiplied for a value which captures the level of connectedness given as an overall measure of the whole network. This represents a peculiar measure concerning the entire network $N$ , defining what the information is worth within it. We call such measure $NIP_{N}$ :

[TABLE]

Since ${\langle d^{2}\rangle}/{\langle d\rangle}>0$ and, excluding network with all nodes with the same degree, we have that $NIP_{N}>1$ .

Please also note that in a complete network – where all nodes are connected to one another – the number of links equals $n(n-1)/2$ , and for every node we obtain that $IP_{i}=1/n$ , while $NIP_{i}=1$ . Such results mean that if each node of a network is mutually connected to the others, then it possesses the whole network information patrimony222222In fact, in this case each node has degree $d_{i}=n-1$ , then $IP_{i}=d_{i}/2m=(n-1)/n(n-1)=1/n$ , while $\langle d^{2}\rangle=(n-1)^{2}$ and $\langle d\rangle=n-1$ thus $NIP_{i}=(1/n)(1+(n-1)^{2}/(n-1))=1$ ..

5.2 Correlated networks

As a general case, taking into consideration equation (5) we approximate the network sharing of $N(i)$ as $\sum_{j\in N(i)}d_{j}\simeq d_{i}\cdot k_{nn,i}$ , that is equal to $d_{i}$ time the average degree of a neighbour of node $i$ . Thus, the Network Information Patrimony is:

[TABLE]

When we move to sets of nodes for identical degree, we have to consider Equation (3) instead of Equation (1) and Equation (5) which becomes:

[TABLE]

5.3 Discussion and implication of the network information patrimony

In the previous section we introduced three new measures for a given network; one referred to uncorrelated ones (Eq. 7) and two to correlated networks (Eq. 9 and Eq. 10). In both cases, we started from the concept of the market share of a node $i$ , taking into consideration the average degree of neighbours as a proxy capable to capture the meso-scale connectiveness effects.

Considering a network, the degree correlation is a phenomenon capable to detect the tendency between nodes with similar degree to associate with each other. In a social networks environment, such tendency is called homophily or assortative mixing (Newman, 2018). A network exhibits assortative mixing if nodes with a high number of connections tend to link with large degree nodes; on the contrary, we are in presence of disassortative mixing when nodes with high connections have neighbours with lower ones. This latter is a frequent feature in many technological networks (Newman, 2002; D’Agostino et al., 2012).

Network assortativity is computed through the Pearson correlation coefficient $r$ between two nodes connected by a link with positive $r$ with a value in the range $0\leq r\leq 1$ ; while in case of network disassortativity, $r$ is negative with a value between $−1\leq r<0$ (Newman, 2003).

When a network is not correlated232323For instance, when neutral with respect to degree correlation, it does not present assortative mixing ( $r=0$ ), the value of $IP_{N}$ is just a multiplier for the market share of a node $i$ , which is identical for each node. Although such element constitutes a unique measure of a network, the fact that it highlights the connectiveness’ degree is of little interest, since we are considering the centrality degree242424The centralisation degree of a node is merely its degree, i.e. the number of links connected to it. In Social Network Analysis literature it is emphasised as a measure since is meaningful in many circumstances. For an extensive discussion on this measure and many other centrality measures please refer to Scott and Carrington (2011) or Newman (2018). where each value is multiplied by a constant.

As soon as a network exhibits a correlation degree, both $NIP_{i}$ and $NIP_{i}(d)$ become meaningful measures. While the former captures the relationship between the degree of node $i$ and the average degree of its neighbours ( $NIP_{i}$ is therefore a measure which magnifies – or shrinks – the information patrimony of a node $i$ on the basis of its neighbours’ quality), the latter considers the average of all nodes’ neighbours degree $d$ (it therefore measures the network effect of those nodes with a degree $d$ ). $NIP_{i}$ should be applied in order to detect network’s effect of node $i$ , while $NIP_{i}(d)$ would capture the aggregated network effect of the $d$ degree nodes.

Since $NIP_{i}(d)$ is the average value of all $NIP_{i}$ of $d$ degree nodes, it could be of interest to explore the scattering around such average value. In such sense, a node with a $NIP_{i}$ below the average suggests that its neighbours’ "value" (to be considered as the amount of information that can be mined from its neighborhood) has a limited effect on its network effect. On the contrary, if $NIP_{i}$ is above the average value, it means that the considered node benefits from the "value" of the neighbours surrounding it. It is indeed admissible, as we will show in the next section, that a node $i$ with degree $d$ can assume values of $NIP_{i}$ greater than any nodes of degree $d+1$ (or, under particular circumstances, even greater).

Such observations lead us to conclude that $NIP_{i}$ and $NIP_{i}(d)$ are second-order measures (Cerqueti et al., 2018b; Rotundo and D’Arcangelis, 2014), being strongly affected by the presence of peculiar arrangements in the neighbourhood. This represents the main difference with the classical degree centrality measure, that is a first-order measure because it is independent of the network topology.

Finally, note that the sum of all the market share $IP_{i}$ trivially sum to $1$ , while the sum of $NIP_{i}$ or $NIP_{i}(d)$ brings to a sum greater than $1$ since we are considering the neighbourhood effects on each node.

6 Examples

We herein propose two brief examples. While the first one is based on a synthetic network, the second one considers some real instances based on the Customers Who Bought This Item Also Bought feature of the Amazon website (Leskovec et al., 2007).

Let us consider the network as in Figure 2, which is composed of three groups of nodes (e.g. users or products) linked in order to simulate three different users’ agglomeration (e.g. a communities), in turn mutually connected among them through links. With $n=18$ and $m=37$ , we can easily compute the network information patrimony of nodes by means of Equations 9 and 10, as shown in the right panel of Figure 2252525For the sake of completeness, the computational complexity of our measures depend from the calculation of $k_{nn}$ , which correspond to a product of a square matrix of size $n$ for a vector of size $n$ . It is well known that this calculation is polynomial in the size of the matrix, i.e., it is an $O(n^{2})$ ..

One of the interesting results deducible from the observation of the blue curve in Figure 2, which is the $NIP_{i}(d)$ , is that: for a fixed degree, the scattering of the different values of $NIP_{i}$ highlights nodes under-performing with respect to $NIP_{i}(d)$ ; while other over-perform it (e.g., for degree 3, node R is close to $NIP_{i}(3)$ , while node Q is under-performing and H is over-performing). Note also that node H over-performs node O which has degree 4, meaning that the network patrimony derived from the node H neighbourhood has a greater "value" compared to the network patrimony of node O, and this beyond its degree.

This is a general phenomenon which brings us to formulate the following hypothesis: since nodes with higher network patrimony are connected with nodes with high degree, they benefit from a greater amount of information. In such sense, nodes with $NIP_{i}$ higher than expected lead us to conclude that they should be basically considered of higher interest for a recommendation system, compared to nodes with $NIP_{i}$ lower than expected. In other words, $NIP$ permits to discriminate users on the basis of a collective effect based on their neighbourhood.

In a second example, we consider the set of instances drawn from the Amazon web site in four different moments in $2003$ . Instances are built hypothesising that a product (node) $i$ is co-purchased with product (node) $j$ , then the graph contains a link from $i$ to $j$ .

Table 1 summarises basic metrics of such instances262626The data processing, the network analysis and all simulations are conducted using the software R (Team et al., 2013) with the igraph package (Csardi et al., 2006)., the density272727The density of a network is defined as the number of its actual links divided by the number of potential links which is $n(n-1)/2$ . It can assume value from [math] (empty network) to $1$ (complete network). and the value for $\langle d\rangle$ , $\langle d^{2}\rangle$ , variance, assortativity and $NIP_{N}$ . Since the networks are almost neutral (i.e. $r$ is close to [math]), the calculation of $NIP_{N}$ is correct and it can be noted how it boosts the magnitude of the information patrimony of each node, even if the network contains very few links, since the density of all instances is of the order of $10^{-5}$ .

Figure 3 shows that amazon0601 network instance has a degree distribution shape close to a lognormal, indicating the presence of hub nodes. In Figure 4 we display the distribution of $NIP_{i}$ and how such values positively correlate with the node degree. Consequently, the majority of nodes with low degree values displays a low network information patrimony. In spite of this, we note a quite strong scattering of the values of $NIP_{i}$ , as shown in the inset of Figure 4, with some nodes displaying a $NIP_{i}$ higher than expected, and others displaying a lower one.

It is also possible to observe that within a class of nodes grouped by the same degree there are some showing different $NIPs$ and – more surprisingly – by comparing those classes, it is possible to detect how nodes with a lower degree can have a higher $NIP_{i}$ than those with a higher degree. In such sense, the red point in the inset in Figure 4 (which corresponds to a node with degree $d=44$ and $NIP_{i}=192.61$ ) outperforms in terms of network patrimony, showing a $NIP_{i}$ greater than those of nodes with twice its degree.

7 Discussions and conclusions

Building on network science’s properties, networks representations can provide significant competitive advantages to those companies which are currently using data without being aware of the gains potentially stemming from an exploitation of such asset. To explain how companies could benefit from the opportunity to extract much more information than it is reasonable to imagine, we used network science and its formal structure moving from a local vision grounded on clustering and classification analysis, to a global one. Network science properties are indeed useful in order to explain complex systems dynamics, as well as to understand how information flows in a network and how much of it can be extrapolated taking into account a targeted node with respect to its neighbourhood. We therefore exploited networks properties, in order to understand how a platform may use data in order to boost its predictive capabilities in terms of individuals’ purchasing capacity and potentially consumers’ steering practices. Such advantage, exploited by platforms relying on a network, can be estimated by measuring the set of their connections as a whole – that means the links between consumers and/or items within the examined network. In this regard, availing of information on the individual consumer, network platforms can benefit from some of the properties deriving from the network configuration, extracting data on a single node (hence, on a consumer or on an item), but also capturing data stemming from its connections’ patterns.

To this end, we proposed an additional measure – further to the one related to the degree of a node – taking into account the quality of nodes’ connections and consequently of a hypothetical user in relation to its neighbourhood. We therefore exploited a well-known network science concept in order to establish a new analytical quality measure. We measured how users who might enjoy of benefits deriving from a good neighbourhood – hence of a superior connections set – may also obtain better information. In order to test the validity of our measure, we prove it on four Amazons’ benchmark instances Amazons’, providing some interesting insights for one of them in particular.

As a matter of fact, the definition of such measure could allow platforms to implement a twofold strategy. Detecting whether a node is over-performing or under-performing compared to the average degree level of its neighbours, it is possible – focusing on over-performing nodes – to offer them more precise recommendations thanks to to a more accurate inference based on their neighbourhood quality. In turns, in relation to under-performing nodes, it will be possible to exploit the recommendation systems in order to offer them goods or services so that such nodes come into contact with higher quality ones, and prompting higher quality recommendations in the future.

The limitations of such approach are mainly connected to the lack – in the present literature – of benchmark instances for empirical tests capable to demonstrate the previous mentioned hypotheses. However, preliminary results obtained provide a positive answer to our research question, and the emerged data properties, together with their envisioned impact in terms of the relevant economic phenomena, points to the suitability of conducting further research on the matter.

Bibliography83

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Acquisti (2008) Acquisti A (2008) Identity management, privacy, and price discrimination. IEEE Security & Privacy 6(2):46–50
2Acquisti et al. (2016) Acquisti A, Taylor C, Wagman L (2016) The Economics of Privacy. Journal of Economic Literature 54(2):442–492
3Akerlof (1970) Akerlof GA (1970) The Market for ”Lemons” Quality Uncertainty and the Market Mechanism. The Quarterly Journal of Economics 84(3):488–500
4Arpetti (2018) Arpetti J (2018) Economia della privacy: una rassegna della letteratura ( in italian) . Rivista di diritto dei media 2:267–297
5Arrow (1958) Arrow KJ (1958) Utilities, attitudes, choices: A review note. Econometrica: Journal of the Econometric Society pp 1–23
6Bakshy et al. (2012) Bakshy E, Rosenn I, Marlow C, Adamic L (2012) The role of social networks in information diffusion. In: Proceedings of the 21st international conference on World Wide Web, ACM, pp 519–528
7Barabási (2013) Barabási AL (2013) Network science. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences 371(1987):20120375
8Barabási (2016) Barabási AL (2016) Network science. Cambridge university press