Matching Media Contents with User Profiles by means of the Dempster-Shafer Theory
Luigi Troiano, Irene D\'iaz, Ciro Gaglione

TL;DR
This paper proposes a model based on Dempster-Shafer Theory of Evidence to match media content with user profiles, aiming to improve personalized content delivery in the media industry.
Contribution
It introduces a novel reference model utilizing Dempster-Shafer Theory for matching media content with user profiles, with potential applications in personalized media services.
Findings
Demonstrated the model with a toy example
Outlined potential applications in media personalization
Highlighted properties of the proposed model
Abstract
The media industry is increasingly personalizing the offering of contents in attempt to better target the audience. This requires to analyze the relationships that goes established between users and content they enjoy, looking at one side to the content characteristics and on the other to the user profile, in order to find the best match between the two. In this paper we suggest to build that relationship using the Dempster-Shafer's Theory of Evidence, proposing a reference model and illustrating its properties by means of a toy example. Finally we suggest possible applications of the model for tasks that are common in the modern media industry.
| … | … | ||||||||
| 1 | |||||||||
| 2 | |||||||||
| 3 | |||||||||
| m | |||||||||
| Age | 30s | 30s | 20s | 40s | ||||||
| Gender | M | F | M | M | ||||||
| Location | IT | IT | SP | IT | ||||||
| \cdashline5-9 | Interests | Movies Books | Sport | Books | Music Sport | |||||
| Director | Year | Stars | Genre | 1 | 2 | 3 | 4 | |||
| Boyle | 1996 | Ewan McGregor, Ewen Bremner | Drama | 0 | ||||||
| Levinson | 1996 | Robert De Niro, Kevin Bacon, Brad Pitt | Crime, Drama, Thriller | 1 | ||||||
| Scorsese | 2015 | Robert De Niro, Leonardo DiCaprio, Brad Pitt | Short, Comedy | 2 | ||||||
| Scorsese | 1990 | Robert De Niro, Ray Liotta, Joe Pesci | Biography, Crime, Drama | 3 | ||||||
| Boyle | 2000 | Leonardo DiCaprio | Adventure, Drama, Romance | 4 | ||||||
| Howard | 1995 | Tom Hanks, Kevin Bacon | Adventure, Drama, History | 5 | ||||||
| Zemeckis | 1994 | Tom Hanks | Comedy, Drama | 6 | ||||||
| Zemeckis | 1985 | Michael J. Fox, Christopher Lloyd | Adventure, Sci-Fi | 7 | ||||||
| Edwards | 2016 | Felicity Jones, Diego Luna | Adventure, Sci-Fi | 8 | ||||||
| Scott | 2015 | Matt Damon | Adventure, Drama, Sci-Fi | 9 | ||||||
| Director | Year | Actors | Genre |
| Boyle | 1996 | Ewan McGregor | Crime |
| Levinson | 2015 | Ewen Bremner | Drama |
| Scorsese | 1990 | Ray Liotta | Thriller |
| Howard | 2000 | Robert De Niro | Short |
| Zemeckis | 1995 | Kevin Bacon | Comedy |
| Edwards | 1994 | Brad Pitt | Biography |
| Scott | 1985 | Leonardo DiCaprio | Adventure |
| 2016 | Joe Pesci | Romance | |
| Ray Liotta | History | ||
| Tom Hanks | Sci-Fi | ||
| Michael J. Fox | |||
| Christopher Lloyd | |||
| Felicity Jones | |||
| Diego Luna | |||
| Matt Damon |
| Age | Gender | Location | Interests |
| 20s | M | IT | Books |
| 30s | F | SP | Movies |
| 40s | SP | Sport | |
| Music |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimedia Communication and Technology
Matching Media Contents with User Profiles
by means of the Dempster-Shafer Theory
Luigi Troiano
University of Sannio
Department of Engineering
Benevento, Italy
Email: [email protected]
Irene Díaz
Oviedo University
Computer Science Department
Gijón, Spain
Email: [email protected]
Ciro Gaglione DISCLAIMER. This article was prepared or accomplished by Ciro Gaglione in his personal capacity. The opinions expressed in this paper are the authors’ own and do not reflect the view of Sky Italia. Sky Italia
Interactive Tv Lab
Milan, Italy
Email: [email protected]
Abstract
The media industry is increasingly personalizing the offering of contents in attempt to better target the audience. This requires to analyze the relationships that goes established between users and content they enjoy, looking at one side to the content characteristics and on the other to the user profile, in order to find the best match between the two. In this paper we suggest to build that relationship using the Dempster-Shafer’s Theory of Evidence, proposing a reference model and illustrating its properties by means of a toy example. Finally we suggest possible applications of the model for tasks that are common in the modern media industry.
I Introduction
Digital technologies are radically changing the way of performing business in media industry, with new possibilities of tailoring the catalog so that everybody has the chance of enjoying contents that best fit his/her interests, often on demand, at the time that is most appropriate for each user. Such a change is requiring to reformulate the way of building the content offering. Data collected from customers regarding their profile and preferences become central, so models able to interpret and to reason about data.
These models aims to discover and exploit the relationship that stands between users and media contents they enjoy. Here the problem is not to ask directly the user what are his/her interests and preferences, but to infer them by looking at those contents they access and to the feedback they provide about them. The ultimate goal is to learn a model from data able to link user to the vast catalog of contents made available by a large media company.
Looking at past interactions is useful to help users to discover contents that they would appreciate as valuable part of the product they paid for. This means to improve the customer retention and foster their upgrade towards more profitable products. The benefits coming from the implementation and use of these models go beyond existing contents and customers. They also help to propose new contents to existing customers, and on the other way to support new customers in discovering existing contents. Soon, new contents and new customers become part of the model, enriching the dataset of new entities, along a self-growing process. Predictiveness of models make them also suitable to support the acquisition of new contents and customers.
These models are at the core logic of recommender systems (RS), that obtained large attention once Netflix showed potentiality of algorithms in developing and supporting their streaming platform [1]. Recommender systems gained large application because of the e-commerce diffusion. They are generally grouped in different types, including Content-based recommenders [2], Collaborative recommenders [3], Demographic recommenders [4], and Hybrid recommenders [5].
The purpose of a recommender system is to provide a suggestion, regarding available alternatives, by scoring and ranking them according to the user preferences. In order to accomplish its task, a recommender system requires information regarding the user profile and habits with respect to the different alternatives that can be proposed to him. This information can be acquired explicitly by asking the users to rate items or implicitly by monitoring users’ behavior (booked hotels or heard songs). RS can also use other kinds of information as demographic features (e.g, age, gender) or social information. The research related to RS has been focused on movies, music and books [6], being music recommendations the most studied topic, although later it has been applied to other e-commerce domains [7].
Similar to RS, we need data about user likings regarding catalog items such as movies, series and shows. Such information can be gathered by asking the user to rate the items, e.g., by using stars or likes, or implicitly by monitoring the customer behavior, e.g., which item enjoyed fully an which partially, how often they accessed the content description, etc. In addition we need other information regarding demographics such age, gender, family members, job, etc. The objective is to relate user profiles to content descriptors. Different techniques have been experimented in order to discover and exploit this relationship. Most of them take the form of information fusion.
Following the idea explored by [8], and more concretely the model developed in [9], we aim to build a relationship model based on the Dempster-Shafer’s Theory of Evidence (D-S theory) [10, 11] and to use it to make inference regarding the relationship between users and contents. The reminder of this paper is organized as follows: Section II provides some preliminaries regarding D-S Theory; Section III describes the model; Section IV outlines some examples of application; Section V draws conclusions and future directions.
II Preliminaries
The Dempster-Shafer theory, also known as the Theory of Evidence [10, 11], is used as basis for the preference model presented in [9]. In D-S theory, basic probabilities are allocated to subsets, instead of elements, according to the following definitions.
Definition 1**.**
A function over a set is called a basic probability assignment if
[TABLE]
Definition 2**.**
Let be a set, then is a focal element if . In addition, represents the set of focal elements induced by .
Definition 3**.**
Let be a basic probability assignment function over a set . The Belief of induced by is defined as follows
[TABLE]
Definition 4**.**
Let be a basic probability assignment function over a set . The Plausibility of induced by is defined as follows
[TABLE]
The relationship between Plausibility and Belief is given by the following equation:
[TABLE]
where is the complement of to .
When the probability basic assignments are given by different sources, it is possible to combine them. The first and most common combination method is known as the Dempster’s rule, that is defined as follows:
Definition 5**.**
Let and be two basic probability assignments, the joint basic probability assignment is computed as
[TABLE]
where
[TABLE]
is a measure of conflict between the two basic probability assignment sets. In addition, it is assumed .
Belief and Plausibility are monotonic functions with respect to inclusion. This means that if we consider the lattice of subsets, as shown in Fig. 1, Belief and Plausibility will increase from bottom () to top (). In particular Belief and Plausibility will be kept constant as far as we move to nodes that do not a probability mass assigned to them. As consequence of this property, we can identify regions of connected nodes, each assuming a specific value of Belief or Plausibility, as illustrated by Fig. 2.
In this example, focal elements are , and with the associated basic probability assignments , and (assuming ). This leads to identify 8 groups in the lattice, each with Belief and Plausibility depending from a focal subset of . Fig. 2 outlines these regions for both Belief and Plausibility. we can observe how all portions of lattice associated to a given value of Belief or Plausibility are connected.
If we sort the Belief (or Plausibility) values in ascending order, we get a sequence of levels, each grouping the nodes into those that are below the level and over the level. For instance, if we assume
[TABLE]
we get the situation depicted by Fig. 3 with respect to Plausibility. The following definitions enable the concept of classes of equivalence among the subsets with respect to Belief or Plausibility and to identify those elements that are most representative of the class.
Definition 6** (Core).**
Given a subset , the set of focal elements included in , core of , is defined as
[TABLE]
Definition 7** (Support).**
Given a subset and the set of focal elements (even partially in ), support of , is defined as
[TABLE]
For instance, according to the example in Fig. 1 , we have and . It is straightforward that , for all . The core and support represent the basis for computing respectively the Belief and the Plausibility of . The core and the support are able to group the subsets of into classes of equivalence as the following definition states.
Definition 8** ( and Equivalence).**
Two sets and are said to be -equivalent if and only if . A -equivalence class is defined as the collection
[TABLE]
In addition, and are -equivalent if and only if . The -equivalence class obtained from this relation. is defined as
[TABLE]
Fig. 4(a) provides an example of -equivalence class assuming as core . Fig. 4(b) shows the -equivalence class for the support .
As an immediate consequence, if and are -equivalent, then , while if they are -equivalent, .
and equivalence classes perform a partitioning of . Thus, each subset can belong only to one equivalence class. Grouping subsets in and equivalence classes allows (i) to explore the lattice by moving across classes, instead of exploring the whole item subset space, and (ii) to choose a representative of each class, so that the list of recommended items is shorter. For instance, we might be interested in using the smallest subset within a -equivalence class.
As representative of a equivalence class we can assume the smallest subset. We call this set minimal. For instance, for the class , the core is and the minimal is . It is possible to prove that each equivalence class as one single minimal. Conversely, for equivalence classes we assume as representative the largest subset, that we call maximal. Similarly to equivalence classes, it is possible to prove that any equivalence class has one single maximal. For example, the class , whose support is , as as maximal.
III Model
In the context of our interest we assume as the set of items belonging to the content catalog, while as the set of users.
Both sets are projected on two feature spaces, respectively made of and dimensions. The first is referred to the set of characteristics describing the items in , , while the second to the user profiling . Both spaces are discrete, so that each and can assume a finite number of values.
The relationship between items and users is expressed by a choice matrix, as that shown in Tab. I. The choice matrix is places side by side to the item characteristics matrix (left side) and to the profile matrix (top).
In general, data points and are multi-valued, meaning that they are represented by sets of values. For instance if is representing the movie cast, is represented by the list of actors that are featuring in the movie . Similarly, if is ”interests”, will list what the user is interested in. In other cases they are single-valued, such as in the case of characteristics such as ”director” and ”year” or in the case of profiling features such as ”age” or ”location”. An example of this matrix is given in Tab.III.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] C. A. Gomez-Uribe and N. Hunt, “The netflix recommender system: Algorithms, business value, and innovation,” ACM Trans. Manage. Inf. Syst. , vol. 6, no. 4, pp. 13:1–13:19, Dec. 2015. doi: 10.1145/2843948. [Online]. Available: http://doi.acm.org/10.1145/2843948
- 2[2] J. Salter and N. Antonopoulos, “Cinemascreen recommender agent: Combining collaborative and content-based filtering,” IEEE Intelligent Systems , vol. 21, no. 1, pp. 35–41, Jan. 2006. doi: 10.1109/MIS.2006.4. [Online]. Available: http://dx.doi.org/10.1109/MIS.2006.4
- 3[3] L. Candillier, F. Meyer, and M. Boullé, “Comparing state-of-the-art collaborative filtering systems,” pp. 548–562, 2007.
- 4[4] M. J. Pazzani, “A framework for collaborative, content-based and demographic filtering,” Artif. Intell. Rev. , vol. 13, no. 5-6, pp. 393–408, Dec. 1999. doi: 10.1023/A:1006544522159. [Online]. Available: http://dx.doi.org/10.1023/A:1006544522159
- 5[5] R. Burke, “Knowledge-based recommender systems,” in Encyclopedia of Library and Information Science, vol. 69 , A. Kent, Ed. Taylor and Francis, 2000, pp. 180–201.
- 6[6] J. Bobadilla, F. Ortega, A. Hernando, and A. Gutiérrez, “Recommender systems survey,” Knowledge-Based Systems , vol. 46, no. 0, pp. 109 – 132, 2013. doi: 10.1016/j.knosys.2013.03.012. [Online]. Available: http://www.sciencedirect.com/science/article/pii/S 0950705113001044
- 7[7] J. J. Castro-Schez, R. Miguel, D. Vallejo, and L. M. López-López, “A highly adaptive recommender system based on fuzzy logic for {B 2C} e-commerce portals,” Expert Systems with Applications , vol. 38, no. 3, pp. 2441 – 2454, 2011. doi: 10.1016/j.eswa.2010.08.033
- 8[8] K. Zhang and H. Li, “Fusion-based recommender system,” in Information Fusion (FUSION), 2010 13th Conference on , July 2010. doi: 10.1109/ICIF.2010.5712091 pp. 1–7.
