Do You Like What I Like? Similarity Estimation in Proximity-based Mobile   Social Networks

Felix Beierle

arXiv:1805.07651·cs.SI·June 10, 2019

Do You Like What I Like? Similarity Estimation in Proximity-based Mobile Social Networks

Felix Beierle

PDF

TL;DR

This paper presents a space-efficient, privacy-preserving method for estimating similarity between users in proximity-based mobile social networks using probabilistic data structures, demonstrated with musical taste data.

Contribution

It introduces a novel approach using Counting Bloom Filters and Count-Min Sketches for fast, accurate similarity estimation without third-party servers.

Findings

01

Single exchange of probabilistic data structures estimates similarity accurately.

02

A Counting Bloom Filter with 128 length and one hash function suffices.

03

Method is generalizable to other multiset similarity estimations.

Abstract

While existing social networking services tend to connect people who know each other, people show a desire to also connect to yet unknown people in physical proximity. Existing research shows that people tend to connect to similar people. Utilizing technology in order to stimulate human interaction between strangers, we consider the scenario of two strangers meeting. On the example of similarity in musical taste, we develop a solution for the problem of similarity estimation in proximity-based mobile social networks. We show that a single exchange of a probabilistic data structure between two devices can closely estimate the similarity of two users - without the need to contact a third-party server.We introduce metrics for fast and space-efficient approximation of the Dice coefficient of two multisets - based on the comparison of two Counting Bloom Filters or two Count-Min Sketches. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.