Identifying User Profiles Via User Footprints
Yasamin Kowsari

TL;DR
This paper presents an empirical study on user profile identification across social networks, using machine learning techniques to classify user attributes with high accuracy based on Reddit data.
Contribution
It introduces a novel scheme combining multiple features and machine learning models for user profile mapping across social networks, achieving over 89% accuracy.
Findings
Support vector machines, Random Forests, and deep belief networks effectively classify user attributes.
Collected a diverse dataset of 5000 Reddit user samples.
Achieved classification accuracy higher than 89%.
Abstract
User identification has been a major field of research in privacy and security topics. Users might utilize multiple Online Social Networks (OSNs) to access a variety of text, videos, and links, and connect to their friends. Identifying user profiles corresponding to multiple virtual activities of users across social networks is significant for the development of related fields, such as network security, user behavior patterns analysis, and user recommendation systems. In addition, predicting personal attributes based on public content is a challenging topic. In this work, we perform an empirical study and proposed a scheme with considerable performance. In this work, we investigate Reddit, a famous social network for questioning and answering. By considering available personal and non-personal attributes, we discuss our main findings based on mapping the different features such as user…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpam and Phishing Detection · Internet Traffic Analysis and Secure E-voting · Network Security and Intrusion Detection
