Weak Links in LinkedIn: Enhancing Fake Profile Detection in the Age of LLMs
Apoorva Gulati, Rajesh Kumar, Vinti Agarwal, Aditya Sharma

TL;DR
This paper evaluates the vulnerability of LinkedIn fake profile detectors to LLM-generated profiles and proposes GPT-assisted adversarial training to improve detection robustness without increasing false rejections.
Contribution
It introduces GPT-assisted adversarial training and demonstrates its effectiveness in significantly reducing false accept rates for GPT-generated fake profiles.
Findings
Existing detectors fail to identify GPT-generated profiles (False Accept Rate: 42-52%)
Adversarial training reduces false accept rate to 1-7%
Detectors trained on combined embeddings are most robust
Abstract
Large Language Models (LLMs) have made it easier to create realistic fake profiles on platforms like LinkedIn. This poses a significant risk for text-based fake profile detectors. In this study, we evaluate the robustness of existing detectors against LLM-generated profiles. While highly effective in detecting manually created fake profiles (False Accept Rate: 6-7%), the existing detectors fail to identify GPT-generated profiles (False Accept Rate: 42-52%). We propose GPT-assisted adversarial training as a countermeasure, restoring the False Accept Rate to between 1-7% without impacting the False Reject Rates (0.5-2%). Ablation studies revealed that detectors trained on combined numerical and textual embeddings exhibit the highest robustness, followed by those using numerical-only embeddings, and lastly those using textual-only embeddings. Complementary analysis on the ability of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLibrary Science and Information Systems · Authorship Attribution and Profiling · Academic integrity and plagiarism
