Understanding Data Collection, Brokerage, and Spam in the Lead Marketing Ecosystem
Yash Vekaria, Nurullah Demir, Konrad Kollnig, Zubair Shafiq

TL;DR
This study empirically investigates privacy and spam risks in the lead marketing ecosystem, revealing deceptive practices, extensive data sharing, and aggressive marketing tactics involving sensitive personal information.
Contribution
First comprehensive measurement framework analyzing data flows, brokerage practices, and spam in lead marketing, highlighting privacy violations and operational tactics.
Findings
Sensitive health data shared with over 70 third parties
Deceptive brokerage selling data to unvetted buyers
Over 8,000 telemarketing calls received from lead submissions
Abstract
The lead marketing ecosystem enables collection, sale, and use of personal data submitted via web forms to deliver personalized quotes in high-value verticals such as insurance. Despite its scale and sensitivity of the collected data, this ecosystem remains largely unexplored by the research community. We present the first empirical study of privacy and spam risks in lead marketing, developing an end-to-end measurement framework to trace data flows from data collection to consumer contact. Our setup instruments over 100 health-related lead-generation websites and monitors 200 controlled phone numbers and email addresses to understand downstream marketing practices. We observe sharing of highly personal and sensitive health information to more than 70 distinct third parties on these lead generation websites. By purchasing our own and other organic leads from three major lead platforms,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
