Identifying Terms and Conditions Important to Consumers using Crowdsourcing
Xingyu Liu, Annabel Sun, Jason I. Hong

TL;DR
This paper develops a crowdsourcing-based workflow to identify and rank terms in online terms and conditions that consumers find important, revealing insights into consumer priorities and improving awareness.
Contribution
It introduces an open-definition crowdsourcing method with pairwise comparisons and rank modeling to assess consumer importance of T&C statements, a novel approach compared to prior category-based methods.
Findings
Consumers prioritize policies related to after-sales and money.
Hard-to-understand statements are deemed more important by consumers.
Machine learning models can identify important T&C clauses with over 92% accuracy.
Abstract
Terms and conditions (T&Cs) are pervasive on the web and often contain important information for consumers, but are rarely read. Previous research has explored methods to surface alarming privacy policies using manual labelers, natural language processing, and deep learning techniques. However, this prior work used pre-determined categories for annotations, and did not investigate what consumers really deem as important from their perspective. In this paper, we instead combine crowdsourcing with an open definition of "what is important" in T&Cs. We present a workflow consisting of pairwise comparisons, agreement validation, and Bradley-Terry rank modeling, to effectively establish rankings of T&C statements from non-expert crowdworkers on this open definition, and further analyzed consumers' preferences. We applied this workflow to 1,551 T&C statements from 27 e-commerce websites,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy, Security, and Data Protection · Mobile Crowdsensing and Crowdsourcing · Hate Speech and Cyberbullying Detection
