SocialQuotes: Learning Contextual Roles of Social Media Quotes on the Web
John Palowitch, Hamidreza Alvari, Mehran Kazemi, Tanvir Amin, Filip, Radlinski

TL;DR
This paper introduces SocialQuotes, a framework for automatically identifying the roles of social media quotes embedded in web pages, leveraging language models and a new dataset to enhance social media retrieval and analysis.
Contribution
It presents a novel language modeling approach and a large annotated dataset for classifying social media quote roles within web content, advancing cross-platform social media understanding.
Findings
Reasonable performance with modern LLMs in role classification
Revealed cross-domain and cross-platform role distributions
Provided explainability through page content ablations
Abstract
Web authors frequently embed social media to support and enrich their content, creating the potential to derive web-based, cross-platform social media representations that can enable more effective social media retrieval systems and richer scientific analyses. As step toward such capabilities, we introduce a novel language modeling framework that enables automatic annotation of roles that social media entities play in their embedded web context. Using related communication theory, we liken social media embeddings to quotes, formalize the page context as structured natural language signals, and identify a taxonomy of roles for quotes within the page context. We release SocialQuotes, a new data set built from the Common Crawl of over 32 million social quotes, 8.3k of them with crowdsourced quote annotations. Using SocialQuotes and the accompanying annotations, we provide a role…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImpact of Technology on Adolescents · Wikis in Education and Collaboration · Knowledge Management and Sharing
MethodsSparse Evolutionary Training
