It's All About The Cards: Sharing on Social Media Probably Encouraged HTML Metadata Growth
Shawn M. Jones, Valentina Neblitt-Jones, Michele C. Weigle and, Martin Klein, Michael L. Nelson

TL;DR
This study analyzes the evolution of HTML metadata in news articles from 1998 to 2016, highlighting the rapid adoption of social card metadata around 2010 driven by social media sharing needs.
Contribution
It provides a comprehensive longitudinal analysis of metadata usage in news articles, emphasizing the dominant role of social card metadata in social media sharing.
Findings
Social card metadata adoption reached over 95% by 2016.
Metadata usage surged starting in 2010, driven by social media sharing.
Social cards outpaced other metadata standards like Schema.org and Dublin Core.
Abstract
In a perfect world, all articles consistently contain sufficient metadata to describe the resource. We know this is not the reality, so we are motivated to investigate the evolution of the metadata that is present when authors and publishers supply their own. Because applying metadata takes time, we recognize that each news article author has a limited metadata budget with which to spend their time and effort. How are they spending this budget? What are the top metadata categories in use? How did they grow over time? What purpose do they serve? We also recognize that not all metadata fields are used equally. What is the growth of individual fields over time? Which fields experienced the fastest adoption? In this paper, we review 227,726 HTML news articles from 29 outlets captured by the Internet Archive between 1998 and 2016. Upon reviewing the metadata fields in each article, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Data Quality and Management · Scientific Computing and Data Management
