Replaying Archived Twitter: When your bird is broken, will it bring you down?
Kritika Garg, Himarsha R. Jayanetti, Sawood Alam, Michele C. Weigle,, Michael L. Nelson

TL;DR
This paper investigates the challenges and limitations of archiving Twitter content, especially after UI changes and account suspensions, highlighting potential data loss and inaccuracies in web archives.
Contribution
It documents the impact of Twitter's UI change on archiving practices and analyzes the potential loss of information and temporal inconsistencies in archived Twitter pages.
Findings
Web archives often fail to capture the new Twitter UI after June 2020.
Archived data may lack evidence of labels or misinformation tags on tweets.
Temporal violations can cause archived pages to show non-existent or outdated content.
Abstract
Historians and researchers trust web archives to preserve social media content that no longer exists on the live web. However, what we see on the live web and how it is replayed in the archive are not always the same. In this paper, we document and analyze the problems in archiving Twitter ever since Twitter forced the use of its new UI in June 2020. Most web archives were unable to archive the new UI, resulting in archived Twitter pages displaying Twitter's "Something went wrong" error. The challenges in archiving the new UI forced web archives to continue using the old UI. To analyze the potential loss of information in web archival data due to this change, we used the personal Twitter account of the 45th President of the United States, @realDonaldTrump, which was suspended by Twitter on January 8, 2021. Trump's account was heavily labeled by Twitter for spreading misinformation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Internet Traffic Analysis and Secure E-voting
