Discovering the Traces of Disinformation on Instagram in the Internet Archive
Haley Bragg, Michele Weigle

TL;DR
This study investigates the accessibility and quality of archived Instagram disinformation content, revealing that most archived pages are incomplete or redirects, with replayability decreasing over recent years, especially for anti-vaccine accounts.
Contribution
It provides a detailed analysis of the limitations of web archives in capturing Instagram disinformation content, highlighting challenges in studying deleted or removed posts.
Findings
96.13% of mementos redirect to login page
Only 27.16% of remaining mementos have complete post images
Replayability of archived content is decreasing over time, notably in 2021-2022
Abstract
Disinformation, which is fabricated, misleading content spread with the intent to deceive others, is accumulating substantial engagements and reaching a vast audience on Instagram. However, the temporary nature of the platform and the security guidelines that remove malicious content make studying this disinformation a challenge. The only way to access removed content and banned accounts that are no longer on the live web is by searching the web archives. In this study, we set out to quantify the replayability and quality of past captures of Instagram account pages, specifically focusing on a group of anti-vaxx content creators known as the Disinformation Dozen. We found that the number of mementos listed for these users' account pages on the Internet Archive's Wayback Machine can be misleading, because a majority of the mementos are actually redirections to the Instagram login page,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Spam and Phishing Detection · Misinformation and Its Impacts
