On the Change in Archivability of Websites Over Time
Mat Kelly, Justin F. Brunelle, Michele C. Weigle, Michael L. Nelson

TL;DR
This paper investigates how the archivability of web pages changes over time due to evolving web technologies, highlighting the impact of dynamic content and scripting on preservation efforts.
Contribution
It demonstrates that archivability can be inferred from page types and accessibility, providing a framework to improve web content publishing for better preservation.
Findings
Archivability varies with web technology evolution.
Dynamic content affects archiving success.
Guidelines for publishing web pages to enhance longevity.
Abstract
As web technologies evolve, web archivists work to keep up so that our digital history is preserved. Recent advances in web technologies have introduced client-side executed scripts that load data without a referential identifier or that require user interaction (e.g., content loading when the page has scrolled). These advances have made automating methods for capturing web pages more difficult. Because of the evolving schemes of publishing web pages along with the progressive capability of web preservation tools, the archivability of pages on the web has varied over time. In this paper we show that the archivability of a web page can be deduced from the type of page being archived, which aligns with that page's accessibility in respect to dynamic content. We show concrete examples of when these technologies were introduced by referencing mementos of pages that have persisted through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
