A Framework for Aggregating Private and Public Web Archives
Mat Kelly, Michael L. Nelson, Michele C. Weigle

TL;DR
This paper presents a novel framework that enhances the aggregation of private, personal, and public Web archives by extending existing standards and involving users in the negotiation process, improving the completeness and privacy of Web history.
Contribution
It introduces a new framework with extended Memento standards and user negotiation mechanisms for better aggregation of diverse Web archives.
Findings
Extended Memento syntax for private Web archives
User-involved negotiation process for archive aggregation
Improved accuracy in Web history representation
Abstract
Personal and private Web archives are proliferating due to the increase in the tools to create them and the realization that Internet Archive and other public Web archives are unable to capture personalized (e.g., Facebook) and private (e.g., banking) Web pages. We introduce a framework to mitigate issues of aggregation in private, personal, and public Web archives without compromising potential sensitive information contained in private captures. We amend Memento syntax and semantics to allow TimeMap enrichment to account for additional attributes to be expressed inclusive of the requirements for dereferencing private Web archive captures. We provide a method to involve the user further in the negotiation of archival captures in dimensions beyond time. We introduce a model for archival querying precedence and short-circuiting, as needed when aggregating private and personal Web archive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
