Building and Querying Semantic Layers for Web Archives (Extended Version)
Pavlos Fafalios, Helge Holzmann, Vaibhav Kasturia, Wolfgang Nejdl

TL;DR
This paper introduces a semantic layer framework for web archives using RDF/S models, enabling advanced querying and integration, demonstrated through experiments on different archive types.
Contribution
It proposes a novel RDF/S-based distributed framework for building semantic profiles of web archives, enhancing their accessibility and exploitable potential.
Findings
Semantic layers improve query capabilities over traditional keyword systems.
The framework supports describing metadata and annotating web archive contents with semantic information.
Experimental results show enhanced information retrieval from web archives.
Abstract
Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem and propose an RDF/S model and a distributed framework for building semantic profiles ("layers") that describe semantic information about the contents of web archives. A semantic layer allows describing metadata information about the archived documents, annotating them with useful semantic information (like entities, concepts and events), and publishing all this data on the Web as Linked Data. Such structured repositories offer advanced query and integration capabilities, and make web archives…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
