Practical Semantic Analysis of Web Sites and Documents
Thierry Despeyroux (INRIA Rocquencourt / INRIA Sophia Antipolis)

TL;DR
This paper introduces a method for semantic analysis of web sites using a specification language inspired by Natural Semantics, enabling consistency checking and error detection in large XML documents to improve site quality.
Contribution
It proposes a novel specification language for semantic constraints in web sites, inspired by programming language semantics, and demonstrates its application for site verification and quality assessment.
Findings
Effective detection of inconsistencies in XML documents
Generation of verification tools from semantic specifications
Application to real-world scientific report data
Abstract
As Web sites are now ordinary products, it is necessary to explicit the notion of quality of a Web site. The quality of a site may be linked to the easiness of accessibility and also to other criteria such as the fact that the site is up to date and coherent. This last quality is difficult to insure because sites may be updated very frequently, may have many authors, may be partially generated and in this context proof-reading is very difficult. The same piece of information may be found in different occurrences, but also in data or meta-data, leading to the need for consistency checking. In this paper we make a parallel between programs and Web sites. We present some examples of semantic constraints that one would like to specify (constraints between the meaning of categories and sub-categories in a thematic directory, consistency between the organization chart and the rest of the site…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Advanced Database Systems and Queries · Web Data Mining and Analysis
