The WebStand Project
Benjamin Nguyen (PRISM), Fran\c{c}ois-Xavier Dudouet (LASP, IRISES),, Dario Colazzo (LRI), Antoine Vion (LEST), Ioana Manolescu (INRIA Saclay - Ile, de France), Pierre Senellart

TL;DR
The WebStand project develops a customizable XML-based platform for web data analysis, focusing on mailing lists and social network analysis to support sociological research on web communities and their temporal dynamics.
Contribution
It introduces a novel XML warehouse platform tailored for web data acquisition, transformation, and sociological analysis, emphasizing temporal aspects.
Findings
Analyzed the W3C standardization process and social network of standard setters.
Developed a flexible system for web data storage and analysis.
Facilitated sociological studies of online social groups.
Abstract
In this paper we present the state of advancement of the French ANR WebStand project. The objective of this project is to construct a customizable XML based warehouse platform to acquire, transform, analyze, store, query and export data from the web, in particular mailing lists, with the final intension of using this data to perform sociological studies focused on social groups of World Wide Web, with a specific emphasis on the temporal aspects of this data. We are currently using this system to analyze the standardization process of the W3C, through its social network of standard setters.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis
