LAGOON: An Analysis Tool for Open Source Communities
Sourya Dey, Walt Woods

TL;DR
LAGOON is an open source platform that visualizes and analyzes open source communities using spatiotemporal graphs to identify malicious actors and understand community dynamics.
Contribution
It introduces a modular, extensible platform that integrates multiple data sources and machine learning for analyzing OSS communities, focusing on security and sociotechnical insights.
Findings
Supports ingestion from multiple data sources
Enables visualization of community interactions
Facilitates identification of bad actors
Abstract
This paper presents LAGOON -- an open source platform for understanding the complex ecosystems of Open Source Software (OSS) communities. The platform currently utilizes spatiotemporal graphs to store and investigate the artifacts produced by these communities, and help analysts identify bad actors who might compromise an OSS project's security. LAGOON provides ingest of artifacts from several common sources, including source code repositories, issue trackers, mailing lists and scraping content from project websites. Ingestion utilizes a modular architecture, which supports incremental updates from data sources and provides a generic identity fusion process that can recognize the same community members across disparate accounts. A user interface is provided for visualization and exploration of an OSS project's complete sociotechnical graph. Scripts are provided for applying machine…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Computational Physics and Python Applications · Complex Network Analysis Techniques
