SHARI -- An Integration of Tools to Visualize the Story of the Day
Shawn M. Jones, Alexander C. Nwala, Martin Klein, Michele C. Weigle,, Michael L. Nelson

TL;DR
SHARI is a comprehensive system that integrates multiple tools and web archives to analyze, identify, and visualize the most significant news story of a specific day, enhancing historical news understanding.
Contribution
This paper introduces SHARI, a novel integration of existing tools for news story analysis and visualization using web archives, which was not previously combined in this manner.
Findings
Successfully clusters news articles into stories using StoryGraph.
Effectively stores and analyzes URLs with Hypercane and ArchiveNow.
Creates visualizations of daily news stories with Raintale.
Abstract
Tools such as Google News and Flipboard exist to convey daily news, but what about the past? In this paper, we describe how to combine several existing tools with web archive holdings to perform news analysis and visualization of the "biggest story" for a given date. StoryGraph clusters news articles together to identify a common news story. Hypercane leverages ArchiveNow to store URLs produced by StoryGraph in web archives. Hypercane analyzes these URLs to identify the most common terms, entities, and highest quality images for social media storytelling. Raintale then uses the output of these tools to produce a visualization of the news story for a given day. We name this process SHARI (StoryGraph Hypercane ArchiveNow Raintale Integration).
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Video Analysis and Summarization · Algorithms and Data Compression
