Efficient and Compact Spreadsheet Formula Graphs
Dixin Tang, Fanchao Chen, Christopher De Leon, Tana Wattanawaroon,, Jeaseok Yun, Srinivasan Seshadri, Aditya G. Parameswaran

TL;DR
TACO is a framework that compresses spreadsheet formula graphs by exploiting tabular locality, enabling faster querying and maintenance, which significantly improves interactivity in spreadsheet systems.
Contribution
We introduce TACO, a novel compression framework for formula graphs that leverages tabular locality patterns to enhance efficiency and support incremental updates.
Findings
TACO reduces formula graph sizes significantly.
Querying speedups of up to 34,972x over baseline.
Effective incremental maintenance during updates.
Abstract
Spreadsheets are one of the most popular data analysis tools, wherein users can express computation as formulae alongside data. The ensuing dependencies are tracked as formula graphs. Efficiently querying and maintaining these formula graphs is critical for interactivity across multiple settings. Unfortunately, formula graphs are often large and complex such that querying and maintaining them is time-consuming, reducing interactivity. We propose TACO, a framework for efficiently compressing formula graphs, thereby reducing the time for querying and maintenance. The efficiency of TACO stems from a key spreadsheet property: tabular locality, which means that cells close to each other are likely to have similar formula structures. We leverage four such tabular locality-based patterns and develop algorithms for compressing formula graphs using these patterns, directly querying the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Database Systems and Queries · Advanced Data Storage Technologies
