Characterizing Scalability Issues in Spreadsheet Software using Online Forums
Kelly Mack, John Lee, Kevin Chang, Karrie Karahalios, Aditya, Parameswaran

TL;DR
This study analyzes online forum data to identify and characterize scalability issues faced by users of spreadsheet software, especially Excel, when handling large datasets, providing insights for future software design improvements.
Contribution
It demonstrates the effectiveness of using online forum data to understand user challenges with spreadsheet scalability, offering a scalable alternative to traditional usability studies.
Findings
Users face significant challenges with large datasets in spreadsheets.
Common issues include errors and performance problems.
Insights inform design of next-generation spreadsheet tools.
Abstract
In traditional usability studies, researchers talk to users of tools to understand their needs and challenges. Insights gained via such interviews offer context, detail, and background. Due to costs in time and money, we are beginning to see a new form of tool interrogation that prioritizes scale, cost, and breadth by utilizing existing data from online forums. In this case study, we set out to apply this method of using online forum data to a specific issue---challenges that users face with Excel spreadsheets. Spreadsheets are a versatile and powerful processing tool if used properly. However, with versatility and power come errors, from both users and the software, which make using spreadsheets less effective. By scraping posts from the website Reddit, we collected a dataset of questions and complaints about Excel. Specifically, we explored and characterized the issues users were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
