Automated Early Leaderboard Generation From Comparative Tables
Mayank Singh, Rajdeep Sarkar, Atharva Vyas, Pawan Goyal, Animesh, Mukherjee, Soumen Chakrabarti

TL;DR
This paper introduces an automated system that constructs and maintains leaderboards by extracting performance data from scientific papers' tables, enabling faster and more reliable comparison of techniques across multiple fields.
Contribution
The authors present a novel graph-based approach to automatically generate leaderboards from paper tables, handling noisy data and uncertainties in performance comparisons.
Findings
System accurately reproduces manually curated leaderboards.
Produces reliable rankings across 27 computer science areas.
Mitigates noise through structured data extraction and ranking algorithms.
Abstract
A leaderboard is a tabular presentation of performance scores of the best competing techniques that address a specific scientific problem. Manually maintained leaderboards take time to emerge, which induces a latency in performance discovery and meaningful comparison. This can delay dissemination of best practices to non-experts and practitioners. Regarding papers as proxies for techniques, we present a new system to automatically discover and maintain leaderboards in the form of partial orders between papers, based on performance reported therein. In principle, a leaderboard depends on the task, data set, other experimental settings, and the choice of performance metrics. Often there are also tradeoffs between different metrics. Thus, leaderboard discovery is not just a matter of accurately extracting performance numbers and comparing them. In fact, the levels of noise and uncertainty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Software System Performance and Reliability · Software Engineering Research
