Asymptotic Experiments with Data Structures: Bipartite Graph Matchings and Covers
Eason Li, Franc Brglez

TL;DR
This paper presents asymptotic performance experiments on bipartite graph algorithms across Java, R, and C++, revealing surprising performance differences and highlighting R's efficiency in certain large-scale problems.
Contribution
It provides empirical insights into the asymptotic performance of data structures and algorithms in different programming languages on large bipartite graph problems.
Findings
R outperforms Java in movie list management.
R's Ford-Fulkerson implementation is faster on large maximum matching instances.
R's stochastic greedy algorithm surpasses C++ solvers on large set cover problems.
Abstract
We consider instances of bipartite graphs and a number of asymptotic performance experiments in three projects: (1) top movie lists, given databases of movies and viewers, (2) maximum matchings, and (3) minimum set covers. Experiments are designed to measure the asymptotic runtime performance of abstract data types (ADTs) in three programming languages: Java, R, and C++. The outcomes of these experiments may be surprising. In project (1), the best ADT in R consistently outperforms all ADTs in public domain Java libraries, including the library from Google. The largest movie list has titles. In project (2), the Ford-Fulkerson algorithm implementation in R significantly outperforms Java. The hardest instance has 88452 rows and 729 columns. In project (3), a stochastic version of a greedy algorithm in R can significantly outperform a state-of-the-art stochastic solver in C++ on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Data Mining Algorithms and Applications · Advanced Database Systems and Queries
