Integrated Benchmarking and Design for Reproducible and Accessible Evaluation of Robotic Agents
Jacopo Tani, Andrea F. Daniele, Gianmarco Bernasconi, Amaury, Camus, Aleksandar Petrov, Anthony Courchesne, Bhairav Mehta and, Rohit Suri, Tomasz Zaluska, Matthew R. Walter, Emilio Frazzoli and, Liam Paull, Andrea Censi

TL;DR
This paper introduces a comprehensive framework for reproducible robotics research that integrates development and benchmarking, exemplified by the DUCKIENet system and the Duckietown Autolab, ensuring low variance and high reproducibility.
Contribution
It presents a novel integrated approach combining development and benchmarking in robotics, with a standardized, accessible infrastructure to enhance reproducibility.
Findings
Low variance in experimental results across different hardware and labs
Validated the repeatability of experiments using the infrastructure
Demonstrated the system's effectiveness in ensuring reproducibility
Abstract
As robotics matures and increases in complexity, it is more necessary than ever that robot autonomy research be reproducible. Compared to other sciences, there are specific challenges to benchmarking autonomy, such as the complexity of the software stacks, the variability of the hardware and the reliance on data-driven techniques, amongst others. In this paper, we describe a new concept for reproducible robotics research that integrates development and benchmarking, so that reproducibility is obtained "by design" from the beginning of the research/development processes. We first provide the overall conceptual objectives to achieve this goal and then a concrete instance that we have built: the DUCKIENet. One of the central components of this setup is the Duckietown Autolab, a remotely accessible standardized setup that is itself also relatively low-cost and reproducible. When evaluating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Modular Robots and Swarm Intelligence · IoT and Edge/Fog Computing
