Reproducibility, energy efficiency and performance of pseudorandom number generators in machine learning: a comparative study of python, numpy, tensorflow, and pytorch implementations
Benjamin Antunes, David R.C Hill

TL;DR
This study compares the statistical quality, reproducibility, energy efficiency, and performance of PRNGs across Python, NumPy, TensorFlow, and PyTorch, revealing close performance to C implementations but challenges in reproducibility.
Contribution
It provides a comprehensive comparison of PRNG implementations in popular ML frameworks, highlighting their efficiency and reproducibility issues relative to C standards.
Findings
ML implementations match C in speed, sometimes outperforming.
Energy consumption in ML frameworks is only slightly higher than C.
Statistical quality is comparable, but numerical reproducibility remains challenging.
Abstract
Pseudo-Random Number Generators (PRNGs) have become ubiquitous in machine learning technologies because they are interesting for numerous methods. The field of machine learning holds the potential for substantial advancements across various domains, as exemplified by recent breakthroughs in Large Language Models (LLMs). However, despite the growing interest, persistent concerns include issues related to reproducibility and energy consumption. Reproducibility is crucial for robust scientific inquiry and explainability, while energy efficiency underscores the imperative to conserve finite global resources. This study delves into the investigation of whether the leading Pseudo-Random Number Generators (PRNGs) employed in machine learning languages, libraries, and frameworks uphold statistical quality and numerical reproducibility when compared to the original C implementation of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
MethodsSeventeen Ways to Call Uphold Helpline Full Guide USA 24 Hour Assistance
