OpenTensor: Reproducing Faster Matrix Multiplication Discovering   Algorithms

Yiwen Sun; Wenye Li

arXiv:2405.20748·cs.AI·June 4, 2024

OpenTensor: Reproducing Faster Matrix Multiplication Discovering Algorithms

Yiwen Sun, Wenye Li

PDF

Open Access 1 Repo

TL;DR

OpenTensor is a framework that reproduces and improves upon AlphaTensor's matrix multiplication algorithms using Deep Reinforcement Learning, making the process more transparent and accessible.

Contribution

We provide a cleaned-up, clarified, and improved implementation of AlphaTensor, enabling easier reproduction and further development of efficient matrix multiplication algorithms.

Findings

01

OpenTensor successfully reproduces AlphaTensor's algorithms.

02

It discovers new efficient matrix multiplication algorithms.

03

The framework improves reproducibility and understanding of the original methods.

Abstract

OpenTensor is a reproduction of AlphaTensor, which discovered a new algorithm that outperforms the state-of-the-art methods for matrix multiplication by Deep Reinforcement Learning (DRL). While AlphaTensor provides a promising framework for solving scientific problems, it is really hard to reproduce due to the massive tricks and lack of source codes. In this paper, we clean up the algorithm pipeline, clarify the technical details, and make some improvements to the training process. Computational results show that OpenTensor can successfully find efficient matrix multiplication algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yiwenai/opentensor
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Computational Physics and Python Applications