Loading paper
FalconGEMM: Surpassing Hardware Peaks with Lower-Complexity Matrix Multiplication | Tomesphere