Loading paper
Computation vs. Communication Scaling for Future Transformers on Future Hardware | Tomesphere