Loading paper
PHOTON: Hierarchical Autoregressive Modeling for Lightspeed and Memory-Efficient Language Generation | Tomesphere