Loading paper
Attention Meets Reachability: Structural Equivalence and Efficiency in Grammar-Constrained LLM Decoding | Tomesphere