Loading paper
Budgeted Attention Allocation: Cost-Conditioned Compute Control for Efficient Transformers | Tomesphere