Loading paper
Bottlenecked Transformers: Periodic KV Cache Consolidation for Generalised Reasoning | Tomesphere