When Does Content-Based Routing Work? Representation Requirements for Selective Attention in Hybrid Sequence Models
Abhinaba Basu

TL;DR
This paper investigates the conditions under which content-based routing in hybrid sequence models is effective, emphasizing the necessity of pairwise token comparison and bidirectional context for high-precision routing.
Contribution
The study systematically maps the landscape of routing mechanisms, demonstrating that high-precision routing requires pairwise token comparison and bidirectional representations, and introduces efficient methods achieving near-perfect routing.
Findings
High-precision routing relies on pairwise token comparison.
Bidirectional context significantly improves routing accuracy.
Efficient methods like bidirectional Mamba with rank-1 projection achieve 99.7% routing accuracy.
Abstract
We identify a routing paradox in hybrid sequence models: content-based routing - deciding which tokens deserve expensive attention - requires pairwise computation, and this requirement is inescapable. Through 20+ controlled experiments across three tasks, multiple scales (200K to 1.4B parameters), and 15+ routing mechanisms, we map the routing landscape exhaustively. Every system that achieves high routing precision does so through pairwise token comparison. Every mechanism that avoids pairwise computation fails: recurrent models (Mamba-1.4B: 29%), memory banks (12%), bandits (0.7-3.6%), contrastive pretraining (1.6%), and 12 other approaches all cluster at 1-29%. Routing needs two ingredients: (1) per-token representations with bidirectional context and (2) pairwise token comparison. Bidirectional Mamba (O(n)) + pairwise comparison achieves 99.5%; replacing the full pairwise router…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
