Loading paper
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding | Tomesphere