Loading paper
Balancing Coverage and Draft Latency in Vocabulary Trimming for Faster Speculative Decoding | Tomesphere