Loading paper
Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters | Tomesphere