Loading paper
Optimized Multi-Token Joint Decoding with Auxiliary Model for LLM Inference | Tomesphere