Loading paper
On multi-token prediction for efficient LLM inference | Tomesphere