Loading paper
Amphista: Bi-directional Multi-head Decoding for Accelerating LLM Inference | Tomesphere