Loading paper
FlexDraft: Flexible Speculative Decoding via Attention Tuning and Bonus-Guided Calibration | Tomesphere