Loading paper
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding | Tomesphere