Loading paper
Efficiency Unleashed: Inference Acceleration for LLM-based Recommender Systems with Speculative Decoding | Tomesphere