Loading paper
EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization | Tomesphere