Loading paper
Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference | Tomesphere