Loading paper
Fast On-device LLM Inference with NPUs | Tomesphere