Loading paper
Cambricon-LLM: A Chiplet-Based Hybrid Architecture for On-Device Inference of 70B LLM | Tomesphere