Loading paper
QMC: Efficient SLM Edge Inference via Outlier-Aware Quantization and Emergent Memories Co-Design | Tomesphere