Loading paper
Sparse or Dense? A Mechanistic Estimation of Computation Density in Transformer-based LLMs | Tomesphere