Loading paper
Automated Interpretability Metrics Do Not Distinguish Trained and Random Transformers | Tomesphere