Loading paper
Sharp feature-learning transitions and Bayes-optimal neural scaling laws in extensive-width networks | Tomesphere