Loading paper
Beyond neural scaling laws: beating power law scaling via data pruning | Tomesphere