Loading paper
FLOP-Efficient Training: Early Stopping Based on Test-Time Compute Awareness | Tomesphere