Loading paper
Chasing the Public Score: User Pressure and Evaluation Exploitation in Coding Agent Workflows | Tomesphere