Measuring AI Progress in Drug Discovery: A Reproducible Leaderboard for the Tox21 Challenge
Antonia Ebner, Christoph Bartmann, Sonja Topf, Sohvi Luukkonen, Johannes Schimunek, G\"unter Klambauer

TL;DR
This paper introduces a reproducible leaderboard for the Tox21 challenge dataset to objectively assess progress in AI-based toxicity prediction over the past decade.
Contribution
It provides a standardized, accessible benchmark with original data and baseline models, enabling consistent comparison of methods in drug toxicity prediction.
Findings
Original Tox21 winner, DeepTox, remains competitive.
Self-normalizing neural networks from 2017 still perform well.
No clear evidence of substantial progress over the past decade.
Abstract
Deep learning's rise since the early 2010s has transformed fields like computer vision and natural language processing and strongly influenced biomedical research. For drug discovery specifically, a key inflection - akin to vision's "ImageNet moment" - arrived in 2015, when deep neural networks surpassed traditional approaches on the Tox21 Data Challenge. This milestone accelerated the adoption of deep learning across the pharmaceutical industry, and today most major companies have integrated these methods into their research pipelines. After the Tox21 Challenge concluded, its dataset was included in several established benchmarks, such as MoleculeNet and the Open Graph Benchmark. However, during these integrations, the dataset was altered and labels were imputed or manufactured, resulting in a loss of comparability across studies. Consequently, the extent to which bioactivity and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Cell Image Analysis Techniques · Biomedical Text Mining and Ontologies
