Benchmarking the Discovery Engine

Jack Foxabbott; Arush Tagade; Andrew Cusick; Robbie McCorkell; Leo McKee-Reid; Jugal Patel; Jamie Rumbelow; Jessica Rumbelow; Zohreh Shams

arXiv:2507.00964·cs.LG·July 2, 2025

Benchmarking the Discovery Engine

Jack Foxabbott, Arush Tagade, Andrew Cusick, Robbie McCorkell, Leo McKee-Reid, Jugal Patel, Jamie Rumbelow, Jessica Rumbelow, Zohreh Shams

PDF

Open Access

TL;DR

This paper introduces the Discovery Engine, an automated system that combines machine learning and interpretability to enhance scientific discovery across various fields, outperforming previous methods in predictive accuracy and insight generation.

Contribution

The paper presents the Discovery Engine as a new benchmark tool that improves predictive performance and interpretability in scientific data analysis across multiple disciplines.

Findings

01

Matches or exceeds prior predictive performance

02

Provides deeper, more actionable insights

03

Demonstrates potential as a new standard for scientific modeling

Abstract

The Discovery Engine is a general purpose automated system for scientific discovery, which combines machine learning with state-of-the-art ML interpretability to enable rapid and robust scientific insight across diverse datasets. In this paper, we benchmark the Discovery Engine against five recent peer-reviewed scientific publications applying machine learning across medicine, materials science, social science, and environmental science. In each case, the Discovery Engine matches or exceeds prior predictive performance while also generating deeper, more actionable insights through rich interpretability artefacts. These results demonstrate its potential as a new standard for automated, interpretable scientific modelling that enables complex knowledge discovery from data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · Scientific Computing and Data Management