Force-Aware Neural Tangent Kernels for Scalable and Robust Active Learning of MLIPs
Eszter Varga-Umbrich, Zachary Weller-Davies, Paul Duckworth, Jules Tilly, Olivier Peltre, Shikha Surana

TL;DR
This paper introduces a scalable, force-aware neural tangent kernel framework for active learning of MLIPs, enabling efficient screening of large candidate pools and robustness to biased distributions.
Contribution
It develops a novel force-aware NTK and a scalable acquisition framework that significantly improves active learning efficiency and robustness for MLIPs.
Findings
Achieves screening of ~200k structures within hours.
Force NTK yields lowest MAE and RMSE on OC20 dataset.
Remains robust under candidate-pool shift scenarios.
Abstract
Active learning for machine-learning interatomic potentials (MLIPs) must address several challenges to be practical: scaling to large candidate pools, leveraging energy-force supervision, and maintaining robustness when candidate pools are biased relative to the target distribution. In this work, we jointly address these challenges. We first introduce a linearly scaling acquisition framework based on chunked feature-space posterior-variance shortlisting. By avoiding materialisation of the candidate and train set kernels, this approach enables screening of ~200k structures within hours and applies broadly to acquisition strategies that score candidates based on molecular similarity metrics. We then extend the Neural Tangent Kernel (NTK) to a force-aware setting via mixed parameter-coordinate derivatives, yielding a force NTK and a joint energy-force NTK that provide natural similarity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
