Genetic programming-based learning of carbon interatomic potential for materials discovery
Andrew Eldridge, Alejandro Rodriguez, Ming Hu, Jianjun Hu

TL;DR
This paper introduces a novel genetic programming approach to automatically discover interatomic potential functions for carbon materials, achieving accurate energy predictions with low computational cost.
Contribution
It presents a new strongly typed parallel genetic programming method combined with multi-objective optimization for potential function discovery, improving automation and accuracy.
Findings
Predicted energies within ±7.70 eV of DFT data
Generalized well across multiple carbon structures
Open-source code available for community use
Abstract
Efficient and accurate interatomic potential functions are critical to computational study of materials while searching for structures with desired properties. Traditionally, potential functions or energy landscapes are designed by experts based on theoretical or heuristic knowledge. Here, we propose a new approach to leverage strongly typed parallel genetic programming (GP) for potential function discovery. We use a multi-objective evolutionary algorithm with NSGA-III selection to optimize individual age, fitness, and complexity through symbolic regression. With a DFT dataset of 863 unique carbon allotrope configurations drawn from 858 carbon structures, the generated potentials are able to predict total energies within eV at low computational cost while generalizing well across multiple carbon structures. Our code is open source and available at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Protein Structure and Dynamics
