Graph Data Modeling: Molecules, Proteins, & Chemical Processes
Jos\'e Manuel Barraza-Chavez, Rana A. Barghout, Ricardo Almada-Monter, Adrian Jinich, Radhakrishnan Mahadevan, Benjamin Sanchez-Lengeling

TL;DR
This paper introduces graph data modeling in chemistry, explaining how graphs represent molecules and proteins, and discusses the application of graph neural networks for chemical discovery.
Contribution
It provides a comprehensive overview of graph design, prediction tasks, and machine learning applications in chemical sciences, bridging chemistry and graph-based machine learning.
Findings
Graphs effectively model chemical structures and processes.
Graph neural networks enhance prediction accuracy in chemical tasks.
The paper serves as a primer for applying graph methods in chemical discovery.
Abstract
Graphs are central to the chemical sciences, providing a natural language to describe molecules, proteins, reactions, and industrial processes. They capture interactions and structures that underpin materials, biology, and medicine. This primer, Graph Data Modeling: Molecules, Proteins, & Chemical Processes, introduces graphs as mathematical objects in chemistry and shows how learning algorithms (particularly graph neural networks) can operate on them. We outline the foundations of graph design, key prediction tasks, representative examples across chemical sciences, and the role of machine learning in graph-based modeling. Together, these concepts prepare readers to apply graph methods to the next generation of chemical discovery.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
