Generative Chemical Transformer: Neural Machine Learning of Molecular Geometric Structures from Chemical Language via Attention
Hyunseung Kim, Jonggeol Na, Won Bo Lee

TL;DR
The paper introduces a neural network model called Generative Chemical Transformer (GCT) that leverages attention mechanisms to generate realistic molecular structures from chemical language, aiding inverse molecular design.
Contribution
It presents a novel attention-based neural network that understands chemical language deeply, enabling the generation of molecules satisfying multiple properties for material discovery.
Findings
GCT generates chemically valid and linguistically consistent molecular strings.
Generated molecules meet multiple target properties simultaneously.
Attention mechanism improves understanding of molecular structures beyond chemical language limitations.
Abstract
Discovering new materials better suited to specific purposes is an important issue in improving the quality of human life. Here, a neural network that creates molecules that meet some desired conditions based on a deep understanding of chemical language is proposed (Generative Chemical Transformer, GCT). The attention mechanism in GCT allows a deeper understanding of molecular structures beyond the limitations of chemical language itself which cause semantic discontinuity by paying attention to characters sparsely. It is investigated that the significance of language models for inverse molecular design problems by quantitatively evaluating the quality of the generated molecules. GCT generates highly realistic chemical strings that satisfy both chemical and linguistic grammar rules. Molecules parsed from generated strings simultaneously satisfy the multiple target properties and vary for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Advanced Graph Neural Networks
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Adam · Dropout · Label Smoothing · Multi-Head Attention
