Proposing Novel Extrapolative Compounds by Nested Variational Autoencoders
Yoshihiro Osakabe, Akinori Asahara

TL;DR
This paper introduces a nested variational autoencoder framework for materials informatics that efficiently generates high-performance compounds using limited experimental data, significantly reducing experimental efforts.
Contribution
It proposes a novel nested VAE model with a specialized loss function to improve compound generation beyond training data ranges with limited data.
Findings
Effective in generating high-performance compounds
Reduces experimental efforts to one-quarter of conventional methods
Validates approach with chemical industry partner
Abstract
Materials informatics (MI), which uses artificial intelligence and data analysis techniques to improve the efficiency of materials development, is attracting increasing interest from industry. One of its main applications is the rapid development of new high-performance compounds. Recently, several deep generative models have been proposed to suggest candidate compounds that are expected to satisfy the desired performance. However, they usually have the problem of requiring a large amount of experimental datasets for training to achieve sufficient accuracy. In actual cases, it is often possible to accumulate only about 1000 experimental data at most. Therefore, the authors proposed a deep generative model with nested two variational autoencoders (VAEs). The outer VAE learns the structural features of compounds using large-scale public data, while the inner VAE learns the relationship…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods
MethodsTest
