When does deep learning fail and how to tackle it? A critical analysis on polymer sequence-property surrogate models
Himanshu, Tarak K Patra

TL;DR
This paper critically analyzes the limitations of deep learning in polymer property prediction and introduces new algorithms for optimal model selection and minimal data utilization, enhancing the development of universal polymer surrogate models.
Contribution
It proposes a layer-by-layer expansion method for neural network topology selection and a pipeline to map polymer sequences to a continuous latent space for minimal data use.
Findings
Layer expansion helps identify optimal neural network structures.
Mapping sequences to latent space reduces data requirements.
Strategies are demonstrated on three polymer property prediction cases.
Abstract
Deep learning models are gaining popularity and potency in predicting polymer properties. These models can be built using pre-existing data and are useful for the rapid prediction of polymer properties. However, the performance of a deep learning model is intricately connected to its topology and the volume of training data. There is no facile protocol available to select a deep learning architecture, and there is a lack of a large volume of homogeneous sequence-property data of polymers. These two factors are the primary bottleneck for the efficient development of deep learning models. Here we assess the severity of these factors and propose new algorithms to address them. We show that a linear layer-by-layer expansion of a neural network can help in identifying the best neural network topology for a given problem. Moreover, we map the discrete sequence space of a polymer to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Advanced Polymer Synthesis and Characterization
