Accelerating the inference of string generation-based chemical reaction   models for industrial applications

Mikhail Andronov; Natalia Andronova; Michael Wand; J\"urgen; Schmidhuber; Djork-Arn\'e Clevert

arXiv:2407.09685·cs.LG·July 18, 2024·1 cites

Accelerating the inference of string generation-based chemical reaction models for industrial applications

Mikhail Andronov, Natalia Andronova, Michael Wand, J\"urgen, Schmidhuber, Djork-Arn\'e Clevert

PDF

Open Access

TL;DR

This paper introduces a speculative decoding method to accelerate autoregressive SMILES-based chemical reaction models, achieving over three times faster inference without sacrificing accuracy, thereby enhancing industrial applicability.

Contribution

The authors propose a novel speculative decoding approach that significantly speeds up SMILES-to-SMILES translation models for chemical reactions, maintaining high accuracy.

Findings

01

Over 3X faster inference in reaction prediction

02

No loss in model accuracy

03

Applicable to molecular transformer models

Abstract

Template-free SMILES-to-SMILES translation models for reaction prediction and single-step retrosynthesis are of interest for industrial applications in computer-aided synthesis planning systems due to their state-of-the-art accuracy. However, they suffer from slow inference speed. We present a method to accelerate inference in autoregressive SMILES generators through speculative decoding by copying query string subsequences into target strings in the right places. We apply our method to the molecular transformer implemented in Pytorch Lightning and achieve over 3X faster inference in reaction prediction and single-step retrosynthesis, with no loss in accuracy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDNA and Biological Computing · Machine Learning in Materials Science · Advanced Text Analysis Techniques