Powerful, transferable representations for molecules through intelligent task selection in deep multitask networks
Clyde Fare, Lukas Turcani, Edward O. Pyzer-Knapp

TL;DR
This paper introduces a method combining multi-task and transfer learning with task similarity measures to generate powerful, transferable molecular representations that are less biased, cost-effective, and effective in low-data scenarios.
Contribution
It proposes a task selection strategy based on pairwise task affinity to improve deep multitask learning for molecular representations.
Findings
Deep representations outperform traditional cheminformatics fingerprints.
Task similarity-based selection reduces bias and improves transferability.
Method is effective on real-world, low-data datasets.
Abstract
Chemical representations derived from deep learning are emerging as a powerful tool in areas such as drug discovery and materials innovation. Currently, this methodology has three major limitations - the cost of representation generation, risk of inherited bias, and the requirement for large amounts of data. We propose the use of multi-task learning in tandem with transfer learning to address these limitations directly. In order to avoid introducing unknown bias into multi-task learning through the task selection itself, we calculate task similarity through pairwise task affinity, and use this measure to programmatically select tasks. We test this methodology on several real-world data sets to demonstrate its potential for execution in complex and low-data environments. Finally, we utilise the task similarity to further probe the expressiveness of the learned representation through a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
