All You Need Is Synthetic Task Augmentation
Guillaume Godin

TL;DR
This paper introduces a synthetic task augmentation method that jointly trains a Graph Transformer on real and synthetic tasks, significantly improving molecular property prediction performance without pretraining.
Contribution
The study presents a novel approach of using synthetic tasks derived from XGBoost models to enhance neural network training in molecular property prediction.
Findings
Consistent performance improvements across 19 tasks.
Outperforms single-task XGBoost in 16 of 19 cases.
Effective without feature injection or extensive pretraining.
Abstract
Injecting rule-based models like Random Forests into differentiable neural network frameworks remains an open challenge in machine learning. Recent advancements have demonstrated that pretrained models can generate efficient molecular embeddings. However, these approaches often require extensive pretraining and additional techniques, such as incorporating posterior probabilities, to boost performance. In our study, we propose a novel strategy that jointly trains a single Graph Transformer neural network on both sparse multitask molecular property experimental targets and synthetic targets derived from XGBoost models trained on Osmordred molecular descriptors. These synthetic tasks serve as independent auxiliary tasks. Our results show consistent and significant performance improvement across all 19 molecular property prediction tasks. For 16 out of 19 targets, the multitask Graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTeleoperation and Haptic Systems
MethodsAttention Is All You Need · Laplacian EigenMap · Linear Layer · Laplacian Positional Encodings · Multi-Head Attention · Dense Connections · Graph Transformer · Dropout · Layer Normalization · Byte Pair Encoding
