ControllableGPT: A Ground-Up Designed Controllable GPT for Molecule Optimization
Xuefeng Liu, Songhao Jiang, Bo Li, Rick Stevens

TL;DR
ControllableGPT is a novel unified language model designed for precise, controllable molecule optimization, integrating multiple training approaches to improve drug design tasks with demonstrated superior performance.
Contribution
It introduces a ground-up design combining MLM, CLM, and seq2seq into a controllable GPT framework tailored for drug optimization.
Findings
Outperforms baseline models on viral and cancer drug optimization benchmarks.
Enables precise control over sequence expansion, reduction, and mutation.
Demonstrates effective controllability and optimization in molecular tasks.
Abstract
Large Language Models (LLMs) employ three popular training approaches: Masked Language Models (MLM), Causal Language Models (CLM), and Sequence-to-Sequence Models (seq2seq). However, each approach has its strengths and limitations, and faces challenges in addressing specific tasks that require controllable and bidirectional generation, such as drug optimization. To address this challenge, inspired by the biological processes of growth and evolution, which involve the expansion, shrinking, and mutation of sequences, we introduce ControllableGPT. This initiative represents the first effort to combine the advantages of MLM, CLM, and seq2seq into a single unified, controllable GPT framework. It enables the precise management of specific locations and ranges within a sequence, allowing for expansion, reduction, or mutation over chosen or random lengths, while maintaining the integrity of any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Tanh Activation · Multi-Head Attention · Adam · Sigmoid Activation · Softmax · Dropout
