Bridging Text and Molecule: A Survey on Multimodal Frameworks for Molecule
Yi Xiao, Xiangxin Zhou, Qiang Liu, Liang Wang

TL;DR
This survey reviews recent multimodal frameworks combining text and molecular data, emphasizing their architectures, pre-training tasks, and applications in drug discovery, highlighting current challenges and future directions.
Contribution
It is the first comprehensive survey on multimodal models for molecules, covering architectures, pre-training, large language models, and applications in drug discovery.
Findings
Categorizes current text-molecule alignment models
Highlights the use of large language models in molecular tasks
Discusses future research directions and limitations
Abstract
Artificial intelligence has demonstrated immense potential in scientific research. Within molecular science, it is revolutionizing the traditional computer-aided paradigm, ushering in a new era of deep learning. With recent progress in multimodal learning and natural language processing, an emerging trend has targeted at building multimodal frameworks to jointly model molecules with textual domain knowledge. In this paper, we present the first systematic survey on multimodal frameworks for molecules research. Specifically,we begin with the development of molecular deep learning and point out the necessity to involve textual modality. Next, we focus on recent advances in text-molecule alignment methods, categorizing current models into two groups based on their architectures and listing relevant pre-training tasks. Furthermore, we delves into the utilization of large language models and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
MethodsFocus
