WideDTA: prediction of drug-target binding affinity
Hakime \"Ozt\"urk, Elif Ozkirimli, Arzucan \"Ozg\"ur

TL;DR
WideDTA is a deep learning model that predicts drug-target binding affinity using textual sequence data, outperforming previous models and highlighting the importance of protein motifs and domains.
Contribution
The paper introduces WideDTA, a novel text-based deep learning model that improves binding affinity prediction by utilizing multiple sequence information sources.
Findings
WideDTA outperforms DeepDTA on the KIBA dataset.
Protein motifs and domains alone can achieve similar performance to full sequences.
Word-based sequence representation is a promising alternative to character-based methods.
Abstract
Motivation: Prediction of the interaction affinity between proteins and compounds is a major challenge in the drug discovery process. WideDTA is a deep-learning based prediction model that employs chemical and biological textual sequence information to predict binding affinity. Results: WideDTA uses four text-based information sources, namely the protein sequence, ligand SMILES, protein domains and motifs, and maximum common substructure words to predict binding affinity. WideDTA outperformed one of the state of the art deep learning methods for drug-target binding affinity prediction, DeepDTA on the KIBA dataset with a statistical significance. This indicates that the word-based sequence representation adapted by WideDTA is a promising alternative to the character-based sequence representation approach in deep learning models for binding affinity prediction, such as the one used in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · Machine Learning in Bioinformatics
