WideDTA: prediction of drug-target binding affinity

Hakime \"Ozt\"urk; Elif Ozkirimli; Arzucan \"Ozg\"ur

arXiv:1902.04166·q-bio.QM·February 13, 2019·154 cites

WideDTA: prediction of drug-target binding affinity

Hakime \"Ozt\"urk, Elif Ozkirimli, Arzucan \"Ozg\"ur

PDF

Open Access 1 Repo

TL;DR

WideDTA is a deep learning model that predicts drug-target binding affinity using textual sequence data, outperforming previous models and highlighting the importance of protein motifs and domains.

Contribution

The paper introduces WideDTA, a novel text-based deep learning model that improves binding affinity prediction by utilizing multiple sequence information sources.

Findings

01

WideDTA outperforms DeepDTA on the KIBA dataset.

02

Protein motifs and domains alone can achieve similar performance to full sequences.

03

Word-based sequence representation is a promising alternative to character-based methods.

Abstract

Motivation: Prediction of the interaction affinity between proteins and compounds is a major challenge in the drug discovery process. WideDTA is a deep-learning based prediction model that employs chemical and biological textual sequence information to predict binding affinity. Results: WideDTA uses four text-based information sources, namely the protein sequence, ligand SMILES, protein domains and motifs, and maximum common substructure words to predict binding affinity. WideDTA outperformed one of the state of the art deep learning methods for drug-target binding affinity prediction, DeepDTA on the KIBA dataset with a statistical significance. This indicates that the word-based sequence representation adapted by WideDTA is a promising alternative to the character-based sequence representation approach in deep learning models for binding affinity prediction, such as the one used in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkmztrk/DeepDTA
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods · Protein Structure and Dynamics · Machine Learning in Bioinformatics