# Machine Learning Modeling for ABC Transporter Efflux and Inhibition: Data Curation, Model Development, and New Compound Interaction Predictions

**Authors:** Nada J. Daood, Sean R. Carey, Elena Chung, Tong Wang, Anna Kreutz, Mounika Girireddy, Suman Chakravarti, Nicole C. Kleinstreuer, Jacqueline B. Tiley, Lauren M. Aleksunes, Hao Zhu

PMC · DOI: 10.1021/acs.molpharmaceut.5c01065 · Molecular Pharmaceutics · 2025-10-20

## TL;DR

This study curates a large dataset of ABC transporter interactions and develops machine learning models to predict substrate binding and inhibition, which can help estimate drug exposure in the brain.

## Contribution

The study presents a large, manually curated dataset and high-performance machine learning models for predicting ABC transporter interactions.

## Key findings

- Eight datasets were created with around 8800 unique chemicals related to four ABC transporters.
- Machine learning models achieved average CCRs of 0.764 for substrate binding and 0.839 for inhibition.
- Compounds predicted as P-gp and BCRP substrates were more likely to have low brain exposure.

## Abstract

In recent years, multiple computational studies have
used machine
learning models to predict substrate binding and inhibition of ATP-binding
cassette (ABC) transporters. However, many of these studies relied
on relatively small training sets with limited applicability. In this
study, we manually curated over 24,000 bioactivity records (i.e.,
inhibition, binding affinity, permeability) for the ABC transporters
P-gp, BCRP, MRP1, and MRP2 from more than 900 literature sources in
ChEMBL, with additional data from PubChem and Metrabase. This effort
yielded eight data sets, comprising around 8800 unique chemicals with
one or more substrate binding or inhibition activities for these four
efflux transporters. Quantitative structure–activity relationship
(QSAR) models were developed for each of the eight data sets using
combinations of four machine learning algorithms and three sets of
chemical descriptors. The resulting models demonstrated excellent
performance by 5-fold cross-validation, achieving an average correct
classification rate (CCR) of 0.764 for the substrate binding models
and 0.839 for the inhibition models. Models were validated with additional
compounds from DrugBank that were known substrates or inhibitors.
We further analyzed how model predictions for efflux transporter activity
could estimate exposure of the brain to xenobiotics. Notably, compounds
predicted as P-gp and BCRP substrates were twice or more likely to
have low brain exposure compared to compounds with high brain exposure.
This study provides a large and curated drug transporter binding and
inhibition database for computational modeling. Applicable models
based on this large database for predicting transporter substrate
binding and inhibition can be used to evaluate more complex drug bioactivities,
such as exposure of protected tissues to chemicals.

## Linked entities

- **Proteins:** PGP (phosphoglycolate phosphatase), ABCG2 (ATP binding cassette subfamily G member 2 (JR blood group)), CD9 (CD9 molecule), ABCC2 (ATP binding cassette subfamily C member 2)

## Full-text entities

- **Genes:** ABCC1 (ATP binding cassette subfamily C member 1 (ABCC1 blood group)) [NCBI Gene 4363] {aka ABC29, ABCC, DFNA77, GS-X, MRP, MRP1}, PGP (phosphoglycolate phosphatase) [NCBI Gene 283871] {aka AUM, G3PP, PGPase}, ABCC2 (ATP binding cassette subfamily C member 2) [NCBI Gene 1244] {aka ABC30, CMOAT, DJS, MRP2, cMRP}, ABCG2 (ATP binding cassette subfamily G member 2 (JR blood group)) [NCBI Gene 9429] {aka ABC15, ABCP, BCRP, BMDP, CD338, CDw338}

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12587445/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12587445/full.md

## References

97 references — full list in the complete paper: https://tomesphere.com/paper/PMC12587445/full.md

---
Source: https://tomesphere.com/paper/PMC12587445