# OPLE: Drug Discovery Platform Combining 2D Similarity with AI to Predict Off-Target Liabilities

**Authors:** Sarah E. Biehn, Juerg Lehmann, Christoph Mueller, Fabien Tillier, Carleton R. Sage

PMC · DOI: 10.3390/ph19020228 · 2026-01-28

## TL;DR

OPLE is a drug discovery platform that uses 2D molecular similarity and AI to predict off-target effects, helping identify problematic drug candidates early.

## Contribution

OPLE combines 2D similarity and machine learning to predict off-target liabilities, improving early-stage drug screening.

## Key findings

- OPLE models achieved over 80% recall for predicting active molecules against SafetyScreen targets.
- Combining similarity and ML predictions using belief theory improved activity prediction accuracy.
- OPLE helps reduce false negatives, saving resources in drug discovery.

## Abstract

Background/Objectives: An impediment to successful drug discovery is the potential for off-target liabilities to eliminate otherwise promising candidates. As the drug discovery process is time-consuming and expensive, the use of artificial intelligence (AI) methods such as machine learning (ML) has drastically increased. It is invaluable to generate models that can quickly differentiate between successful and unsuccessful small-molecule drug candidates. Previous efforts established that molecular similarity could be used with other metrics to inform predictions of potential activity against a protein target. Similar methods were pursued here to combine similarity and machine learning for a collection of models called OPLE. Methods: Models were trained with proprietary and publicly available data to predict the likelihood of a given compound to be active against targets present in existing experimental SafetyScreen panels 18 and 44. Two-dimensional (2D) Tanimoto similarity from extended-connectivity fingerprints (ECFPs) and trained ML models were combined to obtain predictions. Results: Using all training data, a relationship between similarity and activity was established by fitting a probability assignment curve. Calibrated ML label assignment likelihoods were joined with the predictions from ECFP Tanimoto similarity to known active compounds using the belief theory formula, which maintains that activity prediction increases when both pieces of evidence support it. When assessing the performance of OPLE models for SafetyScreen 18 and 44 targets with external data from ChEMBL, more than 80% of the models had recall values greater than 0.8. This indicated favorable predictive ability to identify active molecules while limiting false negative predictions. Conclusions: Predicting and experimentally verifying safety liabilities is insightful at every stage of small-molecule drug discovery. This early detection tool can help project teams save resources that could be better deployed on series with no predicted or measured off-target liabilities.

## Full-text entities

- **Genes:** KCNH2 (potassium voltage-gated channel subfamily H member 2) [NCBI Gene 3757] {aka ERG-1, ERG1, H-ERG, HERG, HERG1, Kv11.1}
- **Diseases:** heart failure (MESH:D006333), cardiac valvulopathy (MESH:D006331), OPLE (MESH:C563602), pulmonary arterial hypertension (MESH:D000081029), weight loss (MESH:D015431), injury to (MESH:D014947)
- **Chemicals:** hydrogen (MESH:D006859), fenfluramine (MESH:D005277), BECFP (-), EMERALD (MESH:C550088)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** OPLE — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_B2J8)

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943774/full.md

---
Source: https://tomesphere.com/paper/PMC12943774