PAIR-Former: Budgeted Relational Multi-Instance Learning for Functional miRNA Target Prediction

Jiaqi Yin; Baiming Chen; Jia Fei; Mingjun Yang

arXiv:2602.00465·cs.LG·May 11, 2026

PAIR-Former: Budgeted Relational Multi-Instance Learning for Functional miRNA Target Prediction

Jiaqi Yin, Baiming Chen, Jia Fei, Mingjun Yang

PDF

TL;DR

PAIR-Former introduces a budgeted relational multi-instance learning framework for miRNA target prediction, balancing relational modeling accuracy with computational efficiency.

Contribution

It formalizes BR-MIL, providing theoretical insights and proposing PAIR-Former, a scalable method that outperforms baselines on biological and non-biological datasets.

Findings

01

Achieves state-of-the-art F1 scores on miRAW and deepTargetPro datasets.

02

Scales to large datasets with 420K pairs, outperforming naive approaches.

03

Demonstrates applicability of BR-MIL beyond biological sequence modeling.

Abstract

Functional miRNA--mRNA targeting is a large-bag prediction problem where each transcript yields a heavy-tailed pool of candidate target sites (CTSs), yet only a pair-level label is observed. Prior methods use max-pooling over individual CTS scores, ignoring relational patterns among sites, but modeling these patterns is critical for accuracy. The challenge is that naive relational aggregation incurs $O (n^{2})$ cost, prohibitive when $n$ reaches thousands, yet a cheap scan alone discards the very interactions that drive functional repression. We formalize this tension as \emph{Budgeted Relational Multi-Instance Learning (BR-MIL)}, a new MIL problem where the compute budget $K$ is a first-class constraint such that at most $K$ instances per bag may receive expensive encoding and relational processing. We establish theoretical foundations for BR-MIL, proving that both approximation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.