PAIR-Former: Budgeted Relational Multi-Instance Learning for Functional miRNA Target Prediction
Jiaqi Yin, Baiming Chen, Jia Fei, Mingjun Yang

TL;DR
PAIR-Former introduces a budgeted relational multi-instance learning framework for miRNA target prediction, balancing relational modeling accuracy with computational efficiency.
Contribution
It formalizes BR-MIL, providing theoretical insights and proposing PAIR-Former, a scalable method that outperforms baselines on biological and non-biological datasets.
Findings
Achieves state-of-the-art F1 scores on miRAW and deepTargetPro datasets.
Scales to large datasets with 420K pairs, outperforming naive approaches.
Demonstrates applicability of BR-MIL beyond biological sequence modeling.
Abstract
Functional miRNA--mRNA targeting is a large-bag prediction problem where each transcript yields a heavy-tailed pool of candidate target sites (CTSs), yet only a pair-level label is observed. Prior methods use max-pooling over individual CTS scores, ignoring relational patterns among sites, but modeling these patterns is critical for accuracy. The challenge is that naive relational aggregation incurs cost, prohibitive when reaches thousands, yet a cheap scan alone discards the very interactions that drive functional repression. We formalize this tension as \emph{Budgeted Relational Multi-Instance Learning (BR-MIL)}, a new MIL problem where the compute budget is a first-class constraint such that at most instances per bag may receive expensive encoding and relational processing. We establish theoretical foundations for BR-MIL, proving that both approximation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
