$\textit{BlockFormer}$ : Transformer-based inference from interaction maps
Elo\"ise Touron, Pedro L. C. Rodrigues, Julyan Arbel, Nelle Varoquaux, Michael Arbel

TL;DR
This paper introduces BlockFormer, a transformer-based method for inferring centromere positions from interaction maps like Hi-C data, effectively handling variability in data and demonstrating accurate localization across species.
Contribution
The work presents a novel transformer architecture and a synthetic data generator for robust inference of genomic features from interaction maps.
Findings
Accurately localizes centromeres across multiple species.
Handles variability in entity number and size in interaction maps.
Uses synthetic data for effective training.
Abstract
Inference from interaction maps, such as centromere identification from genome-wide chromosome conformation capture techniques -- notably Hi-C -- can be formulated as a generic inverse problem: infer a set of parameters given a map summarizing pairwise interactions between entities through blocks of variable numbers and sizes. In this work, we introduce a data-driven approach that leverages shared structure between these maps, such as global alignment between localized patterns, while handling the variability in number and size of entities arising in real-world data. Our approach relies on a transformer architecture capable of handling such variability and a custom simulator to generate abundant, yet computationally cheap synthetic data for training. Applied to the problem of centromere localization, the method accurately recovers their genomic positions across a wide range of species…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
