Set Transformer Architectures and Synthetic Data Generation for Flow-Guided Nanoscale Localization

Mika Leo Hube; Filip Lemic; Ethungshan Shitiri; Gerard Calvo Bartra; Sergi Abadal; Xavier Costa P\'erez

arXiv:2508.16200·cs.ET·August 25, 2025

Set Transformer Architectures and Synthetic Data Generation for Flow-Guided Nanoscale Localization

Mika Leo Hube, Filip Lemic, Ethungshan Shitiri, Gerard Calvo Bartra, Sergi Abadal, Xavier Costa P\'erez

PDF

TL;DR

This paper introduces Set Transformer architectures combined with synthetic data generation to improve flow-guided nanoscale localization, enhancing adaptability and robustness in medical diagnostics.

Contribution

It proposes a permutation-invariant Set Transformer model for FGL and integrates deep generative models for synthetic data augmentation, addressing variability and data scarcity issues.

Findings

01

Set Transformer achieves comparable accuracy to GNNs.

02

Synthetic data improves model robustness under data scarcity.

03

Permutation-invariant models enhance generalization to anatomical variability.

Abstract

Flow-guided Localization (FGL) enables the identification of spatial regions within the human body that contain an event of diagnostic interest. FGL does that by leveraging the passive movement of energy-constrained nanodevices circulating through the bloodstream. Existing FGL solutions rely on graph models with fixed topologies or handcrafted features, which limit their adaptability to anatomical variability and hinder scalability. In this work, we explore the use of Set Transformer architectures to address these limitations. Our formulation treats nanodevices' circulation time reports as unordered sets, enabling permutation-invariant, variable-length input processing without relying on spatial priors. To improve robustness under data scarcity and class imbalance, we integrate synthetic data generation via deep generative models, including CGAN, WGAN, WGAN-GP, and CVAE. These models…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.