Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic Models
Francesco Giuliari, Gianluca Scarpellini, Stuart James, Yiming Wang,, Alessio Del Bue

TL;DR
Positional Diffusion introduces a diffusion probabilistic model with graph neural networks to effectively solve various ordering tasks, outperforming existing methods on puzzles and matching state-of-the-art on sentence ordering and visual storytelling.
Contribution
The paper presents a novel diffusion-based framework for positional reasoning, integrating graph neural networks to handle unordered sets and improve ordering accuracy.
Findings
Outperforms existing puzzle-solving methods by up to 18%
Achieves competitive results on sentence ordering datasets
Performs on par with state-of-the-art in visual storytelling
Abstract
Positional reasoning is the process of ordering unsorted parts contained in a set into a consistent structure. We present Positional Diffusion, a plug-and-play graph formulation with Diffusion Probabilistic Models to address positional reasoning. We use the forward process to map elements' positions in a set to random positions in a continuous space. Positional Diffusion learns to reverse the noising process and recover the original positions through an Attention-based Graph Neural Network. We conduct extensive experiments with benchmark datasets including two puzzle datasets, three sentence ordering datasets, and one visual storytelling dataset, demonstrating that our method outperforms long-lasting research on puzzle solving with up to +18% compared to the second-best deep learning method, and performs on par against the state-of-the-art methods on sentence ordering and visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Human Pose and Action Recognition
MethodsGraph Neural Network · Diffusion
