Learning Set-equivariant Functions with SWARM Mappings
Roland Vollgraf

TL;DR
This paper introduces SWARM mappings, a neural network architecture that efficiently learns set-equivariant functions, achieving state-of-the-art results by mimicking natural swarm behaviors and outperforming existing methods like Set Transformer.
Contribution
The paper presents a novel SWARM mapping architecture based on gated recurrent networks for learning set-equivariant functions, demonstrating superior performance over existing models.
Findings
SWARM mappings achieve state-of-the-art results.
Models based on SWARM layers outperform Set Transformer.
The approach effectively learns permutation-invariant functions.
Abstract
In this work we propose a new neural network architecture that efficiently implements and learns general purpose set-equivariant functions. Such a function f maps a set of entities x = {x1, . . . , xn} from one domain to a set of same cardinality y = f (x) = {y1, . . . , yn} in another domain regardless of the ordering of the entities. The architecture is based on a gated recurrent network which is iteratively applied to all entities individually and at the same time syncs with the progression of the whole population. In reminiscence to this pattern, which can be frequently observed in nature, we call our approach SWARM mapping. Set-equivariant and generally permutation invariant functions are important building blocks for many state of the art machine learning approaches. Even in applications where the permutation invariance is not of primary interest, as to be seen in the recent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Graph Neural Networks · Machine Learning in Healthcare
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Set Transformer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam
