Fast, Expressive SE$(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space
Erik J Bekkers, Sharvaree Vadgama, Rob D Hesselink, Putri A van der, Linden, David W Romero

TL;DR
This paper introduces a novel group convolutional network for 3D point cloud processing that leverages weight sharing based on homogeneous space attributes, achieving state-of-the-art accuracy and efficiency.
Contribution
It formalizes weight sharing in equivariant networks using homogeneous space theory and develops an efficient $SE(3)$-equivariant network with superior performance.
Findings
Achieves state-of-the-art results on multiple benchmarks.
Demonstrates improved computational efficiency over existing methods.
Effectively models directional information in 3D data.
Abstract
Based on the theory of homogeneous spaces we derive geometrically optimal edge attributes to be used within the flexible message-passing framework. We formalize the notion of weight sharing in convolutional networks as the sharing of message functions over point-pairs that should be treated equally. We define equivalence classes of point-pairs that are identical up to a transformation in the group and derive attributes that uniquely identify these classes. Weight sharing is then obtained by conditioning message functions on these attributes. As an application of the theory, we develop an efficient equivariant group convolutional network for processing 3D point clouds. The theory of homogeneous spaces tells us how to do group convolutions with feature maps over the homogeneous space of positions , position and orientations , and the group …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Automated Road and Building Extraction · Face recognition and analysis
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion
