Fast, Expressive SE$(n)$ Equivariant Networks through Weight-Sharing in   Position-Orientation Space

Erik J Bekkers; Sharvaree Vadgama; Rob D Hesselink; Putri A van der; Linden; David W Romero

arXiv:2310.02970·cs.LG·March 18, 2024·5 cites

Fast, Expressive SE$(n)$ Equivariant Networks through Weight-Sharing in Position-Orientation Space

Erik J Bekkers, Sharvaree Vadgama, Rob D Hesselink, Putri A van der, Linden, David W Romero

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel group convolutional network for 3D point cloud processing that leverages weight sharing based on homogeneous space attributes, achieving state-of-the-art accuracy and efficiency.

Contribution

It formalizes weight sharing in equivariant networks using homogeneous space theory and develops an efficient $SE(3)$-equivariant network with superior performance.

Findings

01

Achieves state-of-the-art results on multiple benchmarks.

02

Demonstrates improved computational efficiency over existing methods.

03

Effectively models directional information in 3D data.

Abstract

Based on the theory of homogeneous spaces we derive geometrically optimal edge attributes to be used within the flexible message-passing framework. We formalize the notion of weight sharing in convolutional networks as the sharing of message functions over point-pairs that should be treated equally. We define equivalence classes of point-pairs that are identical up to a transformation in the group and derive attributes that uniquely identify these classes. Weight sharing is then obtained by conditioning message functions on these attributes. As an application of the theory, we develop an efficient equivariant group convolutional network for processing 3D point clouds. The theory of homogeneous spaces tells us how to do group convolutions with feature maps over the homogeneous space of positions $R^{3}$ , position and orientations $R^{3} \times S^{2}$ , and the group $S E (3)$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ebekkers/ponita
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Automated Road and Building Extraction · Face recognition and analysis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Diffusion