VNT-Net: Rotational Invariant Vector Neuron Transformers
Hedi Zisling, Andrei Sharf

TL;DR
VNT-Net is a novel neural network combining vector neurons and transformers to achieve rotational invariance in 3D point cloud processing, improving accuracy and efficiency over existing methods.
Contribution
The paper introduces VNT-Net, a new architecture that integrates vector neurons with self-attention to ensure rotational invariance in 3D point cloud analysis.
Findings
Achieves higher accuracy than state-of-the-art methods.
Requires fewer hyperparameters and less training.
Effectively handles 3D objects in arbitrary orientations.
Abstract
Learning 3D point sets with rotational invariance is an important and challenging problem in machine learning. Through rotational invariant architectures, 3D point cloud neural networks are relieved from requiring a canonical global pose and from exhaustive data augmentation with all possible rotations. In this work, we introduce a rotational invariant neural network by combining recently introduced vector neurons with self-attention layers to build a point cloud vector neuron transformer network (VNT-Net). Vector neurons are known for their simplicity and versatility in representing SO(3) actions and are thereby incorporated in common neural operations. Similarly, Transformer architectures have gained popularity and recently were shown successful for images by applying directly on sequences of image patches and achieving superior performance and convergence. In order to benefit from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · 3D Shape Modeling and Analysis · Advanced Measurement and Metrology Techniques
MethodsAttention Is All You Need · Linear Layer · Softmax · Dense Connections · Position-Wise Feed-Forward Layer · Adam · Absolute Position Encodings · Byte Pair Encoding · Residual Connection · Label Smoothing
