CliffordNet: All You Need is Geometric Algebra
Zhongping Ji

TL;DR
CliffordNet introduces a vision backbone based on Geometric Algebra that unifies feature and structural interactions, achieving high accuracy with fewer parameters and challenging traditional modular architectures.
Contribution
The paper proposes CliffordNet, a novel vision model grounded in Geometric Algebra, replacing separate modules with a unified algebraic interaction mechanism, leading to efficient and dense feature representations.
Findings
Achieves 77.82% accuracy on CIFAR-100 with 1.4M parameters
Sets a new SOTA for tiny models at 79.05% accuracy
Model's geometric interaction makes FFNs redundant
Abstract
Modern computer vision architectures, from CNNs to Transformers, predominantly rely on the stacking of heuristic modules: spatial mixers (Attention/Conv) followed by channel mixers (FFNs). In this work, we challenge this paradigm by returning to mathematical first principles. We propose the Clifford Algebra Network (CAN), also referred to as CliffordNet, a vision backbone grounded purely in Geometric Algebra. Instead of engineering separate modules for mixing and memory, we derive a unified interaction mechanism based on the Clifford Geometric Product (). This operation ensures algebraic completeness regarding the Geometric Product by simultaneously capturing feature coherence (via the generalized inner product) and structural variation (via the exterior wedge product). Implemented via an efficient sparse rolling mechanism with strict linear complexity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgebraic and Geometric Analysis · Machine Learning in Materials Science · 3D Shape Modeling and Analysis
