Training-free Spatially Grounded Geometric Shape Encoding (Technical Report)
Yuhang He

TL;DR
This paper introduces XShapeEnc, a training-free, invertible encoding method for 2D geometric shapes that captures shape and pose efficiently, enabling advanced shape-aware tasks.
Contribution
The work presents a novel, general-purpose, training-free encoding strategy for 2D shapes using harmonic pose fields and Zernike bases, with properties like invertibility and frequency richness.
Findings
XShapeEnc is theoretically valid and efficient.
It demonstrates high discriminability across shape tasks.
Extensive experiments validate its applicability and effectiveness.
Abstract
Positional encoding has become the de facto standard for grounding deep neural networks on discrete point-wise positions, and it has achieved remarkable success in tasks where the input can be represented as a one-dimensional sequence. However, extending this concept to 2D spatial geometric shapes demands carefully designed encoding strategies that account not only for shape geometry and pose, but also for compatibility with neural network learning. In this work, we address these challenges by introducing a training-free, general-purpose encoding strategy, dubbed XShapeEnc, that encodes an arbitrary spatially grounded 2D geometric shape into a compact representation exhibiting five favorable properties, including invertibility, adaptivity, and frequency richness. Specifically, a 2D spatially grounded geometric shape is decomposed into its normalized geometry within the unit disk and its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
