DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation

Mohamed Abdelsamad; Michael Ulrich; Bin Yang; Miao Zhang; Yakov Miron; Abhinav Valada

arXiv:2512.11465·cs.CV·December 15, 2025

DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation

Mohamed Abdelsamad, Michael Ulrich, Bin Yang, Miao Zhang, Yakov Miron, Abhinav Valada

PDF

Open Access 1 Video

TL;DR

This paper introduces DOS, a self-supervised learning framework for 3D point cloud representations that uses observable softmaps and Zipfian prototypes to improve semantic understanding without annotations.

Contribution

The paper proposes DOS, a novel SSL method employing observable softmaps and Zipfian prototypes, addressing challenges of unbalanced semantics and information leakage in 3D point cloud learning.

Findings

01

Outperforms state-of-the-art on multiple 3D benchmarks

02

Effective in semantic segmentation and object detection

03

No extra data or annotations needed

Abstract

Recent advances in self-supervised learning (SSL) have shown tremendous potential for learning 3D point cloud representations without human annotations. However, SSL for 3D point clouds still faces critical challenges due to irregular geometry, shortcut-prone reconstruction, and unbalanced semantics distribution. In this work, we propose DOS (Distilling Observable Softmaps), a novel SSL framework that self-distills semantic relevance softmaps only at observable (unmasked) points. This strategy prevents information leakage from masked regions and provides richer supervision than discrete token-to-prototype assignments. To address the challenge of unbalanced semantics in an unsupervised setting, we introduce Zipfian prototypes and incorporate them using a modified Sinkhorn-Knopp algorithm, Zipf-Sinkhorn, which enforces a power-law prior over prototype usage and modulates the sharpness of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

DOS: Distilling Observable Softmaps of Zipfian Prototypes for Self-Supervised Point Representation· underline

Taxonomy

Topics3D Shape Modeling and Analysis · Robot Manipulation and Learning · Robotics and Sensor-Based Localization