Pointy - A Lightweight Transformer for Point Cloud Foundation Models

Konrad Szafer; Marek Kraft; Dominik Belter

arXiv:2603.10963·cs.CV·April 21, 2026

Pointy - A Lightweight Transformer for Point Cloud Foundation Models

Konrad Szafer, Marek Kraft, Dominik Belter

PDF

1 Repo

TL;DR

Pointy introduces a lightweight transformer architecture for point cloud data that achieves competitive results with less training data and complexity, emphasizing the importance of architecture and training setup.

Contribution

The paper presents a novel, efficient transformer-based point cloud model trained on fewer samples that outperforms larger models, with a comprehensive benchmarking framework.

Findings

01

Our model outperforms larger models trained on more data.

02

Simple backbones can achieve competitive results.

03

Standardized benchmarks highlight architecture benefits.

Abstract

Foundation models for point cloud data have recently grown in capability, often leveraging extensive representation learning from language or vision. In this work, we take a more controlled approach by introducing a lightweight transformer-based point cloud architecture. In contrast to the heavy reliance on cross-modal supervision, our model is trained only on 39k point clouds - yet it outperforms several larger foundation models trained on over 200k training samples. Interestingly, our method approaches state-of-the-art results from models that have seen over a million point clouds, images, and text samples, demonstrating the value of a carefully curated training setup and architecture. To ensure rigorous evaluation, we conduct a comprehensive replication study that standardizes the training regime and benchmarks across multiple point cloud architectures. This unified experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KonradSzafer/Pointy
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.