A Prediction-as-Perception Framework for 3D Object Detection

Song Zhang; Haoyu Chen; Ruibo Wang

arXiv:2603.12599·cs.CV·March 16, 2026

A Prediction-as-Perception Framework for 3D Object Detection

Song Zhang, Haoyu Chen, Ruibo Wang

PDF

Open Access

TL;DR

This paper introduces a biomimetic Prediction-As-Perception framework for 3D object detection that improves accuracy and speed by integrating prediction and perception modules, inspired by how humans observe moving objects.

Contribution

The paper proposes a novel Prediction-As-Perception framework that enhances 3D object detection by combining prediction and perception modules in a biomimetic architecture.

Findings

01

Improves target tracking accuracy by 10%.

02

Increases inference speed by 15%.

03

Reduces computational resource consumption.

Abstract

Humans combine prediction and perception to observe the world. When faced with rapidly moving birds or insects, we can only perceive them clearly by predicting their next position and focusing our gaze there. Inspired by this, this paper proposes the Prediction-As-Perception (PAP) framework, integrating a prediction-perception architecture into 3D object perception tasks to enhance the model's perceptual accuracy. The PAP framework consists of two main modules: prediction and perception, primarily utilizing continuous frame information as input. Firstly, the prediction module forecasts the potential future positions of ego vehicles and surrounding traffic participants based on the perception results of the current frame. These predicted positions are then passed as queries to the perception module of the subsequent frame. The perceived results are iteratively fed back into the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Face Recognition and Perception