3D Motion Perception of Binocular Vision Target with PID-CNN
Jiazhao Shi, Pan Pan, Haotian Shi

TL;DR
This paper presents a PID-CNN model for real-time 3D motion perception of binocular vision targets, demonstrating high accuracy and efficiency in simulated environments with potential for further improvements.
Contribution
Introduces a novel 17-layer PID convolutional neural network for 3D motion perception, integrating PID principles and feature reuse techniques for enhanced performance.
Findings
Prediction accuracy close to input resolution limits
Effective feature reuse via concatenation and pooling
Potential for improved efficiency with high-dimensional convolution
Abstract
This article trained a network for perceiving three-dimensional motion information of binocular vision target, which can provide real-time three-dimensional coordinate, velocity, and acceleration, and has a basic spatiotemporal perception capability. Understood the ability of neural networks to fit nonlinear problems from the perspective of PID. Considered a single-layer neural network as using a second-order difference equation and a nonlinearity to describe a local problem. Multilayer networks gradually transform the raw representation to the desired representation through multiple such combinations. Analysed some reference principles for designing neural networks. Designed a relatively small PID convolutional neural network, with a total of 17 layers and 413 thousand parameters. Implemented a simple but practical feature reuse method by concatenation and pooling. The network was…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAstronomical Observations and Instrumentation · Image and Video Stabilization · Neural Networks and Applications
