TL;DR
This paper introduces FFNet, a monocular pedestrian orientation estimation model that incorporates 2D and 3D dimensions through feedforward links, improving accuracy and interpretability in autonomous driving scenarios.
Contribution
The paper proposes a novel monocular orientation estimation model that integrates pedestrian dimensions via feedforward links, enhancing performance and interpretability.
Findings
At least 1.72% AOS improvement over state-of-the-art models.
Competitive results on KITTI dataset.
Enhanced model interpretability through logical feedforward connections.
Abstract
Accurate pedestrian orientation estimation of autonomous driving helps the ego vehicle obtain the intentions of pedestrians in the related environment, which are the base of safety measures such as collision avoidance and prewarning. However, because of relatively small sizes and high-level deformation of pedestrians, common pedestrian orientation estimation models fail to extract sufficient and comprehensive information from them, thus having their performance restricted, especially monocular ones which fail to obtain depth information of objects and related environment. In this paper, a novel monocular pedestrian orientation estimation model, called FFNet, is proposed. Apart from camera captures, the model adds the 2D and 3D dimensions of pedestrians as two other inputs according to the logic relationship between orientation and them. The 2D and 3D dimensions of pedestrians are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
