Monocular Pedestrian Orientation Estimation Based on Deep 2D-3D   Feedforward

Chenchen Zhao; Yeqiang Qian; Ming Yang

arXiv:1909.10970·cs.CV·December 22, 2020

Monocular Pedestrian Orientation Estimation Based on Deep 2D-3D Feedforward

Chenchen Zhao, Yeqiang Qian, Ming Yang

PDF

1 Repo

TL;DR

This paper introduces FFNet, a monocular pedestrian orientation estimation model that incorporates 2D and 3D dimensions through feedforward links, improving accuracy and interpretability in autonomous driving scenarios.

Contribution

The paper proposes a novel monocular orientation estimation model that integrates pedestrian dimensions via feedforward links, enhancing performance and interpretability.

Findings

01

At least 1.72% AOS improvement over state-of-the-art models.

02

Competitive results on KITTI dataset.

03

Enhanced model interpretability through logical feedforward connections.

Abstract

Accurate pedestrian orientation estimation of autonomous driving helps the ego vehicle obtain the intentions of pedestrians in the related environment, which are the base of safety measures such as collision avoidance and prewarning. However, because of relatively small sizes and high-level deformation of pedestrians, common pedestrian orientation estimation models fail to extract sufficient and comprehensive information from them, thus having their performance restricted, especially monocular ones which fail to obtain depth information of objects and related environment. In this paper, a novel monocular pedestrian orientation estimation model, called FFNet, is proposed. Apart from camera captures, the model adds the 2D and 3D dimensions of pedestrians as two other inputs according to the logic relationship between orientation and them. The 2D and 3D dimensions of pedestrians are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zcc31415926/FFNet
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability