Exploiting latent representation of sparse semantic layers for improved   short-term motion prediction with Capsule Networks

Albert Dulian; John C. Murray

arXiv:2103.01644·cs.CV·March 29, 2021

Exploiting latent representation of sparse semantic layers for improved short-term motion prediction with Capsule Networks

Albert Dulian, John C. Murray

PDF

TL;DR

This paper introduces a novel approach using Capsule Networks to improve short-term motion prediction in autonomous vehicles by leveraging hierarchical semantic layers from HD maps, achieving better accuracy with a smaller model.

Contribution

It presents a new application of Capsule Networks for encoding hierarchical spatial features from sparse semantic map layers in motion prediction.

Findings

01

Significant improvement over recent methods in deterministic prediction.

02

Reduces network size while maintaining high accuracy.

03

Effective hierarchical feature representation with CapsNets.

Abstract

As urban environments manifest high levels of complexity it is of vital importance that safety systems embedded within autonomous vehicles (AVs) are able to accurately anticipate short-term future motion of nearby agents. This problem can be further understood as generating a sequence of coordinates describing the future motion of the tracked agent. Various proposed approaches demonstrate significant benefits of using a rasterised top-down image of the road, with a combination of Convolutional Neural Networks (CNNs), for extraction of relevant features that define the road structure (eg. driveable areas, lanes, walkways). In contrast, this paper explores use of Capsule Networks (CapsNets) in the context of learning a hierarchical representation of sparse semantic layers corresponding to small regions of the High-Definition (HD) map. Each region of the map is dismantled into separate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.