End-to-End Tracking and Semantic Segmentation Using Recurrent Neural Networks
Peter Ondruska, Julie Dequaire, Dominic Zeng Wang, Ingmar Posner

TL;DR
This paper introduces an end-to-end recurrent neural network framework for real-time tracking and semantic classification of objects in complex environments, leveraging unsupervised learning and transfer of knowledge.
Contribution
The work presents a novel RNN architecture trained with unsupervised deep tracking, enabling effective environment understanding with minimal labeled data in real-world robotics.
Findings
Outperforms state-of-the-art model-free tracking methods
Requires less labeled data for semantic classification
Demonstrates effective real-world application at a busy road junction
Abstract
In this work we present a novel end-to-end framework for tracking and classifying a robot's surroundings in complex, dynamic and only partially observable real-world environments. The approach deploys a recurrent neural network to filter an input stream of raw laser measurements in order to directly infer object locations, along with their identity in both visible and occluded areas. To achieve this we first train the network using unsupervised Deep Tracking, a recently proposed theoretical framework for end-to-end space occupancy prediction. We show that by learning to track on a large amount of unsupervised data, the network creates a rich internal representation of its environment which we in turn exploit through the principle of inductive transfer of knowledge to perform the task of it's semantic classification. As a result, we show that only a small amount of labelled data suffices…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Time Series Analysis and Forecasting · Video Analysis and Summarization
