RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems
Burhan A. Mudassar, Sho Ko, Maojingjing Li, Priyabrata Saha, Saibal, Mukhopadhyay

TL;DR
This paper introduces RADNet, a deep neural network designed to improve perception robustness in moving autonomous systems by addressing camera motion artifacts in action detection tasks.
Contribution
We propose a novel action detection pipeline that aligns actor features, combines global and local scene features, and is robust to camera motion effects, validated on a new dataset.
Findings
4.1% increase in frame mAP on MOVE dataset
17% increase in video mAP on MOVE dataset
Effective handling of high camera motion in action detection
Abstract
Interactive autonomous applications require robustness of the perception engine to artifacts in unconstrained videos. In this paper, we examine the effect of camera motion on the task of action detection. We develop a novel ranking method to rank videos based on the degree of global camera motion. For the high ranking camera videos we show that the accuracy of action detection is decreased. We propose an action detection pipeline that is robust to the camera motion effect and verify it empirically. Specifically, we do actor feature alignment across frames and couple global scene features with local actor-specific features. We do feature alignment using a novel formulation of the Spatio-temporal Sampling Network (STSN) but with multi-scale offset prediction and refinement using a pyramid structure. We also propose a novel input dependent weighted averaging strategy for fusing local and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging
