RADNet: A Deep Neural Network Model for Robust Perception in Moving   Autonomous Systems

Burhan A. Mudassar; Sho Ko; Maojingjing Li; Priyabrata Saha; Saibal; Mukhopadhyay

arXiv:2205.00364·cs.CV·May 3, 2022·1 cites

RADNet: A Deep Neural Network Model for Robust Perception in Moving Autonomous Systems

Burhan A. Mudassar, Sho Ko, Maojingjing Li, Priyabrata Saha, Saibal, Mukhopadhyay

PDF

Open Access

TL;DR

This paper introduces RADNet, a deep neural network designed to improve perception robustness in moving autonomous systems by addressing camera motion artifacts in action detection tasks.

Contribution

We propose a novel action detection pipeline that aligns actor features, combines global and local scene features, and is robust to camera motion effects, validated on a new dataset.

Findings

01

4.1% increase in frame mAP on MOVE dataset

02

17% increase in video mAP on MOVE dataset

03

Effective handling of high camera motion in action detection

Abstract

Interactive autonomous applications require robustness of the perception engine to artifacts in unconstrained videos. In this paper, we examine the effect of camera motion on the task of action detection. We develop a novel ranking method to rank videos based on the degree of global camera motion. For the high ranking camera videos we show that the accuracy of action detection is decreased. We propose an action detection pipeline that is robust to the camera motion effect and verify it empirically. Specifically, we do actor feature alignment across frames and couple global scene features with local actor-specific features. We do feature alignment using a novel formulation of the Spatio-temporal Sampling Network (STSN) but with multi-scale offset prediction and refinement using a pyramid structure. We also propose a novel input dependent weighted averaging strategy for fusing local and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Vision and Imaging