RALACs: Action Recognition in Autonomous Vehicles using Interaction Encoding and Optical Flow
Eddy Zhou, Alex Zhuang, Alikasim Budhwani, Owen Leather, Rowan, Dempster, Quanquan Li, Mohammad Al-Sharman, Derek Rayside, and William Melek

TL;DR
This paper introduces RALACs, a novel two-stage action recognition system for autonomous vehicles that leverages interaction encoding and optical flow to improve scene understanding and agent activity detection.
Contribution
It presents a new action recognition framework tailored for road scenes, incorporating attention-based encoding, ROI adaptation, and optical flow fusion, bridging the gap with human action recognition.
Findings
Outperforms baseline on ICCV2021 Road Challenge dataset
Demonstrates effectiveness on real vehicle platform
Enhances situational awareness in autonomous driving
Abstract
When applied to autonomous vehicle (AV) settings, action recognition can enhance an environment model's situational awareness. This is especially prevalent in scenarios where traditional geometric descriptions and heuristics in AVs are insufficient. However, action recognition has traditionally been studied for humans, and its limited adaptability to noisy, un-clipped, un-pampered, raw RGB data has limited its application in other fields. To push for the advancement and adoption of action recognition into AVs, this work proposes a novel two-stage action recognition system, termed RALACs. RALACs formulates the problem of action recognition for road scenes, and bridges the gap between it and the established field of human action recognition. This work shows how attention layers can be useful for encoding the relations across agents, and stresses how such a scheme can be class-agnostic.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
