Semantically Guided Representation Learning For Action Anticipation
Anxhelo Diko, Danilo Avola, Bardh Prenkaj, Federico Fontana, Luigi, Cinque

TL;DR
This paper introduces S-GEAR, a novel framework for action anticipation that learns semantically interconnected action representations using prototypes and language models, leading to improved accuracy on multiple benchmarks.
Contribution
S-GEAR is the first approach to incorporate semantic interconnectivity into visual action representations for anticipation tasks.
Findings
Achieved +3.5, +2.7, +3.5 accuracy improvements on three benchmarks.
Transferred geometric action associations from language to visual prototypes.
Demonstrated the importance of semantic interconnectivity in action anticipation.
Abstract
Action anticipation is the task of forecasting future activity from a partially observed sequence of events. However, this task is exposed to intrinsic future uncertainty and the difficulty of reasoning upon interconnected actions. Unlike previous works that focus on extrapolating better visual and temporal information, we concentrate on learning action representations that are aware of their semantic interconnectivity based on prototypical action patterns and contextual co-occurrences. To this end, we propose the novel Semantically Guided Representation Learning (S-GEAR) framework. S-GEAR learns visual action prototypes and leverages language models to structure their relationship, inducing semanticity. To gather insights on S-GEAR's effectiveness, we test it on four action anticipation benchmarks, obtaining improved results compared to previous works: +3.5, +2.7, and +3.5 absolute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Online Learning and Analytics · Anomaly Detection Techniques and Applications
MethodsAttentive Walk-Aggregating Graph Neural Network · Focus
