Batch-Based Activity Recognition from Egocentric Photo-Streams
Alejandro Cartas, Mariella Dimiccoli, Petia Radeva

TL;DR
This paper introduces a batch-driven deep learning approach using LSTM units for activity recognition from low-frame-rate egocentric photo-streams, effectively capturing temporal features without relying on event boundaries.
Contribution
It proposes two novel batch-processing architectures with overlapping and non-overlapping batches to improve temporal modeling in low-frame-rate activity recognition.
Findings
Effective in capturing temporal evolution of features
Does not require event boundary annotations
Validated on a public dataset with positive results
Abstract
Activity recognition from long unstructured egocentric photo-streams has several applications in assistive technology such as health monitoring and frailty detection, just to name a few. However, one of its main technical challenges is to deal with the low frame rate of wearable photo-cameras, which causes abrupt appearance changes between consecutive frames. In consequence, important discriminatory low-level features from motion such as optical flow cannot be estimated. In this paper, we present a batch-driven approach for training a deep learning architecture that strongly rely on Long short-term units to tackle this problem. We propose two different implementations of the same approach that process a photo-stream sequence using batches of fixed size with the goal of capturing the temporal evolution of high-level features. The main difference between these implementations is that one…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
