SegCodeNet: Color-Coded Segmentation Masks for Activity Detection from Wearable Cameras
Asif Shahriyar Sushmit, Partho Ghosh, Md.Abrar Istiak, Nayeeb Rashid,, Ahsan Habib Akash, Taufiq Hasan

TL;DR
This paper introduces SegCodeNet, a two-stream neural network that uses color-coded segmentation masks and attention mechanisms to improve activity detection accuracy in first-person videos, especially at lower resolutions.
Contribution
The work presents a novel two-stream network with semantic segmentation and attention modules, outperforming existing methods in activity detection from wearable camera videos.
Findings
Achieves up to 14.366% higher F1 score over single-stream models.
Significant performance gains at lower resolutions, up to 26% in F1 score.
Outperforms state-of-the-art I3D method by 4.529% in F1 score.
Abstract
Activity detection from first-person videos (FPV) captured using a wearable camera is an active research field with potential applications in many sectors, including healthcare, law enforcement, and rehabilitation. State-of-the-art methods use optical flow-based hybrid techniques that rely on features derived from the motion of objects from consecutive frames. In this work, we developed a two-stream network, the \emph{SegCodeNet}, that uses a network branch containing video-streams with color-coded semantic segmentation masks of relevant objects in addition to the original RGB video-stream. We also include a stream-wise attention gating that prioritizes between the two streams and a frame-wise attention module that prioritizes the video frames that contain relevant features. Experiments are conducted on an FPV dataset containing activity classes in office environments. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Anomaly Detection Techniques and Applications
