PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living

Fraser Robinson; Souren Pashangpour; Matthew Lisondra; Goldie Nejat

arXiv:2602.00131·cs.CV·February 3, 2026

PovNet+: A Deep Learning Architecture for Socially Assistive Robots to Learn and Assist with Multiple Activities of Daily Living

Fraser Robinson, Souren Pashangpour, Matthew Lisondra, Goldie Nejat

PDF

Open Access

TL;DR

POVNet+ is a novel deep learning architecture enabling socially assistive robots to recognize multiple activities of daily living using multimodal data, thereby improving their ability to assist users proactively in real-world scenarios.

Contribution

This paper introduces POVNet+, the first multimodal deep learning model for multi-activity recognition in socially assistive robots, incorporating ADL and motion embeddings for better detection of known, new, and atypical activities.

Findings

01

POVNet+ outperforms state-of-the-art activity recognition methods in accuracy.

02

The architecture successfully identifies seen, unseen, and atypical ADLs in real environments.

03

Robots using POVNet+ can proactively initiate appropriate assistive behaviors.

Abstract

A significant barrier to the long-term deployment of autonomous socially assistive robots is their inability to both perceive and assist with multiple activities of daily living (ADLs). In this paper, we present the first multimodal deep learning architecture, POVNet+, for multi-activity recognition for socially assistive robots to proactively initiate assistive behaviors. Our novel architecture introduces the use of both ADL and motion embedding spaces to uniquely distinguish between a known ADL being performed, a new unseen ADL, or a known ADL being performed atypically in order to assist people in real scenarios. Furthermore, we apply a novel user state estimation method to the motion embedding space to recognize new ADLs while monitoring user performance. This ADL perception information is used to proactively initiate robot assistive interactions. Comparison experiments with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Multimodal Machine Learning Applications · Gaze Tracking and Assistive Technology