# Semantic–Physical Sensor Fusion for Safe Physical Human–Robot Interaction in Dual-Arm Rehabilitation

**Authors:** Disha Zhu, Xuefeng Wang, Shaomei Shang

PMC · DOI: 10.3390/s26051510 · Sensors (Basel, Switzerland) · 2026-02-27

## TL;DR

This paper introduces a sensor fusion framework for safe human-robot interaction in rehabilitation, combining physical and semantic data with an edge-deployed AI model.

## Contribution

A novel multimodal sensor-fusion framework with an edge-deployed LLM for real-time safety control in physical human-robot interaction.

## Key findings

- A virtual sensing method achieved 18.5% normalized mean absolute error in estimating joint torques.
- The framework detected safety-critical events with low emergency misdetection rate and 223 ms end-to-end latency.
- Real-world experiments showed effective responses to impacts, instability, and visual occlusions.

## Abstract

A safe physical human–robot interaction (pHRI) in rehabilitation requires reliable perception and low-latency decision making under heterogeneous and unreliable sensor inputs. This paper presents a multimodal sensor-fusion-based safety framework that integrates physical state estimation, semantic information fusion, and an edge-deployed large language model (LLM) for real-time pHRI safety control. A dynamics-based virtual sensing method is introduced to estimate internal joint torques from external force–torque measurements, achieving a normalized mean absolute error of 18.5% in real-world experiments. An asynchronous semantic state pool with a time-to-live mechanism is designed to fuse visual, force, posture, and human semantic cues while maintaining robustness to sensor delays and dropouts. Based on structured multimodal tokens, an instruction-tuned edge LLM outputs discrete safety decisions that are further mapped to continuous compliant control parameters. The framework is trained using a hybrid dataset consisting of limited real-world samples and LLM-augmented synthetic data, and evaluated on unseen real and mixed-condition scenarios. Experimental results show reliable detection of safety-critical events with a low emergency misdetection rate, while maintaining an end-to-end decision latency of approximately 223 ms on edge hardware. Real-world experiments on a rehabilitation robot demonstrate effective responses to impacts, user instability, and visual occlusions, indicating the practical applicability of the proposed approach for real-time pHRI safety monitoring.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12986929/full.md

## Figures

15 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12986929/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC12986929/full.md

---
Source: https://tomesphere.com/paper/PMC12986929