Physic-HM: Restoring Physical Generative Logic in Multimodal Anomaly Detection via Hierarchical Modulation

Xiao Liu; Junchen Jin; Yanjie Zhao; Zhixuan Xing

arXiv:2512.21650·cs.LG·January 21, 2026

Physic-HM: Restoring Physical Generative Logic in Multimodal Anomaly Detection via Hierarchical Modulation

Xiao Liu, Junchen Jin, Yanjie Zhao, Zhixuan Xing

PDF

Open Access

TL;DR

Physic-HM introduces a hierarchical, physics-informed multimodal anomaly detection framework that models process-to-result dependency, improving detection accuracy in complex manufacturing scenarios like robotic welding.

Contribution

The paper presents Physic-HM, a novel framework that explicitly incorporates physical generative logic and sensor guidance to enhance multimodal anomaly detection.

Findings

01

Achieves state-of-the-art I-AUROC of 90.7% on Weld-4M benchmark

02

Effectively models process-to-result dependency using hierarchical architecture

03

Utilizes sensor-guided modulation for improved feature extraction

Abstract

Multimodal Unsupervised Anomaly Detection (UAD) is critical for quality assurance in smart manufacturing, particularly in complex processes like robotic welding. However, existing methods often suffer from process-logic blindness, treating process modalities (e.g., real-time video, audio, and sensors) and result modalities (e.g., post-weld images) as symmetric feature sources, thereby ignoring the inherent unidirectional physical generative logic. Furthermore, the heterogeneity gap between high-dimensional visual data and low-dimensional sensor signals frequently leads to critical process context being drowned out. In this paper, we propose Physic-HM, a multimodal UAD framework that explicitly incorporates physical inductive bias to model the process-to-result dependency. Specifically, our framework incorporates two key innovations: a Sensor-Guided PHM Modulation mechanism that utilizes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Human Pose and Action Recognition