Patient4D: Temporally Consistent Patient Body Mesh Recovery from Monocular Operating Room Video
Mingxiao Tu, Hoijoon Jung, Alireza Moghadam, Andre Kyme, Jinman Kim

TL;DR
Patient4D introduces a novel pipeline that leverages stationarity priors and geometric constraints to improve 3D patient body mesh recovery from monocular surgical videos, addressing occlusion and camera movement challenges.
Contribution
The paper presents Patient4D, a new method combining perception models with geometric mechanisms to enhance temporal consistency and robustness in monocular 3D mesh recovery during surgery.
Findings
Achieves 0.75 mean IoU under occlusion, significantly reducing failure frames.
Outperforms existing methods on synthetic and public benchmarks.
Demonstrates effectiveness of stationarity priors in clinical AR scenarios.
Abstract
Recovering a dense 3D body mesh from monocular video remains challenging under occlusion from draping and continuously moving camera viewpoints. This configuration arises in surgical augmented reality (AR), where an anesthetized patient lies under surgical draping while a surgeon's head-mounted camera continuously changes viewpoint. Existing human mesh recovery (HMR) methods are typically trained on upright, moving subjects captured from relatively stable cameras, leading to performance degradation under such conditions. To address this, we present Patient4D, a stationarity-constrained reconstruction pipeline that explicitly exploits the stationarity prior. The pipeline combines image-level foundation models for perception with lightweight geometric mechanisms that enforce temporal consistency across frames. Two key components enable robust reconstruction: Pose Locking, which anchors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Augmented Reality Applications · Interactive and Immersive Displays
