Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images
Tze Ho Elden Tse, Franziska Mueller, Zhengyang Shen, Danhang Tang,, Thabo Beeler, Mingsong Dou, Yinda Zhang, Sasa Petrovic, Hyung Jin Chang,, Jonathan Taylor, Bardia Doosti

TL;DR
This paper introduces a spectral graph-based transformer framework for high-fidelity two-hand reconstruction from multi-view egocentric RGB images, addressing challenges in dataset scarcity and physical plausibility.
Contribution
It presents a novel spectral graph convolution decoder and a multi-view feature fusion strategy, enabling realistic and real-time two-hand pose estimation from egocentric images.
Findings
Effective reconstruction of two hands with high fidelity.
Good generalization from synthetic to real data.
Real-time performance suitable for AR/VR applications.
Abstract
We propose a novel transformer-based framework that reconstructs two high fidelity hands from multi-view RGB images. Unlike existing hand pose estimation methods, where one typically trains a deep network to regress hand model parameters from single RGB image, we consider a more challenging problem setting where we directly regress the absolute root poses of two-hands with extended forearm at high resolution from egocentric view. As existing datasets are either infeasible for egocentric viewpoints or lack background variations, we create a large-scale synthetic dataset with diverse scenarios and collect a real dataset from multi-calibrated camera setup to verify our proposed multi-view image feature fusion strategy. To make the reconstruction physically plausible, we propose two strategies: (i) a coarse-to-fine spectral graph convolution decoder to smoothen the meshes during upsampling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Artificial Intelligence in Healthcare and Education · Anatomy and Medical Technology
MethodsConvolution
