Extract-and-Adaptation Network for 3D Interacting Hand Mesh Recovery
JoonKyu Park, Daniel Sungho Jung, Gyeongsik Moon, Kyoung Mu Lee

TL;DR
This paper introduces EANet, a novel network that improves 3D interacting hand mesh recovery by using specialized tokens to better model hand interactions, overcoming limitations of previous Transformer-based methods.
Contribution
The paper proposes EABlock with SimToken and JoinToken to enhance interaction modeling and address the distant token problem in Transformer-based hand mesh recovery.
Findings
Achieves state-of-the-art performance on 3D interacting hands benchmarks.
Effectively models hand interactions with novel token design.
Robust to different hand pose configurations.
Abstract
Understanding how two hands interact with each other is a key component of accurate 3D interacting hand mesh recovery. However, recent Transformer-based methods struggle to learn the interaction between two hands as they directly utilize two hand features as input tokens, which results in distant token problem. The distant token problem represents that input tokens are in heterogeneous spaces, leading Transformer to fail in capturing correlation between input tokens. Previous Transformer-based methods suffer from the problem especially when poses of two hands are very different as they project features from a backbone to separate left and right hand-dedicated features. We present EANet, extract-and-adaptation network, with EABlock, the main component of our network. Rather than directly utilizing two hand features as input tokens, our EABlock utilizes two complementary types of novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · 3D Shape Modeling and Analysis · Virtual Reality Applications and Impacts
MethodsMulti-Head Attention · Attention Is All You Need · Concatenated Skip Connection · EXP-$Does Expedia refund a cancelled flight? · Linear Layer · Residual Connection · Adam · Softmax · Dropout · Absolute Position Encodings
