Nonrigid Object Contact Estimation With Regional Unwrapping Transformer
Wei Xie, Zimeng Zhao, Shiying Li, Binghui Zuo, Yangang Wang

TL;DR
This paper introduces RUPs and RUFormer, a novel framework for estimating contact patterns between hands and nonrigid objects from monocular images, overcoming geometric restrictions of previous methods.
Contribution
The work proposes a new contact representation called RUPs and a transformer-based model RUFormer for accurate nonrigid contact estimation from monocular images.
Findings
Robust estimation of deformed degrees and transformations.
Effective handling of nonrigid and rigid contact scenarios.
Improved contact pattern prediction accuracy.
Abstract
Acquiring contact patterns between hands and nonrigid objects is a common concern in the vision and robotics community. However, existing learning-based methods focus more on contact with rigid ones from monocular images. When adopting them for nonrigid contact, a major problem is that the existing contact representation is restricted by the geometry of the object. Consequently, contact neighborhoods are stored in an unordered manner and contact features are difficult to align with image cues. At the core of our approach lies a novel hand-object contact representation called RUPs (Region Unwrapping Profiles), which unwrap the roughly estimated hand-object surfaces as multiple high-resolution 2D regional profiles. The region grouping strategy is consistent with the hand kinematic bone division because they are the primitive initiators for a composite contact pattern. Based on this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Nonrigid Object Contact Estimation With Regional Unwrapping Transformer· youtube
Taxonomy
TopicsRobot Manipulation and Learning · Hand Gesture Recognition Systems · Human Pose and Action Recognition
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Softmax · Dense Connections · Layer Normalization · Dropout · Byte Pair Encoding · Adam · Position-Wise Feed-Forward Layer
