AnyTouch 2: General Optical Tactile Representation Learning For Dynamic Tactile Perception
Ruoxuan Feng, Yuxuan Zhou, Siyu Mei, Dongzhan Zhou, Pengwei Wang, Shaowei Cui, Bin Fang, Guocai Yao, Di Hu

TL;DR
This paper introduces a large-scale tactile dataset and a general learning framework for optical tactile sensors, enabling robots to perceive and reason about dynamic tactile information during manipulation tasks.
Contribution
It presents ToucHD, a comprehensive tactile dataset, and AnyTouch 2, a unified model for dynamic tactile perception across various sensors and tasks.
Findings
Strong performance on static and dynamic tactile benchmarks
Effective modeling of physical force dynamics
Robust generalization across sensors and manipulation tasks
Abstract
Real-world contact-rich manipulation demands robots to perceive temporal tactile feedback, capture subtle surface deformations, and reason about object properties as well as force dynamics. Although optical tactile sensors are uniquely capable of providing such rich information, existing tactile datasets and models remain limited. These resources primarily focus on object-level attributes (e.g., material) while largely overlooking fine-grained tactile temporal dynamics during physical interactions. We consider that advancing dynamic tactile perception requires a systematic hierarchy of dynamic perception capabilities to guide both data collection and model design. To address the lack of tactile data with rich dynamic information, we present ToucHD, a large-scale hierarchical tactile dataset spanning tactile atomic actions, real-world manipulations, and touch-force paired data. Beyond…
Peer Reviews
Decision·ICLR 2026 Poster
1. [Useful tactile dataset] To the best of my knowledge, this is the first work that captures large-scale tactile data on action-specific and manipulation tasks with a systematic pipeline. The combination of dynamic tactile perception and robotic manipulation is becoming more important as the tasks requires more dexterity and fine-grained control. The dataset may potentially be a foundation upon which future tactile robotic policies can be developed. 2. [Strong pre-trained encoder] As is shown
1. [Lack of sensor-invariant evidence] Although claimed to be sensor-invariant (by performing contrastive learning on different sensors), there's no clear evidence showing that the learned represenation is generalizable across different tactile sensors. In practice, there are two major problems when using a pretrained tactile encoder: (i) is it generalizable across different sensors? and (ii) is it model performance robust when the gel pads of the sensors are replaced (since for some sensors, th
* Fig. 1 clearly explains the role and position of the ToucHD dataset. * The technical content is thorough, including pixel-level autoencoding, semantic tactile feature learning, and dynamic physical property learning. * Experiments and benchmarking are comprehensive, covering diverse datasets, methods, and hardware.
* Although the paper is thorough, the method introduces many task objectives. Are all of them necessary? How does it perform on each individual task? Additional ablations and per-task performance would be convincing. * Fig. 4 would be clearer with a consistent zero position; the current plot can be misleading. * The training loss schedule (Eq. 7) appears complex. Does this imply that training the encoder is brittle? * For readers less familiar with prior tactile datasets, a table summarizing the
Good visualization. The research on related works are comprehensive.
Minor weakness: Major weakness: 1. The proposed “tactile dynamic pyramid” is ambitious, but its practical utility is unclear. Tier boundaries are not operationally defined, and the paper does not show that organizing data by tiers yields better representations than training a single, unified model on pooled data with task-specific supervision. Please provide measurable criteria for each tier, evidence of annotation consistency, and ablations comparing (a) pyramid-driven training vs. (b) a uni
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Sensor and Energy Harvesting Materials · Tactile and Sensory Interactions · Neural Networks and Reservoir Computing
