TactEx: An Explainable Multimodal Robotic Interaction Framework for Human-Like Touch and Hardness Estimation
Felix Verstraete, Lan Wei, Wen Fan, and Dandan Zhang

TL;DR
TactEx is an explainable multimodal robotic framework that integrates vision, touch, and language to estimate object hardness and guide interactions, demonstrating high success in fruit ripeness assessment and natural language understanding.
Contribution
The paper introduces TactEx, a novel framework combining tactile, visual, and language models for human-like hardness estimation and interactive guidance in robotics.
Findings
Achieves 90% task success rate on user queries
Statistically significant class separation in ripeness levels
Generalizes to new tasks without extensive tuning
Abstract
Accurate perception of object hardness is essential for safe and dexterous contact-rich robotic manipulation. Here, we present TactEx, an explainable multimodal robotic interaction framework that unifies vision, touch, and language for human-like hardness estimation and interactive guidance. We evaluate TactEx on fruit-ripeness assessment, a representative task that requires both tactile sensing and contextual understanding. The system fuses GelSight-Mini tactile streams with RGB observations and language prompts. A ResNet50+LSTM model estimates hardness from sequential tactile data, while a cross-modal alignment module combines visual cues with guidance from a large language model (LLM). This explainable multimodal interface allows users to distinguish ripeness levels with statistically significant class separation (p < 0.01 for all fruit pairs). For touch placement, we compare YOLO…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Sensor and Energy Harvesting Materials · Tactile and Sensory Interactions · Robot Manipulation and Learning
