CERNet: Class-Embedding Predictive-Coding RNN for Unified Robot Motion, Recognition, and Confidence Estimation
Hiroki Sawada, Alexandre Pitti, and Mathias Quoy

TL;DR
CERNet is a hierarchical predictive-coding RNN that unifies robot motion generation, recognition, and confidence estimation, enabling real-time interaction and intent inference on a humanoid robot.
Contribution
This paper introduces CERNet, a novel class-embedding predictive-coding RNN that simultaneously performs motion generation, recognition, and confidence estimation within a single model.
Findings
Achieves 76% lower trajectory error compared to baseline.
Maintains motion fidelity under external perturbations.
Infers trajectory class with 68% Top-1 and 81% Top-2 accuracy.
Abstract
Robots interacting with humans must not only generate learned movements in real-time, but also infer the intent behind observed behaviors and estimate the confidence of their own inferences. This paper proposes a unified model that achieves all three capabilities within a single hierarchical predictive-coding recurrent neural network (PC-RNN) equipped with a class embedding vector, CERNet, which leverages a dynamically updated class embedding vector to unify motor generation and recognition. The model operates in two modes: generation and inference. In the generation mode, the class embedding constrains the hidden state dynamics to a class-specific subspace; in the inference mode, it is optimized online to minimize prediction error, enabling real-time recognition. Validated on a humanoid robot across 26 kinesthetically taught alphabets, our hierarchical model achieves 76% lower…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Human Pose and Action Recognition · Action Observation and Synchronization
