Fast reprogramming and adaptive reproduction of contact-rich assembly

Dimitrios Rakovitis; Vamsi Krishna Origanti; Vinzenz Bargsten; Adrian Danzglock; Dennis Mronga; Frank Kirchner

PMC · DOI:10.3389/frobt.2026.1746577·March 18, 2026

Fast reprogramming and adaptive reproduction of contact-rich assembly

Dimitrios Rakovitis, Vamsi Krishna Origanti, Vinzenz Bargsten, Adrian Danzglock, Dennis Mronga, Frank Kirchner

PDF

Open Access

TL;DR

This paper introduces a new robotic framework that improves assembly tasks by learning from a few demonstrations and adapting to changes in real-world conditions.

Contribution

The novel framework enables adaptive reproduction of contact-rich assembly policies with minimal reprogramming and high success rates.

Findings

01

The framework achieved an 83% success rate in assembly tasks compared to 29.8% with traditional controllers.

02

It demonstrated robustness and transferability under geometric and pose variations.

03

The system uses only force/torque and proprioceptive sensing for adaptive contact handling.

Abstract

Modern manufacturing demands flexible, robust robotic assembly systems capable of handling variable part geometries and dynamic task configurations. Current approaches often suffer from limited generalization, high sample complexity, and the need for extensive reconfiguration or retraining when task parameters change. This paper addresses these limitations by introducing a novel framework that enables adaptive reproduction of kinesthetically taught, contact-rich assembly policies, using only force/torque and proprioceptive sensing. The approach combines three components: i. synchronized wrench–motion Dynamic Movement Primitives (wDMPs) that encode coupled motion and wrench profiles from a single demonstration; ii. an uncertainty-aware Model Predictive Controller (MPC) that updates its model online to enable compliant and adaptive contact handling using uncertainty estimated via a…

Figures8

Click any figure to enlarge with its caption.

Graphical abstract of the proposed framework.

Dataflow of the ART-based contact learning and classification.

(Top) IndustRealKit parts, (Bottom) Disc brake parts.

Output of force applied during contact exploration (Equation 10) with: fx=2.7Hz , fy=5.4Hz , fz=4.5Hz .

Exemplary data from the ART-based contact classification. From top to bottom: the six channels of the raw measurement (three forces, three torques), the corresponding frequency magnitudes stacked, the magnitudes after max-pooling and scaling, the assigned category index, and indication of novelty (mismatch).

Force measurements over the 47 assembly trials listed in Tables 2–5. (Top) 27x peg-in-hole, (Middle) 10x plug insertion, (Bottom) 10x car-parts. (Inset) Average force for all trials.

Snapshots of three assembly tasks following our policy. (Top) Medium cylinder peg, (Middle) 3-prong plug, (Bottom) Wheel bearing.

Distance to final position (Left) and orientation (Right) goals. Results over the 47 assembly trials listed in Tables 2–5: (Top) 27x peg-in-hole, (Middle) 10x plug insertion, (Bottom) 10x car-parts.

Tables6

TABLE 1. Hyperparameters selection.

Component	Group	Parameter	Value
DMP	Gains	$α_{x}, β_{x}$	$25.0, 6.25$
		$α_{ω}, β_{ω}$	$25.0, 6.25$
		$α_{λ}, β_{λ}$	$25.0, 6.25$
	Basis	$N$	50
Dither	Amplitudes	$A_{x}, A_{y}, A_{z}$	$4.0, 4.0, 3.0$
	Frequencies	$f_{x}, f_{y}, f_{z} (H z)$	$2.7, 5.4, 4.5$
	Force bounds	$F_{c, a d}^{\min}, F_{c, a d}^{\max} (N)$	$- 23.5, 0$
	Logistic params	$α_{F}, c_{F}$	$- 10, 0.5$
ART	STFT	Window size	120
	STFT	Overlap	0
	Features	Frequency magnitude range selection	1–61
	Features	Max pooling	32
	Vigilance	$ρ_{global}$	0.93
	Vigilance	$ρ_{local}$	0.94
	Bias	$α_{T}$	0.001
	Learning rate	$b$	0.7
MPC	Stage cost	$Q$	$b l k d i a g (1 0^{4} I_{n \times n}, 5 I_{n \times n})$
	Stage cost	$R$	$0.01 I_{n}$
	Terminal cost	$Q_{T}$	$b l k d i a g (1 0^{6} I_{n \times n}, 500 I_{n \times n})$
	Horizon	$T (s)$	0.45
GMM	Components	$K$	Peg	Plug		Car
GMM	Components	$K$	2	3		2
Uncertainty	Lin. stiffness bounds	$K_{l}^{\min}, K_{l}^{\max} (N / m)$	$1, 200$
	Ang. stiffness bounds	$K_{a}^{\min}, K_{a}^{\min} (N m / r a d)$	$0.01, 0.5$
	Logistic params	$α_{K}, c_{K}$	$- 10, 0.5$
	Force bounds	$F_{d, r}^{\min}, F_{d, r}^{\max} (N)$	$- 20, 0$
	Logistic params	$α_{r}, c_{r}$	$- 10, 0.5$
			Peg		Plug		Car
	Offset	$c$	$- 117.5$		$- 110.0$		$- 121.0$
	Slope	$β$	$- 0.038$		$- 0.038$		$- 0.038$

TABLE 2. Peg-in-hole: success rates.

Method	Cylinder			Orthogonal			Gears			Subtotal
Method	L	M	S	L	M	S	L	M	S	Total
CIC	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$1 / 3$	$2 / 3$	$3 / 27$
MPC	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$1 / 3$	$0 / 3$	$1 / 27$
MPVIC	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$0 / 3$	$2 / 3$	$2 / 27$
uMPC-ART	$3 / 3$	$2 / 3$	$2 / 3$	$3 / 3$	$2 / 3$	$0 / 3$	$3 / 3$	$0 / 3$	$3 / 3$	$18 / 27$
${uMPC - ART}_{r^{+}}$	$3 / 3^{# r = 0}$	$3 / 3^{# r = 1}$	$3 / 3^{# r = 1}$	$3 / 3^{# r = 0}$	$3 / 3^{# r = 1}$	$0 / 3^{# r = 3}$	$3 / 3^{# r = 1}$	$0 / 3^{# r = 3}$	$3 / 3^{# r = 0}$	$21 / 2 7^{# r = 10}$
${uMPC - ART}_{r^{+}}^{stiff}$	$2 / 3^{# r = 2}$	$3 / 3^{# r = 0}$	$1 / 3^{# r = 2}$	$3 / 3^{# r = 0}$	$3 / 3^{# r = 1}$	$1 / 3^{# r = 3}$	$3 / 3^{# r = 2}$	$0 / 3^{# r = 3}$	$2 / 3^{# r = 1}$	$18 / 2 7^{# r = 14}$

TABLE 3. Peg-in-hole: average completion time [s] of successful trials.

Method	Cylinder			Orthogonal			Gears			Subtotal
Method	L	M	S	L	M	S	L	M	S	Mean $\pm$ std
CIC	–	–	–	–	–	–	–	$31.0 \pm 0.0$ s	$30.0 \pm 1.4$ s	$30.3 \pm 1.2$ s
MPC	–	–	–	–	–	–	–	$28.0 \pm 0.0$ s	–	$28.0 \pm 0.0$ s
MPVIC	–	–	–	–	–	–	–	–	$52.5 \pm 2.1$ s	$52.5 \pm 2.1$ s
uMPC-ART	$57.3 \pm 1.2$ s	$61.5 \pm 2.1$ s	$62.0 \pm 1.4$ s	$64.0 \pm 1.7$ s	$62.0 \pm 0.0$ s	–	$59.3 \pm 1.2$ s	–	$62.0 \pm 1.0$ s	$61.1 \pm 2.5$ s
${uMPC - ART}_{r^{+}}$	$57.3 \pm 1.2$ s	$81.0 \pm 33.8$ s	$84.3 \pm 38.7$ s	$64.0 \pm 1.7$ s	$89.0 \pm 46.8$ s	–	$79.7 \pm 34.1$ s	–	$62.0 \pm 1.0$ s	$73.9 \pm 27.2$ s
${uMPC - ART}_{r^{+}}^{stiff}$	$99.0 \pm 41.0$ s	$65.0 \pm 1.73$ s	$128.0 \pm 0.0$ s	$63.3 \pm 2.9$ s	$104.7 \pm 35.3$ s	$119.0 \pm 0.0$ s	$101.3 \pm 35.9$ s	–	$62.0 \pm 2.8$ s	$87.3 \pm 30.4$ s

TABLE 4. Plug insertion and car-parts assembly: success rates.

Method	Plug		Car parts		Subtotal
Method	3-prong	2-prong	Wheel bearing	Wheel disc	Plug	Car parts
CIC	$4 / 5$	$2 / 5$	$2 / 5$	$3 / 5$	$6 / 10$	$5 / 10$
MPC	$1 / 5$	$0 / 5$	$2 / 5$	$5 / 5$	$1 / 10$	$7 / 10$
MPVIC	$4 / 5$	$1 / 5$	$4 / 5$	$5 / 5$	$5 / 10$	$9 / 10$
uMPC-ART	$5 / 5$	$0 / 5$	$5 / 5$	$2 / 5$	$5 / 10$	$7 / 10$
${uMPC - ART}_{r^{+}}$	$5 / 5^{# r = 0}$	$3 / 5^{# r = 5}$	$5 / 5^{# r = 0}$	$5 / 5^{# r = 3}$	$8 / 1 0^{# r = 5}$	$10 / 1 0^{# r = 3}$
${uMPC - ART}_{r^{+}}^{stiff}$	$4 / 5^{# r = 3}$	$5 / 5^{# r = 5}$	$3 / 5^{# r = 1}$	$5 / 5^{# r = 1}$	$9 / 1 0^{# r = 8}$	$8 / 1 0^{# r = 2}$

TABLE 5. Plug insertion and car-parts assembly: average completion time [s] of successful trials.

Method	Plug		Car parts		Subtotal
Method	3-prong	2-prong	Wheel bearing	Wheel disc	Plug	Car parts
CIC	$33.8 \pm 1.0$ s	$36.5 \pm 2.1$ s	$55.5 \pm 4.9$ s	$40.3 \pm 8.5$ s	$34.7 \pm 1.9$ s	$46.4 \pm 10.5$ s
MPC	$30.0 \pm 0.0$ s	–	$59.5 \pm 0.7$ s	$51.0 \pm 0.7$ s	$30.0 \pm 0.0$ s	$53.4 \pm 4.2$ s
MPVIC	$65.5 \pm 3.3$ s	$64.0 \pm 0.0$ s	$74.3 \pm 2.5$ s	$67.6 \pm 0.5$ s	$65.2 \pm 2.9$ s	$70.6 \pm 3.8$ s
uMPC-ART	$78.6 \pm 4.6$ s	–	$96.4 \pm 0.5$ s	$90.0 \pm 0.0$ s	$78.6 \pm 4.6$ s	$94.6 \pm 3.2$ s
${uMPC - ART}_{r^{+}}$	$78.6 \pm 4.6$ s	$151.7 \pm 17.6$ s	$96.4 \pm 0.5$ s	$139.0 \pm 44.8$ s	$106.0 \pm 39.1$ s	$117.7 \pm 37.3$ s
${uMPC - ART}_{r^{+}}^{stiff}$	$106.8 \pm 31.7$ s	$141.0 \pm 5.2$ s	$108.3 \pm 21.6$ s	$124.8 \pm 33.1$ s	$125.8 \pm 26.7$ s	$118.6 \pm 28.8$ s

TABLE 6. Overall success across all tasks.

Method	Overall success
CIC	$14 / 47$ (29.8%)
MPC	$9 / 47$ (19.1%)
MPVIC	$16 / 47$ (34.0%)
uMPC-ART	$30 / 47$ (63.8%)
${uMPC - ART}_{r^{+}}$	$39 / 47$ (83.0%)
${uMPC - ART}_{r^{+}}^{stiff}$	$35 / 47$ (74.5%)

Keywords

adaptive model predictive controladaptive resonance theoryagile manufacturingcontact-rich assemblydynamic movement primitives

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Manufacturing Process and Optimization · Teleoperation and Haptic Systems

Full text

Introduction

1

Modern manufacturing increasingly demands the ability to produce a diverse range of products, requiring assembly systems to handle frequent changes in part geometry, fixture configurations, and sequencing. This variability challenges traditional robotic assembly solutions, which are typically finely engineered or trained for well-defined tasks and variations (Luo et al., 2019; Tang et al., 2023; Noseworthy et al., 2025; Guo et al., 2025; Wu et al., 2025; Yan et al., 2021; Jha et al., 2022; Schoettler et al., 2020; Goyal et al., 2024; Kim et al., 2020; Chang et al., 2022; Morgan et al., 2023). Even then, such solutions are often labor-intensive and time-consuming, which underscores the need for systems capable of robust, adaptive behavior with minimal manual reprogramming.

To address these challenges, we propose a novel framework that enables fast, intuitive robot programming and adaptive reproduction of previously unseen, contact-rich assembly tasks using only a force/torque (F/T) sensor and error measurements. The method integrates Dynamic Movement Primitives (DMPs), Adaptive Model predictive Control (AMPC), and an Adaptive Resonance Theory (ART)–based contact classifier. DMPs encode human-taught assembly demonstrations as stable nonlinear dynamical systems, which capture the motions and wrenches required to perform a task and enable smooth generalization to new initial states and goal fixture configurations (Ijspeert et al., 2013; Kramberger et al., 2017; Steinmetz et al., 2015). AMPC is a model-based predictive controller (MPC) whose prediction model is updated online from data (Rakovitis and Mronga, 2024), enabling adaptive and compliant resolution of errors, such as misalignments caused by unexpected contacts during reproduction of the learned assembly. An ART-based contact classifier is a neural network (NN) that incrementally classifies F/T patterns under a vigilance criterion to recognize known contacts and flag novel ones on the fly (Bargsten et al., 2025). These three methods are combined as follows.

For each new assembly task family (e.g., peg-in-hole or plug-insertion), the approach lets a user train the system in two demonstrations (fast reprogramming): i. a kinesthetic teaching, and ii. an assistive reproduction. At first, the required coupled motion and wrench profiles are captured from a single nominal demonstration using synchronized wrench-motion DMPs (wDMPs). The user segments the demo into the necessary sequences, so that a separate wDMP is trained for i. aligning the parts with the fixture and ii. inserting the parts. Then, each wDMP provides MPC with the motion and wrench references needed to reproduce the learned task. In the second step, an assistive reproduction is executed for the same goal pose and parts, using MPC which models the interaction dynamics with a Cartesian Impedance Model (CIM). During this demo, the robot collects contact data (error-F/T measurements) from a human-ensured, successful assembly, for training ART to recognize correctly aligned contact patterns, and for fitting a Gaussian Mixture Model (GMM) for uncertainty estimation under nominal operation. This GMM later provides uncertainty estimates for new, autonomous assembly executions, by evaluating the likelihood of incoming observations w.r.t. a successful assembly. We refer to this training procedure as “fast reprogramming”, because each task needs to be taught only once and reproduced successfully once, reducing the required training time to essentially the length of the two demonstrations.

After learning, the method is deployed on a real system to autonomously execute diverse, previously unseen assembly tasks with varying geometries, start and goal poses (autonomous reproduction phase). In this setting, misalignments and unintened contacts caused by modelling or goal-estimation errors are inevitable. To address this, after an alignment attempt and before the actual part insertion, the contact surface is explored by applying sinusoidal dither forces at the EE, about the estimated goal direction, guided by tracking errors. The ART classifier monitors F/T data to flag undesired contacts and acts as a scheduler, triggering insertion upon confident alignment or initiating a retrial when the context deviates from known patterns. Throughout, the estimated GMM uncertainty adapts the MPC model based on the distribution of error-F/T measurements, enabling compliant and adaptive exploratory behavior in case of misalignments. A graphical abstract of the proposed approach can be seen in Figure 1.

Graphical abstract of the proposed framework.

Despite recent progress in robotic assembly, most of the literature focuses on learning or engineering nominal policies, while the problem of reliably reproducing such policies under variations in parts and real-world uncertainties, such as modeling errors and localization inaccuracies, remains comparatively underexplored. To the best of our knowledge, our method is the first to combine fast programming from only two demonstrations, uncertainty-aware adaptive compliance within MPC, and online contact-context classification that regulates stage transitions and triggers retrials. Together, these components are designed to reduce reprogramming effort, while improving the chances of successful reproduction of the learned policy in the context of real world, contact-rich assembly under variations.

The approach is evaluated experimentally using a 7-DOF KUKA LBR iiwa14 R820 manipulator, equipped with a F/T sensor at the wrist. The robot performs a wide range of contact-rich assembly tasks, starting with the standard IndustRealKit benchmark (Tang et al., 2023). This includes classic peg-in-hole tasks with cylindrical and rectangular pegs of varying sizes, two- and three-prong plug insertions (which require significant insertion force), and multi-stage gear assemblies involving both peg insertion and gear teeth alignment. Beyond these benchmarks, we evaluate the system on a real-world industrial scenario: a multi-stage disc brake assembly, requiring the handling of heavy components and the application of substantial forces. Each task is repeated across multiple trials to obtain statistically meaningful results. As our novelty lies primarly in the adaptive reproduction of kinesthetically taught policies, we benchmark our reproduction scheme against two state-of-the-art baseline controllers (classic Cartesian impedance control (Origanti et al., 2025), and error-based predictive variable impedance control (Anand et al., 2023)), as well as ablation baselines to highlight the importance of key components during reproduction.

Overall, the contributions of this work are listed below:

We propose a novel, user-friendly framework for fast programming and adaptive reproduction of contact-rich robotic assembly tasks, that combines wDMPs with AMPC. This combination allows robust generalization to varying start and goal assembly configurations via wDMPs, as well as to different part geometries via AMPC, which enables compliant and adaptive contact handling based on uncertainty.
We combine the above with an ART-based neural contact classifier for real-time detection of undesired contact patterns and misalignments, which also serves as a scheduler for assembly stage transitions and retrials.
We provide comprehensive experimental validation on diverse industrial and benchmark scenarios, demonstrating improved handling of misalignments, and adaptability over baseline approaches.

The remainder of the paper is structured as follows: Section 2 details related works, Section 3 outlines the proposed methodology, Section 4 discusses the experimental results, and Section 5 concludes with a summary of findings and potential directions for future work.

Related work

2

Prior research on learning, controls, and contact classification for contact-rich tasks is detailed in the following subsections.

Related works on learning for assembly

2.1

Recent research in robotic assembly has explored a wide range of learning-based approaches to address the challenges of contact-rich insertion tasks. Reinforcement learning (RL) methods have shown promise for learning robust, low-tolerance behaviors, e.g., with Guided policy search (Levine et al., 2015), and RL-based variable impedance control (Luo et al., 2019). Simulation-based RL pipelines such as IndustReal (Tang et al., 2023) and FORGE (Noseworthy et al., 2025) have demonstrated successful sim-to-real transfer with tight-tolerance insertions, through domain randomization, signed-distance rewards, or force conditioning. Meta-RL approaches (Schoettler et al., 2020) leverage shared structure across insertion tasks to enable rapid adaptation from simulation to real-world scenarios. Similarly, SRSA (Guo et al., 2025) retrieves relevant skills from a pre-existing policy library using predicted transfer success, and fine-tunes them on new assembly tasks using Proximal Policy Optimization (PPO) combined with self-imitation learning.

Supervised and imitation learning approaches have also proven effective. In Yan et al. (2021), a contact-state recognition model is trained on F/T data, collected from various inclined peg-in-hole configurations. A support vector machine (SVM) with a Gaussian kernel enables accurate classification of the contact state, which then informs the parameters of an adaptive impedance controller to compliantly correct misalignments. In Jha et al. (2022), DMPs are used to learn a nominal insertion trajectory from human demonstrations, with a corrective compliance policy to handle vision-based goal estimation errors. A generalized accommodation controller bounds contact forces for safe exploration and data collection, while a Gaussian Process (GP) trained on F/T data predicts misalignments to guide successful insertions despite pose errors. DMPs (Ijspeert et al., 2013) have been extensively adapted to represent forces and complex, multi-modal behaviors. Several studies have incorporated force feedback into DMP frameworks (Pastor et al., 2009; Hoffmann et al., 2009), while others have employed basis function regression to model and reproduce force profiles (Ude et al., 2010; Kramberger et al., 2017). Extensions such as Task-Parameterized DMPs (Calinon, 2016) and Probabilistic Movement Primitives (ProMPs) (Paraschos et al., 2013) enhance spatial generalization and capture variability across several demonstrations. Origanti et al. (2022) showcases automatic extraction of skill parameters from human demonstrations, to replicate both motion and force trajectories in simulation, particularly for contact-rich manipulation. Complementarily, RVT-2 (Goyal et al., 2024) introduces a multi-view vision-based system that learns high-precision manipulation from just a few demonstrations. It uses supervised behavioral cloning to map a third-person RGB-D input and language instructions to key-frame poses, with a multi-view transformer and a coarse-to-fine inference strategy enabling millimeter-level accuracy using only visual data. TacDiffusion (Wu et al., 2025) leverages diffusion models trained on expert demonstrations using cuboid pegs to map tactile observations to force-domain actions, achieving a 95.7% zero-shot transfer success rate on novel peg-in-hole tasks including cylinder, prism, and key-shaped pegs.

Imitation learning has also been combined with RL in Kim et al. (2020) by using a human demonstration to train a NN-based movement primitive (NNMP), which constrains RL to a known trajectory manifold. With a properly designed reward function, this enabled the agent to learn high-precision contact tasks while minimizing applied forces. In Chang et al. (2022), an assembly task learned via motion-force DMPs is reproduced with a low-level admittance controller, whose stiffness is tuned by RL, enabling real-time impedance adaptation. Other works, using compliance-enabled strategies (Morgan et al., 2023) or neglecting force feedback (Park et al., 2017) offer hardware-efficient alternatives that exploit passive compliance or implicit search behaviors.

Despite these advancements, several limitations are shared across the above works. Many approaches are sample-inefficient, requiring extensive manual training or simulations that can take several hours to days (Luo et al., 2019; Tang et al., 2023; Noseworthy et al., 2025; Guo et al., 2025; Wu et al., 2025; Yan et al., 2021; Jha et al., 2022; Calinon, 2016; Paraschos et al., 2013; Kramberger et al., 2017), or additional multiple real-world trials $[eqn]$ for transfer learning (Schoettler et al., 2020; Goyal et al., 2024; Kim et al., 2020; Chang et al., 2022). Generalization to arbitrary geometries or unseen part configurations is limited, often requiring timely retraining, fine re-engineering (Morgan et al., 2023), or even specifically engineered sensing conditions such as calibrated multi-view camera rigs (Goyal et al., 2024). Additionally, many of the approaches, consider cut-off forces or fixed compliance (Schoettler et al., 2020; Park et al., 2017; Morgan et al., 2023; Noseworthy et al., 2025) for safe exploration, which may result in task failure in diverse tasks involving significantly different interaction forces, e.g., resistant plug insertion or sliding on high friction surfaces. Adapting these systems for rapidly changing contact-rich assembly tasks, typically entails significant reconfiguration. For example, in a manufacturing environment this would result in costly extended production downtime (Liu et al., 2012), whenever small changes in production are introduced. This highlights the need for flexible, intuitive, and data-efficient methods that can rapidly adapt to diverse changes in assembly settings. For these reasons, in this work we draw inspiration from Steinmetz et al. (2015) and extend standard DMPs (Fabisch, 2024) to learn coupled motion–wrench profiles from a single Cartesian space demonstration. We then reproduce these profiles with an adaptive controller, enhancing generalization to varying part geometries, start and goal configurations.

Related works on controls

2.2

Typically the above works rely on compliant systems (passive or active), with many of them using Cartesian Impedance Control (CIC) to introduce the required task space compliance for safe contact interaction. However, CIC is inherently reactive; it lacks prediction and cannot natively enforce constraints (e.g., joint/torque limits), so it often relies on heuristic task-specific safety margins (Origanti et al., 2025).

Recent (non-assembly) studies combine CIC with MPC into model-based impedance- (MPIC) (Bednarczyk et al., 2020) or variable-impedance control (MPVIC) (Thelenberg and Ott, 2024; Anand et al., 2023) for tasks involving contact uncertainties or variable stiffness objectives. By embedding impedance dynamics in MPC, these approaches retain compliance while enabling look-ahead planning, explicit constraint handling, and proactive adaptation across contact transitions. In particular Thelenberg and Ott (2024) and Anand et al. (2023), leverage the impedance model inside the MPC to forecast the stiffness and damping commanded by a low-level variable-impedance controller, improving task adaptability.

However Bednarczyk et al. (2020), enforces a fixed stiffness profile over the entire task, limiting the controller’s ability to adapt compliance dynamically based on environmental conditions or task phases. Conversely Thelenberg and Ott (2024) and Anand et al. (2023), drive stiffness adaptation by the magnitude of tracking errors, increasing stiffness when the robot is far from its target. While intuitive, this heuristic can lead to unphysical or unsafe stiffness changes in contact-rich scenarios, where increased stiffness at the wrong moment may cause damage, or task failure during contact transitions.

A related line of work addresses contact uncertainty using adaptive MPC, in which the prediction model is updated online based on an estimated contact model. This contact model is either learned, e.g., via RL (Xu S. et al., 2022), supervised learning (Rakovitis and Mronga, 2024), or meta-learning (Saviolo et al., 2024; Anne et al., 2021; Arcari et al., 2023), or predicted via adaptive control or system-identification techniques (Minniti et al., 2021; Xu J. et al., 2022). Such approaches have mainly been demonstrated on mobile manipulators, quadrupeds, and quadrotors, where they are used to compensate for unknown or time-varying dynamics, including payload changes and external disturbances.

Although, all the above control approaches are achieving great results on dealing with contact uncertainties, their effectiveness on fine assembly tasks have yet to be validated, where high precision, delicate contact handling, and sub-millimeter accuracy are required. This highlights the need for more robust studies on context-aware controllers that account for both contact dynamics and compliance requirements to succeed across fine and diverse assembly tasks. Hence, in this work we extend MPC with a Cartesian impedance contact model, whose stiffness and desired wrench are continuously adapted based on uncertainty estimates. These uncertainties are derived from the likelihood of the observed errors and wrench measurements under a GMM trained on a nominal (successful) execution. This measures the proximity of the current situation to an out-of-distribution (OOD) case, thereby enabling adaptive and compliant reproduction of the learned assembly task when unexpected interactions occur. We employ GMMs, as they have been shown to be highly sample-efficient, especially in high-dimensional spaces, and very fast to train (Calinon et al., 2007; Rakovitis and Mronga, 2024). GMMs have also been used before for anomaly and OOD detection (Zong et al., 2018; Iwata and Kumagai, 2022).

Related works on contact classification

2.3

The detection and classification of contacts between a robot and its environment is essential for inferring the current state of the system. Prior research broadly falls into two focal points. The first addresses the detection of unintended collisions, aiming to mitigate impact forces via reflexive counter-actions. Classically, this type of approaches compare an estimated impact metric against a pre-defined threshold. Because robotic assembly inherently involves purposeful contact, our work aligns with the second focal point: the classification of intended contacts to obtain a more fine-grained assessment of the contact forces when performing an assembly task.

Several works have tried to bridge the gap between the two focal points by jointly distinguishing unintended collisions from intended interactions. For example (Cho et al., 2012), differentiates collisions from intended contact by monitoring the rate of change in joint torque measurements, while frequency-domain analyses have also shown promise (Kouris et al., 2016; 2018). In robotic assembly, however, the classification into only two classes (three with no-contact class) based on thresholds is insufficient to assess the successful insertion and joining of rigid and elastic parts with tight tolerances. To address this, some works such as Pankert and Hutter (2023) incorporate visual monitoring with prior knowledge to improve kinematic-level performance by fusing CAD models, tactile cues, and particle simulation to refine object localization.

In contrast, our objective is an assessment driven by contact wrenches, that does not depend on precise prior models or expert supervision. Other domains underscore the potential of such signals, including classification from joint torque measurements (Iskandar et al., 2024) and from acoustic vibration sensing (Liu and Chen, 2024). Within robotic assembly specifically, two preliminary works have shown the feasibility of employing an incremental machine learning approach for continuous classification of episodes encoded by the frequency magnitudes of joint torque measurements (Bargsten and Kirchner, 2023) or EE wrench measurements (Bargsten et al., 2025). In these approaches, time-series measurements are encoded using a short-time Fourier transform (STFT), producing compact, episodic signatures that lend themselves to real-time classification via Adaptive Resonance Theory (ART).

ART originates in cognitive science and models dynamic processes for learning and adapting short- and long-term memory in the human brain (Grossberg, 1976; Carpenter and Grossberg, 1987). ART’s learning principle is a match-based learning that relies on input similarity, in contrast to error-based batch learning such as backpropagation. Therefore, ART naturally supports continuous learning of novel patterns without catastrophic forgetting, and allows to capture rare input events. Building on this foundation, numerous algorithmic variants (Brito da Silva et al., 2019) have been developed that simplify and operationalize these principles. To this end, we adopt Distributed Dual Vigilance Fuzzy ART (DDVFA) (Brito da Silva et al., 2020), a variant of ART in which each learned class is represented by a Fuzzy ART (Carpenter et al., 1991) NN, i.e., an unsupervised clustering model that measures input similarity via fuzzy set operations. This nested design thus represents classes as groups of sub-classes and is able to capture arbitrarily shaped, heterogeneous clusters in the data. Differently from the prior applications, we leverage this property to use DDVFA as a context monitor, classifying contact patterns during assembly and triggering transitions between assembly stages.

Methodology

3

This section describes the proposed methodology in five subsections covering: 1. the DMPs formulation, 2. a force-based contact exploration strategy, 3. the contact classification module via ART, 4. the MPC problem, and 5. the estimation of uncertainty via a GMM and its use to adapt MPC in real-time. Our framework consists of two parts: i. the learning, and ii. the reproduction phase. The complete pipelines for each phase, are detailed in Algorithm 1 and Algorithm 2, respectively.

Algorithm 1Learning of contact-rich assembly.

Input : Single kinesthetic demonstration $[eqn]$ in gravity mode
Output: wDMPs, ART classifier, GMM
Learning is performed in 2 demos:
1) Kinesthetic teaching
1.1) Learn forcing terms $[eqn]$ (Equation 3) of DMPs (Equations 1–9) from $[eqn]$ .
1.2) Segment demo into alignment and insertion $[eqn]$ fit one wDMP per segment.
2) Assistive reproduction (with MPC (Section 3.4))
2.1) Execute alignment wDMP, with human assistance:
– wDMPs $[eqn]$ desired pose and wrench $[eqn]$ MPC.
– Human $[eqn]$ minimal corrections $[eqn]$ ensure parts alignment.
2.2) Once aligned $[eqn]$ axial dither on assembly Z-axis (Equation 12) $[eqn]$ collect contact F/T data $[eqn]$ train ART classifier (Section 3.3).
2.3) Execute insertion wDMP: collect error–wrench data under nominal insertion in $[eqn]$ fit a GMM (Equation 19) on $[eqn]$ .

Algorithm 2Adaptive autonomous reproduction of contact-rich assembly. Algorithm box for adaptive autonomous reproduction of contact-rich assembly showing four main steps: alignment with wDMP, contact exploration with dither forces, alignment detection by restricting dither and classifying contacts, and insertion using wDMP. Inputs and outputs are listed above policy steps.

Dynamic movement primitives

3.1

Each assembly task is specified by the parts to be assembled (whose geometry and required interaction forces may be unknown), along with an initial pose and a fixed goal pose in Cartesian space (which are known). Hence, each task is encoded by two DMPs that share a single phase variable, thereby synchronizing pose and wrench generation over time:

a Cartesian DMP that generates EE pose trajectories (3D position and 4D quaternion),
a Wrench DMP that reproduces the EE wrench (3D force and 3D torque).

We refer to this pair as a synchronized wrench–motion DMP (wDMP). By jointly learning motion and the associated interaction wrenches, wDMPs extend standard DMPs (Ijspeert et al., 2013; Fabisch, 2024) from pure kinematics to coupled motion-wrench behavior, enabling a robot to acquire the spatial and wrench dynamics required for contact-rich manipulation from a single demonstration.

Cartesian DMP

3.1.1

We follow the formulation in Fabisch (2024), which supports Cartesian trajectories, position $[eqn]$ and orientation as quaternions $[eqn]$ , to ensure smooth interpolation on the unit sphere.

The position evolves according to the standard second-order dynamical system:

[eqn]

[eqn]

where $[eqn]$ is an auxiliary state representing the Cartesian velocity, $[eqn]$ is the goal position, $[eqn]$ is the temporal scaling factor, $[eqn]$ and $[eqn]$ are positive gains, and $[eqn]$ is the nonlinear forcing term learned from demonstration and represented using a weighted sum of $[eqn]$ radial basis functions (RBFs). A forcing term is given by:

[eqn]

where $[eqn]$ are the learned weights, $[eqn]$ are the centers, and $[eqn]$ are widths of the basis functions. The phase variable $[eqn]$ evolves over time according to the canonical system:

[eqn]

with $[eqn]$ .

For orientation, using the full quaternion error formulation (Ude et al., 2014; Fabisch, 2024), we write.

[eqn]

[eqn]

Here, $[eqn]$ is the angular velocity, $[eqn]$ and $[eqn]$ denote respectively the initial and desired (goal) quaternions, and $[eqn]$ is the respective forcing term. The positive constants $[eqn]$ and $[eqn]$ control the convergence rate and damping, respectively. The logarithmic quaternion error is

[eqn]

with $[eqn]$ the quaternion product, $[eqn]$ the conjugate of $[eqn]$ , and $[eqn]$ the $[eqn]$ logarithmic map. This formulation constrains the trajectory to the unit quaternion manifold, avoiding singularities and enabling consistent learning and reproduction.

Wrench DMP

3.1.2

To model contact interaction, a similar DMP is used for the wrench vector $[eqn]$ :

[eqn]

[eqn]

where $[eqn]$ is the goal wrench, the forcing term $[eqn]$ is learned componentwise for all six dimensions, and all other parameters are similarly defined as in Cartesian DMP. This DMP shares the same phase variable $[eqn]$ as the Cartesian DMP to ensure time alignment between motion and wrench evolution (Gams et al., 2014).

Training of wDMPs

3.1.3

For each novel assembly, a human performs kinesthetic teaching of the robot in gravity-compensation mode to provide a single demonstration. The measured pose and wrench trajectories are used to fit the forcing terms $[eqn]$ , $[eqn]$ , $[eqn]$ (Equation 3) via imitation learning using ridge regression (Hoerl and Kennard, 1970). Assuming that each assembly task consists of two phases: i. alignment with the fixture, and ii. part insertion, the user segments the demonstration so that one wDMP is obtained for each phase (Equations 1–9). At execution time, the Cartesian DMP yields the desired EE pose trajectory, while the Wrench DMP simultaneously produces the expected interaction wrench.

Contact exploration via force-driven dither motions

3.2

Having an expert-learned wDMP is not enough to guarantee a successful reproduction of the learned assembly. In reality, small misalignments during reproduction could occur due to modeling inaccuracies or minor geometric mismatches. To overcome this issue, an error-driven, force-based exploration policy is employed between the alignment and insertion phase. To facilitate local contact exploration, a time-varying dither force $[eqn]$ is applied at the EE. This force consists of sinusoidal components along each Cartesian direction:

[eqn]

where $[eqn]$ and $[eqn]$ are the amplitude and frequency of the sinusoidal force in the $[eqn]$ -th direction, and $[eqn]$ is a small perturbation term added to prevent the exploration force from vanishing near the goal. The probing direction $[eqn]$ is computed as the normalized vector:

[eqn]

In Equation 11 $[eqn]$ denotes the estimated goal position (e.g., hole center), and $[eqn]$ is the current EE position obtained from forward kinematics. Selecting a low-frequency for these forces allows exploration of the contact surface, while a high-frequency refines an existing contact. These dither motions generate interaction forces that assist in resolving edge contacts and overcoming tight clearances.

Consequently, an alignment detection step is required to check if the exploration strategy resolved the misalignment after a fixed number of attempts or time interval. For this purpose, a similar dither force $[eqn]$ is applied, but only in the assembly direction $[eqn]$ given by wDMP. This is:

[eqn]

where $[eqn]$ are the bounds of the applied force, $[eqn]$ are logistic parameters, and $[eqn]$ is its frequency. These parameters, along with the ones in Equation 10, must be tuned empirically according to the desired outcome for the given robotic system.

To compute the full desired contact wrench for contact exploration, the corresponding contact torques are given as, $[eqn]$ . The exploration process continues until a maximum number of attempts is reached or a successful alignment is detected by the ART-based contact classifier described next.

Contact classification via distributed dual vigilance fuzzy ART (DDVFA)

3.3

The role of the ART classifier in our framework is to verify that the parts to be assembled are properly aligned before joining them, in order to prevent jamming caused by insertion forces applied at incorrect locations. In an ART network, learning proceeds in cycles, governed by an orienting subsystem, which controls the activation and inhibition of nodes representing the categories (classes) during a competition for the best match to the input. Besides the orienting subsystem, a typical ART NN consists of fields (layers) $[eqn]$ and $[eqn]$ that are connected through weight vectors, as well as an optional $[eqn]$ field. A cycle can then be described as follows:

Input encoding. The raw input vector is optionally preprocessed or encoded in the $[eqn]$ field before being presented to the feature representation field $[eqn]$ .
Category activation. This vector is then forwarded through bottom-up weights from the $[eqn]$ field to $[eqn]$ , the category representation field. Each node in $[eqn]$ is a computational unit (neuron) representing a learned category (class) via its weight vectors.
Competition and choice. The $[eqn]$ nodes compete for the highest activation for the current input. The best candidate is selected and the other nodes are inhibited.
Resonance (similarity) test. The chosen $[eqn]$ node sends a top-down expectation pattern back to $[eqn]$ through its top-down weights. The orienting subsystem compares this expectation with the actual input in $[eqn]$ . If their similarity exceeds a vigilance threshold, the node enters a resonant state.
Learning. When the similarity test is satisfied (resonance), the selected node’s bottom-up and top-down weights are updated, thereby incorporating the input into long-term memory. Otherwise, the orienting subsystem suppresses this node and continues the search with the node having the next-highest activation. If none of the nodes achieves a resonant state, a new $[eqn]$ node is created to represent a novel category in memory.

The DDVFA variant of ART implements a nested approach, in which the classes are represented by nodes that are themselves FuzzyART NNs. In FuzzyART, each input feature vector $[eqn]$ is firstly complement coded when passing through the $[eqn]$ field, i.e., $[eqn]$ . Using this encoded feature vector, each node $[eqn]$ then determines its activation $[eqn]$ based on its weight vector $[eqn]$ which serves as the memory:

[eqn]

where $[eqn]$ is the $[eqn]$ norm, $[eqn]$ denotes the fuzzy set $[eqn]$ operator (intersection, element-wise minimum), and $[eqn]$ biases selection toward uncommitted nodes.

Nodes are considered in descending order of $[eqn]$ until one satisfies the resonance criterion

[eqn]

with vigilance parameter $[eqn]$ and match value $[eqn]$ (similarity). The node $[eqn]$ that passes the resonance test, updates its weights according to

[eqn]

with learning rate $[eqn]$ . In DDVFA, a secondary, less strict global vigilance parameter extends this mechanism to regulate learning at the level of grouped classes.

In this work, DDVFA learns and classifies data from a stream of EE wrench samples composed of force and torque vectors, each $[eqn]$ . Before entering the ART network, the wrench stream is preprocessed as follows: i. consecutive sample blocks are windowed with a Blackman–Harris window and transformed to the frequency domain via the Fast Fourier Transform (FFT); ii. the number of features is reduced by max-pooling on the resulting frequency magnitudes; and iii. all features are scaled to [0,1]. The resulting feature vectors are then presented to the ART NN. The overall pipeline is illustrated in Figure 2.

Dataflow of the ART-based contact learning and classification.

In our framework, ART is trained during the assistive reproduction, after a wDMP has been learned. During the alignment phase, the human guides the robot slightly to align properly in case of millimeter errors. After correct alignment is ensured, Equation 12 is applied to collect F/T data characterizing correct alignment with the assembly fixture. These data are used as input to the learning and classification procedure outlined above (Equations 13–15). This results in learning a set of template classes which map to episodes in the contacts of the assembly process. When the learning and adaption is further only activated during the successful alignment check, a distinct learning and further recognition of the frequency magnitude pattern of successful alignment is achieved, mapping all other patterns to a mismatch (category index $[eqn]$ ). The trained classifier is then frozen and deployed as a real-time context monitor and scheduler, triggering insertion after identifying correct alignment or a retrial otherwise, enabling adaptive stage transitions during contact-rich assembly.

Model predictive control

3.4

In this work, MPC is used to reproduce the assembly task. For a manipulator with $[eqn]$ joints, the MPC problem is expressed in joint-space as in (Rakovitis and Mronga, 2024):

[eqn]

where, the system state is denoted by $[eqn]$ , encompassing joint positions and velocities; $[eqn]$ represents the control input in the form of joint torques; $[eqn]$ denotes the joint-space reference obtained via Inverse Kinematics (IK) on the state-space reference given by the Cartesian DMP; $[eqn]$ is the initial state; and $[eqn]$ represents the prediction horizon. The constraints ensure that both the state and input remain within defined bounds $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ . Assuming a rigid contact at the EE, the dynamics $[eqn]$ are given by:

[eqn]

here, $[eqn]$ is the inertia matrix in joint space, $[eqn]$ is the joint acceleration, $[eqn]$ represents the Coriolis, centrifugal, and gravitational effects, and $[eqn]$ is the Jacobian at the contact point. The contact wrench $[eqn]$ is modeled using a CIM:

[eqn]

where the stiffness and damping $[eqn]$ , $[eqn]$ , are diagonal, positive semi-definite matrices. Those control the state-space compliance w.r.t. the EE pose and twist errors $[eqn]$ , $[eqn]$ . The wrench $[eqn]$ is the desired wrench applied at the EE, provided in real-time either by the Wrench DMP or by the force-based contact exploration policy, according to the stage of the assembly. This formulation is chosen as it allows to control the compliance in state-space by adjusting $[eqn]$ , while preserving postural joint-space control. Moreover, to simplify the adaptation of compliance in the system we set $[eqn]$ , with $[eqn]$ . The contact wrench $[eqn]$ is adapted at each time-step before solving MPC (Equations 16, 17) based on operation uncertainty, enabling compliant and adaptive handling of unexpected contacts as outlined in the next subsection.

Uncertainty estimation

3.5

The uncertainty during execution of an assembly operation is derived using a GMM, trained during the human-assisted reproduction. After correct alignment is ensured by the operator and data collection for ART has been completed, the insertion phase takes place. The observed Cartesian pose errors and wrenches recorded at the EE during this phase are collected into a dataset $[eqn]$ , where $[eqn]$ denotes the error measurements, and $[eqn]$ represents the F/T sensor measurements. This dataset is used to train a GMM with Expectation Maximization (Dempster et al., 1977), to model the joint distribution of the error-F/T measurements. A GMM is defined as

[eqn]

where $[eqn]$ denotes a multivariate Gaussian distribution with mean $[eqn]$ and covariance matrix $[eqn]$ . Each component is weighted by a prior $[eqn]$ , such that $[eqn]$ . The number of mixture components, $[eqn]$ , is treated as a hyperparameter and selected via grid search.

The objective is to use this GMM to derive an uncertainty metric, that describes the proximity to OOD measurements during the assembly process. The log-likelihood of an observation, given by

[eqn]

measures how likely the observation belongs to the GMM distribution. To obtain a normalized uncertainty value, the likelihood is mapped to a logistic score

[eqn]

The center $[eqn]$ and slope $[eqn]$ are set by a calibration between the in-distribution (ID) data of $[eqn]$ and a synthetic near-OOD dataset $[eqn]$ , such that $[eqn]$ and $[eqn]$ , with $[eqn]$ . The set $[eqn]$ is constructed by placing points on a constant squared-Mahalanobis shell around each ID point, where the local geometry is approximated by a responsibility-weighted Gaussian (Battistelli and Chisci, 2014). By tuning the Mahalanobis radius so that the synthetic points lie in the negligible-probability tail of the local distribution, we enforce low uncertainty on ID data and a smooth increase in uncertainty as the system drifts toward unmodeled regimes.

This uncertainty is used to modulate the MPC compliance in real-time, by adapting the stiffness and desired wrench of the CIM via logistic functions. The linear and angular stiffness of the CIM adapt as,

[eqn]

with stiffness bounds $[eqn]$ , and with $[eqn]$ denoting the linear or angular part, while the desired contact wrench adapts as

[eqn]

[eqn]

here, the parameters $[eqn]$ denote the slope and midpoint of the logistics; $[eqn]$ is the nominal wrench given by the Wrench DMP or the contact exploration, depending on the stage of the assembly; and $[eqn]$ are the force bounds of the retraction wrench $[eqn]$ . The desired bounds and sigmoid parameters are selected empirically to ensure stable and desired compliant behavior during contact. This formulation (Equations 19–24) adapts compliance to the estimated uncertainty: it maximizes compliance when uncertainty is high (e.g., during unfamiliar contacts) by reducing stiffness and the desired contact wrench, and minimizes compliance when uncertainty is low.

Experimental evaluation

4

In this section, we describe the experimental evaluation of our approach in diverse assembly scenarios, where the assembled parts, as well as their start and goal configuration might change at each trial. The target assembly tasks are defined as:

IndustRealKit (Tang et al., 2023), Figure 3-top
3x Cylinder Pegs: Small (8 mm), Medium (12 mm), Large (16 mm) must be inserted into corresponding holes (clearances: 0.5–0.6 mm),
3x Orthogonal Pegs: Small (8 mm), Medium (12 mm), Large (16 mm) must be inserted into corresponding holes (clearances: 0.5–0.6 mm),
3x Gears: Small (20 mm), Medium (40 mm), Large (60 mm) must be inserted onto corresponding gearshafts (diametral clearances: 0.5 mm),
2x Plug connectors: i. 2-prong, and ii. 3-prong. Must be inserted into corresponding sockets,
Automotive (Car) parts (Disc Brake), Figure 3-bottom
Wheel Bearing: must be inserted onto the spindle,
Wheel Disc: must be placed onto the wheel hub.

(Top) IndustRealKit parts, (Bottom) Disc brake parts.

We train our framework on three parts only: i. the large cylindical peg, ii. the 3-prong plug, and iii. the wheel bearing. The goal is to generalize to related parts in each case respectively: i. other peg-in-hole variants, ii. the two-prong plug, and iii. the wheel disc. We assume the operator selects the task class (peg-in-hole, plug-insertion, or disc brake assembly) in advance. For each case, the learned, task-specific wDMPs, ART, and GMM models are used.

For each learned assembly, we run multiple trials to reproduce all variations using each of the following controllers:

CIC: a Cartesian Impedance Controller with joint limits and singularities avoidance utilizing the nullspace, as implemented in Origanti et al. (2025). This controller maintains fixed stiffness which has been determined empirically and does not use force-based contact exploration or alignment detection.
MPC: standard MPC. This method uses $[eqn]$ in Equation 18, and does not use force-based contact exploration or alignment detection.
MPVIC: MPC with a Cartesian impedance model, which increases stiffness as pose errors grow (inspired by Anand et al. (2023)). This methods sets $[eqn]$ and skips the alignment detection phase.
uMPC-ART: our approach, but without executing a retrial on misalignment.
$[eqn]$ : our approach with retrial.
$[eqn]$ : our approach with retrial and with fixed maximum stiffness $[eqn]$ , instead of adaptive via uncertainty.

For all the above, the MPC is solved at $[eqn]$ using the Feasibility-driven Differential Dynamic Programming approach of the Crocoddyl library (Mastalli et al., 2020), which computes robot dynamics via the pinocchio library (Carpentier et al., 2019). The IK solution, computed with the TRAC-IK library (Beeson and Ames, 2015), is configured to find the closest solution to the current state and provides references to the MPC at $[eqn]$ . The KUKA manipulator is equiped either with the Robotiq 2F-85 adaptive gripper (for IndustRealKit parts) or the OnRobot 3FG25 gripper (for car parts). The OnRobot 3FG25 includes an integrated F/T sensor that gives measurements at $[eqn]$ , while the Robotiq 2F-85 setup incorporates an external Robotiq FT300 sensor mounted before the gripper and working at $[eqn]$ .

For contact exploration, we tune the frequency ratios in Equation 10 to symmetrically excite the EE about the estimated goal in the XY plane—defined as the assembly (mating) surface—with Z aligned to the insertion axis (surface normal) (Figure 4).

Output of force applied during contact exploration (Equation 10) with: fx=2.7Hz , fy=5.4Hz , fz=4.5Hz .

The ART classifier is parameterized by setting the number of samples in a window for the FFT, the overlapping, the size of the blocks used for max pooling, the local and global vigilance parameters, and the scaling to use. When started, it initializes by default with deactivated learning, allowing to pre-train a linear MinMaxScaler of the scikit-learn library (Pedregosa et al., 2011), when the encoded data’s minimum and maximum cannot be estimated in advance. To obtain reasonable values for these parameters, a grid search can be performed on offline data. An exemplary result from such grid search based on a successful and a failed attempt to assemble the wheel bearing is shown in Figure 5. In particular, learning is only activated on the first part of the data (until $[eqn]$ ) corresponding to a successful assembly, and tested with the remaining part of the data corresponding to a failed assembly. We observe that in the successful case, the part is initially jammed ( $[eqn]$ ) causing high forces in Z-Axis, but the contact exploration resolves the misalignment at about $[eqn]$ . During the alignment check that follows $[eqn]$ , the classifier learns the corresponding frequency magnitude pattern during correct alignment and maps it to a unique category 4. On the other hand, in the failed attempt to align the parts $[eqn]$ , neither this pattern nor the pattern of the prior exploration phase $[eqn]$ is stably recognized due to the dissimilarity to the pattern of correct alignment. This results in a significant number of mismatches, where the current input pattern does not match any of the existing categories. In such cases the classifier returns an identifier of $[eqn]$ . Further, we use an additional median filter on the category identifier that is output by the classifier to handle spurious occurrences of patterns and instable class assignments. Equivalently, to only capture relevant patterns, the classifier can be operated such that learning is only active during an assembly or alignment check where correct alignment is ensured. In this mode of operation, all other patterns are thus mismatching, which simplifies further processing of the classifiers’ output, as a dedicated labeling of the category identifiers is avoided.

Exemplary data from the ART-based contact classification. From top to bottom: the six channels of the raw measurement (three forces, three torques), the corresponding frequency magnitudes stacked, the magnitudes after max-pooling and scaling, the assigned category index, and indication of novelty (mismatch).

A detailed summary of the selected hyperparameters for DMPs, contact exploration, ART, MPC, GMMs, and the uncertainty settings used in the experimental evaluation is provided in Table 1.

Results

4.1

Tables 2–5 summarize performance across the three task families (peg-in-hole, plug-insertion, and disc brake assembly) under varying start and goal configurations. We report i. the success rate, and ii. the average completion time computed over successful trials. The evaluation comprises three trials for each peg-in-hole instance (three for each cylindrical peg, three for each orthogonal, and three for each gear), five trials per plug type (2- and 3-prong), and five trials per car part (wheel bearing and wheel disc). We discuss the main trends evident in Tables 2, 4 below.

Overall, the proposed method markedly outperforms the baselines, exhibiting a clear robustness–speed trade-off. The CIC, MPC and MPVIC baselines rarely succeed on peg-in-hole (CIC: $[eqn]$ ; MPC: $[eqn]$ ; MPVIC: $[eqn]$ ), while on plug and car-parts performance is mixed (respectively, CIC: 6/10 and 5/10; MPC: 1/10 and 7/10; MPVIC: 5/10 and 9/10). In contrast, uMPC-ART generalizes well on all tasks, with 18/27 on pegs, 5/10 on plugs, and 7/10 on car parts. Adding the retrial mechanism, $[eqn]$ yields the best reliability overall, achieving $[eqn]$ on pegs, $[eqn]$ on plugs, and $[eqn]$ on car parts. This indicates that the ART-based misalignment detection followed by retrial, significantly improves success in uncertain tight-tolerance insertions tasks. The stiff variant, $[eqn]$ , further improves plugs to $[eqn]$ but regresses on pegs to $[eqn]$ and on car-parts to $[eqn]$ , suggesting that keeping a fixed stiffness and adapting only the contact wrench via uncertainty is not the best fit for diverse assembly tasks.

In terms of timing (Tables 3, 5), when the baselines do succeed they are comparatively fast. CIC completes successful trials the quickest on average $[eqn]$ s, followed by MPC with $[eqn]$ s. MPVIC’s few successful peg-in-hole trials complete around 53s, and on plugs/car parts it averages to $[eqn]$ s, which is comparable to uMPC-ART ( $[eqn]$ s) but without the same reliability. Adding retrials, $[eqn]$ increases these to $[eqn]$ s (with Wheel Disc peaking at $[eqn]$ s); these longer horizons arise from retrials and the additional contact-exploration steps introduced by our policy. Nevertheless, $[eqn]$ improves timing relative to $[eqn]$ ( $[eqn]$ pegs: 87s, plugs: 126s, car-parts: 119s), indicating that adaptive compliance often benefits not only success but also completion time. This is a result of the fact that the stiff approach typically applies higher interaction forces, as seen in Figure 6, which hinders misalignment resolution during contact exploration and thus leading more often to a retrial. In contrast, with adaptive compliance the interaction forces remain among the lowest in most trials and tasks, reducing them by $[eqn]$ vs. CIC, $[eqn]$ vs. MPC, $[eqn]$ vs. MPVIC, and $[eqn]$ vs. its stiff ablation, thereby enabling more efficient search of the contacted surface (Figure 7) and, in turn, more frequent success.

Force measurements over the 47 assembly trials listed in Tables 2–5. (Top) 27x peg-in-hole, (Middle) 10x plug insertion, (Bottom) 10x car-parts. (Inset) Average force for all trials.

Snapshots of three assembly tasks following our policy. (Top) Medium cylinder peg, (Middle) 3-prong plug, (Bottom) Wheel bearing.

This trend can also be seen in Figure 8, which plots the distance to the final goal pose over time for all tasks and trials. Independantly of the parts, or start and goal configurations, the $[eqn]$ variant (red) maintains the lowest steady-state residuals with tight variability. Although the retrial policy produces brief mid-trajectory spikes, corresponding to the automatic retreat and re-attempt, these are followed by renewed convergence. Notably $[eqn]$ (blue) triggers retrials more often, as evidenced by larger retreat curves and greater dispersion than $[eqn]$ .

Distance to final position (Left) and orientation (Right) goals. Results over the 47 assembly trials listed in Tables 2–5: (Top) 27x peg-in-hole, (Middle) 10x plug insertion, (Bottom) 10x car-parts.

Aggregating across all tasks in Table 6, the trend is unambiguous: $[eqn]$ achieves the highest overall success (83.0%), followed by the stiff variant, $[eqn]$ (74.5%), and non-retrial uMPC-ART (63.8%), while MPVIC (34.0%), CIC (29.8%), and MPC (19.1%) trail. Taken together, the results support the following takeaways: i. the classic CIC and MPC methods generalize poorly to different geometries, start and goal configurations of the assemblies, because even millimeter-scale misalignments due to modelling or goal-estimation errors lead to failure when the controller cannot actively explore the contact surface; ii. MPVIC performs better but still degrades under variation, because it increases stiffness during contact, which results in higher interaction forces and thus the parts being stuck because of static friction, thereby limiting effective exploration of the contact surface; iii. a similar effect is observed in our approach with fixed maximum stiffness $[eqn]$ but because the desired wrench is still adapted based on uncertainty the approach succeeds more often, although with frequent retrials; iv. Adapting both the stiffness and desired wrench via uncertainty $[eqn]$ is key for generalization across shapes and pose variations, because it enables compliant and adaptive contact exploration, which lowers interaction forces while searching for the correct alignment; v. a retrial policy after misalignment detection substantially boosts reliability at the cost of longer executions; and vi. whether the increased execution time is acceptable in industrial settings depends on the application requirements. In many production environments, a failed insertion can incur substantial overhead, e.g., because of damaging the parts, or requiring human intervention to reset the process, thus maximizing reliability can be preferable to minimizing nominal cycle time. For regimes with high part variability, uncertain pose estimates, or tight tolerances, the extra time spent on contact exploration and occasional retrials would often be recovered through reduced downtime. Under such variability, prioritizing success makes $[eqn]$ a practical default. By contrast, when cycle time is paramount and parts are well-localized and unambiguous, uMPC-ART without retrial can provide a more favorable speed–reliability trade-off.

Conclusion and outlook

5

To summarize, we presented a unified, data-efficient framework for fast reprogramming and adaptive reproduction of contact-rich assembly tasks that combines synchronized wrench–motion DMPs with a GMM-based, uncertainty-aware AMPC, and an ART-based contact classifier. Training requires only two demonstrations: i. kinesthetic teaching of wrench and motion profiles, and ii. an assistive reproduction in which the operator minimally guides the robot through a successful assembly, allowing it to collect nominal contact data; Those are used for training ART to recognize correct alignment and for fitting the GMM to model uncertainty in future, novel assemblies. At run time, the robot adapts the MPC contact model based on the GMM likelihood, which is key for enabling adaptive and compliant exploration of the contact surface using force-based dither motions. In parallel, ART infers the contact context and triggers stage transitions in the assembly process (e.g., triggering a retrial under misalignment).

Trained on a single instance each of peg-in-hole, plug insertion, and wheel-bearing assembly, the system successfully completed 39/47 $[eqn]$ of mainly novel, previously unseen assemblies with varied geometries and start/goal poses. At the cost of slightly increased completion times, our approach consistently outperformed classic CIC/MPC and an error-driven MPVIC baseline in reliability (success rate), while maintaining the lowest interaction forces overall. These results make a step towards flexible manufacturing lines where production requirements can change frequently.

Although we demonstrate the feasibility of such solution using only an F/T sensor and proprioception, our current system has several limitations that also motivate clear future work directions. First, the higher completion times observed in some cases reflect an intentional trade-off between speed and reliability through compliant exploration and recovery behaviors; future work should aim to retain or improve the achieved success rates while shortening cycle time. Importantly, two concrete routes to achieve this are to make exploration more informed and reduce avoidable retrials. The lack of visual feedback in our framework makes exploration less directed, which may increase the number of retrials; integrating vision (and/or richer tactile sensing) could improve pose initialization, guide contact exploration, and reduce unnecessary retries, thereby directly reducing cycle time. Likewise, performance is currently constrained by gripper capability: limited grasp stability can allow parts to shift within the gripper and exacerbate misalignment, suggesting that improved end-effector design or grasp sensing could reduce failure modes and retrials. Second, while requiring only two demonstrations is comparatively data-efficient, it still imposes manual effort; an important direction is to further automate the process via automatic phase segmentation and self-supervised collection of nominal contact data. Third, the approach relies on multiple empirically tuned hyperparameters, particularly in the uncertainty and adaptation mechanisms; reducing engineering effort through principled self-tuning and online calibration of uncertainty and exploration models is therefore a key avenue for future research. Finally, we considered primarily single-contact insertions; extending the framework to multi-contact and multi-stage assemblies, potentially with regrasping and more complex contact-state transitions, remains an important step toward broader industrial applicability.

Bibliography61

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Anand A. S. Gravdahl J. T. Abu-Dakka F. J. (2023). Model-based variable impedance learning control for robotic manipulation. Robotics Aut. Syst. 170, 104531. 10.1016/j.robot.2023.104531 · doi ↗
2Anne T. Wilkinson J. Li Z. (2021). “Meta-learning for fast adaptive locomotion with uncertainties in environments and robot dynamics,” in 2021 IEEE/RSJ international conference on intelligent robots and systems (IROS), 4568–4575. 10.1109/IROS 51168.2021.9635840 · doi ↗
3Arcari E. Minniti M. V. Scampicchio A. Carron A. Farshidian F. Hutter M. (2023). Bayesian multi-task learning mpc for robotic mobile manipulation. IEEE Robotics Automation Lett. 8, 3222–3229. 10.1109/LRA.2023.3264758 · doi ↗
4Bargsten V. Kirchner F. (2023). Actuator-level motion and contact episode learning and classification using adaptive resonance theory. Intell. Serv. Robot. 16, 537–548. 10.1007/s 11370-023-00481-7 · doi ↗
5Bargsten V. Rakovitis D. Origanti V. K. Danzglock A. Kirchner F. (2025). “Continuous learning of contact episodes from proprioceptive sensors in industrial assembly scenarios using adaptive resonance theory,” in Intelligent and fuzzy systems. Editors Kahraman C. Cebi S. Oztaysi B. Cevik Onar S. Tolga C. Ucal Sari I. (Cham: Springer Nature Switzerland), 342–349. 10.1007/978-3-031-98304-7_39 · doi ↗
6Battistelli G. Chisci L. (2014). Kullback–leibler average, consensus on probability densities, and distributed state estimation with guaranteed stability. Automatica 50, 707–718. 10.1016/j.automatica.2013.11.042 · doi ↗
7Bednarczyk M. Omran H. Bayle B. (2020). “Model predictive impedance control,” in 2020 IEEE international conference on robotics and automation (ICRA), 4702–4708. 10.1109/ICRA 40945.2020.9196969 · doi ↗
8Beeson P. Ames B. (2015). “Trac-ik: an open-source library for improved solving of generic inverse kinematics,” in 2015 IEEE-RAS 15th international conference on humanoid robots (humanoids), 928–935. 10.1109/HUMANOIDS.2015.7363472 · doi ↗