The interplay between haptic guidance and personality traits in robotic-assisted motor learning

Alberto Garzás-Villar; Caspar Boersma; Alexis Derumigny; J. Micah Prendergast; Arkady Zgonnikov; Jane Murray Cramm; Laura Marchal-Crespo

PMC · DOI:10.1186/s12984-025-01709-6·November 12, 2025

The interplay between haptic guidance and personality traits in robotic-assisted motor learning

Alberto Garzás-Villar, Caspar Boersma, Alexis Derumigny, J. Micah Prendergast, Arkady Zgonnikov, Jane Murray Cramm, Laura Marchal-Crespo

PDF

Open Access

TL;DR

This study explores how personality traits affect how well people learn motor skills with the help of robots, suggesting that adapting training to individual traits could improve outcomes.

Contribution

The study introduces the idea that personality traits influence the effectiveness of robotic haptic guidance in motor learning.

Findings

01

Participants with high Transform of Challenge or external Locus of Control reduced interaction forces less when receiving haptic guidance.

02

Participants with a high Free Spirit gaming style were more sensitive to guidance perception during retention.

03

Personality traits modulate motor learning outcomes and human-robot interaction metrics in robotic-assisted training.

Abstract

Robotic devices have shown promise in supporting motor (re)learning. However, there is a limited understanding of how personality traits influence the effectiveness of robot-aided training strategies. We conducted a motor learning experiment with 40 unimpaired participants who trained to control a virtual pendulum using a robotic haptic device. Before the experiment, we assessed personality traits including the perceived control over life events (Locus of Control), the tendency to turn challenges into engaging activities (Transform of Challenge), and other subscales from Autotelic and Hexad gaming style questionnaires. Participants were divided into two groups, one receiving haptic guidance during training and a second one without assistance. Short- and long-term retention was assessed, and relationships between personality traits, performance metrics, and human-robot interaction…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Figures7

Click any figure to enlarge with its caption.

The experimental setup (up left) consists of the screen and the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Delta.3$$\end{document}$ (Force Dimension, Switzerland). Down left: Game screenshot with the pendulum, walls in black and yellow, targets as vertical red lines, and the score in green numbers. Right: The device could be controlled by holding the black ball attached to the robot end-effector

Left: Front view of the pendulum with forces applied on the pivoting point. $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{HG}$$\end{document}$ represents the force from the haptic guidance while $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \beg

The study protocol included two sessions spaced by 1 to 3 days. A set comprised 20 targets. D.C.: Data collection, QUEST.: Questionnaires, s: Seconds, T1: Position transfer task, T2: Dynamics transfer task, STR: Short-Term retention, LTR: Long-Term retention, Exp.: Experimental

Barplot showing results from the M1.1 LMM regarding exclusively the main task. Each bar shows the error reduction that the model predicts a participant will show in different situations, i.e., from baseline to short-term retention (left) and from baseline to long-term retention (right). Note that the terms related to the influence of LOC in the *Experimental* group from baseline to STR or LTR were excluded from the model M1.1 to improve interpretability by reducing unnecessary complexity. White bars refer to a participant with average levels of every trait. Colored bars represent the same situ

Barplot showing results from the M1.2 LMM regarding exclusively the main task. Each bar shows the interaction force reduction that the model predicts a participant will show in different situations, i.e., from baseline to short-term retention (left) and from baseline to long-term retention (right). White bars refer to a participant with average levels of every trait. Colored bars represent the same situation for a participant with a higher (solid) or lower (striped) level of a specific trait. Error bars represent the confidence interval associated with each estimated interaction force reductio

Top: Barplot showing results from the M1.1 LMM regarding the dynamics transfer task. Bottom: Barplot showing results from the M1.2 LMM regarding the position transfer task. Each bar shows the error (top) or interaction force (bottom) reduction that the model predicts a participant will show in different situations, i.e., from the main task in the baseline to the dynamics transfer task in the baseline (top) or from the baseline to STR/LTR in both, the main and the transfer task (bottom). White and pink bars refer to a participant with average levels of every trait. Other colored bars represent

Barplot showing results from the M2 LMM, including only the *Experimental* group and the main task. Each bar shows the error reduction that the model predicts a participant will show in different situations, i.e., from baseline to long-term retention. The white bar refers to a participant with average levels of every trait. Colored bars represent the same situation for participants with a higher (solid) or lower (striped) level of a specific trait and/or high frustration. Error bars represent the confidence interval associated with each estimated error reduction. The square brackets at the bot

Funding1

—Convergence Human Mobility Centre

Keywords

Personality traitsMotor learningRobotic-assisted rehabilitationHaptic guidancePersonalization

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMotor Control and Adaptation · Stroke Rehabilitation and Recovery · Action Observation and Synchronization

Full text

Introduction

Damage of the central nervous system, e.g., after stroke, often results in motor impairments that require re-learning of lost motor skills [1]. Technological advances, such as robotics and virtual reality (VR), have shown promise in supporting motor (re)learning by providing augmented feedback, e.g., in the form of visual, auditory, haptic, or their multimodal information combinations [2]. Among these, the provision of haptic feedback is one of the most popular approaches in robot-aided training. Haptic feedback has shown potential in positively influencing motor learning when combined with other feedback forms [2, 3]. In particular, haptic guidance — where robots physically assist users in achieving correct movements [4]—has been shown to enhance motor learning, especially in those initially less skilled or in the early stages of stroke neurorehabilitation [3, 5]. There is also evidence that robotic training strategies that challenge users, e.g., by amplifying errors, may benefit more skilled users [6, 7].

Despite previous research pointing towards individual differences in robot-assisted motor learning, there is a lack of understanding of how to optimally select haptic training strategies for each individual trainee. Different training strategies affect not only motor performance during training but also trainees’ motivation [3, 8–10], which in turn influences motor learning [11]. To date, determining the most suitable training approach for each individual is hindered by insufficient knowledge of the intrinsic human factors that play a role in the effectiveness of robot-aided training [12, 13].

Factors that might differentiate trainees are multifaceted, ranging from variations in individual characteristics — e.g., physiological or psychological differences [14, 15] — to environmental and social factors [16, 17]. Adapting robotic assistance to trainees’ physiological measures and physical capacities has shown benefits in motor learning and neurorehabilitation [18, 19]. Yet, there is increasing recognition of the potential value of incorporating other characteristics, such as personality traits, personal preferences, and psychological states to provide more personalized robotic training programs [20–24].

While psychological states — e.g., motivation and engagement — are dynamic and can change depending on circumstances [14, 25], personality traits are stable characteristics that shape how people consistently think, feel, and behave over time [15]. Since personality traits are expected to be stable, such information could reduce the need for in vivo data collection. Personality traits have been shown to be relevant in different areas such as sports [26], employee productivity [27], online gaming [28], or academic learning [29]. However, despite the potential relationship between personality traits and motor learning, their interplay with robotic-aided training has not been systematically investigated.

To address this gap, we performed a parallel design experiment involving 40 unimpaired participants who learned to control a virtual pendulum with one internal degree of freedom to hit upcoming targets with the pendulum mass in a VR-based game. Participants could control the pendulum by moving the pendulum’s pivoting point through a haptic device. Participants could feel the pendulum dynamics through the haptic device. During training, the device supported half of the participants (Experimental group) through haptic guidance. This group was compared against a Control group that did not receive this guidance. Using questionnaires, we measured two personality traits and two gaming styles for each participant.

Autotelic personality. According to the challenge point framework [30], optimal learning occurs when learners face challenges that are neither too easy nor overwhelming, facilitating task engagement. Therefore, personality traits that influence how individuals perceive and approach challenges may be crucial during robotic-assisted motor learning. The autotelic personality trait gives information about the disposition to seek challenges or enter the “flow” state [31] and has already shown its potential to predict engagement in sports settings [32]. In this study, we assessed autotelic personality using the Transform of Challenge and Transform of Boredom subscales from the Autotelic Personality Questionnaire [33]. These subscales reflect an individual’s tendency to transform “challenging” or “boring” situations, respectively, into personally motivating ones. These subscales might be relevant when interacting with robotic devices, as they might interact with how the robotic assistance influences participants’ engagement in tasks whose functional difficulty — i.e., how challenging the task is perceived — can be modulated by physically assisting the movement [9].
Locus of Control (LOC) is related to individuals’ sense of control over their actions and outcomes [34]. Autonomy and perceived control can be of particular interest when interacting with robotic devices that provide physical assistance [20]. Previous research indicated that participants’ LOC (internal vs. external) can impact how they react to external feedback [35], as well as their interaction with robotic devices [36]. For instance, in this last work, Acharya et al. found that LOC was correlated with how participants behaved under different types of assistance in a robotic teleoperation task.
Gaming styles. Gamification can motivate trainees [37] but can be intricate as it intertwines with personal preferences and characteristics that can either boost or hinder engagement [38]. An established method for capturing this is to classify the user’s gaming style, which characterizes users based on their interaction preferences within game-like environments [39]. Though not a personality trait, gaming style can be potentially used to assess the trainee’s personality profile [40]. Here, we assessed participants according to two gaming styles: Achiever — reflecting a preference for challenges — and Free Spirit — reflecting a tendency for exploration [41]. These gaming styles may influence how users respond to structured interactions like haptic guidance, which can be perceived as a game element that limits free exploration or directly impacts task difficulty. The experimental protocol was divided into two sessions, 1–3 days apart. The first session included a personality traits assessment, a baseline to determine participants’ initial skill level, a training phase, and a short-term retention (STR) evaluation. The second session included a long-term assessment of motor learning. During the baseline and the two retention phases, the participants performed two transfer tasks to asses the generalization of the acquired skill [3, 42]. The transfer tasks were similar to the trained task (referred to as the main task) but included slight design variations. In the position transfer task, the targets were randomly re-located from the trained main tasks to evaluate the participants’ ability to adapt to unfamiliar spatial patterns. The second transfer task, referred to as dynamics transfer task, introduced variations in the pendulum dynamics, i.e., the pendulum rod length was reduced by 70%, while the target positions were the same as in the main task.

In this paper, we present results on the analysis of the relationship between personality traits, haptic guidance, human-robot interaction forces, and participants’ motor learning by testing the following hypotheses.

H1 Personality traits and presence of haptic guidance influence how task performance and human-robot interaction force change from baseline to retention.
H1.1 Haptic guidance: Participants were anticipated to perform worse during short-term retention if they were allocated to the Experimental group, compared to those allocated to the Control group, due to reliance on the haptic guidance.
H1.2 High Transform of Challenge participants: For participants allocated to the Experimental group, performance during short-term retention was expected to decline even more if they had high levels of the Transform of Challenge trait. This is based on findings in [43], where participants with high Transform of Challenge showed worse performance when receiving guidance compared to not receiving guidance during the training phase.
H1.3 High Achievers: Participants with high levels of Achiever gaming style were expected to outperform average participants. This is irrespective of the training group they were allocated to and based on their improved performance w.r.t. average participants during the training phase [43].
H1.4 Human-robot interaction force and Free Spirits: Participants with high Free Spirit gaming style in the Control group were expected to exhibit higher interaction forces with the haptic device during short- and long-term retention compared to an average participant, which they already showed during the training phase [43].
H1.5 Human-robot interaction force and Locus of Control: Participants with high LOC scores (external LOC) allocated in the Experimental group were expected to exhibit greater interaction forces from baseline to STR compared to a participant with low LOC scores (internal LOC), as they already showed this trend during the training phase [43].
H2 Different movement patterns (target positions) and pendulum dynamics affect task performance and human-robot interaction force and are influenced by personality traits.
H2.1 Transfer task with different target positions: During baseline, participants were expected to perform worse in terms of both performance and interaction force metrics in the position transfer task compared to the main task. This is due to the potential added challenge of adjusting to new, less familiar target positions. However, we hypothesized this effect would diminish during the retention phases due to learning after the training.
H2.2 Transfer task with different pendulum dynamics: Participants were expected to perform better during this transfer task compared to the main task in all phases. In this transfer task, as the pendulum rod was shortened, the pendulum’s natural frequency increased. In [44], Özen et al. showed how operating the system far from the pendulum’s natural frequency was correlated with higher control. We hypothesized that keeping the same target positions within walls, but increasing the pendulum’s natural frequency, would facilitate controlling it.
H2.3 Transfer task and Transform of Challenge: Participants with high Transform of Challenge were expected to improve the performance metric in a smaller amount than the average participant in those tasks potentially less challenging — i.e., the pendulum is easier to control (H2.2) — and outperform average participants in those involving new and unexpected target positions (H2.1).
H3 Subjective perception of human-robot interaction correlates with task performance metrics and personality traits.
H3.1 Interaction perception and performance: Positive responses to the robot interaction perception questions (e.g., perceiving the robot as permitting, helpful, or non-frustrating) were expected to be associated with better performance improvements after training.
H3.2 Interaction perception and personality traits: We also expected that the subjective perception of the haptic guidance would interact with personality traits and affect the performance and human-robot interaction force decrease after training. We did not include hypotheses specifically addressing the relationship between personality traits, haptic guidance, and motor performance during training, as this specific experimental phase was analyzed in a prior study published in a conference proceeding (see [43]).

Methods

Experimental setup

The experimental setup included a $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Delta.3$$\end{document}$ haptic robot (Force Dimension, Switzerland) placed on a desk next to a display monitor (Fig. 1). The device is capable of measuring positions and providing forces up to 20.0 N in the three translational directions (x, y, and z axes, Fig. 1). The device control was implemented in C++, operating at 4 kHz. Motion data was recorded at 1.67 kHz.Fig. 1. The experimental setup (up left) consists of the screen and the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Delta.3$$\end{document}$ (Force Dimension, Switzerland). Down left: Game screenshot with the pendulum, walls in black and yellow, targets as vertical red lines, and the score in green numbers. Right: The device could be controlled by holding the black ball attached to the robot end-effector

The pendulum game

The game, inspired by the work of [44] and created in Unity 3D (Unity Technologies, USA), consisted of controlling a virtual pendulum to hit moving targets approaching the participant. The pendulum consisted of a black ball (pivoting point) and a red ball (pendulum mass), with a rigid link connecting both balls, as shown in Figs. 1 and 2. The pendulum’s pivoting point could be moved horizontally and vertically (y and z axis in Fig. 2) by displacing the haptic device’s end-effector (black ball in Fig. 1 Right; 1:1 movement mapping). The pendulum could only swing in the vertical plane (y-z), and therefore, movements of the haptic device in the x-direction were not mapped to the pendulum.Fig. 2. Left: Front view of the pendulum with forces applied on the pivoting point. $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{HG}$$\end{document}$ represents the force from the haptic guidance while $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{rod}$$\end{document}$ is the force from the pendulum dynamics. Center: 3D representation of the game. Right: Top view representation of the game with exemplary trajectories of the pivoting point and the pendulum mass, in black and red dashed lines respectively. The red lines within the walls represent the targets. The variable b represents the target position with respect to the centerline, while the variable a represents the absolute error between the pendulum mass and the target

The task consisted of hitting vertical targets with the pendulum mass. The targets were located on walls approaching the participants in the x direction, i.e., perpendicular to the screen plane. The walls were spaced by 1 m and their speed was set to 1 m/s. The targets could appear in three different positions: the center point of the wall or ± 0.12 m to the right/left. The target’s width was 0.02 m, and the pendulum ball and pivoting point diameter were set to 0.03 m.

By moving the pendulum pivoting point through the device end-effector, participants influenced the swing of the pendulum, which behaved according to the equation of motion of a simple pendulum:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} \ddot{\theta }=-\frac{1}{l}((\ddot{z}+g)\sin {\theta } + \ddot{y}\cos {\theta })- \frac{c}{ml^2}\dot{\theta }, \end{aligned}$$\end{document}

where y and z are the horizontal and vertical coordinates of the robot end-effector position and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ddot{\theta }$$\end{document}$ the angular acceleration of the pendulum’s internal degree of freedom (DoF). Since the internal DoF was located at the pendulum’s pivoting point, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\theta $$\end{document}$ was defined relative to the pendulum rod, as illustrated in Fig. 2. The robot’s coordinates were referenced with respect to its initial position after calibration, similar to the one shown in Fig. 1right. The pendulum mass was set to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m = 0.6$$\end{document}$ kg, the rod length to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$l = 0.25$$\end{document}$ m, gravity to $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$g = 3.24$$\end{document}$ m/ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s^2$$\end{document}$ , and the constant $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$c = 3.00e^{-6}$$\end{document}$ N $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\cdot $$\end{document}$ s/rad. These parameters were adjusted and chosen in order to minimize passive stabilization of the pendulum and maintain task difficulty.

As the pendulum crossed a wall, a score based on the absolute distance of the pendulum’s mass to the center of the target in the y-direction (|Error|) was briefly displayed for 0.5 s to provide feedback regarding participants’ performance. The score ranged between 0 and 100 and was calculated as:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} Score = {\left\{ \begin{array}{ll} 0 & \text {if } |Error| \ge 0.2\,m, \\ 100-500\cdot |Error| & \text {if } |Error| < 0.2\,m. \end{array}\right. } \end{aligned}$$\end{document}

Each phase of the experiment was organized into wall sets, with 20 walls presented per set (see the Study protocol Section). A final score, based on the average of all 20 scores, was shown at the end of each set to inform participants of their overall performance in that set.

Haptic rendering and haptic guidance

To enhance the ecological validity of the task — ensuring the experimental conditions closely replicate real-world scenarios — we incorporated haptic rendering throughout the whole experiment, i.e., the provision of the forces originating from the pendulum dynamics on the device end-effector. Participants could feel the pendulum force dynamics ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{rod}$$\end{document}$ ), calculated as:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} F_{rod}=m((\ddot{z}+g)\cos {\theta }-\ddot{y}\sin {\theta }+\dot{\theta }^2l), \end{aligned}$$\end{document}

using the same constants as in Eq. (1).

Participants allocated to the Experimental group were also provided with haptic guidance during training to physically assist them in the target-hitting task. This was achieved by first calculating an optimal end-effector trajectory between the pendulum state at the moment of target wall collision and the target at the following wall, and then enforcing this trajectory using a Proportional-Derivative (PD) controller.

The optimal end-effector trajectory was calculated every time the pendulum hit a target wall using the ACADO toolkit [45]. The cost function included terms to maximize accuracy (i.e., minimize the distance between the pendulum ball and the next target’s centerline), maximize the pendulum stabilization (i.e., penalizing the velocity components of the pendulum ball), and minimize end-effector acceleration based on the current state of the pendulum, as described in the Appendix B.

The PD controller aimed to minimize the distance between the end-effector and the reference trajectory in the y-direction at each time point by applying a guiding force $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{HG}$$\end{document}$ at the end-effector. We only provided guidance in the y-axis as it was sufficient to achieve the target-hitting tasks. By not guiding in the z-direction, we also reduced the potential masking effects of the guidance on the perception of the haptic rendering of the pendulum dynamics. The resulting equation for the PD controller is as follows:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} F_{HG} = K_p e(t) + K_d\frac{d}{dt}e(t), \end{aligned}$$\end{document}

where the y-axis error between the actual and the reference trajectory was denoted as e(t), and the proportional ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_p$$\end{document}$ ) and derivative ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_d$$\end{document}$ ) gains were set to 75.0 N/m and 15 N $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\cdot $$\end{document}$ s/m, respectively. The guiding force was added to the haptic rendering force ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F_{rod}$$\end{document}$ in Eq. (3)). The order of magnitude of the guidance force was around four times the haptic rendering.

Participants

Forty-two unimpaired participants performed the experiment. Data from two participants were excluded from further analysis. One participant exhibited errors three standard deviations higher than the average of all participants. We encountered a technical problem when recording the data of a second participant, leading to missing data within the dataset. Thus, 40 participants were included in the analysis (age = $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$27 \pm 6\,yrs$$\end{document}$ ; 19 identified as female, 21 identified as male; no participants identified as non-binary). The target sample size of approximately 40 participants was determined based on a power analysis, as detailed in Appendix A. Handedness was assessed using the Short-Form Edinburgh Handedness Inventory [46], resulting in 35 right-handed, four left-handed, and one ambidextrous participant. All participants signed the informed consent to participate in the study, which was approved by the TU Delft Human Research Ethics Committee (HREC).

Participants were allocated into two training groups: Control or Experimental. The Experimental group received haptic guidance during some parts of the training phase (see the Study protocol Section), while the Control group practiced without any physical assistance. To promote an even distribution between groups, we used an adaptive randomization method. We randomly allocated the first twenty participants into one of the two training groups and distributed new ones into each group based on their sex and results from the Locus of Control questionnaire (see the Outcome metrics Section), similar to [7]. The Locus of Control was employed as it directly relates to the perception of control, which aligned with the groups’ training conditions (guidance vs. no guidance).

Study protocol

The experiment was conducted in two locations: Delft University of Technology, Delft, the Netherlands, and Alten Netherlands B.V., Rotterdam, the Netherlands. The experimental setup and protocol were identical in both locations. Minor environmental differences (e.g., room layout, lighting) were not expected to systematically affect performance. The overview of the experimental protocol can be found in Fig. 3.Fig. 3. The study protocol included two sessions spaced by 1 to 3 days. A set comprised 20 targets. D.C.: Data collection, QUEST.: Questionnaires, s: Seconds, T1: Position transfer task, T2: Dynamics transfer task, STR: Short-Term retention, LTR: Long-Term retention, Exp.: Experimental

The experiment took place in two sessions on different days, with one to three days between sessions, following recommendations to evaluate motor learning [3]. At the beginning of the first session, participants were invited to sit at the set-up table. The chair height was adjusted based on personal preferences to ensure a comfortable arm movement within the robot’s workspace. The haptic device was placed at a reachable distance with a relaxed posture on the dominant hand’s side. The screen was placed on the opposite side of the device in front of the participant. Participants were informed about the goal of the pendulum task at the beginning of the first session. No extra information was given during the rest of the experiment.

The experiment began by inviting participants to fill out the first block of questionnaires, including demographic data collection and the questionnaires to quantify the personality traits (see the Outcome metrics Section). Participants were then invited to familiarize themselves with the haptic device and the virtual environment for 40 s. During this familiarization phase, they were asked to move the pendulum freely in the virtual environment without loading any target. They could observe the pendulum moving and feel the haptic rendering of the pendulum dynamics through the haptic device end-effector. Once the 40 s were over, participants were instructed again about the game goal: move the pendulum such that the red ball hits each wall as close as possible to the target’s center. They were then invited to play a first set of 20 targets.

Once the familiarization was completed, the main experiment began. Participants underwent three main phases during Session 1: baseline, training, and short-term retention. During these phases, participants performed one or three different tasks. The main task consisted of playing 20 targets in a specific order. Each time the main task was played, targets were set in the same sequence of positions, except during the training phase, in which some of the sets were mirrored (further explained later in this section). Participants also played two transfer tasks to asses the generalization of the acquired skill. Those tasks were similar to the main task but included slight design variations. In the position transfer task, the targets were randomly re-located (still appearing every 1 s) to introduce new movement sequences. During the dynamics transfer task, the target positions were kept the same as during the main task, but the pendulum dynamics were changed. While the appearance of the pendulum did not change, the pendulum rod length was reduced by 70% in Eq. (1) and (3). This variation affected the pendulum’s natural frequency, which increased from 0.573 Hz to 0.685 Hz. For both transfer tasks, the goal remained the same: Use the pendulum mass to hit each target.

The baseline included two sets of 20 targets for the main task. The game did not stop between sets within the same task, but there was an extended pause of three seconds in which no new targets were loaded. After the second set was finalized, participants completed a new set of questionnaires to assess motivation and agency. To keep our analysis focused on answering the listed hypotheses, the analysis of motivation and agency are out of the scope of this work. Participants were then asked to play the game two more times, for two sets of the position transfer task and two sets of the dynamics transfer task, with an on-demand break offered between the two tasks.

After the baseline trials were complete, participants began the training. They completed two rounds of 15 sets of the main task of 20 targets each. Participants had the opportunity to take a break between rounds. During training, the Experimental group received haptic guidance on top of the haptic rendering. However, to avoid participant reliance on the guidance, guidance was removed during the first set and once every five sets (“catch sets”). In addition, to avoid only learning the specific movement patterns and target positions, mirrored sets were interspersed within the non-catch sets for both groups. The position of the targets during these sets was mirrored with respect to the walls’ y-axis. The distribution of “catch sets” and mirrored sets can be found in Fig. 3. Participants were informed that they might or might not be assisted during training to promote active participation. The Control group only experienced the haptic rendering from the pendulum dynamics during training.

Immediately following the last training set, participants took a 10-minute break. During this time, they filled in a new set of questionnaires. The Experimental group was asked questions about their subjective experience with the robotic guidance, i.e., how disturbing, frustrating or restrictive it was perceived (see the Outcome metrics Section). Following the break, a washout set of the main task was conducted by both groups to mitigate any temporary effects from training with haptic guidance, e.g., “slacking” [47].

Right after the washout set, participants performed the short-term retention phase. The structure was similar to the baseline but without the questionnaire. Participants returned after one to three days to perform a long-term retention phase, which was structured identically to the baseline tests.

Outcome metrics

Personality traits questionnaires

Before the familiarization phase, participants completed a battery of questionnaires assessing the selected personality traits to study. These personality traits included the LOC scale [34], the Transform of Challenge and Transform of Boredom sub-scales from the Autotelic personality questionnaire [33], and the Achiever and Free Spirit sections of the Hexad Gaming style questionnaire [39]. All the questionnaires, except for the LOC, were formed by seven-point-based questions and normalized between 0 and 1 (low to high level of trait/characteristic). The LOC questionnaire was formed by 23 multiple-choice questions, and the overall score for the whole questionnaire ranged from 0 to 23. To improve interpretability and facilitate later modeling (see the Statistical analysis Section), this range was normalized from -1 to 1 to reflect the continuum between Internal LOC (-1) and External LOC (1), which are widely recognized classifications in literature and commonly used to interpret behavior. In addition, the LOC scores usually follow an approximate Gaussian distribution centered near zero. This makes this range statistically practical and close to the centered scale. Internal and external LOC differ in whether outcomes from an action are attributed to oneself or external circumstances, respectively. The employed questions for all the questionnaires can be found in the Appendix C.

Human-robot interaction experience questionnaire

Three questions were filled in by only the Experimental group after training. These questions related to frustration, disturbance, and restrictiveness perception during the training (see Appendix C). They were answered on a seven-point scale, which was then normalized between 0 and 1 (low to high).

Task performance: absolute error

To assess motor learning, the distance between the pendulum’s mass position and each target’s centerline at the time of pendulum-wall contact was calculated (|Error|), in meters. This was used as our performance metric and one value per wall was obtained.

Human-robot interaction: interaction force

To assess participants’ interaction with the haptic device, the human-robot interaction force was estimated. This estimate was computed using Reaction Torque Observers based on recorded motor currents and the robot dynamic model, as implemented in [44]. For the analysis, we used the average force per target in Newtons. We calculated this average force within the interval from consecutive midpoints between walls.

Statistical analysis

To evaluate the hypotheses outlined in the Introduction Section, we used Linear Mixed Models (LMMs). These models were fitted using the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\texttt{lmer}$$\end{document}$ function from the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\texttt{lmerTest}$$\end{document}$ package in $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\texttt{R}$$\end{document}$ . Statistical significance was set at $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p < 0.05$$\end{document}$ , and p-values were adjusted for multiple comparisons using Bonferroni correction.

The employed LMMs were selected as outlined in Appendix D. We group them throughout the current section depending on the hypotheses they are tailored to evaluate. Table 1 summarizes the variables that can be included in the models. Task performance (|Error|) and human-robot interaction force (|IntForce|) metrics were analyzed as dependent variables depending on the model. Logarithmic transformations were applied to correct skewed distributions and achieve normality requirements.Table 1. Variables employed for the LMM. Data was structured at the target level; However, in models where this structure was not applicable, the dataset was reduced to eliminate duplicate entries. The variables |Error| and |IntForce| were log-transformed to address skewness in their distributionsVariableTypeDescriptionValuesIDIntParticipant number1 to 40wIndexIntWall number within a set0 to 19TaskCat $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\textrm{Task type}^1$$\end{document}$ (main task or transfer task)mn, t1, t2StageCat $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\textrm{Experimental stage}^1$$\end{document}$ (baseline, training, short- and long-term retention)BL, TR, STR, LTRsIndexIntSet identifier within a Task in a specific Stage0 to ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$N_{set}$$\end{document}$ -1) $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$XX_c$$\end{document}$ ContNormalized and mean-centered value of the trait score. XX are substituted by the corresponding personality trait.0- $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\overline{XX_{norm}}$$\end{document}$ to 1- $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\overline{XX_{norm}}$$\end{document}$ |Error|ContAbsolute error values–|IntForce|ContInteraction force values–NewWallsIntWall number within the position transfer task0 to 40 Int: Integer, Cat: Categorical, Cont: Continuous^1^Depending on the model, some variables may not include all levels, e.g., when the training phase was excluded from a model, the variable “Stage” would only include BL, STR, and LTR

Models to infer motor learning (M1.1 and M1.2)

To evaluate the impact of personality traits and the training condition on motor learning outcomes across different experimental phases (related to hypotheses H1 and H2) we employed two models, one for each dependent variable. These models include independent variables regarding the training group, the task type, the stage, and the personality traits (see description in Table 1). Given the extensive number of variables and potential interactions, a stepwise comparison between models of different complexity was performed using the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) to prevent overfitting and ensure stability (see Appendix D).

For the performance metric (|Error|), the following model (M1.1) with the smallest AIC and BIC was chosen:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\begin{aligned} log_{{10}} |{\text{Error}}| & = Group \times (Task + Stage) \\ & + (TC_{c} + LOC_{c} ) \\ & \times (Task + Stage + sIndex) \\ & + Task \times Stage + Group \\ & \times TC_{c} \times Stage + (1|ID) \\ & + (1|wIndex:NewWalls). \\ \end{aligned} $$\end{document}

In this equation, the normalized and centered results from the Transform of Challenge ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$TC_c$$\end{document}$ ) and Locus of Control (LOCc) were included as traits of interest, as others did not show statistical significance during model selection. Therefore, the hypothesis related to the Achiever gaming style (H1.3) is not supported by the data. A nested random effect for wall index (wIndex : NewWalls) was adjusted to account for changes in wall positioning during the position-based transfer task.

A similar model (M1.2) was developed for the interaction force metric. Notably, this model included an additional interaction term, $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Group \times LOC_c \times Stage$$\end{document}$ . When compared with other configurations, the model with this extra interaction led to a lower AIC and slightly increased BIC (see Appendix D). In view of these competing results, previous literature was used to guide the choice. This extra relationship was considered of interest as LOC was found to correlate to the interaction force metric during the training phase when the haptic guidance was active (see [43]). The final model has the form:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \begin{aligned} \log _{{10}} |{\text{IntForce}}| & = Group \times (Task + Stage) \\ & + (TC_{c} + LOC_{c} ) \\ & \times (Task + Stage + sIndex) \\ & + Task \times Stage + Group \times TC_{c} \\ & \times Stage + Group \times LOC_{c} \\ & \times Stage + (1|ID) \\ & + (1|wIndex:NewWalls). \\ \end{aligned} $$\end{document}

Note that while this model does not include the Free Spirit gaming style, results from an alternative model (see Appendixes D and E) suggest a potential relationship between this trait and interaction force outcomes, which can be considered of interest for hypothesis H1.4. Yet, fully evaluating this effect would require studying the model complexity beyond the current study’s scope, complicating the understanding of the results. As such, we leave a thorough investigation of H1.4 for future work.

Human-robot interaction perception model (M2)

To investigate whether subjective perceptions of human-robot interaction (HRI) influenced task performance (H3), we employed the following model:

\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \begin{aligned} \log _{{10}} |{\text{Error}}| & = (AC_{c} + FS_{c} + TC_{c} + LOC_{c} ) \\ & \times HRIQuestion \times Stage \\ & + (1|ID) + (1|wIndex), \\ \end{aligned} $$\end{document}

where the HRIQuestion represents the normalized and centered response to each of the specific HRI perception questions. The normalized and centered version of four of the traits were considered of interest for this model, i.e., the Achiever ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$AC_c$$\end{document}$ ) and Free Spirit ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$FS_c$$\end{document}$ ) gaming styles, Transform of Challenge ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$TC_c$$\end{document}$ ), and Locus of Control ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$LOC_c$$\end{document}$ ). All the phases were included in this dataset (baseline, training, short- and long-term retention) while the transfer tasks were excluded as HRI questions were asked exclusively after the training phase, which did not include the transfer tasks.

Results

The results from the LMM are visually summarized in Figs. 4, 5, 6 and 7, and tables with the full outputs from the models are available in the Appendix E. The figures within this section show bar plots illustrating expected changes in performance or interaction force between two conditions. These changes compare the relative difference between a reference case, e.g., a participant in baseline with average trait levels (or a specific trait value), and a case of interest, e.g., the same participant in a later phase. The expected performance or interaction force in these individual conditions was calculated based on the corresponding LMM estimates, with the differences reflecting the predicted percentage of improvement or decline. Furthermore, we provide 95% confidence intervals around these estimated changes to show the uncertainty around our estimates.

Note that some of the changes shown in the figures may not be significantly different from each other after applying the Bonferroni correction (see the corresponding corrected and uncorrected p-values in Appendix E). We decided to include them for consistent comparisons across conditions and to highlight emerging patterns that could guide future work.

Throughout this section, to illustrate the comparison cases, we refer to high or low levels of personality traits or gaming styles. We set those values to be 10% above (high level of a trait/gaming style) or below (low level of a trait/gaming style) the average participant levels. This percentage was selected as it represents a small, realistic shift within the observed data range, allowing us to meaningfully illustrate potential individual differences. This does not apply to the Locus of Control, in which we will compare external LOC (LOC = 1) or internal LOC (LOC = -1), as these well-established endpoints represent psychologically meaningful behavioral profiles. For all the cases, this is explicitly indicated in each figure.

Motor learning results (M1.1 and M1.2)

Outputs from model M1.1 were used to calculate expected |Error| evolution across phases. Those predictions are shown in Fig. 4 (main task) and Fig. 6 top (transfer tasks). Outputs from the model M1.2 were used to calculate the expected |InteractionForce| evolution and are shown in Fig. 5 (main task) and Fig. 6 bottom (transfer tasks).

When focusing on the Control group, Fig. 4 illustrates how a participant with average levels of every personality trait (av. participant) is expected to show an error reduction of 38% from baseline to short-term retention (STR) and 39% from baseline to long-term retention (LTR) if completing the main task.

Regarding the effect of personality traits in the Control group, a higher Transform of Challenge reduces even more the error from baseline to STR/LTR compared to an average participant; i.e., these participants showed improved performance. However, the p-values associated with these effects were only significant before Bonferroni’s correction (after the correction: $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p = 0.147$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=1$$\end{document}$ respectively). A high level of this trait corresponded with around 5% performance improvement in addition to the error reduction expected from an average participant from baseline to STR. LOC also slightly influenced performance changes from baseline to LTR in the Control group (p-value only significant before Bonferroni’s correction, after: $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.277$$\end{document}$ ). An external LOC was associated with degraded performance from baseline to LTR compared to an average participant from baseline to LTR.

The Experimental group exhibits a degraded performance metric, with reductions of around 28% from baseline to STR (corrected p-value: $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.05$$\end{document}$ ) and around 31.5% from baseline to LTR (corrected p-value: $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=1$$\end{document}$ ). Those values can be compared to the 38% (STR) and 39% (LTR) error reduction seen in the Control group. In the Experimental group, a high level of the Transform of Challenge trait did not show better improvement compared with the improvement shown in the Control group (Corrected p-values for the STR and LTR were $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=0.729$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=1$$\end{document}$ respectively).Fig. 4. Barplot showing results from the M1.1 LMM regarding exclusively the main task. Each bar shows the error reduction that the model predicts a participant will show in different situations, i.e., from baseline to short-term retention (left) and from baseline to long-term retention (right). Note that the terms related to the influence of LOC in the Experimental group from baseline to STR or LTR were excluded from the model M1.1 to improve interpretability by reducing unnecessary complexity. White bars refer to a participant with average levels of every trait. Colored bars represent the same situation for a participant with a higher (solid) or lower (striped) level of a specific trait. Error bars represent the confidence interval associated with each estimated error reduction. The square brackets at the bottom represent the corrected p-values associated with those comparisons. Note that the ones in gray represent triple interactions; therefore, comparison of comparisons. STR: Short-term retention, LTR: Long-term retention, Av.: Average, Exp.: Experimental, LOC: Locus of Control

We found that the human-robot interaction force was correlated with personality traits. Figure 5 shows how high levels of Transform of Challenge were associated with different changes from baseline to STR depending on the training group. Participants in the Control group with higher levels of this trait exhibited a larger reduction in the interaction forces from baseline to STR, compared to participants with this trait in the Experimental group (corrected p-value $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=1.56e^{-9}$$\end{document}$ ). For participants allocated to the Experimental group, LOC also had a notable influence. While participants with an internal LOC showed larger interaction force reductions from baseline to STR and LTR, those participants with external LOC demonstrated a force increase. In this case, the p-values associated with both personality trait effects showed statistically significant after the Bonferroni correction ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=5.84e^{-2}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=5.17e^{-5}$$\end{document}$ respectively).Fig. 5. Barplot showing results from the M1.2 LMM regarding exclusively the main task. Each bar shows the interaction force reduction that the model predicts a participant will show in different situations, i.e., from baseline to short-term retention (left) and from baseline to long-term retention (right). White bars refer to a participant with average levels of every trait. Colored bars represent the same situation for a participant with a higher (solid) or lower (striped) level of a specific trait. Error bars represent the confidence interval associated with each estimated interaction force reduction. The square brackets at the bottom represent the corrected p-values associated with those comparisons. Note that the ones in gray represent triple interactions; therefore, comparison of comparisons. STR: Short-term retention, LTR: Long-term retention, Av.: Average, Exp.: Experimental, LOC: Locus of Control

Results regarding the transfer tasks can be found in Fig. 6. Both metrics (performance and interaction force) were affected by the transfer task. Some tendencies were found in the Control group. Participants in this group showed better performance in the dynamics transfer task w.r.t. the main task during baseline ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=6.303e^{-4}$$\end{document}$ ). Participants with high levels of Transform of Challenge in the same group showed smaller differences ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=4.719e^{-2}$$\end{document}$ ). The position-based transfer task was associated with differences in the interaction force metric. In particular, participants in the Control group reduced, to a smaller degree, the interaction force from baseline to STR and LTR in the position transfer task as compared to the main task ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=6.417e^{-3}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=7.124e^{-7}$$\end{document}$ respectively).Fig. 6. Top: Barplot showing results from the M1.1 LMM regarding the dynamics transfer task. Bottom: Barplot showing results from the M1.2 LMM regarding the position transfer task. Each bar shows the error (top) or interaction force (bottom) reduction that the model predicts a participant will show in different situations, i.e., from the main task in the baseline to the dynamics transfer task in the baseline (top) or from the baseline to STR/LTR in both, the main and the transfer task (bottom). White and pink bars refer to a participant with average levels of every trait. Other colored bars represent the same situation for a participant with a higher or lower level of a specific trait. Error bars represent the confidence interval associated with each estimated error or interaction force reduction. The square brackets at the bottom represent the corrected p-values associated with those comparisons. Note that the ones in gray represent triple interactions; therefore, comparison of comparisons. STR: Short-term retention, LTR: Long-term retention, Av.: Average

Human-robot interaction perception results (M2)

We did not find significant effects between the subjective experience of the haptic guidance (HRI questions) and changes in performance or interaction force from baseline to any of the other phases. However, we found that a high frustration combined with a high Free Spirit will cause a significantly higher error in the long-term retention (LTR) for those participants allocated in the Experimental group ( $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$p=5.772e^{-3}$$\end{document}$ ), i.e., they showed degraded performance metric. While participants in this group showed error reduction from baseline to LTR, this improvement was no longer statistically significant when considering participants with both high frustration and high Free Spirit scores. Interestingly, those participants who perceived high frustration but scored lower in the Free Spirit gaming style questionnaire showed an error reduction of almost 20% more than an average participant when comparing baseline to LTR (Fig. 7).Fig. 7. Barplot showing results from the M2 LMM, including only the Experimental group and the main task. Each bar shows the error reduction that the model predicts a participant will show in different situations, i.e., from baseline to long-term retention. The white bar refers to a participant with average levels of every trait. Colored bars represent the same situation for participants with a higher (solid) or lower (striped) level of a specific trait and/or high frustration. Error bars represent the confidence interval associated with each estimated error reduction. The square brackets at the bottom represent the corrected p-values associated with those comparisons. Note that the ones in white represent triple interactions; therefore, comparison of comparisons. LTR: Long-term retention, Av.: Average, Exp.: Experimental

Discussion

The influence of personality traits in motor learning (H1)

Overall, participants demonstrated learning progress, i.e., reduced the error and interaction forces from baseline to STR and LTR. We observed slightly lower learning for those participants who trained with haptic guidance (Experimental group) compared to the ones who trained without it (Control group). Still, the p-values associated with these effects did not remain statistically significant after Bonferroni correction, contrary to our first hypothesis (H1.1). This aligns with our previous findings in a similar pendulum task, where we did not find significant differences between training with haptic guidance and without it in terms of accuracy improvements after training [44]. Yet, although the power analysis (Appendix A) confirmed that 40 participants were enough to detect the effect size, it could be that we overestimate that effect. This effect might be too small to be detected for this sample size; repeating the experiment with a larger sample size is left for future work.

From the evaluated personality traits and gaming styles, only the Transform of Challenge (H1.2) and Locus of Control (H1.5) were found to influence motor learning. The Achiever gaming style was excluded from the LMM M1.1, as its inclusion did not improve model fit and resulted in higher AIC and BIC values. Therefore, hypothesis H1.3 is not supported by the data. Although exploratory analyses suggested a potential relationship relevant to H1.4 (related to the Free Spirit gaming style), fully evaluating this effect would require studying the model complexity beyond the current study’s scope. As such, we leave a thorough investigation of H1.4 for future work (see Appendices D and E for completeness).

The effect of autotelic personality in motor learning depends on the training strategy (H1.2)

In line with our second hypothesis (H1.2), we observed that those scoring high in Transform of Challenge showed different patterns in both performance and human-robot interaction force changes from baseline to STR depending on the training group they were allocated to. Those with high Transform of Challenge levels in the Control group exhibited slightly greater improvement in performance (lower error) from baseline to STR when compared to those with average trait levels. However, this effect did not remain statistically significant after applying Bonferroni correction to the specific cases, likely due to the conservative nature of the adjustment and the small sample size. However, differences remained significant after the Bonferroni correction when evaluating the interaction force. Participants with a high level of Transform of Challenge in the Control group reduced the interaction force substantially more from baseline to STR than those in the Experimental group. This greater force reduction in the Control group is in line with literature that shows an association between Autotelic behaviors and high-intensity athletes [32], who are experts in movement economy and efficiency [48]. For these subjects moved by challenges, the haptic guidance may have reduced their perception of the task difficulty, leading to a smaller reduction of the interaction force in those allocated in the Experimental group. The haptic guidance may have interfered with this group of participants’ engagement and focus, making the learning environment potentially less ideal for this personality type. This aligns with our observations during training: Participants with high Transform of Challenge demonstrated worse performance under guidance than average participants, likely due to a lack of engagement caused by the reduced perceived difficulty [43].

These findings suggest that individuals with Autotelic traits, particularly Transform of Challenge, could excel in economizing their effort. However, they may be sensitive to interventions that reduce the perceived challenge, such as haptic guidance. That finding positions this trait as an intriguing subject for further study. Future research should explore the interplay between this trait and perceived difficulty and attentional focus. Additionally, employing methods like error amplification, which increases task difficulty, could help sustain the motivation of such participants and further enhance their learning outcomes [7].

The effect of locus of control in motor learning depends on the training strategy (H1.5)

In line with our hypothesis H1.5, we found a significant relationship between the LOC and the reduction in human-robot interaction forces from baseline to the retention phases. For participants in the Experimental group, we found contrasting tendencies in interaction force reduction after training depending on whether participants exhibited a more external or internal LOC orientation. In particular, participants with an external LOC (i.e., believe that outcomes from actions are attributed to external circumstances) increased their interaction force between baseline and STR & LTR. In contrast, those with an internal LOC (i.e., believe that outcomes from actions are attributed to oneself) demonstrated significantly larger reductions when compared to participants with average trait levels allocated in the Experimental group.

These results align with our previous findings during training: Participants in the Experimental group with external LOC exhibited higher interaction forces during training than those with internal LOC [43]. A possible explanation is that participants with an internal LOC perceived the haptic guidance as a means to enhance their personal control over the pendulum dynamics, which aligns with their intrinsic belief in self-control. By attributing successful performance improvements to their own actions, internal LOC participants may have been more engaged in refining their interaction strategy. This interpretation is consistent with previous studies showing that individuals with an internal LOC tend to perform better when provided with positive feedback, or in this case, perceiving better performance by obtaining a higher score while receiving guidance [35].

Conversely, participants with a more external LOC may have perceived the haptic guidance as the primary driver of task success, leading to less motivation to actively improve their performance/interaction with the device. Prior research in educational and behavioral psychology has shown that individuals with an external LOC often demonstrate lower persistence when facing new challenges or scenarios [49]. In the context of haptic guidance, this may translate to a tendency to “lean on” the robotic support, resulting in higher interaction forces from baseline to retention phases. However, this was not accompanied by a statistically significant decline in performance. One interpretation is that these participants, having attributed earlier improvements to the robotic guidance, may have been unprepared for its removal, resulting in inefficient strategies that increased physical interaction with the device without necessarily worsening performance outcomes.

The type of transfer task affects performance and interaction metrics (H2)

When examining the performance and interaction metrics during the transfer tasks, distinct patterns emerged. During baseline, participants exhibited smaller errors in the different dynamics transfer task as compared to the main task, in line with our hypothesis (H2.2). The dynamics transfer task involved a pendulum with a shorter rod, resulting in a higher natural frequency. Since the hand movement requirements remained unchanged (i.e., identical target positions as in the main task), the altered dynamics may have made it easier for participants to move outside the natural frequency of the pendulum. Prior work from our group showed that deviation from the pendulum’s natural frequency is indeed correlated with higher control in this particular target-hitting task [44]. The improved control could have contributed to better performance. Interestingly, those participants with a high Transform of Challenge score showed smaller error reduction from the main to the dynamics transfer task in baseline. This suggests that these individuals may have perceived the altered dynamics task as less difficult and, therefore, less engaging, in line with hypothesis H2.3 and the previously discussed findings related to motor learning. The better performance observed in the dynamics transfer task compared to the main task was no longer significant in the subsequent STR and LTR phases. This likely reflects participants’ increased mastery of the main task, which was the only task trained during the training phase. As a result, improved overall skill during the training phase may have reduced the performance gap between the two tasks.

Contrary to hypothesis H2.1, we did not find differences in performance or interaction force between the main and the position-based transfer task during baseline. However, the human-robot interaction force was reduced less in the position-based transfer task than in the main task when comparing the baseline to STR in both cases. This transfer task was designed to promote wider movements, obligating the trainees to deal with bigger pendulum amplitudes. This could have hindered the generalization process, as they needed to increase the interaction force to overcome the challenge.

The interaction between personality traits and human-robot interaction perception affect performance (H3)

Participants in the Experimental group were asked three additional questions about how restricting or permitting they found the guidance, how disturbing or helpful they found the guidance, and their level of frustration during the task (HRI questions). Contrary to our hypothesis H3.1, none of the questions showed a statistically significant effect on performance improvement. Yet, when the information about personality traits was included in the analysis, we found that participants with varying levels of the Free Spirit gaming style were particularly sensitive to frustration, affecting their performance and aligning with hypothesis H3.2.

The performance improvement from baseline to long-term retention in the Experimental group — potentially associated with learning effects — was no longer statistically significant when accounting for participants with both high frustration and high Free Spirit scores. Individuals with high Free Spirit gaming style scores — characterized by strong exploratory tendencies [41] — did not experience a performance improvement from baseline to long-term retention if they felt frustrated. This might be due to the fact that the guidance restricted their exploration tendencies. The Free Spirit gaming style has been found to negatively correlate with social design game elements, suggesting a preference for autonomy and self-expression over structured interactions [50]. In contrast, participants who reported high frustration but scored low in Free Spirit showed greater performance improvements from baseline to LTR. With a lower tendency toward exploration, these participants with a low Free Spirit gaming style may have been less inclined to deviate from the guidance and more likely to adhere to the training structure. Consequently, their frustration may have driven them to focus on refining familiar, learned movements rather than seeking alternative approaches, ultimately enhancing their task performance over time. These findings also align with prior research in human-in-the-loop systems, which emphasizes the importance of incorporating human preferences to maintain optimal human performance [20, 21].

Implications for robot-aided motor learning and rehabilitation and future work

Our findings suggest that autotelic personality, Locus of Control, and the Free Spirit gaming style may shape how individuals respond to haptic guidance, highlighting the need for personalized robot-aided rehabilitation protocols. Understanding and extending our knowledge about how these trait-dependent differences interact with robotic assistance could help refine robotic rehabilitation approaches by tailoring assistance levels and feedback mechanisms to enhance performance and motor (re)learning. Personality traits and individuals’ desires and needs might change before and after an acquired brain injury and through the recovery process. Yet, we expect these changes to follow a slowed temporal scale compared to changes in mental states, which vary largely depending on the day, the time of the day, or the task to be completed, reducing the need for in-vivo measurements to provide tailored rehabilitation programs.

Although this work represents a first step toward understanding the potential value of incorporating personality traits into personalized rehabilitation protocols, future work should examine whether similar personality-dependent effects influence motor recovery in individuals post-stroke. This entails broadening the evaluated traits, as stroke frequently induces new or altered psychological traits that can be of importance when regulating their interaction with robotic devices [51]. Additionally, future studies could explore the relationship between personality traits and psychological states like perceived difficulty, attention focus, and engagement across diverse tasks and haptic training methods such as error-enhancing strategies, in both unimpaired and clinical populations.

Study limitations

This study has several limitations. Although the statistical power computation showed that 40 participants were sufficient to detect some effects of interest, the model selection process led to the inclusion of more complex interaction terms than originally anticipated. As a result, the effective power of the statistical tests was reduced, which is reflected in the confidence intervals and non-statistically significant p-values observed in some results. This suggests that, while 40 participants is a reasonable sample size within motor learning experiments, it may be relatively small when investigating more complex effects, such as those involving personality traits and their interactions. Future studies could address this limitation by using the findings presented here to inform the design of future experiments, with larger sample sizes or simplified models to maintain adequate power.

Additionally, the values used to estimate high or low levels of personality trait effects in this work may not fully represent real-world distributions. Moreover, participants were not classified into distinct personality groups but rather exhibited a mix of traits simultaneously, making it difficult to isolate individual effects. Finally, we acknowledge that individual differences in motor learning may be shaped by a range of sociocultural influences, including those related to gender identity. We did not investigate gender-related effects in this study due to the already high complexity of the models. Yet, we encourage future work to explore this further using more inclusive and representative gender frameworks.

Conclusion

This study highlights the multifaceted relationship between personality traits, motor learning, and human-robot interaction (HRI) in the context of robotic-assisted training. We conducted a motor learning experiment with forty unimpaired participants, half of whom received physical guidance from a robotic device. We found that trainees with high Transform of Challenge personalities showed less error reduction after training with haptic guidance, probably due to a lower perceived task difficulty. In addition, those with high Transform of Challenge characteristics who trained without haptic guidance improved their interaction with the robot as compared to the average participants. We theorize that participants with Autotelic personalities hold a high capacity to reduce effort but are substantially penalized by a loss of interest due to tasks that are too easy. Locus of Control (LOC) also showed the nuanced impact of personality on human-robot interaction. Internal LOC participants exhibited a substantial reduction in the human-robot interaction force after training. In contrast, external LOC participants demonstrated only minimal improvement, underscoring the interplay between perceived control and the effectiveness of guidance systems. Finally, participants with a low Free Spirit gaming style showed sensitivity to the subjective perception of frustration due to overly restrictive haptic guidance, resulting in an improvement in their performance after training.

Our work contributes to the understanding of how personality traits and human-robot interaction perception affect motor learning to inform the design of future personalized rehabilitation systems.

Bibliography2

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Reinkensmeyer DJ, Akoner OM, Ferris DP, Gordon KE. Slacking by the human motor system: computational models and implications for robotic orthoses. In: 2009 Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2009. pp. 2129–2132. Ieee.10.1109/IEMBS.2009.533397819964581 · doi ↗ · pubmed ↗
2Özen Ö, Traversa F, Gadi S, Buetler, KA, Nef T, Marchal-Crespo L. Multi-purpose robotic training strategies for neurorehabilitation with model predictive controllers. In: 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR), 2019. pp. 754–759. IEEE.10.1109/ICORR.2019.877939631374721 · doi ↗ · pubmed ↗