A study protocol for a systematic review and meta-analysis on macroscopic body movements as a marker for acute stress
Miriam Kurz, Veronika Ringgold, Luca Abel, Felicitas Burkhardt, Robert Richer, Lena Schindler-Gmelch, Anne I. Turner, Bjoern M. Eskofier, Nicolas Rohleder

TL;DR
This paper outlines a systematic review and meta-analysis to explore if body movements can serve as a new way to measure acute stress.
Contribution
The study introduces a protocol for evaluating macroscopic body movements as a novel, non-intrusive marker for acute stress.
Findings
The study will assess the feasibility of using macroscopic body movements as a stress marker.
It will synthesize findings from existing studies using meta-analysis and GRADE evaluation.
The protocol includes a comprehensive search strategy and bias assessment to ensure robust results.
Abstract
Experiencing stress is a normative part of human life. Due to its major impact on health and longevity, effort has been put into understanding underlying determinants and consequences. For stress measurement, a vast number of different methods, including physiological, subjective, and behavioral assessments, can be used. Previous measurement methods have already provided valuable insights into the role of the stress systems. However, their application has some drawbacks, highlighting the need for more convenient, less expensive, less intrusive, and continuous methods. A promising alternative could be the assessment of macroscopic body movements, which we define as body movements that are actively, consciously, or unconsciously, initiated by the muscular system. The study protocol was registered with PROSPERO, and PRISMA-P guidelines were followed. We will screen MEDLINE, PsycInfo, Web…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
- —http://dx.doi.org/10.13039/501100001659Deutsche Forschungsgemeinschaft
- —Friedrich-Alexander-Universität Erlangen-Nürnberg (1041)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Heart Rate Variability and Autonomic Control · Stress Responses and Cortisol
Background
The primary objective of the human body is to preserve a state of homeostasis, which involves regulating inner processes to achieve a “state of balance” [1]. When facing a stressor, its perception via the central nervous system (CNS) ultimately activates the locus coeruleus–norepinephrine system [2] and the paraventricular nucleus of the hypothalamus [3]. In turn, the two human stress systems, the sympathetic nervous system (SNS) as part of the autonomic nervous system (ANS) and the hypothalamic–pituitary–adrenal axis (HPA axis), are activated. The SNS reacts within seconds and prepares the body for fight-or-flight by releasing adrenaline and noradrenaline. The HPA axis reacts more slowly and releases the stress hormone cortisol, which, in complex orchestration with the ANS, enables the organism to cope with the stressor [4–9]. Short-term stress responses are beneficial and necessary to ensure survival by enabling an adequate response to threatening stimuli [10, 11]. However, long-term stress, and stressors that are recurring and associated with emotional or physical strain, can exert harmful effects on human health [12–14] due to, e.g., dysregulation of the HPA axis and the ANS [15], and increased inflammation [11].
Given the negative impact that particularly long-term stress can exert, it is important to understand its determinants, mitigate its consequences, and provide a basis for designing more effective interventions to buffer the effects of stress and therefore foster well-being. Laboratory stress protocols allow for standardized stress exposure and assessment of the individual’s behavioral, physiological, and self-reported stress responses [16, 17]. Thus, they can help to understand underlying mechanisms of the human stress response and draw causal conclusions [10, 18, 19]. Alongside protocols such as the Socially Evaluated Cold Pressor Test (SE-CPT; [20]) or the Montreal Imaging Stress Task (MIST; [21]), the Trier Social Stress Test (TSST; [22]) is considered the gold standard to induce acute stress in the laboratory [10, 23]. Of these laboratory stress protocols, the stress response can be assessed in various ways. Subjective self-reports, for instance via standardized state questionnaires like the Short Stress State Questionnaire (SSSQ; [24, 25]), Positive Affect Negative Affect Schedule (PANAS; [26]), or the Primary Appraisal Secondary Appraisal (PASA) questionnaire [27], offer a reliable and cost-effective, but time-delayed and non-continuous method to interrogate inner states and thus quantify the subjectively perceived stress [26, 28]. In addition, they, like all self-reports, might suffer from recall bias [29]. Further, biological measures, such as hormone (e.g. cortisol; [30, 31]) or enzyme (e.g. alpha-amylase; [32]) levels in blood, urine, or saliva, provide insights into the (re-)activity of the HPA axis or of the SNS, respectively [33]. Although such hormonal or enzymatic measures offer a dependable and non-invasive means of assessment [34], they are also subject to confounding influences such as hormonal diurnal cycles [35, 36], medication or food intake [37], or mental and physical conditions [38–40]. Further, they do not allow for continuous assessments [28, 41], and are associated with considerable laboratory cost and work [42, 43]. Physiological variables, such as electrocardiogram (ECG), skin conductance, respiration, or electroencephalogram (EEG) data may provide insights into the activity of the SNS [28, 44]. Heart rate (HR) and HR variability (HRV), for instance, have been found to be a reliable marker for stress assessment [45–48]. However, while HR(V) can now be measured in a minimally obtrusive manner using wearable trackers like smartwatches [45, 49], more complex ECG data can so far only be collected by a varying number of electrodes on the upper body [46], which potentially interferes with participants’ natural experience and behavior.
Thus, it can be summarized that while there is a plethora of possibilities to study stress in a standardized fashion, there is a lack of methods allowing for non-invasive, non-intrusive, continuous, and inexpensive assessment of stress. One innovative approach that may bridge this gap is the contactless assessment and interpretation of body movements. Body movements can be classified into macroscopic and microscopic movements. We define macroscopic movements as body movements that are actively initiated by voluntary and/or involuntary activation of skeletal muscles, and microscopic movements as all body movements that are indirectly caused by internal physiological processes (such as the beating of the heart or respiration-induced chest movements). With regard to macroscopic movements, a long-established methodology entails the evaluation of facial expressions [50, 51]. A limited number of studies are available that describe how facial expressions are associated with stress responses. Lupus et al. (2014) report links between the expression of anger and HPA axis and SNS activity following the TSST in men, but not in women [52]. Blasberg et al. (2023) report correlations between movements of different regions of the face, such as the upper eyelid, the upper lip, or the lip corner, so-called Action Units (AUs), with the psychosocial stress response, including cortisol, to the TSST for both women and men [53]. From such AUs, emotion indices may be retrieved [54]. Using Machine Learning algorithms, Giannakakis et al. (2017) have established a pipeline to automatically detect and analyze AUs from video recordings [55].
Cues about people’s inner states may not only be visible in their facial expressions but may also be inferred from their body posture. The concept of “freezing”, which is characterized by a decrease in heart rate as well as a reduction in body sway, has received attention as a marker for social threat [56]. Participants were found to show more freezing behavior in response to stimuli such as angry faces [56] or unpleasant films [57]. While freezing refers to rapid reductions in body movements and HR in the immediate response to a threat, we and others have found that stress responses that develop more slowly are also associated with a reduction of body movements. We found a decrease in head, upper extremities, and upper body movements in response to the TSST [58]. Similarly, Pisanski et al. (2018) reported a decrease in hand movement, following the TSST [59].
Besides the approaches using facial expressions and reduction of macroscopic body movements, a smaller body of research has looked at further possibilities to assess stress and related psychological states. For instance, one study detected variations in pressure distribution on an office chair after a laboratory stress intervention [60]. By assessing the angular movement of a steering wheel, Paredes et al. (2018) reported that this allowed them to distinguish between a stressed and a non-stressed state of participants driving a car [61]. Some studies have explored alterations in keyboard typing patterns before and after stressful events. While some of the results suggest a possibility of differentiating between high and low stress conditions based on keyboard typing behavior [62, 63], others found no universal patterns [64, 65].
Thus, it can be stated that there currently are a number of different approaches aiming to utilize body movements as stress markers. In light of recent technological advances, such as the prediction of human pose through images or videos (see e.g., [66]), this research field can be expected to grow rapidly over the next few years. Hence, it is necessary to provide an overview of the current literature, and, thus, a framework for further research. This systematic review and meta-analysis examine the reliability and validity of macroscopic body movements as a marker of acute stress or related psychological constructs in humans, either by comparing responses under experimentally induced stress to baseline or non-stressful conditions, or by including studies that assessed traditional stress markers alongside macroscopic movements.
Methods
Protocol registration and reporting information
This systematic review was registered with PROSPERO CRD42024539659. This protocol, as well as the ensuing systematic review/meta-analysis, is reported in accordance with PRISMA-P guidelines [67] (see Additional file 1).
Classification of body movements
In our research methodology, we employ a categorization approach to identify suitable studies. Specifically, we distinguish between macroscopic movements and microscopic movements. Macroscopic movements are, for example, body posture and/or its change over time, or behaviors such as keyboard typing. Moreover, macroscopic movements may further be classified into movements that involve an object, like a steering-wheel or a keyboard, and object-free movements, such as body sway. An example of a microscopic movement (which is not part of the planned review) is the upward and downward motion of the rib cage during inhalation and exhalation, which is caused by respiratory activity. Although these chest movements are visible to the human eye and could be classified as either a macroscopic or a microscopic movement, we have explicitly chosen to exclude them from the current review. This decision was made to narrow the scope of the review and to maintain a clear focus on movement patterns primarily driven by motor behavior.
Eligibility criteria
We will select studies based on the criteria study design, participants, exposure/interventions, and outcomes (PICO/PECO framework; [68]).
- Study design, comparison, and* exposures/interventions*: We will include studies that actively induce acute stress or related psychological states, i.e., perception of threat or negative affect, in human study participants via a standardized protocol (e.g., via laboratory stress protocols such as the TSST or MIST, or via experimental paradigms that systematically manipulate real-life environmental factors such as office noise), and that report a comparison between a stressed group or condition and an appropriate control group or condition in the desired outcome variables. This could be achieved by comparing different experimental groups (between-subject) or conditions (within-subject). We will also include studies reporting only on pre-post comparisons before and after the stressor if their measures include both an established marker of stress (e.g., saliva-based markers such as cortisol, blood-based markers such as adrenaline/noradrenaline, or electrophysiological markers such as HR(V) or skin conductance) and a measure of macroscopic movement. Studies that investigate the opposite relationship between body movement and stress, e.g., the effect of physical exercise on the stress response, will be excluded from the analysis.
- Participants: In the systematic review, we will include studies that exclusively involve human participants. Animal studies will be excluded from our analysis. With regards to the human sample, there will be no limitation, i.e., we will include studies reporting outcomes in children, adolescents, and adults as well as studies that focus on either the entire population or specific sub-populations. Furthermore, we will include patient groups independent of the presence/absence of any disease/disorder.
- Outcomes: We will include studies that report on at least one macroscopic movement which we classify as any movement that is, consciously or unconsciously, evoked by the muscular system. Studies may also include established indicators of the human stress response. However, if only traditional measures of stress are reported, we will not include the study in this review.
There will be no restrictions based on the publication date. This is due to our anticipation that relevant research is more likely to be recent, and we aim to avoid excluding valuable foundational literature in an already limited pool of literature. However, we will only consider articles published in peer-reviewed journals or peer-reviewed conference proceedings, written in either English or German. Conference abstracts, dissertations, and other gray literature will not be included in the study selection.
Information sources and search strategy
As a primary source for identifying eligible papers related to our systematic review on stress and macroscopic movements, we will conduct a comprehensive search in the following electronic databases: MEDLINE, PsycInfo, Web of Science, Embase, and Google Scholar. These databases were chosen for their extensive coverage of scientific literature in the fields of psychology, medicine, and related disciplines. We will screen the title and abstract of each paper to determine its eligibility based on a previously developed guideline. Additionally, we will employ a citation chaining technique to identify additional relevant studies. This approach involves reviewing the reference lists of included studies to find other potentially eligible articles that might have been missed in the initial database search. The systematic search will be performed by using combinations of relevant keywords related to stress and macroscopic movements. These keywords were derived based on an exploratory literature scan and refined through team consensus (MK, VR, LA, FH, LSG, RR, AT, and NR), informed by prior relevant publications and iterative testing in different databases. The final search strategies for PsycInfo, MEDLINE Complete, and Embase are displayed in Tables 1, 2, and 3, respectively. For Google Scholar, due to its large volume of search results, we will screen only the first 10 pages of search results (i.e., the top 100 entries), as has been done previously [69]. This step is intended to identify additional potentially relevant studies that may not have been indexed by the other databases. We will use Web of Science for forward and backward citation searches. Table 1. Search string for PsycInfo. The search combines the concept stress with the concept macroscopic movementsPsycInfo searchFilterSearch stringConcept 1: StressTitle/Abstract("acute stress" OR "psycho* stress" OR "stress respons*" OR "stress react*" OR "stress exposure*" OR "stress detection*" OR "stress measur*" OR “stress intervention*” OR “stress induc*” OR “Trier Social Stress Test” OR TSST OR “social* threat*” OR “threat* social”)ORSubjects/Mesh terms"Cardiovascular Reactivity" OR "Stress" OR "Acute Stress" OR "Perceived Stress" OR "Physiological Stress" OR "Stress Reactions" OR "Distress" OR "Emotional Exhaustion"Concept 2: Macroscopic body movementsANDTitle/Abstract(motion* OR gait OR postur* OR sway OR gesture* OR "Facial Action Coding System" OR "facial expression*" OR “action unit*” OR freezing OR biometrics OR "pressure distribution" OR "capacitive mouse" OR "pressure sensitive")ORSubjects/Mesh terms"Facial Recognition (Artificial Intelligence)" OR "Sensor Technology"Table 2. Search string for MEDLINE complete. The search combines the concept stress with the concept macroscopic movementsMEDLINE complete searchConcept 1: StressTitle("acute stress" OR "psycho* stress" OR "stress respons*" OR "stress react*" OR "stress exposure*" OR "stress detection*" OR "stress measur*" OR “stress intervention*” OR “stress induc*” OR “Trier Social Stress Test” OR TSST OR “social* threat*” OR “threat* social”)ORAbstract("acute stress" OR "psycho* stress" OR "stress respons*" OR "stress react*" OR "stress exposure*" OR "stress detection*" OR "stress measur*" OR “stress intervention*” OR “stress induc*” OR “Trier Social Stress Test” OR TSST OR “social* threat*” OR “threat* social”)ORMesH TermsStress, Physiological OR Stress, Psychological OR Trier Social Stress Test OR Trier Stress TestConcept 2: Macroscopic body movementsANDTitle(motion* OR gait OR postur* OR sway OR gesture* OR "Facial Action Coding System" OR "facial expression*" OR “action unit*” OR freezing OR biometrics OR "pressure distribution" OR "capacitive mouse" OR "pressure sensitive")ORAbstract(motion* OR gait OR postur* OR sway OR gesture* OR "Facial Action Coding System" OR "facial expression*" OR “action unit*” OR freezing OR biometrics OR "pressure distribution" OR "capacitive mouse" OR "pressure sensitive")ORMesH terms"Facial Recognition" OR "Automated Facial Recognition" OR "Digital Health"Table 3. Search string for Embase. The search combines the concept stress with the concept macroscopic movementsEmbase searchConcept 1: StressTitle("acute stress" OR "psycho* stress" OR "stress respons*" OR "stress react*" OR "stress exposure*" OR "stress detection*" OR "stress measur*" OR “stress intervention*” OR “stress induc*” OR “Trier Social Stress Test” OR TSST OR “social* threat*” OR “threat* social”)ORAbstract("acute stress" OR "psycho* stress" OR "stress respons*" OR "stress react*" OR "stress exposure*" OR "stress detection*" OR "stress measur*" OR “stress intervention*” OR “stress induc*” OR “Trier Social Stress Test” OR TSST OR “social* threat*” OR “threat* social”)Concept 2: Macroscopic body movementsANDTitle(motion* OR gait OR postur* OR sway OR gesture* OR "Facial Action Coding System" OR "facial expression*" OR “action unit*” OR freezing OR biometrics OR "pressure distribution" OR "capacitive mouse" OR "pressure sensitive")ORAbstract(motion* OR gait OR postur* OR sway OR gesture* OR "Facial Action Coding System" OR "facial expression*" OR “action unit*” OR freezing OR biometrics OR "pressure distribution" OR "capacitive mouse" OR "pressure sensitive")
The entire search process will be conducted by two reviewers (MK, VR) independently, and any discrepancies in study selection will be resolved through discussion and consensus. Thereby, bias will be minimized and reliability and reproducibility of the search results will be enhanced.
Screening and selection procedure
All identified records from the databases will be imported into Rayyan [70] and duplicates will be removed. Two reviewers (MK, VR) will then screen the titles and abstracts of the remaining records to identify potentially eligible studies. Any disagreements or uncertainties during this stage will be resolved through discussion or, if necessary, by consulting a third reviewer (LA). In the next step, full texts of the selected articles will be retrieved and assessed by the two reviewers against the predetermined eligibility criteria.
Data collection process
Data of eligible studies will be extracted independently by two authors (MK, VR) and imported into Excel [71]. This step will be pre-tested with five articles to ensure feasibility and comprehensiveness. We will contact authors in case of missing data. If there are multiple reports on one set of data, only one report will be included. We will report on the percentage of agreement between the two raters as well as calculate Cohen’s kappa (K) to determine the inter-rater reliability.
Data processing and classification of outcome variables
Extracted data will include information on the title of the paper, author(s), year of publication, country of recruitment, sample size, number of exclusions, reason(s) for exclusion, final participant number, mean age, SD age, percentage of female participants, sample type (general population vs. specific sub-population), inclusion criteria, measure(s) of stress, type of macroscopic movements, description of the stress intervention, description of the control intervention, order of conditions (in case of within-subject design), mean and standard deviation of used stress metric at pre-assessment and post-assessment for the experimental group and control group, and received compensation for participation. Considered studies will be assessed for their methodological quality. We will use appropriate tools, such as the revised Cochrane Risk of Bias tool [72] or ROBINS-I [73] to evaluate the quality of randomized and non-randomized trials, respectively. The assessment will be performed independently by the two reviewers (MK, VR), and any discrepancies will be resolved through consensus or consultation with a third reviewer (LA).
Data synthesis
First, a qualitative summary of the extracted information from each study and of our risk of bias assessment will be provided in narrative and tabular form. If there is significant variability and unsuitability for statistical pooling and, thus, meta-analytic analyses prove to not be feasible, we will utilize graphical summary methods for synthesizing evidence when a meta-analysis is not feasible. This includes employing harvest plots, effect directions, or bubble plots to summarize information in a manner that is both accessible and user-friendly. Second, if an adequate number of high-quality studies with relatively low heterogeneity are obtained, we will quantitatively synthesize data from primary studies through a meta-analysis. Heterogeneity will be assessed by using I^2^ statistics [74] and Cochran’s Q test [75]. For those studies that report on both traditional markers and a macroscopic movement, we will compare effect sizes between the two measures. In addition, when correlations between traditional markers and macroscopic movements are reported, we will analyze the correlation coefficients meta-analytically. All analyses will be conducted using Rstudio (version 2023.12.1 + 402 or later). Meta-analyses will be performed using the meta [76] and/or metafor [77] packages, and additional analyses will use relevant packages as appropriate.
Additional analyses
Meta-biases
The results of this review and meta-analyses will undergo critical examination towards meta-biases, such as selective reporting within studies or publication bias across studies. We will construct a funnel plot and conduct tests for asymmetry, such as Egger’s test [78], with a minimum of 10 studies (whenever feasible) to assess the presence of small-study effects [79, 80].
Confidence in cumulative evidence
The robustness of the evidence base will be evaluated using the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) system, a framework for appraising the quality of evidence and strength of recommendations [81, 82]. Quality of evidence refers to the certainty that estimated effects are accurate and can be categorized into high, moderate, low, and very low [81, 83]. We will assess the certainty of evidence separately for each primary and secondary outcome. Factors that compromise the quality of results are, among others, study limitations, inconsistent results, indirect evidence, imprecision, and publication bias; if these five factors are present, the quality rating can be downgraded [84]. The strength of the recommendation is characterized by the confidence that the positive effects of an intervention outweigh its negative effects and can be classified as either strong or weak [81, 85].
Discussion
We propose a study protocol for a systematic review and meta-analysis on stress assessment based on body movements and behavior. As a first step, we propose to discriminate between macroscopic movements and microscopic movements. While systematic reviews focusing on internal physiological processes that affect microscopic movements are available (e.g., [86]), findings on macroscopic movements to this day have, to the best of our knowledge, not been assembled in a systematic way. Hence, to offer a more profound insight into this novel approach to stress assessment, and to provide a sound basis for further research, we plan to conduct a systematic review and (if possible) meta-analysis on the effects of acute stress on macroscopic movements, i.e., all body movements that are actively, consciously, or unconsciously initiated by the muscular system. The aim is to determine whether macroscopic movements can be an indicator for acute stress and, if so, which movements best distinguish between a stressful and non-stressful state in human beings. In light of recent technological advances, our findings could further advance unobtrusive and continuous stress detection using tools such as motion sensors, wearable devices, or cameras. This could not only be beneficial in laboratory stress research, but may also be applied in real-world settings, where early interventions or adaptive support are conceivable.
Supplementary Information
Additional file 1: PRISMA-P guidelines.
