# Performance of active and passive ambulatory assessment measures and mood monitoring in bipolar disorder: a systematic review

**Authors:** Laurence Astill Wright, Eduard Bakstein, Kate Saunders, Boliang Guo, Richard Morriss

PMC · DOI: 10.1186/s40345-025-00407-5 · 2026-01-23

## TL;DR

This review evaluates how well active and passive digital tools track mood in bipolar disorder, finding inconsistent data and a need for better standards.

## Contribution

The study systematically reviews the performance of ambulatory assessment tools in bipolar disorder, highlighting the lack of standardized metrics and evidence for passive measures.

## Key findings

- Active ambulatory assessment approaches showed good performance compared to clinical measures.
- Passive ambulatory assessment measures lack sufficient evidence for validity and reliability.
- High variability in metrics limits meaningful comparisons and replication across studies.

## Abstract

Ambulatory assessment uses digital technology to capture real-time data on mood, mental state and behaviour. It has the potential to enhance traditional clinical outcome measures, but the practical application of these tools fundamentally depends on their performance.

This systematic review aimed to assess the performance of active and passive ambulatory assessment and mood monitoring outcome measures in non-randomised and randomised studies in bipolar disorder over 3 months or longer. We aimed to evaluate their performance against established clinical measures and through inter-ambulatory assessment comparisons.

Systematic review (PROSPERO: CRD42023396473) of performance of mood monitoring and ambulatory assessment protocols in RCTs and non-randomised studies in bipolar disorder. Identified studies were assessed for risk of bias. Due to the very high heterogeneity in included studies and performance metrics we were not able to aggregate the data via meta-analysis.

The review included 42 studies with a combined sample of 7,813 participants. We included 28 distinct ambulatory assessment protocols which reported 487 different smartphone-based performance metrics. The considerable variability and inconsistency across these metrics limited our ability to make definitive comparisons of performance. Overall, some active ambulatory assessment approaches showed good performance when compared with established clinical measures. There was a paucity of data examining the performance of passive ambulatory assessment measures. Most studies were rated as having low to moderate risk of bias.

While ambulatory assessment holds significant promise, current evidence fails to establish the validity and reliability of passive ambulatory assessment to measure mood. The substantial methodological variation—particularly in how performance metrics are defined and reported—limits meaningful comparison and replication. Greater consistency in ambulatory assessment design and reporting standards is essential to support reliable evaluation and broader adoption of these behavioural assessment tools.

The online version contains supplementary material available at 10.1186/s40345-025-00407-5.

## Linked entities

- **Diseases:** bipolar disorder (MONDO:0004985)

## Full-text entities

- **Diseases:** hypomania (MESH:D000087122), DSM-IV (MESH:D006011), Mental Disorders (MESH:D001523), Mental Health (OMIM:603663), Affective Disorder (MESH:D019964), GAD-7 (MESH:C537955), mental (MESH:D008607), Anxiety (MESH:D001007), mental distress (MESH:D012128), Anxiety Disorder (MESH:D001008), BD (MESH:D001714), Depression (MESH:D003866)
- **Chemicals:** CGI-BP (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** NU23-04-00534 — Homo sapiens (Human), Transformed cell line (CVCL_H046)

---
Source: https://tomesphere.com/paper/PMC12852565