# A hybrid CNN-spectral architecture for non-contact respiratory rate estimation using multi-region optical-flow analysis

**Authors:** Sreya Deb Srestha, Sungho Kim

PMC · DOI: 10.1371/journal.pone.0325340 · PLOS One · 2026-02-20

## TL;DR

This paper introduces a new non-contact method to accurately estimate respiratory rate using a hybrid CNN-spectral framework and optical-flow analysis across multiple body regions.

## Contribution

The novel contribution is a multi-region CNN-spectral architecture with adaptive Kalman filtering and SNR-based fusion for robust respiratory rate estimation.

## Key findings

- The method achieves a mean average error of 0.61–0.95 bpm across different skin tones and ages.
- Multi-region optical-flow analysis improves accuracy in varying lighting and motion conditions.
- Data augmentation enhances generalizability for real-world healthcare applications.

## Abstract

Respiratory rate (RR) is a key indicator for assessing health conditions, yet noncontact measurement remains challenging due to motion artifacts, lighting variability, and skin-tone differences. This study presents a robust framework combining a custom convolutional neural network (CNN) with spectral analysis of optical-flow signals to estimate RR accurately across diverse population. Respiration-induced motion is extracted from six anatomical regions: forehead, cheeks, upper chest, and shoulders. Adaptive Kalman filtering combined with signal-to-noise ratio (SNR)-based weighted fusion enables reliable RR estimation. To improve generalizability, extensive data augmentation was applied, simulating illumination conditions ranging from normal to low light. The experimental results indicate that the proposed method achieves a mean average error (MAE) of 0.61–0.95 breaths per minute (bpm) for different skin tones and ages, within the clinically relevant range. These findings support the effectiveness of the multi-region CNN-spectral framework as a reliable, noncontact, real-time respiratory monitoring solution with potential for continuous healthcare and telemedicine applications.

## Full-text entities

- **Diseases:** RSA (MESH:D001146), skin irritation (MESH:D012871), cardiopulmonary arrest (MESH:D006323), skin pigmentation (MESH:D010859), pneumonia (MESH:D011014), respiratory (MESH:D012131), respiratory distress (MESH:D012128), COVID-19 (MESH:D000086382), heart failure (MESH:D006333), chronic pulmonary diseases (MESH:D002908), sleep apnea (MESH:D012891)
- **Chemicals:** PPG (-), melanin (MESH:D008543)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12923011/full.md

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12923011/full.md

## References

56 references — full list in the complete paper: https://tomesphere.com/paper/PMC12923011/full.md

---
Source: https://tomesphere.com/paper/PMC12923011