# Denoising and Baseline Correction of Low-Scan FTIR Spectra: A Benchmark of Deep Learning Models Against Traditional Signal Processing

**Authors:** Azadeh Mokari, Shravan Raghunathan, Artem Shydliukh, Oleg Ryabchykov, Christoph Krafft, Thomas Bocklitz

PMC · DOI: 10.3390/bioengineering13030347 · Bioengineering · 2026-03-17

## TL;DR

This paper introduces a new deep learning model that improves the quality of fast FTIR imaging, making it much faster while maintaining accuracy for medical diagnostics.

## Contribution

A physics-informed cascade Unet with a deterministic Physics Bridge that separates noise and baseline drift more effectively than existing methods.

## Key findings

- The cascade model reduced RMSE by 51.3% compared to raw single-scan inputs.
- It outperformed traditional workflows and standard Unet models in spectral accuracy and speed.
- The model preserves peak intensity better and avoids spectral hallucinations.

## Abstract

High-quality Fourier Transform Infrared (FTIR) imaging usually needs extensive signal averaging to reduce noise and drift, which severely limits clinical speed. Deep learning can accelerate imaging by reconstructing spectra from rapid, single-scan inputs. However, separating noise and baseline drift simultaneously without ground truth is an ill-posed inverse problem. Standard black-box architectures often rely on statistical approximations that introduce spectral hallucinations or fail to generalize to unstable atmospheric conditions. To solve these issues, we propose a physics-informed cascade Unet that separates denoising and baseline correction tasks using a new, deterministic Physics Bridge. This architecture forces the network to separate random noise from chemical signals using an embedded SNIP layer to enforce spectroscopic constraints instead of learning statistical approximations. We benchmarked this approach against a standard single Unet and a traditional Savitzky–Golay smoothing followed by SNIP baseline correction workflow. We used a dataset of human hypopharyngeal carcinoma cells (FaDu). The cascade model outperformed all other methods, achieving a 51.3% reduction in RMSE compared to raw single-scan inputs, surpassing both the single Unet (40.2%) and the traditional workflow (33.7%). Peak-aware metrics show that the cascade architecture eliminates spectral hallucinations found in standard deep learning. It also preserves peak intensity with much higher fidelity than traditional smoothing. These results show that the cascade Unet is a robust solution for diagnostic-grade FTIR imaging. It enables imaging speeds 32 times faster than current methods.

## Linked entities

- **Diseases:** hypopharyngeal carcinoma (MONDO:0005216)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** hypopharyngeal carcinoma (MESH:D007012)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13023615/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13023615/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC13023615/full.md

---
Source: https://tomesphere.com/paper/PMC13023615