# Machine Learning Calibration of Smartphone-Based Infrared Thermal Cameras: Improved Bias and Persistent Random Error

**Authors:** Jayroop Ramesh, Tom Loney, Stefan Du Plessis, Homero Rivas, Assim Sagahyroon, Fadi Aloul, Thomas Boillat

PMC · DOI: 10.3390/s26041295 · 2026-02-17

## TL;DR

Smartphone thermal cameras like FLIR One Pro have large errors in measuring skin temperature, and machine learning calibration only partially improves accuracy.

## Contribution

The study demonstrates that smartphone-based thermal cameras have significant systematic bias and variability, and that machine learning calibration has limited effectiveness in correcting these issues.

## Key findings

- The FLIR One Pro showed a mean bias of -1.42°C and wide agreement limits of ≈6°C compared to a reference thermometer.
- Machine learning models reduced bias but could not fully correct the device's high variability, with the best model achieving an R2 of 0.152.
- The study concludes that the FLIR One Pro is unsuitable for clinical use requiring precise temperature measurements.

## Abstract

Low-cost, smartphone-based thermal cameras offer unprecedented accessibility for physiological monitoring, yet their validity and reliability for absolute skin temperature measurement in clinical settings remain contentious. This study aims to quantify the agreement and repeatability of a widely used smartphone thermal camera, the FLIR One Pro, against a consumer-grade, non-contact infrared thermometer, the iHealth PT3. A method comparison study was conducted with 40 healthy adult participants, yielding a total of 2400 temperature measurements. Skin temperature of the hand dorsum was measured concurrently with the FLIR One Pro and the iHealth PT3. The protocol involved two rounds: Round 1 (R1) in a stable, static environment to assess baseline repeatability, and Round 2 (R2) in a dynamic environment mimicking clinical repositioning. The performance of the instruments was compared using paired t-tests for mean differences and Bland–Altman analysis for assessing agreement. The iHealth PT3 demonstrated superior precision, with an average intra-participant standard deviation (SD) of 0.030 °C in R1 and 0.092 °C in R2. In stark contrast, the FLIR One Pro exhibited significantly higher variability, with an average SD of 0.34 °C in R1 and 0.30 °C in R2. Bland–Altman analysis revealed a substantial mean bias of −1.42 °C in R1 and −1.15 °C, with critically wide 95% limits of agreement ranges of ≈6 °C. The substantial systematic bias and poor agreement of the FLIR One Pro far exceed both its manufacturer-stated accuracy and clinically acceptable error margins for absolute temperature measurement. To further examine whether calibration could mitigate these deficiencies, we applied a suite of ten machine learning regressors to map FLIR readings onto iHealth PT3 values. Calibration reduced systematic bias across all models, with Quantile Gradient-Boosted Regression Trees achieving the lowest MAE (1.162 °C). The Extra Trees model yielded the lowest RMSE (1.792 °C) and the highest explained variance (R2 = 0.152), yet this relatively low value confirms that the device’s high intrinsic variability limits the effectiveness of algorithmic correction. As such the device has limited utility for longitudinal patient monitoring or for diagnostic decisions that rely on precise, absolute temperature thresholds. These findings inform medical practitioners in low-resource settings of the profound limitations of using this device as a standalone clinical thermometer and emphasize that algorithmic correction cannot compensate for fundamental hardware and measurement noise constraints.

## Full-text entities

- **Genes:** ZNF135 (zinc finger protein 135) [NCBI Gene 7694] {aka ZNF61, ZNF78L1, pHZ-17, pT3}
- **Diseases:** rheumatoid arthritis (MESH:D001172), inflammation (MESH:D007249), injury to (MESH:D014947), COVID-19 (MESH:D000086382), diabetic (MESH:D003920), infection (MESH:D007239), diabetic neuropathy (MESH:D003929), rheumatic diseases (MESH:D012216), vascular disorders (MESH:D002561), febrile (MESH:D000071072), fever (MESH:D005334), metabolic dysregulation (MESH:D021081), burn (MESH:D002056), inflammatory chronic disease (MESH:D002908)
- **Chemicals:** SEEK (-), nitrogen (MESH:D009584)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12944422/full.md

---
Source: https://tomesphere.com/paper/PMC12944422