# Breath-based lung cancer detection using an ML-driven low-cost sensor array

**Authors:** Dhruv Iyer, Kavin Gobinath, Krish Kowkuntla, Vitthalrao Vijaykumar Wanjari, Gokulakrishna Banumurthy

PMC · DOI: 10.1038/s41598-025-26416-z · Scientific Reports · 2025-11-24

## TL;DR

A low-cost breath sensor system using machine learning can detect lung cancer with high accuracy and speed.

## Contribution

A novel e-nose system with data augmentation achieves 96.26% accuracy for lung cancer detection.

## Key findings

- The system achieved 96.26% accuracy, 92.88% sensitivity, and 97.75% specificity using ML analysis.
- Data augmentation with Gaussian noise improved performance while preserving data properties.
- The system outperforms existing e-nose methods by over 5% and completes classification in ~5 minutes.

## Abstract

Lung cancer is the leading cause of cancer-related mortality worldwide. Lately, electronic nose (e-nose) systems have emerged as a promising method for non-invasive lung cancer detection. These systems, however, have several limitations, including low accuracy rates and long detection times. To address these challenges, we conducted a pilot study involving the development of an affordable e-nose device that can detect more than 30 volatile organic compounds, using twelve metal oxide semiconductor sensors and one chemi-resistive alkane sensor. The device recorded data for 28 healthy controls and 18 lung cancer breath samples that were then analyzed using a multilayer perceptron neural network. The dataset was expanded through a novel use of data augmentation, where Gaussian noise was applied to generate synthetic samples while preserving the original data’s statistical properties. The model was evaluated by 5-fold cross-validation and achieved an accuracy of 96.26%, sensitivity of 92.88%, specificity of 97.75%, and an area under the curve of 0.9286. Our system outperforms existing e-nose detection methods by more than 5% and is capable of classifying in approximately 5 minutes. These findings highlight the potential of this breath analyzer system as a rapid and cost-effective tool for preliminary lung cancer screening.

## Linked entities

- **Diseases:** lung cancer (MONDO:0005138)

## Full-text entities

- **Diseases:** Lung cancer (MESH:D008175), cancer (MESH:D009369)
- **Chemicals:** metal oxide (-), alkane (MESH:D000473)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12644706/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12644706/full.md

## References

5 references — full list in the complete paper: https://tomesphere.com/paper/PMC12644706/full.md

---
Source: https://tomesphere.com/paper/PMC12644706