# Early Remaining Useful Life Prediction of Lithium-Ion Batteries Based on a Hybrid Machine Learning Method with Time Series Augmentation

**Authors:** Jingwei Zhang, Jian Huang, Taihua Zhang, Erbao He, Sipeng Wang, Liguo Yao

PMC · DOI: 10.3390/s26041238 · 2026-02-13

## TL;DR

This paper introduces a hybrid machine learning method to accurately predict lithium-ion battery life early on, even with limited and noisy data.

## Contribution

A novel hybrid framework combining signal decomposition, time series augmentation, and deep forecasting for early battery RUL prediction.

## Key findings

- The proposed method achieves high accuracy with R2 scores between 0.9643 and 0.9972 using only 20% of historical data.
- RMSE and MAE values are below 0.0296 and 0.0198, respectively, indicating strong predictive performance.
- The framework is robust to noise and data scarcity in early-stage battery life prediction.

## Abstract

Early and accurate prediction of the remaining useful life (RUL), defined as the number of operational cycles a battery can continue to function before reaching its end-of-life threshold, is crucial for improving the reliability of new energy vehicles. To address noise contamination, capacity regeneration effects, and data scarcity in early-stage prognostics, this paper proposes a hybrid framework integrating signal decomposition, time series augmentation, and deep forecasting. The raw capacity sequence is decomposed using Complete Ensemble Empirical Mode Decomposition with Adaptive Noise (CEEMDAN) to separate multi-scale components. A Transformer-enhanced time series generative adversarial network (HyT-GAN) is then employed to augment decomposed components, improving robustness under small-sample conditions. A CNN-BiGRU predictor is trained for capacity forecasting, and key hyperparameters are tuned via the Dung Beetle Optimizer (DBO). Experiments on NASA and CALCE benchmark datasets demonstrate that the proposed method achieves accurate early-stage prediction using only 20% historical data, with R2 ranging from 0.9643 to 0.9972 and RMSE/MAE below 0.0296/0.0198. These results indicate that the proposed framework can deliver reliable RUL estimates under data-limited and noisy measurement conditions.

## Full-text entities

- **Genes:** PDGFRB (platelet derived growth factor receptor beta) [NCBI Gene 5159] {aka CD140B, IBGC4, IMF1, JTK12, KOGS, OPDKD}, GAN (gigaxonin) [NCBI Gene 8139] {aka GAN1, GIG, KLHL16}
- **Diseases:** OI (MESH:C566784), injury to (MESH:D014947), RUL (MESH:D000071298), CEEMDAN (MESH:C537734), EMD (MESH:D020389)
- **Chemicals:** CEEMDAN (-), Lithium (MESH:D008094)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943947/full.md

---
Source: https://tomesphere.com/paper/PMC12943947