HybridoNet-Adapt: A domain-adapted framework for accurate lithium-ion battery RUL prediction

Khoa Tran; Bao Huynh; Tri Le; Lam Pham; Vy-Rin Nguyen; Duong Tran Anh; Hung-Cuong Trinh; Zhibin Zhao; Zhibin Zhao; Zhibin Zhao

PMC · DOI:10.1371/journal.pone.0335066·October 31, 2025

HybridoNet-Adapt: A domain-adapted framework for accurate lithium-ion battery RUL prediction

Khoa Tran, Bao Huynh, Tri Le, Lam Pham, Vy-Rin Nguyen, Duong Tran Anh, Hung-Cuong Trinh, Zhibin Zhao, Zhibin Zhao, Zhibin Zhao

PDF

Open Access

TL;DR

This paper introduces HybridoNet-Adapt, a new framework that improves the accuracy of predicting lithium-ion battery life under different conditions.

Contribution

The novel use of Maximum Mean Discrepancy and hybrid prediction with domain-specific predictors enhances RUL prediction under domain shifts.

Findings

01

HybridoNet-Adapt reduces RMSE by up to 152 cycles on battery datasets compared to non-adaptive models.

02

The framework outperforms baselines like XGBoost and Elastic Net in domain-shifted scenarios.

03

Combining LSTM, attention, and Neural ODE blocks with domain adaptation improves robustness.

Abstract

Accurate prediction of the Remaining Useful Life (RUL) of lithium-ion batteries is critical for safe, reliable Battery Health Management in diverse operating conditions. Existing RUL models often fail to generalize when test data diverge from the training distribution. To address this, we introduce HybridoNet-Adapt, a domain-adaptive RUL prediction framework that explicitly bridges the gap between labeled source and unlabeled target domains. During training, we minimize the Maximum Mean Discrepancy (MMD) between feature distributions to learn domain-invariant representations. Simultaneously, we employ two parallel predictors—one tailored to the source domain and one to the target domain—and balance their outputs via two learnable trade-off parameters, enabling the model to dynamically weight domain-specific insights. Our architecture couples this adaptation strategy with LSTM,…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Chemicals1

lithium

Figures12

Click any figure to enlarge with its caption.

Fig 1 — The overall RUL prediction process for Lithium-ion battery cells.

Fig 2 — Architecture of the proposed HybridoNet-Adapt model during training process with domain adaptation.

Fig 3 — Comparison of maximum charge and discharge capacities over cycle life for battery cells.(a) Maximum charge and discharge capacities over charge-discharge cycles for a single battery cell. (b) Maximum discharge capacities over charge-discharge cycles for many battery cells.

Fig 4 — Comparison of RUL prediction performance with and without median filtering.

Fig 5 — Feature contribution analysis: (a) feature importance ranking, and (b) RMSE comparison across feature extraction groups.

Fig 6 — Performance comparison of LSTM-based blocks for feature extractor.Experiment on the testing data of the second dataset.

Fig 7 — Comparison of different feature loss methods.Experiment on the testing data of the second dataset.

Fig 8 — Comparison of RMSE across different number of LSTM layers, hidden dimensions, and dropout configurations.Experiment on the testing data of the second dataset.

Fig 9 — Comparison of RMSE for different NODE discrete time steps (t) and Multihead Attention output time step selections.Experiment on the testing data of the second dataset.

Fig 10 — Comparison of model predictions with observed RUL for Cells 4-5 and 3-1 from the testing data of the second dataset.

Fig 11 — PCA-based comparison of embedding features between HybridoNet-Adapt and DANN from Cell 3-1 the testing data of the second dataset.

Fig 12 — Comparison of our proposed models with existing state-of-the-art methods.(a) Results on the secondary testing data of the first dataset. (b) Results on the testing data of the second dataset. (a) The first dataset. (b) The second dataset.

Equations24

Funding1

—http://dx.doi.org/10.13039/100018950Đại học Tôn Đức Thắng

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Battery Technologies Research · Advancements in Battery Materials · Machine Fault Diagnosis Techniques

Full text

1 Introduction

1.1 Motivations

Lithium-ion batteries (LIBs) [1], renowned for their affordability and high energy density, are extensively utilized [2–5] in electric vehicles (EVs), portable devices, and energy storage stations. The global lithium-ion battery (LIB) market is projected to surpass 170 billion dollars by 2030 [6]. With the wide-ranging adoption of LIBs, interest in battery health management (BHM) has surged within both academia and industry in recent years. In a BHM system, several common and essential techniques are employed, including thermal management [7,8], fault diagnosis/detection [9], state of charge (SOC) [10–12] and state of health (SOH) [13] estimation, remaining useful life (RUL) prediction [14,15], and cycle life early prediction [16–19]. Among these, RUL prediction plays a crucial role in ensuring the proactive maintenance, minimizing downtime, and enhancing the operational efficiency of LIBs over their lifespan. RUL can be assessed based on the number of remaining cycles the battery can undergo before reaching its EOL. RUL prediction falls into three categories: model-based, data-driven, and hybrid approaches.

Model-based approaches often utilize physics-based degradation models, such as the Double Exponential Model (DEM) [20], two-phase degradation models [21], and Markov Models [22], constructed using early-cycle data (200-500 cycles) to forecast the entire battery’s capacity degradation curve. However, relying solely on maximum discharge capacity degradation during early cycles often leads to inaccuracies due to the influence of various factors (current, voltage, temperature, time) and sudden changes in degradation trends [20,21].

Data-driven models predict the RUL of LIBs by analyzing data during current cycles. Techniques like dual-input Deep Neural Networks (DNN) [23], 1D Convolutional Neural Networks (1DCNN) [24], Dense layers [25], Long Short-Term Memory (LSTM) networks [26], and Echo State Networks (ESN) [27] have shown superior performance.

Hybrid approaches combine model-based and data-driven methods to improve RUL prediction. For instance, a hybrid model using the Double Exponential Degradation Model (DEDM) and Gated Recurrent Unit (GRU) network fused with a Bayesian neural network (BNN) offers enhanced predictions [28].

Accurately predicting the RUL of LIBs remains a significant challenge due to the complex, non-linear, and cycle-dependent degradation behavior of battery features. This variability necessitates a highly adaptive prediction model capable of tracking and learning degradation patterns over time throughout charge–discharge cycles. The specific challenges associated with this task will be discussed in the following section.

1.2 Problem statement and potential

As summarized in Table 1, current state-of-the-art studies categorize RUL prediction methods into two primary approaches: historical data-independent methods, which estimate the current RUL based on current and preceding few cycles, and historical data-dependent methods, which leverage early-cycle data to predict the battery’s full lifespan.

Table 1: Overview of RUL prediction methods in LIB research.

Historical data-dependent methods estimate the future capacity trajectory. The EOL cycle index is then determined, typically defined as 70% [33] or 80% [20] of the nominal capacity, and the RUL is calculated as the difference between the current cycle index and the EOL cycle index. These approaches primarily rely on early-cycle data. While historical data-dependent methods can achieve reasonable accuracy in benchmark evaluations [28,38], they face practical limitations such as the unavailability of early-cycle records, varying operational conditions [17] throughout a battery’s lifespan, and challenges in battery repurposing [43]. Therefore, historical data-independent approaches are considered more suitable for real-world scenarios.

Small datasets, like the Oxford Battery dataset (13 cells) and NASA battery datasets (4–34 cells) limit model robustness in real-world failure prediction. In contrast, large datasets such as the TRI dataset (124 cells, fast-charging) and the LHP dataset (77 cells, diverse discharge) provide extensive charge-discharge scenarios, making them well-suited for both training and testing of data-independent models.

Signal preprocessing can be limited by high dimensionality, especially with variational decomposition methods like EMD and VMD, which preserve or expand the original signal size. In contrast, statistical feature extraction methods—such as mean and standard deviation—offer a low-dimensional, efficient alternative that captures essential characteristics, making them ideal for real-time industrial applications.

Model-based and hybrid approaches typically rely on early-cycle data for RUL prediction, yet each battery exhibits unique degradation patterns over its lifespan requiring adaptive data-driven strategies. In data-driven approaches, domain adaptation (DA) techniques such as domain-adversarial neural networks (DANN)[44,45] and generative adversarial networks (GANs)[46] offer effective solutions for transferring degradation patterns from a source domain to improve prediction in the target domain. [31] proposed a transferable RUL prediction method using DA that enforces cycle-consistency of degradation trends across batteries, aligning feature representations to mitigate domain shifts. Their approach improves cross-battery generalization, it relies on comparable degradation levels between source and target domains. This work demonstrates that domain adaptation techniques benefit RUL prediction in the battery domain, highlighting an direction for further exploration. To address these mentioned challenges and leverage the identified potential, our proposed approach is introduced in the next section.

1.3 Main contribution

The main contributions of this work are summarized as follows:

We propose a historical data-independent RUL prediction framework for lithium-ion batteries that relies solely on current and recent cycling data, eliminating the need for early-cycle information. The prediction model integrates advanced deep learning components—including Long Short-Term Memory (LSTM), Multihead Attention, and Neural Ordinary Differential Equations (NODE) blocks—as a powerful feature extractor, along with linear layers in the predictors. Furthermore, a domain adaptation strategy is employed, combining two predictors with trainable trade-off parameters and an MMD-based loss to learn domain-invariant features, thereby enhancing transferability from source to target domains. This strategy improves the generalizability of the prediction model.The framework includes a lightweight yet robust preprocessing pipeline-noise reduction, statistical feature extraction (e.g., mean, standard deviation), and normalization—to improve signal quality and reduce dimensionality for efficient, real-time prediction.Extensive evaluations on the two largest publicly available datasets of A123 APR18650M1A cells [16,23], covering diverse charging and discharging conditions, validate the superior performance of our approach in real-world battery health management.

The remainder of this paper is organized as follows. Sect 2 introduces the preliminaries. Sect 3 presents the proposed method. Sect 4 describes the experiments and discussion. Finally, Sect 5 concludes the paper and outlines future work.

2 Preliminaries

This section presents an overview of the key components of the prediction model architecture, including LSTM, Multihead Attention, and NODE blocks.

2.1 LSTM

The LSTM network [26] is a recurrent architecture designed to mitigate the vanishing gradient problem by introducing gating mechanisms. Its operations at time step t are defined as:

[eqn]

[eqn]

[eqn]

[eqn]

[eqn]

[eqn]

where $[eqn]$ is the input vector at time step t. $[eqn]$ and $[eqn]$ are the hidden and cell states, respectively. $[eqn]$ denote the input, forget, and output gates. $[eqn]$ is the candidate cell state. $[eqn]$ , $[eqn]$ , and $[eqn]$ are trainable weight matrices and bias vectors. $[eqn]$ is the sigmoid activation, $[eqn]$ is the hyperbolic tangent, and $[eqn]$ denotes element-wise multiplication.

2.2 Multihead attention

Multihead Attention [47] is a critical mechanism in Transformer models, enabling the network to attend jointly to information from different subspaces. The basic building block is the scaled dot-product attention:

[eqn]

where $[eqn]$ , $[eqn]$ , and $[eqn]$ denote the query, key, and value matrices, respectively, and dk is the dimensionality of the keys. In a multihead setting, multiple attention heads are computed as

[eqn]

where $[eqn]$ , $[eqn]$ , and $[eqn]$ are trainable projection matrices. Finally, the outputs of all heads are concatenated and linearly transformed:

[eqn]

where $[eqn]$ is a trainable output projection matrix, and h is the number of attention heads.

2.3 NODE

NODE is a framework that extends deep learning architecture by modeling continuous-time dynamics instead of discrete transformations between layers. In NODE, the evolution of a hidden state h(t) is governed by an ordinary differential equation (ODE):

[eqn]

where f is a neural network parameterized by θ. The final state $[eqn]$ is the hidden state at time t, obtained by solving this ODE over a time interval, which provides a flexible and memory-efficient representation.

3 Proposed method

3.1 Overall architecture

Fig 1 illustrates the RUL prediction process for Lithium-ion battery cells. In the data collection phase, Lithium Iron Phosphate (LFP)/graphite cells are monitored to capture voltage, current, and capacity signals for each individual charge-discharge cycle.

The overall RUL prediction process for Lithium-ion battery cells.

Regarding cycle life degradation, the cycle life of a battery is defined as the total number of charge-discharge cycles from the Beginning of Life (BOL) to the End of Life (EOL). The EOL is typically identified when the battery’s maximum capacity in a charge-discharge cycle degrades to 70% [33] or 80% [20] of its nominal capacity. The RUL, expressed in terms of the remaining number of cycles, is computed as:

[eqn]

where $[eqn]$ is the total cycle life of the battery, $[eqn]$ is the number of aging cycles already completed.

In the signal preprocessing phase, the raw signals—voltage, current, and capacity from the most recent charge-discharge cycles—are passed through a noise-reduction filter named median filter [48] to smooth out sudden peaks. The filtered signals are then processed using feature extraction methods, including mean, standard deviation (Std), minimum (Min), maximum (Max), variance (Var), and median (Med) [49,50]. The extracted features for each cycle are represented as

[eqn]

where the 3 correspond to the three signal types (voltage, current, and capacity), and the 6 represent the extracted features (Mean, Std, Min, Max, Var, Med). Each input sample to the prediction model consists of 10 selected cycles, uniformly sampled from a 30-cycle window (i.e., one cycle every three cycles) [29]. The input sample is represented as

[eqn]

Thus, the shape of the total target input data after the feature extraction step becomes

[eqn]

where N denotes the number of samples. During normalization, a MinMaxScaler is fitted and applied to scale each feature across all time steps and samples between 0 and 1.

In the prediction phase, the prediction model, named HybridoNet-Adapt, maps the target input $[eqn]$ to the predicted RUL $[eqn]$ . The details of the proposed RUL prediction model are presented in the following section.

3.2 HybridoNet-Adapt: A proposed RUL prediction model with novel domain adaptation

As shown in Fig 2, HybridoNet-Adapt is composed of four key components: the source predictor $[eqn]$ , the target predictor $[eqn]$ , and the feature extractor GF, which is equipped with a DA technique to bridge the distribution gap between the source and target domains.

Architecture of the proposed HybridoNet-Adapt model during training process with domain adaptation.

The feature extractor integrates a LSTM (Sect 2.1), a Multihead Attention mechanism (Sect 2.2), and a NODE block (Sect 2.3). The NODE block models the hidden state h(t), which evolves continuously over time according to the following ODE: $[eqn]$ where h(t) denotes the hidden state at time t, f is a trainable function parameterized by $[eqn]$ , and t represents the continuous time variable. In our implementation, f is a single linear layer to strike a balance between performance and computational efficiency. The initial condition for the NODE block is given by h(t0), and the final transformed state h(t1) is obtained by solving the ODE over the time interval $[eqn]$ :

[eqn]

In our experiments, the time bounds are set to t0 = 0 and t1 = 1, based on empirical results (see Fig 9). The function h(t) thus represents the dynamic trajectory of the hidden state under continuous transformation, enabling the model to capture nuanced temporal dependencies.

The target and source predictions in HybridoNet-Adapt are respectively computed as follows:

[eqn]

[eqn]

where $[eqn]$ and $[eqn]$ are learnable trade-off parameters that balance the contributions from the source and target predictors. The outputs $[eqn]$ and $[eqn]$ denote the target and source predictions, respectively. The source predictor $[eqn]$ and the feature extractor GF are trained using both source and target data, enabling the model to transfer domain-invariant features from the source domain to the target domain. This promotes robust prediction performance in the target domain, especially in scenarios where the model would otherwise underperform if trained solely on target data.

The hyperparameters of the proposed HybridoNet-Adapt are summarized in Table 2. LayerNorm denotes layer normalization [51], and Dropout refers to a dropout layer [52] with a rate of 0.1. FC represents a fully connected layer [53], while in the NODE block, h(t) is parameterized by an FC layer. The learnable trade-off parameters $[eqn]$ and $[eqn]$ are initialized at 0.5 and updated during training. ReLU refers to the rectified linear unit activation function [54], Sigmoid refers to the sigmoid activation function [55], and BN denotes batch normalization [56]. The columns Input Dim. and Output Dim. specify the dimensionality of the inputs and outputs of each module, respectively. These values are determined empirically, as discussed in Sect 4.7. Note: The output from the Multihead Attention block is taken from the last step along the time dimension.

Table 2: Hyperparameters of HybridoNet-Adapt.

To optimize the model, we employ a domain adaptation strategy that combines two loss functions: the mean squared error (MSE) loss, $[eqn]$ , used for regression targets $[eqn]$ and $[eqn]$ ; and the maximum mean discrepancy (MMD) [57] loss, $[eqn]$ , which encourages alignment between the feature distributions extracted from the source and target domains. The total loss function is defined as:

[eqn]

where $[eqn]$ and $[eqn]$ are the input samples from the source and target domains, respectively, and $[eqn]$ and $[eqn]$ are their corresponding RUL labels. The hyperparameter $[eqn]$ controls the weight of the MMD loss in the overall objective. The MMD loss quantifies the discrepancy between the distributions of source and target feature embeddings. Given extracted feature sets $[eqn]$ from the source domain $[eqn]$ and $[eqn]$ from the target domain $[eqn]$ , it is defined as:

[eqn]

where $[eqn]$ is a kernel function, commonly chosen as the Gaussian kernel: $[eqn]$ with σ as the kernel bandwidth parameter. n and m denote the number of training samples from source and target domains, respectively. The MSE loss is used to optimize the regression outputs by penalizing the squared differences between predicted and ground truth values. It is defined as:

[eqn]

where $[eqn]$ denotes the predicted value, and Yi is the corresponding label for the RUL. u is the number of training samples.

In the following section, a series of experiments are conducted to identify the optimal configuration of HybridoNet-Adapt and to demonstrate its superiority over state-of-the-art methods.

For validating the proposed domain adaptation used in HybridoNet-Adapt, we construct a supervised learning model named HybridoNet, consisting of the target predictor $[eqn]$ and the feature extractor GF. This model is trained solely on labeled target data using the MSE loss function. By comparing between HybridoNet (without domain adaptation) and HybridoNet-Adapt (with domain adaptation), we highlight the performance improvements achieved through the incorporation of our proposed Domain Adaptation technique.

4 Experiments and discussion

4.1 Experimental setup

Our proposed RUL model is implemented using the PyTorch framework and optimized using the AdamW algorithm [58] to minimize the respective loss functions. All experiments are conducted on an NVIDIA A100 GPU with 80GB of memory. Each experiment is trained for 10 epochs with a batch size of 128 and a fixed learning rate of 0.0005. To reduce variability in the training process, each experiment is repeated 10 times, and the final prediction is computed as the average of these runs. The training data is divided into 90% for training and 10% for validation, with the model selected based on the lowest RMSE on the validation set (see Sect 4.3). The weighting factor λ in Eq 18 is dynamically adjusted during training using the following schedule [59]:

[eqn]

where $[eqn]$ denotes the current training epoch, and $[eqn]$ is the total number of training epochs.

4.2 Datasets

4.2.1 First dataset: Varied fast-charging conditions, with consistent discharging conditions.

The first dataset, referred to as the TRI dataset [16], encompasses a detailed study of 124 LFP/graphite lithium-ion batteries. Each LIB in the dataset has a nominal capacity of 1.1 Ah and a nominal voltage of 3.3 V. The cycle life span of these batteries ranges from 150 to 2,300 cycles, showcasing a wide spectrum of longevity. In terms of operational conditions, all LIBs were subjected to uniform discharge protocols. Specifically, they were discharged at a constant current rate of 4 C until the voltage dropped to 2 V, followed by a constant voltage discharge at 2 V until the current diminished to C/50. The LIBs were charged at rates between 3.6 C and 6 C, under a controlled temperature of 30^°^C within an environmental chamber. The dataset contains approximately 96,700 cycles, making it one of the largest datasets to consider various fast-charging protocols. The dataset is divided into three distinct parts: a training set with 41 LIBs, a primary test set with 43 LIBs, and a secondary test set comprising 40 LIBs.

4.2.2 Second dataset: Varied discharge conditions, with consistent fast-charging conditions.

The second dataset, referred as the LHP dataset [29], was developed through a battery degradation experiment involving 77 cells (LFP/graphite A123 APR18650M1A) with a nominal capacity of 1.1 Ah and a nominal voltage of 3.3 V. Each of the 77 cells was subjected to a unique multi-stage discharge protocol, while maintaining an identical fast-charging protocol for all cells. The experiment was conducted in two thermostatic chambers at a controlled temperature of 30^°^C. The dataset encompasses a total of 146,122 discharge cycles, making it one of the largest datasets to consider various discharge protocols. The cells exhibit a cycle life ranging from 1,100 to 2,700 cycles, with an average of 1,898 cycles and a standard deviation of 387 cycles. The discharge capacity as a function of cycle number reveals a wide distribution of cycle lives. The dataset is divided into two distinct parts: a training set with 55 LIBs, and a test set with 22 LIBs.

4.3 Evaluation metrics

To evaluate RUL prediction, we use Root Mean Square Error (RMSE) [60], R-squared (R^2^) [38,61], and Mean Absolute Percentage Error (MAPE) [62]. These are calculated as follows:

[eqn]

[eqn]

[eqn]

Where yi and $[eqn]$ are the observed and predicted RUL, respectively. y is cycle life.The smaller the RMSE and MAPE, and the larger the R^2^, the better the performance.

4.4 Signal analysis

Fig 3a and 3b analyze battery cycle life. Fig 3a tracks an individual cell’s charge and discharge capacities, marking EOL when the maximum capacity degrades to 80% of nominal capacity. Fig 3b compares cycle life across cells, revealing significant variation in discharge capacity. This variability challenges prediction models for RUL, emphasizing the need for accurate and adaptable RUL predictions for BHM systems.

Comparison of maximum charge and discharge capacities over cycle life for battery cells.(a) Maximum charge and discharge capacities over charge-discharge cycles for a single battery cell. (b) Maximum discharge capacities over charge-discharge cycles for many battery cells.

4.5 Signal preprocessing

Before feature extraction step in the signal preprocessing phase (as mentioned in Sect 3), the raw signals exhibit sudden peaks and fluctuations, resembling noise. Smoothing the time-series data can help reduce noise and enhance the key characteristics of the signal. To achieve this, a median filtering method is applied to eliminate abrupt peaks in the signals before feature extraction. As a result, the application of median filtering improves overall model performance. The filtered data leads to better RMSE, R^2^, and MAPE (%) values compared to the unfiltered data, as illustrated in Fig 4.

Comparison of RUL prediction performance with and without median filtering.

Fig 5a presents the feature importance ranking derived from XGBoost [63] trained and evaluated on the second dataset. The analysis considers 19 common feature extraction methods: 75th, 90th, 50th, 25th, and 10th percentiles; Maximum; Range; Energy; Mean; Interquartile Range (IQR); Median; Skewness; Standard Deviation (Std); Kurtosis; Root Mean Square (RMS); Minimum; Variance; Zero-Crossing Rate; and Autocorrelation (implemented using NumPy [64] and SciPy [65]). Among these, the 75th, 90th, and 10th percentiles demonstrate the highest contribution. We group the feature extraction methods into five categories: three based on high importance (Groups 1–3), one consisting of fundamental statistics (Group 4), and one hybrid group (Group 5):

Group 1: Percentiles 75th, 90th, 10th, Range, IQR.Group 2: Group 1 + Maximum, Percentiles 25th, Skewness, Kurtosis, Standard Deviation (Std).Group 3: Group 2 + Energy, Median, RMS, Mean, Variance, Minimum.Group 4: Mean, Std, Min, Max, Variance, Median.Group 5: Group 4 + Percentiles 75th, 90th, 10th.

Fig 5b shows that although Group 1 and 2 contain features with the highest importance scores, they do not yield strong overall prediction performance. Group 4 (fundamental statistics), which is selected as the feature extraction step during the Signal Preprocessing phase of the proposed framework, combines both high- and low-importance features but achieves the best performance, with an RMSE of 181.45, significantly outperforming others.

Feature contribution analysis: (a) feature importance ranking, and (b) RMSE comparison across feature extraction groups.

4.6 Feature extractor

The feature extractor is progressively developed, starting with an LSTM architecture and sequentially integrating Multihead Attention (MA) and a NODE block. To evaluate the effectiveness of each component, we assess the performance of HybridoNet-Adapt at different stages. With each addition as shown in Fig 6, the model’s predictive capability improves. Ultimately, HybridoNet-Adapt achieves an RMSE of 166.33, an R^2^ score of 0.86, and a MAPE of 7.44%, demonstrating its superior performance.

Performance comparison of LSTM-based blocks for feature extractor.Experiment on the testing data of the second dataset.

4.7 HybridoNet-Adapt with domain adaption

HybridoNet-Adapt is evaluated with various feature loss functions, including CORAL Loss, Domain Loss [44], MMD, as well as combinations such as MMD with Domain Loss, and MMD with Domain Loss and CORAL Loss, as shown in Fig 7. The results indicate that using only MMD as the feature loss function yields the best performance, achieving an RMSE of 160.05.

Comparison of different feature loss methods.Experiment on the testing data of the second dataset.

To determine the optimal hyperparameters, including hidden dimension of all layers, the number of recurrent LSTM layers, and the dropout rate, 27 experiments were conducted. The results are presented in Fig 8. In the graph, L represents the number of recurrent layers, H denotes the hidden dimension size. Based on RMSE score, the best performance is achieved with 2 recurrent LSTM layers, a hidden dimension of 64, and a dropout rate of 0.1.

Comparison of RMSE across different number of LSTM layers, hidden dimensions, and dropout configurations.Experiment on the testing data of the second dataset.

To identify the optimal time step in the sequence dimension for both Multihead Attention and NODE outputs, a comprehensive evaluation was conducted. Various NODE output time steps ranging from 2 to 6 were tested, along with different Multihead Attention output time step selections, including the last, the second-to-last, and the mean time step. As shown in Fig 9, the best performance was achieved when using the second-to-last time step of the Multihead Attention output and a NODE output time step of 2.

Comparison of RMSE for different NODE discrete time steps (t) and Multihead Attention output time step selections.Experiment on the testing data of the second dataset.

The proposed HybridoNet-Adapt model is systematically evaluated under various scenarios by experimenting with four different target sets, each derived from the training data of the second dataset. The source data is the training data from the first dataset. Below are four target groups of battery cells selected from the training data of the second dataset. These groups are carefully formed to ensure each set represents a diverse range of battery performance. For instance, Group 1 includes both high-cycle cells (e.g., 2-2 with 2,651 cycles) and low-cycle cells (e.g., 1-6 with 1,143 cycles), ensuring a comprehensive representation of aging behaviors.

Group 1: 1-3 (1,858 cycles), 1-6 (1,143 cycles), 2-2 (2,651 cycles), 2-6 (1,572 cycles), 3-2 (2,283 cycles), 3-6 (2,491 cycles), 4-3 (1,142 cycles), 5-4 (1,962 cycles)Group 2: 1-5 (1,971 cycles), 1-8 (2,285 cycles), 2-4 (1,499 cycles), 2-7 (2,202 cycles), 3-3 (1,649 cycles), 3-7 (2,479 cycles), 4-4 (1,491 cycles), 5-5 (1,583 cycles)Group 3: 2-8 (1,481 cycles), 3-4 (1,766 cycles), 3-8 (2,342 cycles), 4-1 (2,217 cycles), 4-7 (2,216 cycles), 5-1 (2,507 cycles), 5-6 (2,460 cycles), 6-3 (1,804 cycles)Group 4: 4-8 (1,706 cycles), 5-2 (1,926 cycles), 5-7 (1,448 cycles), 6-4 (1,717 cycles), 6-5 (2,178 cycles), 7-2 (2,030 cycles), 7-7 (1,685 cycles), 8-2 (2,041 cycles)All: All battery cells from the training set of the second dataset.

Table 3 shows that HybridoNet-Adapt outperforms both HybridoNet (without DA) and DANN (with DA) across all groups. It achieves the lowest RMSE and MAPE while maintaining the highest R^2^, demonstrating better generalization. For instance, in Group 1, HybridoNet-Adapt reduces RMSE from 368.99 to 356.46 and improves R^2^ from 0.21 to 0.30. On the full dataset, it achieves the best RMSE of 153.24 and R^2^ of 0.88, significantly outperforming DANN, which shows degraded performance (RMSE = 835.35, R^2^ = −1.37). DANN struggles with large variations in battery aging, while HybridoNet-Adapt effectively adapts to different distributions, leading to consistently better predictions.

Table 3: Comparison of HybridoNet, DANN, and HybridoNet-Adapt across four target data groups from the testing data of the second dataset.

Table 4 presents the evaluation metrics for RUL prediction on the test data from the second dataset, comparing Elastic Net, A1, A2 of paper [29] (see Table S4), with our HybridoNet, and HybridoNet-Adapt methods. The results indicate that HybridoNet-Adapt achieves competitive RMSE values, particularly in cases where Elastic Net exhibits high errors. The R^2^ values show that HybridoNet-Adapt generally improves predictive accuracy compared to the baseline methods. Additionally, MAPE results suggest that HybridoNet-Adapt provides more stable and reliable predictions, especially in challenging scenarios. Overall, these findings demonstrate the potential of HybridoNet-Adapt for enhanced RUL estimation.

Table 4: Evaluation metrics for RUL prediction performance using existing Elastic Net (Ela), A1, A2 results of paper [29] (see Table S4), along with HybridoNet (H), and our proposed HybridoNet-Adapt (H-Adapt).Experiment on the testing data of the second dataset.

Fig 10 illustrates the RUL predictions of XGBoost, HybridoNet, HybridoNet-Adapt, and DANN, compared to the true (observed) RUL for Cell 4-5 and Cell 3-1 in the testing set of the second dataset. Among all methods, HybridoNet-Adapt demonstrates the closest alignment with the observed RUL, highlighting its superior predictive accuracy. This improvement is attributed to HybridoNet-Adapt’s ability to align feature representations from the source domain to the target domain, as shown in Fig 11. By effectively increasing the amount of target-relevant data through our domain adaptation technique, HybridoNet-Adapt enhances robustness, making it more adaptable to diverse real-world battery degradation scenarios.

Comparison of model predictions with observed RUL for Cells 4-5 and 3-1 from the testing data of the second dataset.

PCA-based comparison of embedding features between HybridoNet-Adapt and DANN from Cell 3-1 the testing data of the second dataset.

4.8 Comparison with state-of-the-art methods

Fig 12a presents a performance comparison of different models on the secondary testing data from the first dataset. The Multi-Time Scale Feature Extraction Hybrid (MSFEH) model [42], XGBoost, and HybridoNet were trained using the training data from the first dataset. HybridoNet-Adapt, in contrast, was trained with the training data of the second dataset as the source input and the training data of the first dataset as the target input. HybridoNet-Adapt achieves the best results, with the lowest RMSE (146.52), demonstrating its superior predictive accuracy through domain adaptation. Moreover, HybridoNet outperforms both XGBoost and MSFEH, highlighting the effectiveness of deep learning–based approaches. The additional improvements achieved by HybridoNet-Adapt further validate the benefits of domain adaptation in enhancing RUL prediction performance. It should be noted that in the MSFEH paper [42], the MAPE formula differs from the one used in our work; therefore, we report only the RMSE comparison with MSFEH.

Comparison of our proposed models with existing state-of-the-art methods.(a) Results on the secondary testing data of the first dataset. (b) Results on the testing data of the second dataset. (a) The first dataset. (b) The second dataset.

Fig 12b presents a comparison of our HybridoNet and HybridoNet-Adapt models with state-of-the-art methods, including Elastic Net [29], $[eqn]$ [29], $[eqn]$ [29], Ridge Linear [23], Random Forest [23], and Structural Pruning [66], evaluated on the testing data from the second dataset. HybridoNet-Adapt, was trained with the training data of the first dataset as the source input and the training data of the second dataset as the target input, whereas the other methods were trained solely on the training data from the second dataset. The results demonstrate that HybridoNet-Adapt achieves the lowest RMSE (153.24), outperforming all other approaches. This highlights the effectiveness of our proposed method in enhancing predictive performance. Overall, HybridoNet-Adapt consistently outperforms across large datasets with diverse charging and discharging profiles.

In future work, we plan to explore Physics-Informed Neural Networks (PINNs) as a approach to enhance interpretability and physical consistency. By embedding governing equations and domain-specific constraints directly into the learning process, PINNs can reduce reliance on purely data-driven correlations while improving extrapolation to unseen conditions. Such a framework would allow the model to not only achieve strong predictive performance but also provide insights grounded in physical principles, thereby addressing the gap between predictive accuracy and practical interpretability.

5 Conclusion

In this paper, we proposed HybridoNet-Adapt, a novel domain-adaptive framework for accurate Remaining Useful Life (RUL) prediction of lithium-ion batteries. Our approach addresses the challenge of distribution shift between training and testing data by leveraging domain adaptation techniques, specifically Maximum Mean Discrepancy (MMD), to align feature representations between source and target domains. By integrating a hybrid prediction mechanism with trainable trade-off parameters, the model effectively balances contributions from both domain-specific predictors. The proposed architecture combines LSTM, Multi-head Attention, and NODE blocks within a feature extractor, enabling the model to capture both temporal and dynamic characteristics of battery degradation. Extensive experiments on two large-scale benchmark datasets demonstrate that HybridoNet-Adapt consistently outperforms state-of-the-art baselines, such as XGBoost and Elastic Net, as well as state-of-the-art deep learning models like Structural Pruning and Multi-Time Scale Feature Extraction Hybrid (MSFEH) model, affirming its effectiveness in the RUL prediction task. For future work, we plan to enhance model generalization through Physics-Informed Neural Networks, and explore multi-modal data integration to improve scalability and robustness across diverse BHM applications. We will investigate the integration of multi-modal data to enhance scalability and robustness across a wide range of BHM applications.

Preprint availability

A preprint of this manuscript is available at: https://arxiv.org/pdf/2503.21392.

Contact information

For access to the code and further information about this proposed system, please contact AIWARE Limited Company at: https://aiware.website/Contact.

Bibliography60

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Goodenough JB, Park K-S. The Li-ion rechargeable battery: a perspective. J Am Chem Soc. 2013;135(4):1167–76. doi: 10.1021/ja 3091438 23294028 · doi ↗ · pubmed ↗
2Guo R, Wang F, Akbar Rhamdhani M, Xu Y, Shen W. Managing the surge: a comprehensive review of the entire disposal framework for retired lithium-ion batteries from electric vehicles. Journal of Energy Chemistry. 2024;92:648–80. doi: 10.1016/j.jechem.2024.01.055 · doi ↗
3Dunn B, Kamath H, Tarascon J-M. Electrical energy storage for the grid: a battery of choices. Science. 2011;334(6058):928–35. doi: 10.1126/science.1212741 22096188 · doi ↗ · pubmed ↗
4Zhang J, Fan T, Yuan S, Chang C, Wang K, Song Z, et al. Patent-based technological developments and surfactants application of lithium-ion batteries fire-extinguishing agent. Journal of Energy Chemistry. 2024;88:39–63. doi: 10.1016/j.jechem.2023.08.037 · doi ↗
5Larcher D, Tarascon J-M. Towards greener and more sustainable batteries for electrical energy storage. Nat Chem. 2015;7(1):19–29. doi: 10.1038/nchem.2085 25515886 · doi ↗ · pubmed ↗
6Karimi G, Li X. Thermal management of lithium-ion batteries for electric vehicles. Int J Energy Res. 2012;37(1):13–24. doi: 10.1002/er.1956 · doi ↗
7Zhang G, Wei X, Chen S, Wei G, Zhu J, Wang X, et al. Research on the impact of high-temperature aging on the thermal safety of lithium-ion batteries. Journal of Energy Chemistry. 2023;87:378–89. doi: 10.1016/j.jechem.2023.08.040 · doi ↗
8Du X, Meng J, Amirat Y, Gao F, Benbouzid M. Exploring impedance spectrum for lithium-ion batteries diagnosis and prognosis: a comprehensive review. Journal of Energy Chemistry. 2024;95:464–83. doi: 10.1016/j.jechem.2024.04.005 · doi ↗