Fault Detection and Monitoring in Induction Machines Using Data-Driven Model Drift Detection
Abdiel Ricaldi-Morales, Camilo Ramírez, Jorge F. Silva, Manuel A. Duarte-Mermoud, Marcos E. Orchard

TL;DR
This paper introduces a new data-driven method for detecting early stator faults in induction motors using statistical tests, without needing labeled data or extra sensors.
Contribution
A novel fault detection framework using the Residual Information Value (RIV) principle and integration into existing Variable Speed Drives (VSDs) without manual data collection.
Findings
The proposed method detects 1% incipient faults in about 61 ms with high accuracy.
It achieves better statistical separability and lower false alarm rates than traditional methods.
The framework is validated on an industrial test bench and is suitable for real-world predictive maintenance.
Abstract
Stator short-circuit faults (SSCFs) account for a significant portion of induction motor failures, yet their early detection remains a challenge in industrial environments where labeled fault data is scarce and installing additional sensors is often impractical. This paper proposes a novel, data-driven fault detection and diagnosis framework grounded in the Residual Information Value (RIV) principle to overcome reliability limitations of traditional spectral and residual energy methods. By redefining fault detection as a statistical test of independence between control inputs (voltages) and current residuals, the proposed method identifies incipient faults as model drifts without relying on prior knowledge of fault distributions. A key contribution of this work is the seamless integration of the diagnostic scheme into standard Variable Speed Drives (VSDs): the healthy nominal model (a…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —National Agency for Research and Development (ANID) through the Scholarship Program “DOCTORADO BECAS CHILE 2023”
- —Advanced Center for Electrical and Electronic Engineering (AC3E)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Fault Diagnosis Techniques · Sensorless Control of Electric Motors · Multilevel Inverters and Converters
1. Introduction
Induction motors are among the most critical assets in modern industry due to their robustness, relatively low cost, and simple construction, which have led to their widespread adoption across a wide range of applications [1,2,3]. Given their central role in industrial processes, the reliability of these machines is of paramount importance. Faults in induction motors not only reduce efficiency but can also trigger unplanned shutdowns, with severe economic consequences in the case of catastrophic failures [4,5]. For this reason, predictive maintenance strategies that enable the early detection of incipient faults are essential, as they allow corrective actions before faults evolve into severe or irreversible damage. Therefore, predictive maintenance strategies are essential, as they enable early detection and timely corrective actions before faults evolve into severe or irreversible damage.
Induction motor failures are typically classified into stator, rotor, bearing, and miscellaneous faults, with stator faults (SFs) drawing particular attention due to their rising incidence, representing more than 30% of induction motor failures [5,6,7]. Such statistics highlight the importance of developing reliable diagnostic techniques for stator-related faults, especially under realistic operating conditions where voltage imbalance, load variability, and non-Gaussian disturbances are frequently present. In this context, the capability to detect faults across diverse operating regimes—including light loading or variable speeds—is essential. Such faults are technically defined as “incipient” when they involve a limited number of shorted turns (typically less than 3% of the phase winding) [4]. Unlike severe failures, incipient faults do not immediately trigger conventional protection systems; instead, they generate localized hotspots that progressively degrade the insulation, inevitably leading to catastrophic breakdown if left unaddressed [1].
Recent research on induction motor fault detection (FD) has primarily advanced along two methodological directions, data-driven and model-based approaches, with hybrid techniques also emerging in recent years. Data-driven methods rely exclusively on signal measurements and employ statistical or machine learning tools to infer the health state of the machine, without requiring a detailed physical model of its dynamics. Conversely, model-based approaches exploit mathematical representations of the motor—such as equivalent circuits or state-space models—to identify discrepancies between expected and observed behavior. Finally, hybrid approaches combine signal-driven indicators with theoretical models to enhance robustness and interpretability.
In the realm of data-driven strategies, machine learning has played a transformative role in fault detection across many fields, including induction motor diagnosis [4,5,6,7,8,9,10,11,12,13]. Surveys such as [4,5] provide comprehensive overviews of these techniques, most of which rely on artificial neural networks (ANNs), recurrent neural networks (RNNs), gated recurrent units (GRUs), long short-term memory (LSTM) networks, and convolutional neural networks (CNNs) and their variants. In particular, ref. [4] highlights diagnostic methods that monitor mainly current signals (and occasionally voltage) under both line-fed induction motors and variable-speed drives (VSD), covering stator faults and other failures. Meanwhile, ref. [5] specifically addresses induction motors controlled by direct torque control (DTC), emphasizing the challenges posed by stator-related faults and their effect on system reliability.Crucially, it highlights that the closed-loop control action tends to compensate for the fault effects, masking the symptoms and making detection significantly more challenging than in open-loop systems.
Several works have proposed machine learning schemes tailored for different induction motor operating conditions. For instance, in the context of direct field-oriented control (DFOC) drives, ref. [8] introduced an integrated method combining hierarchical CNNs with support vector machines (SVMs), optimized by particle swarm optimization (PSO), where current signals are transformed into images for incipient fault detection. Similarly, for induction motors under open-loop V/f control, ref. [6] developed a CNN-based FD algorithm complemented by encoder measurements. Addressing grid-connected induction motors, ref. [9] explored two composite architectures—CNN-LSTM and CNN-GRU—under balanced and unbalanced supply conditions.
A prominent trend involves image-based transformations of current signals. For example, ref. [11] employed CNNs fed with images from phasor magnitudes of fundamental components. Along the same line, ref. [12] proposed a recurrence plot CNN (RPCNNet) that encodes three-phase currents into RGB recurrence plots after Park transformation. In a more advanced control setting, ref. [7] analyzed an induction motor controlled by model predictive control (MPC) using RNN and CNN classifiers. On a different complexity scale, ref. [13] demonstrated a simpler yet effective strategy using a k-NN classifier on PCA-reduced signals, implemented in real-time on an FPGA.
Regarding hybrid approaches—which integrate the aforementioned signal-driven indicators or theoretical models with machine learning—distinct strategies have been proposed to enhance diagnostic performance [1,2,3,14,15,16,17]. A common strategy in this domain is a two-stage process of feature extraction followed by classification.
For instance, refs. [1,2] adopt this two-stage strategy using a mutual information estimator between delayed stator currents as a feature, classified with decision trees (DTs) or multilayer perceptrons (MLPs). However, these works do not establish a rigorous theoretical connection between mutual information and the physical fault mechanism. Both approaches were tested in line-fed induction motors, with [2] also demonstrating feasibility in industrial environments using a TMS320F28388D microcontroller. Similarly, ref. [3] presented a real-time method for a VSD-fed induction motor using Discrete Wavelet Transform (DWT) and the L2 norm of statistical parameters for feature extraction, followed by an SVM for classification.
Other feature-engineering approaches include [14], which combined Fast Fourier Transform (FFT), wavelet transform, and perceptual linear prediction, and [15], which proposed features based on spectral peak detection and band power ratios. A more recent approach, ref. [16], introduced a multi-scale active noise cancellation network (ANC-Net). Finally, illustrating the integration of physical principles, ref. [17] leveraged prior knowledge by integrating a diagnostic model with a machine learning technique for FD in a VSD-controlled induction motor.
Despite the high accuracy and ability to model complex fault signatures demonstrated by these data-driven and hybrid strategies, significant challenges remain for their widespread industrial adoption. Regarding deployment in real-world systems, while some studies have successfully implemented these algorithms on embedded platforms such as FPGAs [13] or high-performance microcontrollers [2] for real-time monitoring, the dominant workflow relies on offline training using extensive historical datasets. However, a common and critical limitation persists across most of the reviewed methods: their strong dependence on large amounts of labeled data representing both healthy and, most problematically, faulty conditions. As explicitly noted in [4], despite the good results reported, these methodologies require a “very large and diverse database.” Furthermore, ref. [5] emphasizes that acquiring such experimental data across different load conditions and failure levels is “extremely arduous in practical terms.”
In practice, industrial environments rarely provide access to such comprehensive datasets, as motors are not deliberately operated in degraded states. Consequently, most data-driven methods are trained on data derived from laboratory test benches under controlled conditions. When transferred to dynamic industrial environments, the discrepancy in data distribution—a challenge technically known as domain shift—often leads to performance degradation [5,18]. This fundamental limitation highlights the pressing need for novel approaches that can reduce dependency on faulty data and establish stronger theoretical links between the extracted features and the underlying physical fault mechanisms.
Conversely, following the model-based paradigm, a substantial body of research has focused on model-based and specialized signal analysis strategies for FD [19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35]. A prominent strategy involves residual analysis, where deviations between measured signals and a model-based observer are monitored. In this domain, recent theoretical frameworks have formalized the Residual Information Value (RIV) [19] as a rigorous statistical metric to quantify these deviations based on information theory, characterizing faults as model drifts. In the specific context of induction machines, for instance, ref. [20] proposes a residual spectrum analysis based on a state-space model, while digital twin approaches [21] and other methods [22] also calculate fault indices from this discrepancy. This concept is central to FD within advanced control frameworks like Model Predictive Control (MPC) [23,24]. However, a critical limitation of these methods is their high dependency on accurate model parameters ( , etc.), often requiring full motor knowledge [23], complex online identification routines [25], or specialized estimators like MRAS [26].
Alongside these observer-based methods, specialized signal-processing techniques focus on extracting specific fault signatures derived from physical principles. This includes classical harmonic analysis via FFT [27], time-frequency analysis using discrete wavelet transforms [28], and the analysis of sequence components [29,30]. Other strategies analyze specific control-related signals, such as PWM-induced common-mode voltage (CMV) sidebands [31] or internal signals from the DFOC controller [32]. While often effective, these methods can be sensitive to operational variability and may require deep domain expertise to select and interpret the relevant signal features.
However, a significant practical limitation shared by many of these approaches is their reliance on additional or non-standard sensors. For example, some methods require dedicated magnetic flux sensors mounted on the machine [33,34], which are rarely available in industrial setups. Others employ external antennas to measure radiated electromagnetic fields [35], dedicated torque sensors [24], or additional voltage sensors [30]. Even methods integrated into VSDs may require feedback from encoders [23,32] or current/voltage measurements at multiple points [29], increasing system cost, complexity, and potential points of failure.
These works, while demonstrating high sensitivity, highlight a clear trade-off: detection robustness is often achieved at the cost of requiring additional sensors [24,30,33,35] or precise, detailed motor parameters [20,23,26]. Both of these requirements represent significant barriers to practical adoption in diverse industrial environments.
In light of these discussed limitations—specifically, the dependency on extensive fault data in data-driven methods and the need for additional sensors or precise parameters in model-based approaches—this work introduces a novel fault detection framework based on the RIV principle, specifically targeting the challenging scenario of closed-loop operation where fault signatures are attenuated by the controller. By rigorously quantifying the information shared between the supply voltages and the current residuals, the proposed method detects faults as statistical model drifts, addressing the data scarcity and sensor cost barriers simultaneously. The main contributions are summarized as follows:
- A Novel RIV-Based Fault Detection and Phase Discrimination Framework: We propose a diagnostic scheme that adapts the theoretical RIV concept [19] for induction machines. This method estimates the mutual information between the input vector (stator voltages) and the residual vector (the difference between measured and estimated stator currents). By characterizing faults as statistical changes in this dependency, the framework achieves robust detection and identifies the faulty phase without assuming specific fault distributions or requiring labeled fault data.
- Experimental Validation of an Industry-Ready Strategy: The proposed framework is rigorously validated on a laboratory test bench driven by a standard industrial VSD. Addressing the implementation gap identified in the literature, this validation demonstrates the method’s applicability to realistic industrial scenarios by utilizing a self-commissioning routine for the nominal model generation. This approach allows for the diagnostic system to be deployed on loaded motors without mechanical decoupling or offline tests, bridging the gap between the theoretical RIV concept and practical, non-intrusive industrial implementation.
The remainder of this paper is organized as follows. Section 2 presents the theoretical background, including the motor modeling and sensorless control framework, the mutual information formulation, and the model drift–based fault detection concept. Section 3 details the proposed methodology and its integration into a unified monitoring architecture. Section 4 reports the experimental validation on an industrial VSD platform, and Section 5 concludes the paper with final remarks and perspectives for future work.
2. Background
2.1. Fault Detection as Statistical Model Drift
The recent work of Ramírez et al. [19] establishes a rigorous connection between fault detection and the detection of model drift, i.e., deviations in the deterministic input-output relationship of a system. This framework provides a powerful, distribution-free tool for fault detection.
2.1.1. Additive Noise Model (ANM) Formulation
Let and denote the input and output variables of a system. An additive noise model (ANM) is defined as
where is the deterministic model, W is a noise variable independent of X, and is the noise model. Under this structure, any change in the deterministic function are defined as model drifts. Since faults correspond to undesired alterations in the system’s physical dynamics, they can be equivalently characterized as the presence of model drift.
2.1.2. Fault Detection as Hypothesis Testing
Let be the nominal model of the system (whose data-driven construction is detailed later in Section 3). The core idea is to test whether the real-time system behavior, denoted by an unknown function , matches the nominal model . This is formalized as the following hypothesis test:
A fundamental result from [19] establishes a direct equivalence between this test and the statistical independence of the input and the residual. If we define the residual as , then:
This equivalence holds because under , the residual is just the system noise, which is independent of the input X. Conversely, under , the residual contains a model-mismatch term , which makes R statistically dependent on X.
Therefore, the complex engineering problem of fault detection is rigorously transformed into a statistical problem: testing for independence between the input X and the residual R. To operationalize this theoretical test, we adopt the RIV framework. In the specific context of induction motor diagnostics, as established in the subsequent sections, the input variable X corresponds to the stator voltage vector and the output variable Y to the stator current vector.
2.2. The Residual Information Value (RIV) Detector
To implement the hypothesis test in (3), we adopt the RIV method introduced in [19]. The RIV detector operationalizes this test by using the mutual information (MI) as its test statistic, since if and only if X and R are independent.
The RIV detection procedure, applied to a batch of n new samples, is as follows:
- Collect data: Acquire n real-time input-output samples from the drive.
- Compute residuals: Calculate the residuals using the nominal model (detailed in Section 2.4):
- Estimate MI: Compute the MI value using the distribution-free estimator described in Section 2.3 on the sample set .
- Make decision: Compare the estimated MI to a predefined threshold , which determines the system’s False Alarm Rate (FAR):
This methodology is supported by strong theoretical guarantees [19]. These include strong consistency (the detector almost surely converges to the correct decision as ) and exponentially fast detection bounds (the probability of a false alarm decays exponentially with the sample size n, while the detection power tends to one).
The RIV methodology provides a robust and unsupervised fault detection strategy. It does not require examples of faulty data, is agnostic to the system’s probability distribution (admitting non-linear and non-Gaussian cases), and can be applied using only the nominal model. These properties make it especially attractive for real-world industrial setups, such as induction motors, where failures often manifest as subtle model drifts and labeled faulty data are scarce or impossible to generate without damaging the equipment. However, the practical implementation of Step 3 requires a specialized estimator capable of handling these unknown distributions, which is presented next.
2.3. Distribution-Free Mutual Information Estimation
To implement the RIV detector described in Section 2.2, it is necessary to quantify the shared information between the inputs and residuals. Mutual information (MI), originally introduced by Shannon and Weaver, is the ideal metric for this task as it measures the reduction in uncertainty of one variable when another is known, capturing both linear and nonlinear statistical dependencies.
Formally, for two random variables X and R, MI is defined as:
where and are the marginal probability functions, and is the joint probability.
However, in real-time motor monitoring, the underlying probability distributions are unknown and non-Gaussian. Therefore, we employ the non-parametric and distribution-free estimator presented in [19,36]. Unlike computationally expensive kernel-based methods, this estimator computes from an i.i.d. sample without assuming any specific distributional form, operating in three key stages optimized for online efficiency:
Stage 1: Data-Driven Partitioning. The estimator constructs a partition of the joint space using a complexity-regularized binary tree. The space is recursively split by axis-parallel hyperplanes until each cell contains at most data points, where controls the minimum cell mass. The tree is subsequently pruned using a regularization parameter to prevent overfitting, ensuring a compact model representation suitable for real-time processing.Stage 2: Empirical Probability Estimation. For each resulting cell , the empirical probabilities are computed as:
where is the indicator function. The marginal probabilities and are computed analogously.Stage 3: MI Calculation. Finally, the mutual information is estimated as:
where represents the probability under the independence assumption. This estimator is consistent given appropriate parameter choices ( ), providing a reliable statistic for the RIV test.
2.4. Data-Driven Nominal Modeling and Sensorless Control
The foundation of the proposed RIV framework is a precise nominal model of the healthy motor. To construct this model without disassembling the drive or relying on often-unavailable manufacturer datasheets, we leverage the online self-commissioning algorithm presented in [37].
2.4.1. Self-Commissioning for Parameter and Data Acquisition
This method is pivotal, as it estimates the set of induction motor and load parameters in approximately 180 s without disconnecting the motor from its load. Unlike traditional offline techniques, this approach employs a discrete normalized MRAS implemented on the TMS320F28388D microcontroller.
As illustrated in the dashed block labeled “Self-Commissioning Routine” at the top of Figure 1, which directly references the offline identification workflow detailed in Section 3, this routine serves a dual purpose. First, the “Parameter Estimator” block calculates the estimated parameters ( , and ), which are collectively denoted by the tag Par and automatically injected into the control loops and observers for normal operation (switch position 2). Second, and crucially for this work, the rich time-series data (voltages and currents ) collected during this process (executed in switch position 1) provides the training dataset for the nominal model, establishing the healthy baseline against which model drift is later quantified.
2.4.2. Integrated Control Architecture
Once the parameters are identified, they are deployed within a Sensorless DFOC architecture, encapsulated by the blue dashed box labeled “Sensorless DFOC Scheme” in Figure 1. This control strategy, which operates when the switch is in position 2, is selected for its high dynamic performance and elimination of mechanical sensors [8]. The control structure, depicted in the central green block within this scheme, uses a cascade arrangement with outer loops for speed/flux regulation and an inner loop for current control. To enable sensorless operation, a State Observer (specifically an Extended Kalman Filter) estimates the rotor speed and flux . As shown by the connection tag Par in Figure 1, this observer relies directly on the values ( , etc.) obtained from the self-commissioning stage. It is worth noting that this complete sensorless topology corresponds to the functional block simply labeled “DFOC” in the monitoring architecture of Figure 2. The complete integrated system provides a stable and known operational context, ensuring that the RIV detector monitors the motor health under controlled conditions.
2.5. Baseline Methods and Performance Metrics
To validate the proposed RIV framework, a comparative analysis is conducted against established diagnostic techniques. This section defines the physical principles and operational limitations of these baseline methods.
2.5.1. Spectral Signature Analysis
Motor Current Signature Analysis (MCSA) remains the industrial standard for induction motor diagnosis due to its non-intrusive nature [4]. This method relies on the detection of characteristic harmonic sidebands in the stator current spectrum induced by the magnetic asymmetry of the fault. For inter-turn short circuits (ITSC), the characteristic fault frequencies ( ) are typically located at [15,32]:
where is the supply frequency, s is the slip, p is the pole pair number, and are integers.
While effective in stationary regimes, MCSA faces a fundamental physical trade-off between frequency resolution ( ) and detection latency. To distinguish fault sidebands that are close to the fundamental frequency (i.e., small ), the FFT requires a data acquisition window inversely proportional to the desired resolution ( ). Consequently, obtaining high spectral resolution inevitably introduces a significant detection delay. Furthermore, as noted in [33], for incipient faults or low-severity short circuits, the amplitude variations in these sidebands are often negligible (see Figure 4 in [33]), rendering the method insensitive to early-stage failures and susceptible to false negatives. However, the sensitivity of MCSA strongly depends on the selected time-window length and signal-to-noise ratio, which limits its effectiveness for incipient faults.
2.5.2. Residual Energy Monitoring (Model-Based Baseline)
To rigorously benchmark the proposed method against model-based approaches, we implemented the residual energy monitor following the stationary frame formalism described in Digital Twin frameworks [21]. The detection logic proceeds in three steps:
First, the instantaneous residual vector is computed as the difference between the measured stator currents and the nominal model predictions ( ).
Second, the three-phase residuals are projected onto the stationary two-phase reference frame ( ). This transformation aligns the diagnostic signals with the vector control variables, facilitating the analysis of the residual space vector magnitude.
Third, the Fault Detection Index (FDI) is calculated as the instantaneous energy of the stationary residual vector. This metric, denoted here as , captures the total deviation of the current space vector from its expected trajectory:
Finally, the detection algorithm compares this energy index against a statistical threshold , which is calibrated based on the peak noise observed during healthy operation (conventionally set at of the healthy baseline noise):
While theoretically sound, the reliance on the magnitude of the residual vector makes this method highly sensitive to transient modeling errors. As will be shown in the Section 4, rapid load or speed changes cause momentary spikes in that can be misclassified as faults if the threshold is not set conservatively high.
2.5.3. Quantitative Performance Metrics
To objectively evaluate the proposed method against the baselines and address the need for rigorous validation, we adopt three quantitative metrics. These metrics are grounded in the theoretical performance analysis for hypothesis testing established in [19]:
- Detection Delay ( ): Formally related to the finite-sample collapsing time defined in [19], this metric measures the time elapsed between the physical fault inception ( ) and the moment the decision rule stabilizes on the rejection of :
where . Comparison of highlights the trade-off between the fast statistical convergence of RIV versus the window-dependent latency of MCSA. 2. False Alarm Rate (FAR): This metric corresponds to the empirical significance level ( ) of the test [19]. It quantifies the robustness of the detector against dynamic transients (e.g., load impacts) when the motor is healthy:
where is the number of samples where a fault is incorrectly declared during healthy operation. 3. Separability Index (Fisher Score ): To quantify the discriminative power of the fault indicator beyond simple thresholding, we utilize the Fisher Score. This index measures the statistical distance between the healthy distribution ( ) and the faulty distribution ( ) of the diagnostic index:
A higher value indicates superior fault sensitivity and a lower probability of classification error.
With the theoretical foundations, baseline comparison methods, and quantitative evaluation metrics fully defined, the subsequent section presents the experimental validation of the proposed framework.
3. Proposed Methodology
This section describes the proposed fault detection and phase discrimination methodology, which integrates a data-driven nominal model, residual generation, and RIV estimation into a unified online monitoring architecture.
To address the limitations identified in Section 1, this work proposes a fault detection and diagnosis framework grounded in the RIV principle. Unlike traditional methods that rely on labeled fault data or intrusive signal injection, our approach leverages the statistical concept of model drift to identify incipient stator faults using only standard control signals. The proposed strategy is integrated directly into the drive’s control architecture, enabling reliable operation under realistic industrial conditions.
Integrated RIV-Based Monitoring Scheme
The core of the proposed methodology is the real-time monitoring loop illustrated in Figure 2. To visually distinguish the novel contributions of this work, the diagnostic functional blocks are highlighted in green. This distinction also corresponds to the hardware architecture: the standard control loops (blue blocks) are executed on the embedded DSP to guarantee real-time stability, while the diagnostic algorithms (green blocks) are hosted on an external monitoring station (e.g., an industrial PC). This decoupled design ensures that the statistical analysis does not burden the control processor and facilitates the method’s extension to other VSDs capable of streaming and data. This system operates in parallel with the control structure, passively processing the input voltages and output currents to detect statistical anomalies without affecting the controller performance. The framework is composed of four sequential stages: nominal model prediction, residual generation, RIV estimation, and faulty phase discrimination.
The process begins at the Nominal Model Prediction block. Here, the real-time voltage vector is fed into a pre-trained nominal model to estimate the expected healthy currents . Crucially, the proposed framework operates in the natural reference frame rather than the synchronous frame. This design choice is strategic: unlike the transformation which distributes a single phase fault across both axes, the representation preserves the unique spatial signature of each winding. This feature is essential for the direct Faulty Phase Discrimination stage (detailed later) and simplifies the neural model structure by removing the dependency on the rotor flux angle for synchronization. The construction of this specific block is a critical component of the methodology. To ensure the model accurately represents the healthy baseline without requiring manual data collection, we utilize the data extracted from the self-commissioning routine described in [37] and depicted in Figure 3. The raw waveforms ( ) collected during this 180-s initialization process cover the entire dynamic excitation range of the motor. This dataset is used to train the mapping from voltage history to current response.
It is important to emphasize that the proposed RIV framework is model-agnostic; the nominal model could be implemented using analytical physics-based equations, RNNs, or other regression tools. However, in this implementation, an MLP is selected specifically for its structural simplicity and low computational burden. The MLP offers sufficient approximation capacity to ensure that the residuals are uncorrelated with inputs under healthy conditions, which is the only requirement for the RIV test to function correctly. Thus, the model deployed in this block is essentially a data-driven, open-loop observer of the healthy stator impedance.
Remark 1. It is assumed that the motor operates under healthy or near-healthy conditions during the self-commissioning procedure used to train the MLP. If the motor already exhibits significant stator degradation at this stage, the resulting nominal model may embed these faulty dynamics, potentially masking subsequent fault detection.
Following the prediction, the Residual Generation stage computes the instantaneous deviation between the measured currents and the model estimates . The residual vector is defined as:
The use of current residuals is advantageous as these variables are already measured by the VSD for control purposes, eliminating the need for additional hardware such as vibration sensors or flux probes.
These residuals are accumulated in a sliding window buffer to form the dataset necessary for statistical analysis. The RIV Estimator block then acts as the core engine of the method. It computes the statistical dependency between the voltage inputs and the current residuals using the distribution-free estimator described in Section 2.3. This step operationalizes the RIV test: under healthy conditions, the residuals should be independent of the inputs (noise); under a fault, the physical modification of the machine creates a detectable dependency ( ).
The decision logic compares the estimated RIV metric against a threshold . This threshold is calibrated empirically using the statistics of the RIV distribution observed during a brief healthy initialization period. If exceeds , the system triggers the Faulty Phase Discrimination stage.
Instead of complex pattern recognition, this stage leverages the localized nature of the RIV to identify the fault location. We compute three separate, per-phase mutual information values:
The global fault indicator can be defined as their aggregation (e.g., maximum or sum). Once a fault is declared, the faulty phase is identified by finding the phase that contributes the most to this statistical dependency:
This identification principle relies on the physical premise that a short circuit induces the strongest model drift—and consequently the highest statistical dependency—in its corresponding phase residual.
It is important to note that this monitoring scheme is fully integrated with the Sensorless DFOC architecture shown in Figure 1. The control loops and the Extended Kalman Filter (EKF) observer [37] continue to regulate the machine using the parameters identified during self-commissioning, while the RIV framework operates in the background. This ensures that the diagnosis is performed within a stable, controlled context.
4. Experimental Results
4.1. Experimental Setup and Test Description
To validate the proposed framework, extensive experiments were conducted on a laboratory test bench under realistic operating conditions. The objective of this evaluation is to rigorously assess the detection sensitivity, response time, and phase discrimination capability of the proposed method when addressing SSCF. All tests were executed under controlled and repeatable conditions to ensure the reliability of the reported results. To mitigate the effects of thermal degradation on the stator winding resistance during the experimental campaign, the test duration was kept short (approx. 15 s per trial), and a cool-down period of 20 min was enforced between consecutive experiments to allow the machine to return to its ambient thermal equilibrium.
Figure 4 illustrates the laboratory test bench employed for the experimental validation. The setup relies on a Siemens induction motor (model 1LE0101-0DA22-2AA4), structurally modified to emulate stator faults. The detailed technical specifications of this machine are listed in Table 1. The motor is driven by a custom-designed Variable Speed Drive (VSD) developed at the university laboratory, which utilizes a Texas Instruments TMS320F28388D DSP to command a two-level voltage-source inverter. Regarding the software implementation, MATLAB R2024a and Simulink 24.1 were used to design, compile, and deploy the discrete-time control algorithms onto the VSD’s embedded processor. During operation, the DSP streams voltage and current measurements to the host computer via a high-speed serial interface. The diagnostic algorithm, running in parallel on the host, processes the sliding window buffer sequentially. Since the execution time of the RIV estimator and MLP inference is negligible compared to the data acquisition period, the setup operates as a real-time monitoring system, providing instantaneous fault feedback.
All diagnostic signals employed in this study are directly acquired from the drive’s internal measurements, without the need for additional sensors or hardware modifications.
During the experimental tests, the induction machine operated at a mechanical speed of approximately 1000 rpm. The motor was mechanically loaded using a Prony brake system, as visible in Figure 4. This mechanism applies a frictional load to the motor shaft, which is manually regulated via an adjustment handle to establish the desired torque operating point.
To validate the proposed method, Stator Short-Circuit Faults (SSCF) were physically emulated on the test motor. As illustrated in Figure 5, this was achieved by connecting calibrated external resistors between dedicated taps in the stator winding (specifically in Phase B). Figure 5a shows the electrical schematic of this connection, while Figure 5b provides a photograph of the modified motor terminals. This approach enables the controlled and repeatable replication of incipient faults while preserving the electromagnetic behavior of true insulation degradation. The fault severity is defined by the percentage of shorted turns relative to the total turns in the phase; severities ranging from 1% to 4% were tested. Although the fault-emulation taps were installed exclusively on Phase B due to manufacturing constraints, the symmetrical nature of the induction machine ensures that these results are representative of faults occurring in any of the three phases.
Finally, to ensure the reproducibility of the proposed data-driven framework, the specific hyperparameters configured for both the Nominal Model (MLP) and the RIV Estimator are detailed in Table 2.
Regarding the MLP, the input size corresponds to the time-delay structure defined in the Section 3 (15 inputs). Although the proposed RIV framework is model-agnostic, an MLP was selected here for its computational efficiency. The choice of a single hidden layer is grounded in the Universal Approximation Theorem [38], which posits that this structure is sufficient to represent continuous motor dynamics. Furthermore, the specific size of 64 neurons was determined via an empirical grid search to minimize the validation error, prioritizing a compact model to prevent overfitting. For the RIV estimator, the parameters ( ) were tuned based on the consistency conditions [19] where different values were tested to ensure convergence, specifically setting the cell mass decay exponent to . As shown in the table, three different sliding window lengths ( ) were evaluated, with selected as the optimal operating point for the reported results.
To validate the performance against established methods, two baselines were configured following standard literature guidelines. The MCSA method was implemented using a Hanning window of s to ensure sufficient frequency resolution for sideband detection as recommended in [15,32]. Simultaneously, the Residual Energy monitor was set up according to the digital twin principles described in [21], utilizing a fixed detection threshold set at of the steady-state noise observed during healthy operation.
4.2. RIV Calibration and Detection Results
This subsection evaluates the ability of the proposed framework to detect SSCF at its incipient stages and establishes the optimal hyperparameter configuration for the subsequent comparative analysis. Figure 6 reports the system’s response to a representative 1% inter-turn short-circuit fault induced in Phase B at s. The figure simultaneously illustrates the detection sensitivity and the critical trade-off between detection speed and estimator noise, which is governed by the sliding window (SW) length.
The figure is organized into three plots. The top plot shows the measured three-phase stator currents ( ). Even after the fault inception at , the currents exhibit only a very subtle asymmetry, demonstrating the challenge of detecting such an incipient fault using simple current monitoring.
The middle plot shows the evolution of the estimated global mutual information, , for three different SW lengths (100, 180, and 250 samples). This plot clearly demonstrates the core of the proposed method:
- Healthy State ( s): Before the fault, all three estimators remain stable and fluctuate below their respective empirically-calibrated detection thresholds ( ). This confirms the method’s robustness against false alarms during normal operation.
- Fault State ( s): Immediately after the fault, the statistical dependency increases, causing all three estimators to rise and promptly cross their thresholds.
The bottom plot confirms this logic by showing the resulting binary fault alarms. Each alarm is triggered precisely when its corresponding estimator in the middle plot crosses its threshold. This visualization captures the fundamental trade-off of the window length:
- SW 100 (Blue): Although theoretically faster, it exhibits the highest variance (noise). As observed in the middle plot, this volatility can momentarily delay the threshold crossing, resulting in a detection time of 0.25 s in this trial.
- SW 180 (Orange): Provides the fastest detection in this scenario (95 ms) while maintaining a balanced compromise between response speed and signal stability.
- SW 250 (Green): Provides the smoothest signal (lowest noise) but results in the longest detection delay (approx. 0.28 s) due to the higher filtering effect.
These results confirm that the proposed framework successfully detects a 1% SSCF—a challenging incipient-level event—in under 0.3 s. Based on this analysis, a sliding window of 180 samples was selected as the optimal operating point for the subsequent tests.
The discrimination logic relies on the three separate MI estimators defined in Equations (10)–(12): , , and . The faulty phase is identified by finding the estimator that shows the largest deviation from its nominal behavior, following the rule.
Figure 7 provides a clear validation of this mechanism for the 1% SSCF in Phase B (using the optimal SW 180 configuration). The plot shows the evolution of the three per-phase MI estimators, which naturally operate at different baseline offsets during healthy operation.
As shown in Figure 7, during the Healthy Operation phase ( s), all three estimators fluctuate within their established nominal bands. Although the estimators exhibit inherent statistical variance (particularly Phase A), they remain bounded. However, immediately following the Faulty State inception, the estimator corresponding to the faulty phase ( , orange) exhibits a clear and sustained increase, breaking away from its healthy baseline. In contrast, the estimators for the healthy phases ( , blue, and , green) continue to fluctuate within their original ranges, showing no correlation with the fault event.
This result visually confirms that the model drift caused by the fault is almost exclusively captured by the MI estimator of the affected phase. This clear divergence allows the rule to robustly and correctly identify the faulty phase. With the detection parameters calibrated and the discrimination capability verified, this configuration serves as the basis for the comparative benchmarking against the state-of-the-art methods presented next.
4.3. Performance of Baseline Methods
To establish a comparative benchmark for incipient fault detection, the experimental scenarios were first analyzed using the classical Motor Current Signature Analysis (MCSA) and the Model-Based Residual Energy method.
4.3.1. Spectral Analysis Limitations (MCSA)
To evaluate the feasibility of classical spectral methods, the stator current ( ) was analyzed using Fast Fourier Transform (FFT). The tests were conducted at the nominal speed of 1000 rpm ( Hz), ensuring the same experimental conditions used for the proposed RIV evaluation. Following the methodology in [32], the analysis focuses on the amplitude of the sideband component at , which is characteristic of inter-turn short-circuit faults.
Figure 8 illustrates the impact of the acquisition window length ( ) on the detectability of this fault signature. The results reveal a critical trade-off between frequency resolution and detection latency:
As observed in Figure 8a, applying a short time window of s—desirable for fast detection—results in significant spectral leakage. The main lobe of the fundamental frequency widens considerably, masking the fault harmonic. Extending the window to s (Figure 8b) improves resolution but fails to isolate incipient faults; the 1% severity case (orange line) remains indistinguishable from the healthy baseline.
A clear separation of the fault harmonic is only achieved by increasing the window length to s, as shown in Figure 8c. These results are consistent with those reported in [32], confirming that MCSA is effective for detecting incipient faults (1%, 3%, and 4%) provided that sufficient spectral resolution is available. However, the required acquisition time (>8 s) introduces a significant latency, making the method unsuitable for real-time protection against rapidly evolving faults.
4.3.2. Residual Energy Sensitivity
To ensure a fair comparison with the proposed RIV method, the Residual Energy strategy was evaluated under identical experimental conditions, specifically at a constant speed of 1000 rpm. Implemented using the same MLP architecture described in Section 4.1, this approach aligns with standard digital twin methodologies [21]. As detailed in Table 2, a moving average filter ( ) was applied to the raw residuals to suppress high-frequency sensor noise. The detection threshold ( ) was then empirically calibrated based on the filtered healthy baseline to minimize the False Alarm Rate (FAR), ensuring that normal noise fluctuations do not trigger spurious detections.
The detection capabilities are visually confirmed in Figure 9a, where the instantaneous residuals for the 4% severity case exhibit a distinct deviation from zero after the fault injection. This clear signature translates into the energy domain shown in Figure 9b, where faults with severities of 3% and 4% rapidly exceed the threshold after s. In contrast, the 1% severity case (cyan trajectory) remains comparable to the noise floor, masked by the residual variance. This confirms that while energy-based metrics are effective for moderate faults, they lack the necessary sensitivity for incipient stages under realistic noise conditions.
MCSA and residual-energy-based detection were selected as baselines because they represent widely adopted spectral and model-based diagnostic strategies in industrial practice.
4.4. Overall Performance Comparison and Discussion
To conclude the experimental study, a comprehensive comparison is presented between the proposed RIV method and the baseline strategies (MCSA and Residual Energy) discussed in Section 4.3. This comparison is based on the key performance indicators defined in Section 2.5.3: detection delay, statistical separability (Fisher Score), and the False Alarm Rate (FAR). Additionally, the capability to correctly identify the affected phase is verified as a fundamental requirement for all diagnostic methods. The results for the different fault severities at 1000 rpm are summarized in Table 3.
The proposed RIV-based method consistently achieves lower detection delay and higher statistical separability than the baseline techniques, particularly for incipient fault conditions. The detection delay analysis reveals a clear advantage for the proposed RIV method. For the most critical case (1% severity), the RIV achieves a stable detection in only ms. In contrast, the MCSA requires an acquisition window of at least 8.0 s to resolve the harmonic from the fundamental spectral leakage, while the Residual Energy method fails to consistently exceed the threshold, rendering the incipient fault undetectable. This difference of two orders of magnitude in latency highlights the RIV’s suitability for near-instantaneous protection.
Regarding statistical separability, the Fisher Score ( ) confirms the superiority of the information-based approach. The RIV maintains a significantly higher score ( ) compared to the Residual Energy method ( ) at 1% severity. This indicates that the RIV effectively extracts fault signatures that are otherwise masked by noise in the time-domain energy calculation. It should be noted that the Fisher Score is not applicable to MCSA, as the latter relies on frequency-domain snapshots rather than continuous time-series distributions.
The False Alarm Rate (FAR) was evaluated using a large healthy dataset characterized by frequent variations in speed and load to test the detectors under dynamic transients. The MCSA exhibits the lowest FAR, as its long-term averaging naturally filters out transient oscillations; however, this comes at the cost of high data requirements. Among the fast-response methods, the Residual Energy approach showed poor robustness with a high FAR (< ), as load impacts were frequently misinterpreted as faults. The RIV method significantly mitigates this issue (< ), providing a robust diagnostic even under varying operating conditions.
Finally, all three methods demonstrated the capability to correctly identify the faulty phase ( ), confirming that the proposed RIV method preserves the fundamental diagnostic features of traditional baselines while significantly improving sensitivity and response speed.
5. Conclusions
This work presents a high-sensitivity fault detection framework based on the Residual Information Value (RIV) principle, specifically designed to address the data bottleneck in industrial diagnostics. By treating fault detection as a statistical test of independence between control signals and current residuals, the proposed method breaks a major industry obstacle: the heavy dependency on large, labeled fault datasets for training. The framework’s primary advantage lies in its seamless integration into standard industrial VSDs, leveraging existing self-commissioning routines to train a lightweight MLP nominal model without requiring additional sensors or intrusive signal injection.
Experimental results on an industrial-grade motor confirm that the RIV method substantially advances the state of the art. While traditional MCSA requires acquisition windows of at least 8.0 s to resolve incipient signatures, the RIV framework achieves reliable detection of 1% stator short-circuit faults in only 61 ms. Furthermore, the proposed method successfully overcomes the reliability issues of conventional residual-energy schemes: whereas standard energy indicators exhibit an unacceptable False Alarm Rate of up to 25% for incipient faults, the RIV approach reduces this metric to below 0.6%. This robustness is further supported by the Fisher Score, which reveals that the RIV extracts fault information that is over 130 times more statistically significant than time-domain energy indicators for 1% severity cases, providing a precise and model-agnostic tool for early protection.
Looking forward, several research directions emerge to further extend these contributions. Future studies will explore a hybrid diagnostic architecture by integrating the proposed statistical drift indicator with physical-model residuals derived from the DFOC’s internal Extended Kalman Filter (EKF). The deployment of the framework as a fully embedded edge-computing function within the VSD’s DSP constitutes a promising path for immediate industrial adoption. Because the underlying hypothesis is generic, future work will also investigate the applicability of this model-drift paradigm to other fault classes, including stator open-circuit faults, bearing degradation, broken rotor bars, and mechanical imbalances. Furthermore, since the methodology relies on fundamental input-output constitutive relations rather than specific machine parameters, it is theoretically transferable to other AC motor types, such as PMSM or SynRM, as long as the requisite signals are available. To further enhance robustness against long-term thermal variations, future iterations of the nominal model will incorporate real-time stator resistance estimates ( ) as an auxiliary input to the MLP, ensuring the framework’s stability in diverse and demanding industrial environments.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bazan G.H. Scalassara P.R. Endo W. Goedtel A. Palácios R.H.C. Godoy W.F. Stator short-circuit diagnosis in induction motors using mutual information and intelligent systems IEEE Trans. Ind. Electron.2018663237324610.1109/TIE.2018.2840983 · doi ↗
- 2Bazan G.H. Goedtel A. Scalassara P.R. Endo W. Nunes E.A. Takase V.T.F. Guedes J.J. Gentil M.G. An embedded system for stator short-circuit diagnosis in three-phase induction motors using information theory and artificial neural networks IEEE Trans. Syst. Man Cybern. Syst.2022526582659210.1109/TSMC.2022.3149851 · doi ↗
- 3Akhil Vinayak B. Anjali Anand K. Jagadanand G. Wavelet-based real-time stator fault detection of inverter-fed induction motor IET Electr. Power Appl.202014829010.1049/iet-epa.2019.0273 · doi ↗
- 4Garcia-Calva T. Morinigo-Sotelo D. Fernandez-Cavero V. Romero-Troncoso R. Early detection of faults in induction motors—A review Energies 202215785510.3390/en 15217855 · doi ↗
- 5Zaparoli I.O. Êvo M.T.A. de Paula H. Reliability of Industrial Drive Systems: A Review on Inter-Turn Short-Circuit Detection in DTC Driven Motors Proceedings of the 2021 IEEE Industry Applications Society Annual Meeting (IAS)Vancouver, BC, Canada 10–14 October 2021110
- 6Skowron M. Orlowska-Kowalska T. Wolkiewicz M. Kowalski C.T. Convolutional neural network-based stator current data-driven incipient stator fault diagnosis of inverter-fed induction motor Energies 202013147510.3390/en 13061475 · doi ↗
- 7Oner M.U. Şahinİ. Keysan O. Neural networks detect inter-turn short circuit faults using inverter switching statistics for a closed-loop controlled motor drive IEEE Trans. Energy Convers.2023382387239510.1109/TEC.2023.3274052 · doi ↗
- 8Husari F. Seshadrinath J. Incipient interturn fault detection and severity evaluation in electric drive system using hybrid HCNN-SVM based model IEEE Trans. Ind. Inform.2021181823183210.1109/TII.2021.3067321 · doi ↗
