Machine-Learning-Assisted Viscoelastic Characterization of PC/ABS Blends via Multi-Frequency Dynamic Mechanical Analysis
Yancai Sun, Wenzhong Deng, Haoran Wang, Ranran Jian, Wenjuan Bai, Dianming Chu, Peiwu Hou, Yan He

TL;DR
This paper uses machine learning and dynamic mechanical analysis to study and predict the viscoelastic behavior of a PC/ABS plastic blend.
Contribution
The study introduces a physics-informed machine learning model (NeuralWLF) that outperforms data-driven models in cross-frequency predictions.
Findings
NeuralWLF achieved R² > 0.92 for all targets in cross-frequency generalization.
MLP performed best in interpolation with R²¯=0.989, but its performance dropped significantly with larger validation gaps.
A physics-data crossover threshold was identified at a gap/FWHM ratio of ≈2, beyond which NeuralWLF outperformed data-driven models.
Abstract
This study combines multi-frequency dynamic mechanical analysis (DMA) with machine learning (ML) to characterize and predict the viscoelastic properties of a commercial polycarbonate/acrylonitrile–butadiene–styrene (PC/ABS) blend. DMA temperature sweeps at four frequencies (1–10 Hz) in single cantilever mode yielded a glass transition range of 115.8–123.2 °C (E″ peak), frequency sensitivity of 7.18 °C/decade, and an apparent activation energy of 335±85 kJ mol−1. Time–temperature superposition master curves were parameterized with a six-term Prony series (R2=0.998). Four data-driven models (RF, XGB, SVR, MLP) and a physics-informed NeuralWLF model were evaluated through a hierarchical validation framework. Temperature-blocked CV ranked MLP (R2¯=0.989) above RF (0.950) for interpolation; LOFO validation revealed that NeuralWLF achieved the best cross-frequency generalization (R2>0.92 for…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15- —Guangxi Natural Science Foundation
- —Shandong Province Key R&D Program, Major Science and Technology Innovation Project
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPolymer crystallization and properties · Machine Learning in Materials Science · Material Dynamics and Properties
1. Introduction
Polycarbonate/acrylonitrile–butadiene–styrene (PC/ABS) blends constitute one of the most commercially significant families of engineering thermoplastics, combining the impact resistance and heat distortion resistance of polycarbonate (PC) with the processability and cost advantages of ABS [1,2]. These blends are extensively deployed in automotive, electronics, and consumer product applications and have increasingly been adopted as feedstock materials for fused deposition modeling and other additive manufacturing processes [3,4]. Across these applications, the mechanical performance under service conditions is governed by the time- and temperature-dependent viscoelastic response, making accurate characterization across broad frequency and temperature domains essential for engineering design.
Dynamic mechanical analysis (DMA) is the principal experimental technique for measuring the viscoelastic properties of polymeric materials [5,6]. By subjecting a specimen to small-amplitude sinusoidal deformation at controlled frequency and temperature, DMA provides simultaneous determination of the storage modulus ( ), loss modulus ( ), and loss tangent ( ). Multi-frequency DMA experiments enable systematic investigation of the frequency dependence of relaxation processes, yielding kinetic parameters such as the apparent activation energy for segmental motion [7,8]. The glass transition temperature ( ), the key parameter governing the service performance of amorphous polymers, can be determined from the temperature at which or reaches its maximum [9].
For polymer blends, the glass transition behavior is inherently more complex than for homopolymers. In immiscible systems such as PC/ABS, the blend typically exhibits distinct glass transitions corresponding to the PC-rich and ABS-rich phases [10,11]. The PC-phase , which dictates the upper service temperature, is commonly observed in the range of 120–135 in blends, depressed from the 145–155 characteristic of neat PC [6,11]. The overlap of the PC- and SAN-phase (the matrix component of ABS) relaxations complicates both the extraction of and the application of analytical superposition methods.
Time–temperature superposition (TTS) exploits the equivalence between the effects of frequency and temperature on viscoelastic response to construct master curves spanning frequency ranges far exceeding those accessible in a single experiment [5,12]. For thermorheologically simple materials, the shift factors follow the Williams–Landel–Ferry (WLF) equation or an Arrhenius relationship near [13,14]. However, immiscible polymer blends are inherently thermorheologically complex: the distinct relaxation mechanisms in each phase possess different temperature dependencies, leading to imperfect superposition [15,16,17]. For PC/ABS systems, the degree of TTS failure depends on composition and morphology, with PC-rich blends exhibiting more satisfactory superposition due to the dominance of the PC-phase relaxation [11].
Once TTS master curves are obtained, the relaxation behavior can be parameterized through a Prony series representation, expressing the relaxation modulus as a sum of exponential decay terms suitable for direct implementation in finite element method (FEM) software [18,19,20]. For immiscible blends where TTS is imperfect, the resulting Prony parameters carry inherent uncertainties that are difficult to quantify within the traditional analytical framework.
Machine learning (ML) methods have demonstrated growing capability in predicting the properties of polymeric materials [21,22]. Supervised regression algorithms can capture complex nonlinear relationships between testing conditions and material properties without requiring explicit physical models. ML approaches have been applied to predict glass transition temperatures from molecular descriptors [23], to establish structure–property relationships for polymer composites [24], and to forecast mechanical performance under varying environmental conditions [25]. More broadly, recent materials–ML studies have expanded toward integrated discovery workflows, including surrogate modeling, inverse design, and data-driven exploration at scale [26,27]. Recent polymer-focused work has also moved toward physics-guided learning, including large-deformation viscoelastic prediction [28], physics-enforced neural modeling of polymer melt viscosity [29], and systematic reviews of physics-informed neural networks in polymer research [30]. These advances motivate the use of hybrid ML models for polymer mechanics, but they also show that model validity must be tied to well-defined experimental domains. In particular, direct learning of the DMA triplet over coupled temperature–frequency space with explicit cross-frequency extrapolation diagnostics remains limited.
ML-based prediction of DMA properties offers distinct advantages over the traditional TTS framework. First, ML models do not require the assumption of thermorheological simplicity, which is inherently violated in immiscible blends. Second, once trained, ML models provide instantaneous predictions at arbitrary temperature–frequency combinations. Third, feature importance metrics from tree-based methods offer quantitative insight into the relative contributions of temperature and frequency to the viscoelastic response. Nevertheless, the extrapolation capabilities of ML models beyond the training domain have not been systematically assessed.
In this study, we present an integrated DMA–ML framework for the viscoelastic characterization of a commercial PC/ABS blend. Multi-frequency DMA measurements in single cantilever mode are supplemented by dual cantilever geometry verification. Traditional analysis methods—frequency-dependent determination, Arrhenius activation energy, TTS master curves, and Prony series fitting—are applied alongside four data-driven ML regression models (RF, XGB, SVR, MLP) and a physics-informed NeuralWLF model that embeds a differentiable WLF layer within a neural network. A leave-one-frequency-out (LOFO) validation strategy is introduced to assess frequency extrapolation capability. Relative to prior polymer–ML studies, our emphasis is on experimentally grounded prediction of under standard DMA conditions and on making the in-domain/extrapolation boundary explicit. The objectives are threefold: (1) to establish a comprehensive DMA characterization with engineering-approximate constitutive parameters; (2) to evaluate interpolation and extrapolation performance of data-driven and physics-informed ML models; and (3) to compare ML insights with physical understanding from traditional analysis.
2. Materials and Methods
2.1. Materials
The material investigated was a commercial-grade PC/ABS blend. The supplier trade name and technical datasheet were not available for the specific grade; the nominal composition of approximately 40:60 PC:ABS by weight is based on supplier verbal communication and is used as a working assumption throughout this study. Accordingly, the material identity is characterized primarily by its measured DMA fingerprint ( , at 1 , at 1 ). No resolved secondary peak was observed in the DMA traces, suggesting either peak merging between the PC and SAN phases or a dominant single-phase response; additive content (fillers, flame retardants) cannot be inferred from DMA alone, and the detailed formulation is not disclosed.
Material characterization limitations. Complementary thermal analysis (DSC, TGA) and spectroscopic characterization (FTIR) were not performed in this study. Consequently, (i) the PC:ABS ratio remains unverified beyond nominal supplier information, (ii) the phase morphology (miscibility, domain size) is unknown, and (iii) the presence and concentration of additives (stabilizers, flame retardants, impact modifiers) cannot be assessed. The single observed in DMA could reflect either a miscible blend, overlapping phase transitions, or PC-dominated response; distinguishing among these scenarios would require DSC with controlled heating rates or dynamic rheology at lower frequencies. Readers seeking to reproduce or extend this work should treat the material as a “black-box” commercial formulation characterized by its DMA response rather than a compositionally defined system.
Test specimens were injection molded into rectangular bars of approximately ( ) for DMA testing. Prior to measurement, all specimens were dried at 80 for a minimum of 4 to remove absorbed moisture.
2.2. DMA Measurements
Dynamic mechanical analysis was performed using a commercial dynamic mechanical analyzer operated in both single cantilever and dual cantilever bending modes. Temperature sweep experiments were conducted over the range of 30–170 at a heating rate of 3 min^−1^ under a nitrogen atmosphere.
Single cantilever mode. The primary characterization was performed in single cantilever geometry. Multi-frequency measurements were carried out simultaneously at five test frequencies: 0.5, 1, 2, 5, and 10 . The oscillation amplitude was maintained within the linear viscoelastic region, confirmed by amplitude sweeps (5–50 μm) showing variation in . For the single cantilever geometry employed (free length , thickness ), the applied amplitude of 15 μm corresponds to a maximum surface strain of approximately 0.015%, well within the typical linear viscoelastic limit for glassy polymers. Storage modulus ( ), loss modulus ( ), and loss tangent ( ) were recorded as functions of temperature at each frequency. Approximately 930 data points were acquired per frequency, with the exception of the channel (94 points; see Section 3.2). The 1–10 analysis window was selected for three practical reasons. First, it provided dense and stable sampling in the multi-frequency acquisition mode under a fixed thermal ramp. Second, it covered the main transition kinetics with acceptable signal quality across all retained frequencies. Third, this single-decade band is commonly used for comparative DMA characterization under engineering test conditions. We emphasize that model conclusions in this work are therefore validated within this band; extrapolation beyond 1–10 should be treated as out-of-domain unless additional data are acquired.
Dual cantilever mode. To provide an independent geometry verification of the single cantilever results, three additional specimens (designated A, B, and C) were tested in dual cantilever mode at frequencies of 0.5, 1.0, and , respectively. The dual cantilever geometry constrains both specimen ends and applies a central deflection, producing a different stress distribution from the single cantilever configuration. Comparison of values and damping characteristics between geometries provides a quantitative robustness check (see Section 3.2).
2.3. Traditional Analysis Methods
Glass transition temperature. was determined by two standard methods: (i) the peak maximum ( ) and (ii) the peak maximum ( ) [9].
Frequency dependence and activation energy. The frequency sensitivity was quantified as via linear regression. The apparent activation energy ( ) was calculated from the Arrhenius equation:
where R = 8.314 J mol^−1^ K^−1^.
Time–temperature superposition. TTS master curves were constructed at . Shift factors ( ) were determined empirically and analyzed using the Arrhenius formalism.
Prony series fitting. The storage modulus master curve was parameterized using a generalized Maxwell model:
where is the equilibrium modulus, and are the modulus and relaxation time of the i-th element, is the angular frequency corresponding to the reduced frequency , and terms were employed, optimized by nonlinear least-squares regression. Throughout this work, master curves and Prony fits are plotted against (Hz) rather than (rad/s) for consistency with the experimental frequency axis.
2.4. Machine Learning Methodology
Data preparation. DMA data from four reliable test frequencies (1, 2, 5, and 10 ) were compiled into a unified dataset of 3751 samples (1 Hz: 951; 2 Hz: 935; 5 Hz: 930; 10 Hz: 935). The data were excluded due to sparse sampling (94 points). No data points were removed during cleaning (no missing values, no outlier rejection); the count differences arise from minor variations in instrument sampling across frequencies. No interpolation or temperature-grid alignment was applied; each frequency retained its native temperature sampling. All four frequencies cover the full 30–170 range, ensuring that LOFO train/test partitions differ only in frequency, not in temperature coverage. Each sample comprised two input features—temperature (T) and —and three targets: , , and .
Model selection. Five regression models were evaluated:
- (1)Random forest (RF): bootstrap-aggregated decision tree ensemble [31];
- (2)Extreme gradient boosting (XGB): sequential ensemble with regularization [32];
- (3)Support vector regression (SVR): kernel-based hyperplane fitting [31];
- (4)Multilayer perceptron (MLP): feedforward neural network [31];
- (5)NeuralWLF (physics-informed): a hybrid model embedding the WLF equation as a differentiable PyTorch layer, described below.
All data-driven models were implemented using standard hyperparameters selected a priori from preliminary experiments on a dedicated tuning block (the 70–80 glassy-region block, which was excluded from all 14 reported CV test folds to ensure strict separation between tuning and evaluation data); no per-fold hyperparameter re-optimization was performed. Input features were standardized (zero mean, unit variance), with the standardization parameters fitted exclusively on the training partition of each fold and applied to the corresponding test partition to prevent information leakage through preprocessing.
NeuralWLF architecture. The NeuralWLF model is a hybrid model embedding the WLF equation as a differentiable PyTorch layer. The architecture consists of (a) a differentiable WLF layer computing ; (b) a shared fully connected backbone receiving ; and (c) three output heads for , , and . The backbone uses width 128 with GELU activations (num_layers = 4 in code, corresponding to three hidden transforms before the output heads). The head uses Softplus to enforce non-negative outputs. For CV/LOFO experiments, optimization used Adam with separate learning rates for neural parameters (5 × 10^−3^) and WLF parameters (1 × 10^−3^), warmup of 50 epochs, ReduceLROnPlateau scheduling (patience 100, factor 0.5), early stopping patience 600, and 5 random restarts per fold under deterministic seeds. Additional full-physics runs for K-K plausibility used , , and curriculum training with WLF freezing for the first 300 epochs [33,34]. Detailed network/training settings are summarized in Table 1.
Cross-validation strategy. Two complementary validation strategies were employed:
(i) Temperature-blocked CV: The temperature axis was partitioned into 10 blocks (14 blocks total). In each fold, all data within one block were held out as the test set, preventing data leakage from the densely sampled DMA data (autocorrelation at spacing).
(ii) Leave-one-frequency-out (LOFO) validation: Each of the four training frequencies was held out in turn. LOFO assesses extrapolation capability—predicting properties at a frequency entirely absent from the training data.
Reproducibility and limitations. All stochastic operations (RF bootstrap, MLP weight initialization, permutation importance shuffling) used a fixed global random seed (RANDOM_STATE = 42). Temperature-blocked CV boundaries were generated deterministically as np.arange (30, 180, 10), yielding 14 blocks of 10 width. All models were trained from scratch within each CV or LOFO fold; no pre-trained weights were carried across folds. The entire dataset derives from a single injection-molded specimen per test condition; consequently, the results characterize measurement variability (instrument noise, within-specimen heterogeneity) but do not capture batch-to-batch or specimen-to-specimen variation. Cross-batch generalization should not be assumed without additional replication.
Performance metrics. and RMSE were calculated separately for each target variable. For the modulus targets, both metrics were computed in the domain (i.e., RMSE is in units of log_10_ MPa); for , metrics were computed directly in the untransformed domain. A negative indicates that the model’s predictions are worse than a constant-mean baseline (i.e., the residual sum of squares exceeds the total sum of squares), as can occur when a model is evaluated on data outside its training distribution.
3. Results and Discussion
3.1. DMA Temperature Sweep Characteristics
Figure 1 presents the storage modulus ( ), loss modulus ( ), and loss tangent ( ) as functions of temperature at five frequencies. The storage modulus exhibited a glassy plateau of approximately 1871 below 100 , followed by a steep decrease through the glass transition region to a rubbery plateau of approximately above 150 . The modulus decrease of more than two orders of magnitude is characteristic of cooperative segmental relaxation ( -relaxation).
The loss modulus displayed a well-defined peak at (1 ), with a peak magnitude of 330 . The peak occurred at with a maximum value of 2.36 and a full width at half maximum (FWHM) of . The offset between the and peak positions reflects the mathematical relationship [5,6].
No clearly resolved secondary peak attributable to the ABS phase was observed in the measured temperature range (30–170 ). Visual inspection of the traces in the 100–120 range revealed no discernible shoulder or inflection that would indicate a separate SAN-phase relaxation; however, peak deconvolution was not attempted, and a low-amplitude SAN contribution partially merged with the dominant PC peak cannot be excluded at the present signal-to-noise level and temperature resolution. The absence of a resolved secondary peak is consistent with—but does not uniquely prove—a scenario where the SAN-phase (typically 100–115 ) overlaps with the onset of the PC relaxation [10,11]; confirming the phase structure would require complementary techniques such as DSC or dynamic rheology at lower frequencies. Table 2 summarizes the key DMA parameters.
3.2. Frequency Dependence and Activation Energy
Both and shifted to higher temperatures with increasing frequency, consistent with the principle that higher frequencies probe shorter relaxation times [5,12]. Figure 2 presents versus with linear fits.
For the method (excluding the anomalous data), ; the method yielded / (Table 3). The -based value of / lies at the upper bound of the 3–7 / range commonly reported for PC/ABS blends [6,11,13]. This near-upper-bound value is compatible with a PC-dominated relaxation response typical of moderate PC fractions, though it may also reflect the limited frequency window (1–10 , i.e., 4 data points for the regression), which amplifies sensitivity to individual determinations. No quantitative inference about the PC:ABS ratio can be drawn from the frequency sensitivity alone. The -based value of / falls more centrally within the reported range.
The apparent activation energy was estimated from the Arrhenius analysis of the peak shift (Figure 3b). Linear regression of versus yielded a slope of ( , frequencies), where the abscissa has units of . Converting to activation energy via : , with a 95% confidence interval of from the regression slope uncertainty (Table 4). A leave-one-point-out sensitivity check on the 4-point regression yielded estimates of 261, 339, and 360 kJ mol^−1^ when the 10, 2, and 5 points were individually omitted; however, omitting 1 produced an outlier of 713 kJ mol^−1^ because the remaining three values span only , making the regression ill-conditioned. This confirms that the 4-point Arrhenius estimate is order-of-magnitude stable (261–360 kJ mol^−1^ for the three well-conditioned subsets) but should be interpreted as an approximate value rather than a precisely determined quantity. The central estimate of 335 kJ mol^−1^ falls within the 200–400 kJ mol^−1^ range commonly reported for PC/ABS systems [11,13], consistent with PC-dominated cooperative segmental relaxation.
Anomalous data. The channel contained only 94 data points compared with ∼930 at each of the other four frequencies. This order-of-magnitude difference is attributed to the instrument’s multi-frequency acquisition mode, in which the lowest-frequency channel is recorded at a decimated rate to accommodate the longer oscillation period; upon data export, only one point per approximately was retained (versus ∼0.15 at higher frequencies). The sparse sampling produced an anomalously sharp peak (4.22 versus 2.36 at 1 ) and narrow FWHM ( versus ); the corresponding of violates the expected monotonic frequency– relationship. The glassy plateau at (1565 , Table 2) is also ∼16% below the ∼1800 MPa level observed at 1–10 ; this depression is consistent with the glassy plateau not being fully resolved by the sparse temperature grid rather than a genuine material difference, because the overall curve shape is consistent across all frequencies. These data were excluded from all quantitative analyses and ML training.
Geometry robustness check with dual cantilever mode. Independent specimens tested in dual cantilever mode at 1 (specimen B) yielded and , compared with and from single cantilever ( for both definitions). The peak value was 2.43 (dual) versus 2.36 (single), a difference of approximately 3%. At 2 (specimen C), versus ( ) and versus ( ). The larger discrepancy at 2 may reflect specimen-to-specimen variability or the different stress distributions between the two clamping geometries. Overall, the close agreement at 1 confirms that the extracted values are robust with respect to measurement geometry, while the 2 discrepancy underscores that dual cantilever data serve as an independent geometry verification rather than a strict cross-validation.
3.3. TTS Master Curves and Prony Series Fitting
Master curves were constructed at (Figure 4). The master curve spans approximately five decades of reduced frequency, from the rubbery plateau (∼5–10 ) to the glassy plateau (∼1800 MPa). The glassy-to-rubbery modulus ratio of is comparable to reported values for PC/ABS blends [6,11]. Minor superposition imperfections in reflect the thermorheological complexity of the immiscible blend [15,16].
A six-term Prony series achieved (Table 5, Figure 5). The equilibrium modulus is consistent with the measured rubbery plateau. The dominant terms – ( – ) capture the primary glass transition and correspond to time scales observable within the experimental frequency band ( –10 , equivalent to – ). Term ( s) represents the fast glassy response; its relaxation time falls below the observable window and should be regarded as a weakly identifiable, extrapolative term that captures the steep high-frequency rise rather than a uniquely determined relaxation process. Terms and , with relaxation times of and 3777 , lie far outside the observable frequency window and similarly serve as mathematical baseline-correction terms that stabilize the fit at the low-frequency (rubbery) limit; their individual values are not physically identifiable from the present data and should not be interpreted as discrete relaxation processes. Restricting the fit to s (N = 4 terms) yielded , confirming that the observable-range terms capture the essential response with only marginal loss of accuracy. Notably, and share nearly identical relaxation times ( ), indicating practical non-identifiability; a merged single term ( , ) provides an equivalent representation. The Prony parameters in Table 5 should be regarded as engineering-equivalent fits valid at within the experimental frequency band, rather than uniquely identifiable material constants. For FEM applications, these parameters should be used only within the calibrated reduced-frequency range ( – Hz); extrapolation beyond this range inherits the TTS imperfections of the immiscible blend and the non-identifiability of the boundary terms ( , , ).
3.4. Damping Analysis
Figure 6 presents the damping characteristics for the four reliable frequencies (1–10 ). The peak height decreased systematically from 2.36 (1 ) to 1.70 (10 ), while the FWHM broadened from to . At higher frequencies, polymer chains have less time to respond within each cycle, resulting in broader, lower-amplitude transitions [5,7]. The effective damping range ( , a pragmatic engineering threshold commonly used in vibration damping literature [5]) narrows from at 1 to at 10 ; the trend is robust to modest threshold variations (e.g., 0.2–0.5) (The damping range ( ) is shown in Figure 6 for visual reference but is excluded from trend discussion due to sparse sampling; see Section 3.2.).
The preceding sections established the traditional DMA characterization of the PC/ABS blend: temperature sweep behavior across five frequencies, frequency-dependent determination with Arrhenius activation energy, time–temperature superposition master curves with Prony series parameterization, and damping performance characterization. While these analytical methods provide engineering-ready constitutive parameters, they rely on assumptions—most critically thermorheological simplicity—that may not strictly hold for immiscible blends such as PC/ABS. The following sections evaluate data-driven and physics-informed ML approaches as complementary tools for viscoelastic property prediction and assess their interpolation, extrapolation, and generalization capabilities.
3.5. ML Model Comparison: Temperature-Blocked Cross-Validation
The four data-driven ML models and NeuralWLF were evaluated using temperature-blocked CV (Figure 7). Table 6 summarizes the results.
MLP achieved the highest , followed by RF (0.950). Both models captured the steep modulus transition through with high accuracy. MLP outperformed RF for all targets, with the largest advantage observed for ( versus 0.889), suggesting that continuous function approximation is better suited than piecewise-constant tree-based representation for reproducing the smooth but steep damping peak. XGB ( ) and SVR (0.731) showed markedly lower accuracy, particularly for . Figure 8 presents the MLP parity plots. The physics-informed NeuralWLF is evaluated primarily via LOFO in Section 3.8. For comparability, NeuralWLF was also subjected to blocked CV restricted to the transition zone (90–150 , 6 of 14 blocks), where target variance is sufficient for meaningful . Within this restricted range, NeuralWLF achieved ( ), 0.998 ( ), and 0.985 ( )—comparable to MLP. However, the full-range (all 14 blocks) MAE was 0.21 ( ), 0.04 ( ), and 0.11 ( ), substantially exceeding MLP’s 0.03, 0.04, and 0.04, respectively. To enable direct comparison within the same temperature zone, the transition-zone RMSE (100–150 blocks only) was computed: NeuralWLF achieved 0.029/0.040/0.065 and MLP achieved 0.048/0.055/0.058 for / / , respectively. Within the transition zone, the two models perform comparably; the full-range gap confirms that the WLF layer provides excellent frequency–temperature coupling within the transition zone but offers no advantage in the thermally flat plateau regions. The NeuralWLF model’s primary design goal—cross-frequency generalization—is assessed via LOFO in Section 3.8; its performance advantage is contingent on the narrow 1–10 window where the WLF constraint is maximally informative.
The adoption of temperature-blocked CV is essential for reliable performance assessment. Preliminary experiments with conventional 5-fold CV yielded , indicative of data leakage arising from the densely sampled data ( between adjacent -spaced points).
3.6. Frequency Extrapolation via LOFO Validation
All four data-driven models and the NeuralWLF model were evaluated under LOFO to compare extrapolation behavior across model architectures. In the following discussion, “edge frequencies” refer to those at the boundaries of the training input space in : 1 ( , the low-frequency boundary) and 10 ( , the high-frequency boundary); “interior frequencies” (2 and 5 ) lie within the convex hull of the training values. Predicting at an edge frequency requires extrapolation beyond the training support, whereas predicting at an interior frequency is interpolation. Results are presented in Table 7 and Figure 9 and Figure 10.
RF extrapolation. When interior frequencies (5, 10 ) are held out, RF predicts with and with . At lower-edge frequencies (1, 2 ), remains strong ( ) but degrades ( and 0.49). Critically, RF maintains positive at all held-out frequencies and achieves the highest mean (0.75) among the four data-driven models.
XGB extrapolation. XGB exhibits a similar pattern to RF but with generally lower accuracy. At interior frequencies, XGB achieves – , comparable to RF. At the 1 edge, (versus RF 0.55); at 10 , (versus RF 0.98). The sequential boosting strategy provides moderate extrapolation robustness (mean ), slightly below RF, but does not exhibit the catastrophic failures observed in MLP and SVR at edge frequencies.
SVR extrapolation. SVR exhibited the worst extrapolation behavior among all models, with catastrophic failure at both edge frequencies: at 1 and at 10 ; at 1 and at 10 . Even at interior frequencies, SVR underperformed ( – ). The kernel-based extrapolation is fundamentally limited by the decay of the RBF kernel outside the training support, which causes predictions to collapse toward the training-set mean.
MLP extrapolation. MLP achieves excellent interpolation at 2 and 5 ( and 0.97) but exhibits marked degradation at edge frequencies: at 1 and 0.38 at 10 . In this implementation (fixed architecture, no monotonic constraints), MLP exhibits edge-frequency instability characteristic of neural networks operating outside the convex hull of training inputs.
NeuralWLF extrapolation. The physics-informed NeuralWLF achieved for all targets at all held-out frequencies (Table 7; Figure 10), including the challenging 1 Hz edge case where MLP collapsed to and SVR to for . This robustness is attributable to the embedded WLF shift layer, which constrains the frequency dependence to follow a physically motivated functional form. The advantage is most pronounced at edge frequencies, where all four data-driven models exhibit degradation. However, this benefit is contingent on the validity of the WLF assumption within the tested frequency window; broader ranges or non-thermorheologically simple regimes could erode the physics-informed advantage.
Interpolation versus extrapolation tradeoff. Among purely data-driven models, MLP is the superior interpolator (blocked CV ) but the second-worst extrapolator (mean LOFO ). RF provides the best extrapolation–interpolation balance ( , mean LOFO ). XGB offers comparable extrapolation to RF (0.74) with lower interpolation accuracy (0.810). SVR is unsuitable for frequency extrapolation (mean LOFO ). NeuralWLF achieves the best cross-frequency generalization ( everywhere) at the cost of reduced interpolation in thermally flat regions (Section 3.5). The complementarity suggests that model selection should be guided by the prediction task: MLP for within-domain interpolation, RF or XGB for edge-frequency prediction, and NeuralWLF when physically interpretable frequency extrapolation is the primary objective.
Directional asymmetry and physical origin. All four data-driven models exhibit stronger performance at the high-frequency edge (10 ) than at the low-frequency edge (1 ). For RF, at 10 versus 0.55 at 1 ; for XGB, 0.80 versus 0.43. This asymmetry has a physical origin: at lower frequencies, polymer chains have more time to relax within each cycle, producing a sharper, taller peak (2.36 at 1 versus 1.70 at 10 ) with narrower FWHM ( versus ). The sharper peak concentrates the signal in a narrower temperature window, making precise prediction more sensitive to small temperature offsets—an inherently harder extrapolation target than the broader, lower-amplitude peak at 10 .
3.7. Feature Importance and Physical Interpretation
The mean decrease in impurity (MDI) feature importance for the RF model trained on the full dataset showed that temperature accounted for 96.6% of the total MDI, while contributed 3.4%. This pronounced dominance is partly physical—temperature sweeps spanning drive modulus changes exceeding two orders of magnitude, whereas the 1–10 frequency range induces only a shift per decade—but also partly artifactual: MDI is known to be biased toward high-cardinality continuous features [35], and temperature (∼940 unique values) offers far more split opportunities than (4 discrete levels).
As a robustness check, permutation importance (10 repeats per fold, computed on each blocked-CV test fold and size-weighted averaged) yielded an inverted ranking: temperature 26.3%, 73.7%. This reversal is explained by the blocked-CV design: within each 10 °C test block, temperature varies over a narrow range, whereas frequency spans all four levels, so shuffling frequency destroys most of the predictive signal in that context. Neither metric alone captures the complete picture; their disagreement exposes the known MDI cardinality bias while also reflecting the restricted within-block temperature variance of the blocked-CV evaluation. Taken together, both features are necessary: temperature drives the dominant physical variation across the glass transition, while frequency encodes the complementary time–temperature equivalence exploited by TTS analysis. The importance partition is conditional on the tested frequency window (1–10 ); a broader range (e.g., 0.01–100 ) would likely shift the balance.
3.8. NeuralWLF and Approximate Kramers–Kronig Plausibility
The NeuralWLF model is evaluated via LOFO rather than temperature-blocked CV. Per-fold neural-network training is ill-conditioned in the glassy plateau region where is nearly constant ( standard deviation ) and becomes undefined. LOFO, which holds out an entire frequency (25% of data), therefore provides a more meaningful assessment of cross-frequency generalization. In each LOFO fold, the NeuralWLF model—including all neural network weights and the learnable WLF parameters ( , , )—is trained from scratch using only the three non-held-out frequencies. The held-out frequency participates in neither parameter optimization nor hyperparameter selection, ensuring strict anti-leakage separation.
Figure 11 presents LOFO parity plots for the held-out 5 frequency (representative example). Across all four held-out frequencies, NeuralWLF achieved for both and and for (Table 7). The lowest performance occurred at the edge frequency (1 ), where —a known challenge for frequency extrapolation. The learned WLF parameters were consistent across LOFO folds ( – , – ), confirming robust physical parameter recovery. Notably, the NeuralWLF model outperformed both RF and MLP on LOFO for edge frequencies, suggesting that physics-informed constraints provide a distinct advantage for frequency extrapolation that compensates for its limitations in temperature generalization.
Empirical uncertainty across held-out frequencies. To make the uncertainty level explicit, we summarize LOFO variability across the four held-out frequencies (computed from Table 7). NeuralWLF achieved mean performance of (mean ± standard deviation), while MLP showed much larger spread ( ) because of edge-frequency collapse. For NeuralWLF, the corresponding means were for and for . This fold-to-fold dispersion provides an empirical uncertainty envelope for cross-frequency deployment within the tested 1–10 window.
Approximate Kramers–Kronig plausibility check. Important caveat: Rigorous Kramers–Kronig validation requires master-curve data spanning multiple frequency decades; the single-decade bandwidth of the present study (1–10 ) is fundamentally insufficient for quantitative K-K compliance assessment. The following analysis is presented as a qualitative plausibility check of – covariation, not as thermodynamic validation.
Figure 12 shows an approximate Kramers–Kronig plausibility check for the full training model. The local derivative was evaluated by central finite differences on a uniform grid (step size ), with a Savitzky–Golay smoothing filter (window = 11 points, polynomial order = 3) applied to suppress high-frequency numerical noise. The Pearson correlation was computed over the central band in , which spans the steepest portion of the master curve and thus provides the most informative derivative signal; at the boundaries ( or ), the derivative approaches zero in the plateau regions, and endpoint truncation artifacts from the finite-difference stencil inflate the numerical noise. The central-band correlation is ; over the full evaluated range , . This indicates that and predictions share qualitatively correct covariation; however, the derivative approximation overestimates the peak by approximately , consistent with the known tendency of the local derivative method to overshoot narrow loss peaks [5]. The K-K check provides moderate evidence of internal consistency within the limited bandwidth but should not be interpreted as rigorous thermodynamic validation, which would require master-curve data spanning multiple decades of frequency. The learned WLF parameters ( , ) are physically reasonable, with close to the universal value of °C [12].
3.9. Validation Stringency Analysis and Physics-Data Crossover
To investigate the sensitivity of model evaluation to validation design, a systematic block size sweep was conducted using temperature blocks of 5, 10, 15, 20, 25, and 30 °C (Table 8, Figure 13). This experiment reveals how the ratio of the validation gap to the characteristic feature width—specifically, the FWHM of the peak ( at 1 )—determines the stringency of the validation and the relative performance of data-driven versus physics-informed models.
Validation inflation. At small block sizes (5 °C, gap/FWHM ), MLP achieved , suggesting near-perfect prediction. However, as the block size increased to 30 °C (gap/FWHM ), MLP performance dropped to —a decline of 0.394 in absolute . This dramatic sensitivity to block size exposes a validation inflation effect: when the gap between training and test data is smaller than the peak width, adjacent training blocks provide sufficient information for the model to interpolate across the gap without genuinely reconstructing the transition behavior. Conventional random k-fold CV, which does not create spatial gaps at all, would yield even more inflated metrics. Detailed architecture-level comparison at strict block settings is shown in Figure 14.
Physics–data crossover. The block size sweep reveals a crossover phenomenon (Figure 15a): below a critical gap/FWHM ratio of approximately 2.0, MLP (the best data-driven model) outperforms NeuralWLF in prediction; above this threshold, NeuralWLF becomes superior. At 20 °C blocks (gap/FWHM ), NeuralWLF achieved versus MLP’s 0.819 ( ). At 25 °C blocks (gap/FWHM ), the gap widened to 0.866 versus 0.566 ( )—NeuralWLF’s largest advantage. This crossover has a clear physical interpretation: the WLF layer encodes the constraint that the peak shape must conform to a time–temperature superposition structure. When the validation gap is large enough that the test block contains the peak maximum while the training data do not, this structural constraint becomes the only available information for reconstructing the transition. Data-driven models, lacking such prior knowledge, cannot extrapolate the peak shape from monotonic regions alone.
Training strategy effects and architecture sensitivity. Deep exploration of NeuralWLF variants (Table 9, Figure 14) revealed clear architecture and objective sensitivity at the most stringent validation level (30 °C blocks). Curriculum learning—freezing the WLF layer for the first 300 epochs and then enabling joint optimization—achieved the highest of 0.731. Standard joint training reached 0.660, while -weighted loss ( ) dropped to 0.455. This ordering indicates that parameter-regularization strategy is more influential than simply increasing the damping-loss weight under large train–test gaps.
The weighted-loss failure is instructive. Overemphasizing fitting drove and to adapt too strongly to the local peak shape in training blocks, which reduced transferability when the held-out block contained a different part of the transition region. Conversely, the curriculum schedule stabilized early feature learning and preserved cross-gap generalization. These results support the view that physics-informed models require balanced optimization of data fit and physically interpretable parameter stability.
Target-dependent model requirements. Ensemble experiments combining MLP predictions for / with NeuralWLF predictions for (blending weight ) demonstrated that the optimal model architecture depends on the target property (Table 9). At 30 °C blocks, the ensemble achieved , , and —combining the strengths of both approaches. The storage modulus , which exhibits a smooth monotonic decrease through the transition, is well-captured by data-driven models without physics constraints. In contrast, involves a sharp peak arising from the ratio of two rapidly changing quantities, making it inherently more sensitive to the physical structure of the relaxation process.
4. Conclusions
An integrated experimental–computational framework combining multi-frequency DMA characterization with hierarchical ML validation was applied to a commercial PC/ABS blend. The principal findings are as follows:
- 1.DMA characterization and engineering parameters. Multi-frequency DMA (1–10 ) yielded a glass transition range of 115.8– °C ( peak), a frequency sensitivity of °C/ , and an apparent activation energy of , consistent with PC-dominated relaxation. A six-term Prony series ( ) provides FEM-ready constitutive parameters at .
- 2.Interpolation–extrapolation tradeoff. Temperature-blocked CV ranked MLP as the best interpolator ( ), while LOFO validation revealed that the physics-informed NeuralWLF achieved the best cross-frequency generalization ( at all held-out frequencies, including the 1 edge where MLP collapsed to ). Among data-driven models, RF provided the best interpolation–extrapolation balance.
- 3.Validation inflation and the gap/FWHM criterion. A block size sweep (5–30 °C) exposed a validation inflation effect: MLP dropped from 0.986 to 0.592 as the gap-to-FWHM ratio increased from 0.5 to 3.1. Honest evaluation of DMA–ML models requires validation gaps comparable to or exceeding the characteristic feature width ( FWHM).
- 4.Physics–data crossover. A crossover at gap/FWHM separates a data-driven regime (where MLP suffices) from a physics-informed regime (where NeuralWLF becomes essential). At gap/FWHM , NeuralWLF outperformed MLP by in ; curriculum learning (freezing the WLF layer for 300 epochs) further improved the most stringent validation ( ), demonstrating that physics-parameter regularization outweighs loss weighting for extrapolation.
- 5.Target-dependent model selection. and (smooth, monotonic) are well-captured by data-driven MLP ( ), whereas (sharp peak, ratio quantity) benefits from physics-informed constraints. The gap/FWHM ratio and target property together provide a quantitative criterion for selecting between data-driven and physics-informed architectures, generalizable to other polymer systems and relaxation-dominated property predictions.
Scope statement. This work is positioned as a methodological DMA–ML validation study on a single commercial PC/ABS system rather than as a fundamental advancement of polymer viscoelastic theory.
Limitations. The present study is constrained by (i) a single-decade frequency range limiting the precision of activation energy estimation and TTS validation, (ii) incomplete material characterization (no DSC, TGA, or compositional analysis), and (iii) single-specimen testing precluding assessment of batch-to-batch variability.
Future work should extend this framework to broader frequency ranges (0.01–100 ), explore domain adaptation for cross-material transfer, and investigate whether the gap/FWHM crossover criterion applies to other relaxation phenomena beyond glass transitions.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Balart R. López J. García D. Dolores Salvador M. Recycling of ABS and PC from electrical and electronic waste. Effect of miscibility and previous degradation on final performance of industrial blends Eur. Polym. J.2005412150216010.1016/j.eurpolymj.2005.04.001 · doi ↗
- 2Seo J.S. Jeon H.T. Han T.H. Rheological Investigation of Relaxation Behavior of Polycarbonate/Acrylonitrile-Butadiene-Styrene Blends Polymers 202012191610.3390/polym 1209191632854352 PMC 7563493 · doi ↗ · pubmed ↗
- 3Dawoud M. Taha I. Ebeid S.J. Mechanical behaviour of ABS: An experimental study using FDM and injection moulding techniques J. Manuf. Process.201621394510.1016/j.jmapro.2015.11.002 · doi ↗
- 4Sood A.K. Ohdar R.K. Mahapatra S.S. Parametric appraisal of mechanical property of fused deposition modelling processed parts Mater. Des.20103128729510.1016/j.matdes.2009.06.016 · doi ↗
- 5Ferry J.D. Viscoelastic Properties of Polymers 3rd ed.John Wiley & Sons New York, NY, USA 1980
- 6Menard K.P. Menard N.R. Dynamic Mechanical Analysis 3rd ed.CRC Press Boca Raton, FL, USA 202010.1201/9780429190308 · doi ↗
- 7Mc Crum N.G. Read B.E. Williams G. Anelastic and Dielectric Effects in Polymeric Solids John Wiley & Sons New York, NY, USA 1967
- 8Liu X. Tian J. Li X. Chen J. Li J. Temperature and Frequency Dependence of the Dynamic Viscoelastic Properties of Silicone Rubber Polymers 202315300510.3390/polym 1514300537514395 PMC 10383995 · doi ↗ · pubmed ↗
