Adaptive Impulse Reconstruction of Seismic Signals Induced by TBM Drilling Noise via CEEMDAN-Assisted MDD Interferometry
Lei Zhang, Guowei Zhu

TL;DR
This paper introduces a new method to reconstruct seismic signals from TBM drilling noise, improving tunnel prospecting by enabling clearer interpretation of subsurface structures.
Contribution
The novel approach combines CEEMDAN and MDD interferometry to adaptively reconstruct impulse responses from noisy TBM data.
Findings
The proposed method produces more compact and coherent virtual-source gathers compared to existing techniques.
Field tests confirm the workflow's feasibility and realistic moveout behavior in real tunneling conditions.
Abstract
Tunnel ahead prospecting is important for reducing construction risks associated with faults, fractured zones, and cavities ahead of the tunnel face, but controlled active-source surveys are often impractical during continuous TBM operation. TBM drilling-noise records provide persistent passive excitation; however, strong nonstationarity and narrowband tonal contamination can hinder stable retrieval of interpretable impulse-like responses. We propose an adaptive impulse reconstruction algorithm that couples CEEMDAN-based mode screening with MDD interferometry. CEEMDAN screening suppresses quasi-stationary tonal components while preserving coherent propagation-related wavefields, producing effective signals suitable for interferometric processing. The MDD stage is stabilized using band-limited inversion, phase-only whitening, and a multi-reference strategy. Numerical experiments with a…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10- —Major Program of the National Natural Science Foundation of China
- —National Key Research and Development Program of China
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSeismic Waves and Analysis · Tunneling and Rock Mechanics · Seismic Imaging and Inversion Techniques
1. Introduction
Unexpected geological anomalies (e.g., faults, fractured zones, karst cavities and water-bearing structures) ahead of the tunnel face may cause severe construction delays and safety risks during mechanized tunneling. To reduce such uncertainties, active-source ahead-prospecting techniques have been widely applied in practice (e.g., Tunnel Seismic Prediction, TSP, using explosive/hammer sources in boreholes), providing reflection-based indications of adverse structures in front of the excavation face [1,2,3,4,5]. However, these methods often require dedicated source operations, interrupting excavation or demanding additional working space and manpower, which may limit their timeliness and deployment efficiency in continuously advancing tunnel boring machine (TBM) tunnels.
In contrast, TBM rock-breaking signals naturally provide persistent vibration energy during excavation and thus offer an attractive opportunity for near-real-time seismic probing without additional active sources. Early studies demonstrated that tunnel seismic data can be acquired while drilling by treating TBM noise as a passive excitation [6], and subsequent developments further improved interface prediction and wavefield reconstruction for practical tunneling scenarios [7,8]. Recent work has continued to explore TBM-noise-based forward prospecting and scatterer localization, highlighting the potential of passive/operational sources for tunnel-scale sensing [9,10,11,12,13]. Nevertheless, compared with controlled hammer or explosive shots, TBM rock-breaking signals usually exhibit strong spectral line components, time-varying nonstationarity, and complex mixtures of direct, scattered and reflected wavefields, making stable retrieval of interpretable impulse-like responses challenging [14].
Seismic interferometry provides a theoretical foundation for reconstructing Green’s functions from uncontrolled or operational sources by exploiting wavefield correlations in space and time [15,16,17]. Correlation-based interferometry has been widely used for ambient-noise applications and engineering monitoring, yet its results may be affected by source directivity, incomplete illumination and non-ideal wavefield assumptions [18,19]. To mitigate these issues, interferometry by deconvolution has been introduced to reduce source-signature imprint and improve temporal focusing [20]. Furthermore, multidimensional deconvolution (MDD) extends the deconvolution concept by incorporating multi-receiver constraints, effectively compensating for the point-spread effect and reducing the dependence on a single reference trace, thereby yielding more robust and better-balanced responses [21,22,23]. These characteristics are particularly attractive for TBM tunnel observations, where reference-channel quality may vary, and the recorded wavefield can be severely contaminated by machine-induced oscillations.
However, for TBM rock-breaking data, directly applying interferometric deconvolution or MDD to raw recordings may still lead to unstable solutions when the input is dominated by persistent oscillatory components and narrow-band interference. Therefore, it is necessary to adaptively suppress TBM-related line spectra and enhance effective wavefield components prior to interferometric solving, so that the subsequent MDD inversion is fed with signals that are more interpretable and physically consistent. Empirical mode decomposition (EMD) and its noise-assisted variants provide a fully data-driven way to handle nonlinear and nonstationary signals [24,25,26]. In particular, ensemble-based schemes such as Ensemble Empirical Mode Decomposition (EEMD)/complete ensemble empirical mode decomposition (CEEMDAN) and their improved versions have demonstrated strong capability in alleviating mode mixing and extracting meaningful intrinsic mode functions (IMFs) under noisy conditions [27]. This motivates a combined strategy: using CEEMDAN-style adaptive decomposition to construct an effective input for MDD-based interferometry, aiming at a more stable impulse-like reconstruction from TBM rock-breaking signals.
In this paper, we propose an adaptive impulse reconstruction algorithm for TBM rock-breaking seismic signals by integrating CEEMDAN-based mode screening with MDD-assisted interferometry. The algorithm first performs noise-assisted EMD decomposition and multi-domain IMF selection to suppress oscillatory interference and preserve coherent wavefield components, and then applies MDD interferometry to obtain an impulse-like virtual shot gather with improved interpretability. The proposed algorithm is validated using both numerical modeling and a field experiment in an operating TBM tunnel, demonstrating its effectiveness and improving the interpretability and practical potential of TBM rock-breaking seismic records for ahead-of-face probing.
2. Materials and Methods
2.1. TBM Drilling-Noise Seismic Signals and Numerical Validation Setup
This study focuses on seismic signals generated by TBM rock breaking and aims to improve their interpretability for tunnel-ahead prospecting. To provide a controlled benchmark for evaluating the proposed CEEMDAN-assisted MDD interferometric impulse reconstruction, we constructed a 3-D elastic forward-modeling framework and generated synthetic three-component seismic records under two excitation types: a controlled active-source benchmark and a TBM-equivalent drilling-noise excitation.
2.1.1. Numerical Simulation Scheme and Stability
We performed 3-D elastic forward modeling using an explicit velocity–stress formulation on a staggered grid. Spatial derivatives were evaluated with a second-order finite-difference stencil, and the wavefield was advanced in time with an explicit scheme. Boundary reflections were suppressed by applying a convolutional perfectly matched layer (CPML) on all sides of the model.
The time step was selected according to a conservative Courant–Friedrichs–Lewy (CFL) setting based on the maximum wave speed in the model vmax and the minimum grid spacing Δ_min_, i.e., . With Δt = 0.02 ms, Δ_min_ = 1 m, and vmax = 5200 m/s, the Courant number is C = 0.104, providing a conservative stability margin for the adopted simulations. No numerical blow-up was observed in the simulated wavefields. The key numerical settings are listed in Table 1.
2.1.2. Numerical Model and Observation Geometry
A 3-D elastic model with dimensions of 100 m × 200 m × 100 m was constructed, in which intact granite was assumed as the background medium. The tunnel axis is aligned with the -direction, and a cylindrical excavation cavity is embedded at the model center (Figure 1). To account for excavation-induced loosening and enhanced scattering near the boundary, an excavation-damaged zone (EDZ) was included as a 2 m-thick annulus surrounding the tunnel [28]. A fractured zone was further placed ahead of the tunnel face to represent a distributed low-velocity heterogeneity. In the reference configuration (Figure 1), the leading interface of the fractured zone is located 50 m in front of the face, and the zone thickness is 20 m. The corresponding elastic parameters assigned to the host rock, tunnel, EDZ, and fractured zone are listed in Table 2.
A single receiver line composed of 24 three-component receivers was deployed along the tunnel sidewall at x = 56 m and z = 50 m, with receiver positions spanning y = 67–90 m at a spacing of 1 m. For the controlled active-source benchmark, forward modeling was performed at the initial tunnel-face position (#1) under the same acquisition geometry. For the TBM-equivalent drilling-noise excitation, the source position was stepped along the tunnel axis to form ten locations from y = 100 m to y = 109 m. This compact one-sided geometry is used throughout the numerical validation.
2.1.3. Source Design and Benchmark Dataset
Two excitation types are simulated for a controlled validation. The target case adopts a TBM-equivalent drilling-noise excitation, which is quasi-continuous and nonstationary in time (Figure 2a). Its spectrum exhibits persistent narrowband peaks with surrounding energy spread (Figure 2b), with dominant components around ~76 Hz, ~143 Hz, ~273 Hz, and ~375 Hz. The corresponding time–frequency map shows long-lasting tonal lineations at these bands (Figure 2c), consistent with the superposition of narrowband components related to TBM rotation or rock-breaking cycles and broadband fluctuations commonly observed in drilling-noise records [9,10].
At each TBM-equivalent source position, a distributed point-source set was used to emulate spatially distributed loading and radiation near the tunnel face. Specifically, 10 grid points were randomly selected in the vicinity of the tunnel-face center, and the same waveform was injected at these points in a Y-dominant manner. The random selection was performed independently for each position to reflect the spatial variability of the TBM-induced loading.
As a controlled reference, an impulsive benchmark dataset was generated under the same model and receiver layout by exciting the model with a band-limited Ricker wavelet (f0 = 200 Hz) at the initial tunnel-face position (#1). This impulsive benchmark provides clear arrivals and reflection timing and serves as a consistent reference for assessing interpretability improvements brought by interferometric impulse reconstruction.
2.2. TBM-Adaptive CEEMDAN Mode Screening and Effective-Signal Construction
TBM drilling-noise records are strongly nonstationary and often contain persistent narrowband tonal interference and pronounced amplitude fluctuations. These features can degrade trace-to-trace consistency and hinder subsequent interferometric processing. We therefore apply CEEMDAN trace-wise to obtain a multiscale representation and then retain physically meaningful modes to construct an effective signal for later interferometric reconstruction.
2.2.1. Energy-Calibrated CEEMDAN Decomposition
Let denote the time-domain signal at receiver i for the s-th source position (or time segment). CEEMDAN decomposes each trace into a set of IMFs and a residual trend,
where denotes the k-th IMF and is the final residual.
TBM drilling-noise records commonly exhibit strong energy fluctuations. To make the ensemble construction more consistent under such variability, we use an energy-calibrated noise-injection amplitude. Specifically, the perturbation level is scaled by a local energy measure,
where β is a tuning coefficient and is estimated from a sliding-window mean-square energy of . With this scaling, perturbations are strengthened in low-energy intermittent intervals to improve statistical robustness, while being reduced during high-energy impact segments to better preserve transient details and stabilize the subsequent mode screening. The key CEEMDAN parameters used in this study are summarized in Table 3.
2.2.2. TBM-Adaptive Mode Screening and Effective-Signal Construction
The CEEMDAN decomposition yields a pool of candidate IMFs, whereas only a subset exhibits propagation-related characteristics associated with TBM rock breaking. Modes dominated by persistent tonal components, slow trends, or receiver-dependent unstable oscillations can reduce cross-trace consistency and introduce artifacts in subsequent interferometric processing. We therefore perform TBM-adaptive mode screening by jointly enforcing complementary criteria in the frequency, time, and multi-trace domains, and then construct an effective signal by summing only the retained IMFs.
- (i)Frequency-domain relevance with anti-tonal control.
For each IMF, we compute the in-band energy ratio rband within the target band relative to its total spectral energy. IMFs with negligible in-band contribution ( ) are rejected. For IMFs that satisfy the in-band requirement, we further quantify tonal dominance using the peak-to-band energy ratio Rpeak (energy of the strongest spectral peak divided by the total in-band energy). Line-dominated IMFs ( ) are excluded to prevent persistent TBM line spectra from dominating the retained set.
- (ii)Time-domain transience and energy concentration.
To favor arrival-like components, each IMF is evaluated by its excess kurtosis (computed from the time-domain samples) and by a short-time energy concentration ratio p_E_, defined as the maximum fraction of IMF energy captured in a sliding window of length Tconc. IMFs with weak impulsiveness and poor concentration ( ) are rejected because they tend to represent long-lasting oscillations or trends and blur interferometric products.
- (iii)Cross-trace consistency within a shot gather.
Physically meaningful components should be coherent across receivers under the compact one-sided array. We compute a multi-trace coherency curve using a sliding window Tcoh and select coherent samples by a quantile threshold qcoh. For each IMF, we then calculate the coherency-energy ratio Ccoh, defined as the mean IMF energy within coherent samples divided by that within incoherent samples. IMFs with insufficient cross-trace consistency ( ) are rejected.
After screening, the retained IMFs are more concentrated on propagation-related arrivals associated with TBM rock breaking, rather than quasi-stationary tonal interference or slow trends. The effective signal is constructed as follows:
It serves as the default input for the subsequent interferometric processing. The retained index set is stored to ensure full reproducibility of the effective-signal construction. The key parameters for mode screening and effective-signal construction are summarized in Table 4.
2.3. Seismic Interferometry for Wavefield Retrieval from TBM Drilling Noise
The energy-calibrated CEEMDAN decomposition and TBM-adaptive mode screening construct effective signals that suppress quasi-stationary tonal components to a certain extent and enhance propagation-related consistency across receivers. Building on these effective signals, we compute virtual-source responses using interferometric operators. In this study, we report three baseline reconstructions, cross-correlation (CC), its phase-weighted variant (PHAT-CC), and deconvolution interferometry (DC), and then focus on multidimensional deconvolution interferometry (MDD) as the primary operator for wavefield retrieval from TBM drilling-noise data.
Let denote the effective signal at receiver i for the s-th source position (or time segment), and let r denote a selected reference receiver.
2.3.1. Cross-Correlation Interferometry
We compute a CC estimate between i and r from the effective signals and [16]. The CC response is defined as follows:
where N_s_ is the number of source positions (or segments).
Because TBM records can remain influenced by residual narrowband peaks even after effective-signal construction, we additionally consider a phase-weighted correlation baseline (PHAT-CC) [19]. In the frequency domain, let denote the cross-spectrum for the s-th realization. The PHAT-CC response is computed as
where denotes the inverse Fourier transform and ϵ is a small constant for numerical stability. This phase-weighting down-weights amplitude-dominated tonal peaks and places more emphasis on phase consistency, which is more directly tied to relative timing and moveout patterns.
2.3.2. Deconvolution Interferometry
DC is introduced to obtain a more compact virtual wavelet by normalizing each channel spectrum with that of the reference receiver [20]. Let denote the Fourier transform of . The stabilized DC estimate in the frequency domain can be written as follows:
where denotes complex conjugation and ϵ is a stabilization term to avoid numerical instability. In our implementation, DC is computed within the working band and shares the same band and taper settings as the other operators for consistent comparison. While DC often yields a sharper wavelet than correlation baselines, it can remain sensitive to the quality and spectral characteristics of the reference channel.
2.3.3. Multidimensional Deconvolution Interferometry
While DC relies on a single reference trace, MDD interferometry formulates wavefield retrieval as a multichannel deconvolution problem so that the constraints can be distributed across multiple reference receivers [21,22,23]. For each angular frequency ω within the working band, we arrange the effective signal spectra into a receiver–realization matrix , whose (i, s)-th entry corresponds to . We select a compact set of reference receivers with , and form the corresponding reference submatrix by extracting the rows indexed by .
Under an interferometric forward model, the observed matrix is approximated by
where represents an impulse-like virtual-source response matrix. We estimate by solving a Tikhonov-regularized least-squares problem, leading to
where is the conjugate transpose, λ is a stabilization parameter, and I is the identity matrix. Finally, is transformed back to the time domain to obtain virtual-source gathers for subsequent interpretation.
2.3.4. TBM-Adaptive Robustification Strategies
TBM drilling-noise records are commonly contaminated by persistent narrowband tonal components and time-varying amplitudes. These features can distort spectral estimates and weaken waveform focusing in correlation- and deconvolution-based interferometry. To keep the comparisons consistent across operators, we apply the same working band and taper settings to all methods and use phase-only whitening where appropriate. In this study, Hz is adopted as a common working band (consistent with Section 2.2.2), which we treat as the effective frequency range of TBM rock-breaking excitation for subsequent interferometric processing. In our implementation, the following measures are adopted:
- (i)Band-limited inversion with a short taper.
The processing is restricted to the working band , and a short symmetric time-domain taper is applied prior to the Fourier transform. This combination reduces spectral leakage and boundary effects and yields more stable spectral estimates within the working band. By limiting the operation to the band where the effective signal is reliable, out-of-band components are prevented from dominating subsequent interferometric products.
We use [50, 500] Hz throughout for cross-operator comparability, and apply a short symmetric taper of 5–15 ms (default 10 ms). If attenuation is strong or the SNR above 400–500 Hz is low, the high-frequency end can be conservatively reduced, but the same band should be kept for all operators within a given comparison.
- (ii)Phase-only whitening within the working band.
To limit the influence of persistent tonal peaks, we apply phase-only whitening in , which reduces spectral contrast while preserving the phase information that governs relative timing and moveout. For the correlation baseline, this corresponds to PHAT-weighting of the cross-spectrum (PHAT-CC). For the deconvolution-type operators (DC and MDD), the same phase-only whitening can be applied to the effective-signal spectra before forming the normal-matrix terms. This normalization down-weights amplitude-dominated tonal peaks and makes the result depend more on phase consistency across realizations.
Phase-only whitening is applied only within Hz. We recommend enabling it for TBM drilling-noise records with prominent tonal peaks; if tonal peaks are weak, it can be disabled for a closer-to-amplitude reconstruction.
- (iii)Multi-reference constraints.
A multi-reference set is used instead of a single reference trace so that the constraints are distributed across multiple receivers. This design reduces the dependence on any single channel and provides more stable solutions when individual traces are affected by local coupling variations or intermittent spectral anomalies. In the least-squares sense, multiple references provide redundant constraints, which reduces variance in when individual reference traces are imperfect.
We recommend using 4–8 reference traces (or roughly 20–35% of the receiver channels) that are spatially distributed and avoid visibly unstable or poorly coupled sensors. Using too few references increases sensitivity to individual traces, whereas using many highly similar references often yields diminishing returns.
- (iv)Conditioning-aware regularization.
The stabilization parameter λ in Equation (8) is selected to balance temporal focusing and numerical stability. During parameter selection, the conditioning of is monitored as an auxiliary indicator to mitigate ill-conditioned inversions and spurious artifacts. A larger λ effectively damps poorly conditioned spectral components, whereas a smaller λ favors temporal focusing when the system is well conditioned.
We select λ from a small candidate set on a logarithmic scale and recommend (default 0.1 for this study). In practice, λ can be chosen as the smallest value that avoids obvious ringing or instability in the retrieved gathers while remaining consistent with the conditioning statistics reported in Section 3.4.
2.4. Evaluation Metrics
Based on the effective signal construction in Section 2.2 and the interferometric operators in Section 2.3, Figure 3 summarizes the proposed algorithm. To evaluate the retrieved impulse-like gathers from complementary perspectives, we use three metrics throughout this study: (i) a reflection-coherence proxy to quantify the relative prominence of reflection-window energy over the early-time/background level, (ii) an active-source-referenced consistency index to quantify the agreement with controlled active-source gathers and thus assess the interpretability of reflection-related patterns, and (iii) a band-limited conditioning statistic to characterize the numerical stability of the MDD inversion.
- (i)Reflection coherence proxy.
To quantify whether the reconstructed gathers exhibit more prominent reflection-window energy relative to the early-time level, we introduce a coherence proxy based on the envelope RMS measured within two time windows. Let denote the reconstructed trace at receiver i (i = 1, …, N_r_).
We then define
where is the envelope operator (magnitude of the Hilbert transform), and and represent the early-time and reflection-analysis windows, respectively. Larger Rcoh indicates that energy becomes more concentrated within the reflection window relative to the early-time level.
- (ii)Active-source-referenced consistency.
To assess the interpretability of reflection-related patterns, we measure the agreement between the reconstructed gathers and a controlled active-source reference. Let denote the gather from controlled active source excitation, and denote the gather reconstructed by method for each given TBM source position group. Because the two gathers may not share the same time zero, we first align them by their picked direct-wave arrivals and then compute the similarity within the reflection-analysis window as
where is the envelope operator and stacks all traces within the reflection-analysis window into a single vector. A larger value of indicates stronger agreement of reflection-related patterns with the active source reference.
- (iii)Band-limited conditioning.
For MDD interferometry, the stability of the frequency-domain inversion is governed by the normal matrix . We quantify its conditioning by
where σmax and σmin are the largest and smallest singular values, respectively. A band-limited summary over the processing band is reported as
where denotes the discrete frequency samples within . In general, a lower indicates a better-conditioned system and a more stable MDD solution.
3. Results
3.1. Baseline Raw Records: Active-Source Versus TBM-Equivalent Excitation
Figure 4 compares the baseline raw Y-component shot gathers of the fracture-zone model under an active impulsive source (Figure 4a) and a TBM-equivalent drilling-noise excitation (Figure 4b). In the active-source case, the gather exhibits clear and coherent moveout patterns, enabling straightforward recognition of the main wave groups (Direct_P, Direct_S, and the surface wave) as well as reflection-related event bands associated with the front and back interfaces (Front-interface Reflected_P/PS/S and Back-interface Reflected_P/PS/S). These annotated arrivals provide a convenient timing and interpretation reference for subsequent comparisons.
In contrast, the TBM-equivalent gather is characterized by more persistent and superposed energy with reduced lateral coherence, making reflection-related events less distinguishable. To highlight this behavior, Figure 4b presents the gather induced by TBM-equivalent drilling-noise excitation in a longer time window (0–1000 ms) together with a zoomed view of 100–200 ms (marked by the box). Importantly, this 100–200 ms interval is adopted as the fixed analysis window for subsequent TBM-equivalent numerical experiments.
3.2. Effective-Signal Construction by TBM-Adaptive CEEMDAN
Following the fixed 100–200 ms analysis window adopted for TBM-equivalent records (Figure 4b), we apply the proposed TBM-adaptive CEEMDAN screening trace by trace and construct an effective gather by summing the retained IMFs.
Figure 5 presents the CEEMDAN result for the TBM-equivalent drilling-noise excitation record. Figure 5a shows the CEEMDAN-screened effective gather constructed by TBM-adaptive CEEMDAN screening, while Figure 5b shows the corresponding residual gather . Compared with the raw gather in Figure 4b, the effective gather preserves the major propagation-related event bands and makes them more traceable across receivers, thereby improving interpretability. Nevertheless, the wavelet in Seff is still strongly controlled by the TBM-equivalent drilling-noise excitation, exhibiting a relatively broad main lobe and pronounced coda/dragging; it is therefore not yet an impulse-narrowed input for reliable event interpretation, motivating the subsequent interferometric reconstruction for further impulse-like sharpening and robustification.
As a complementary view of the screening outcome, the residual gather in Figure 5b is dominated by more dispersed, background-like components and lacks the stable event bands and lateral continuity seen in Figure 5a, indicating that the rejected IMFs mainly capture non-target persistent/unstable oscillations. Based on these observations, Seff is used as the default input for the interferometric reconstructions in the following sections.
3.3. Interferometric Reconstruction Results: CC, PHAT-CC, DC, and MDD Interferometry
Figure 6 compares four interferometric reconstructions obtained from the effective signals Seff (Y component), including the conventional cross-correlation baseline (CC, Figure 6a), its phase-weighted variant (PHAT-CC, Figure 6b), stabilized deconvolution interferometry (DC, Figure 6c), and multidimensional deconvolution interferometry (MDD, Figure 6d). All panels are displayed with consistent plotting settings to enable direct visual comparison.
In the CC result (Figure 6a), the response is dominated by relatively broad wavelets and noticeable trace-to-trace variability. Coherent energy can be identified around the early-time window (approximately 10–20 ms), whereas later arrivals appear more diffuse and show limited lateral continuity, which makes reflection-related bands difficult to track reliably.
Compared with CC, PHAT-CC (Figure 6b) suppresses amplitude-dominated narrowband influence and yields a more phase-consistent gather, most clearly in the early-time window. The early arrivals become cleaner and more stable across traces, but reflection-related bands remain only partially continuous and can still be affected by residual coda-like oscillations.
The DC reconstruction (Figure 6c) further compresses the wavelet relative to the correlation baselines, with stronger energy concentration. However, localized dominance and residual oscillatory artifacts may still appear on several traces, indicating that the deconvolution outcome remains sensitive to the spectral characteristics of the reference channel even under a good-reference setting, where the reference trace is selected as the channel exhibiting stable coupling and the highest early-time signal-to-noise ratio within the analysis window.
In contrast, the MDD-based reconstruction (Figure 6d) provides the most interpretable virtual gather among the four methods. Multiple reflection-related event bands are distinctly traceable across the array, and the associated wavelets appear more compact with reduced trailing oscillations. When compared with the controlled active-source benchmark (Figure 4a), the prominent reflection bands in the MDD gather exhibit comparable relative arrival times with respect to the direct arrivals and show clearer moveout trends across traces. By comparison, the correlation baselines and DC retain more dispersed or locally distorted reflection-related bands.
3.4. Quantitative Assessment of Interpretability and Retrieval Stability
To quantitatively evaluate the retrieval quality of the interferometric reconstructions, we use the three metrics defined in Section 2.4 and summarize the results in Figure 7. For Rcoh and Cact, statistics are computed position-wise across the available TBM source position groups (n = 10). For , conditioning is evaluated for the MDD normal system under different robustification settings, with position groups providing the sample distribution. All measurements use the same fixed time windows ms and ms.
As summarized in Figure 7a, the reflection-coherence proxy Rcoh increases from CC to PHAT-CC and then to DC, and reaches the highest level for MDD. This indicates that the envelope energy becomes progressively more concentrated within the reflection analysis window relative to the early-time level. The boxplots and position-wise scatter plots further suggest a tighter distribution for MDD, consistent with a more stable enhancement of reflection-window energy across TBM source-position groups.
The interpretability of the reconstructed gathers is further assessed using the active-source-referenced consistency Cact (Figure 7b). A consistent ranking is observed across the correlation-based operators and the deconvolution-based operators: PHAT-CC improves over CC, DC yields a higher median Cact, and MDD achieves the highest median with a relatively compact interquartile range. Notably, DC shows a wider spread than PHAT-CC, which is consistent with the reference sensitivity of deconvolution-type estimates—spectral division may amplify intermittent spectral anomalies on the reference trace—whereas PHAT-CC emphasizes phase consistency. This trend is consistent with the waveform comparison: the MDD reconstructed impulse-like gather (Figure 6d) more closely reproduces the event-band continuity and moveout patterns of the controlled active-source benchmark (Figure 4a) than the baseline operators (Figure 6a–c).
In addition, the numerical stability of the MDD inversion is quantified by . As shown in Figure 7c, both phase-only whitening (PW) and the multi-reference strategy (MR) reduce relative to the baseline configuration, and their combination (PW + MR) yields the lowest values, suggesting a better-conditioned normal system and a more stable inversion across position groups.
3.5. Algorithm Validation in an Operating TBM Tunnel
To further verify the practical applicability of the proposed algorithm under real tunneling conditions, we conducted a field experiment in an operating TBM tunnel, where seismic data were acquired using the same receiver array under two excitation conditions: (i) TBM rock-breaking excitation and (ii) an active hammer-impact benchmark. The hammer-impact record serves as an impulsive reference, enabling a direct and intuitive comparison in terms of waveform compactness, lateral coherence, and overall interpretability of the retrieved virtual-source gathers.
As illustrated in Figure 8, a single-sided linear array consisting of 12 receivers with a spacing of 2 m was deployed along the tunnel sidewall. To ensure stable coupling and reduce sensor-to-sensor variability, the receivers were installed in pre-drilled boreholes and fixed using gypsum filling, providing robust contact with the surrounding rock mass. The advancing direction of the tunnel and the TBM equipment side are also marked to document the acquisition geometry.
Figure 9 compares the field records before and after impulse reconstruction. The raw TBM rock-breaking record in Figure 9a is dominated by persistent background oscillations and pronounced ringing, which obscures coherent moveout trends and complicates phase tracking. After applying the proposed impulse reconstruction, the retrieved virtual-source gather in Figure 9b becomes noticeably more compact, with energy concentrated into a clearer main-arrival band; the direct-wave moveout highlighted by the red dashed box also exhibits improved continuity and is easier to follow across the array. In comparison, the hammer-impact record in Figure 9c naturally shows impulsive and compact arrivals, providing a meaningful benchmark for field validation. Overall, the impulse-reconstructed gather exhibits waveform features more consistent with the hammer-impact benchmark, suggesting improved interpretability of field TBM drilling-noise records in practical environments.
It is also observed that, compared with the hammer-impact data, the reconstructed gather still retains noticeable energy over a longer traveltime range. This is consistent with the stronger and more continuous nature of TBM rock-breaking excitation, which is typically low-frequency dominant; such low-frequency components attenuate more slowly in tunnel-surrounding rock, potentially benefiting longer-range propagation.
To further assess the physical plausibility of the reconstructed moveout structure, we estimate the apparent direct-wave velocity from the main arrival band in Figure 9b and compare it with that derived from the hammer-impact record in Figure 9c. The apparent velocity is computed by multi-trace averaging as follows:
The estimated apparent velocity from the impulse-reconstructed gather is = 3355 m/s, which is close to that obtained from the hammer-impact record = 3056 m/s. This value falls within the expected range for body-wave propagation in tunnel surrounding rock, indicating that the reconstructed virtual-source gather not only exhibits improved waveform compactness but also preserves physically reasonable moveout behavior.
4. Discussion
4.1. CEEMDAN Screening for Interferometry-Ready Effective Signals
Deconvolution-type interferometry can be sensitive to inputs that contain persistent tonal interference and strong time variability. In such cases, mode mixing becomes more likely, and the retrieved gathers may be biased toward the reference trace. Variational mode decomposition (VMD) has been used to isolate narrowband components from nonstationary vibration signals, and benchmark tests suggest that it can offer robust mode identification at a lower computational cost than CEEMDAN [29].
In TBM tunneling, however, the wavefield is not purely tonal. Propagation-related transients are intermittent. They coexist with direct, scattered, and reflected contributions. For this reason, our objective is not only to isolate narrowband tones. We also need to preserve propagation-consistent transients that are suitable for interferometric retrieval.
We therefore adopt CEEMDAN as a front-end step to construct an effective signal, rather than to perform generic denoising. Its data-adaptive multiscale IMFs enable targeted screening against components that are inconsistent with coherent, propagation-related wavefields. The retained IMFs are selected to support cross-trace coherence and physically plausible transients, while suppressing quasi-stationary tones and incoherent fluctuations. We also avoid strong band-pass filtering before CEEMDAN, because aggressive pre-filtering can reshape intrinsic scale separation and shift IMF allocation, weakening physical interpretability. Instead, frequency-domain relevance and anti-tonal control are enforced during screening within the working band . The interferometric stage then operates within the same working band and applies the stabilization measures described in Section 2.3.4, so that CEEMDAN focuses on adaptive mode separation while interferometry focuses on stable wavefield retrieval.
4.2. Reflection Interpretability Beyond Wavelet Compression
To evaluate whether the proposed algorithm improves reflection interpretability, rather than merely narrowing a virtual wavelet, we combine visual inspection with complementary quantitative indicators to summarize gather-level event-band behavior (continuity, stability, and traceability). The reflection-coherence proxy Rcoh describes the relative prominence of energy in the reflection-analysis window compared with an early-time/background baseline, and it characterizes whether reflection-relevant energy becomes more concentrated and band-like. After alignment by direct arrivals, the active-source-referenced index Cact provides a physics-based check against the controlled active-source benchmark by assessing whether event-band continuity and moveout trends are consistent with that reference.
Under this evaluation perspective, the differences among the four operators mainly reflect different sensitivities to reference dependence and narrowband tonal disturbance. CC is inherently reference-based and can be vulnerable to illumination imbalance and tone-dominated spectra [15,16,17,18]. PHAT-CC down-weights amplitude-dominated narrowband peaks by emphasizing phase consistency and often stabilizes timing and moveout, but residual correlation-type fluctuations may remain [19]. DC can further compress the wavelet, yet it is more sensitive to the spectral characteristics and stability of the chosen reference trace, so unfavorable references may introduce ringing or localized dominance [20]. In contrast, MDD distributes constraints across receivers through a multichannel inverse formulation, reducing single-reference dominance and supporting more interpretable late-time event bands under complex TBM-equivalent drilling-noise excitation [21,22,23].
4.3. TBM-Adaptive Robustification for MDD: Mitigating Reference Sensitivity and Improving Stability
In deconvolution-type interferometry, reference selection remains a practical bottleneck. Even after CEEMDAN suppresses quasi-stationary tonal components, a single reference trace may still carry residual line peaks or exhibit spectral/phase instability. Under single-reference normalization, these traits can be amplified and imprinted onto the retrieved gather, which helps explain the pronounced degradation of DC in the illustrative case shown in Figure 10 as reference quality deteriorates. This observation motivates the use of MDD, which distributes constraints across channels and reduces the chance that any single trace dominates the solution.
For MDD, robustness is closely tied to the conditioning of the band-limited normal system. Two complementary strategies are used to address common failure modes in interferometry based on TBM drilling-noise records. Phase-only whitening (PW) reduces excessive spectral contrast associated with narrowband line components, making the inversion less prone to being driven by a few dominant peaks. The multi-reference strategy (MR) reduces reference dependence by distributing constraints across multiple reference traces. Used together, PW primarily targets spectral dominance, while MR targets reference sensitivity. This combination provides a practical route to more stable late-time retrieval under tonal and nonstationary excitation. The associated improvements are consistent with the quantitative conditioning and interpretability measures summarized in Section 3.4.
4.4. Practical Implications for Risk Mitigation and Applicability Limits
During excavation, TBM drilling-noise records provide persistent passive observations with much denser temporal sampling than controlled active-source surveys. Yet the raw data are often dominated by tonal components and nonstationary background vibrations, which undermines interpretability and repeatable reflection recognition. By producing virtual-source gathers that are more compatible with reflection-oriented analysis, the proposed method enables coherent late-time events to be evaluated using consistent windows and objective measures. This improved interpretability supports earlier recognition of potentially adverse features ahead of the face and informs risk-aware construction decisions.
CEEMDAN-assisted MDD is computationally heavier than CC/PHAT-CC and DC because it combines ensemble-based decomposition with a band-limited frequency-domain inversion. The workflow is, however, well-suited to parallel computing and windowed processing. For long continuous TBM records, low-cost screening can be used to locate candidate intervals, after which CEEMDAN+MDD is applied only to the selected segments. With short windows and parallel resources, this design supports near-real-time or quasi-online operation. In practice, the additional overhead is most meaningful in tonal, nonstationary noise conditions where correlation- and DC-based gathers remain difficult to interpret, and stable reflection-oriented patterns are needed.
Reconstruction quality can be site-dependent and is influenced by attenuation, scattering strength, and velocity heterogeneity. In strongly attenuative or highly fractured conditions, more conservative retention criteria and stronger stabilization are often beneficial, whereas in clearer propagation environments, a broader working band and weaker constraints may be sufficient. The recommended ranges reported in this study are guided by empirical observations and stability diagnostics on the available dataset. Broader evaluation under different operating conditions and geological regimes would therefore be beneficial, together with case-specific adjustment of key settings, including effective-band selection, screening thresholds, reference strategy, and regularization strength. In addition, the present study validates the method primarily at the gather level; coupling the reconstructed gathers with imaging and interpretation analyses in future studies would help quantify practical resolution and decision-support benefit.
5. Conclusions
This study proposes a TBM-adaptive impulse reconstruction algorithm for TBM drilling-noise seismic records by combining CEEMDAN-based mode screening with MDD interferometric retrieval. Based on the comparative results with CC, PHAT-CC, and DC, as well as field validation in an operating TBM tunnel, we draw the following conclusions:
- CEEMDAN-based screening suppresses quasi-stationary tonal components and constructs an effective signal with improved cross-trace consistency for subsequent interferometric processing;
- Compared with CC, PHAT-CC, and DC, MDD reduces dependence on a single reference trace and yields more interpretable propagation-related event patterns in the retrieved gathers;
- The TBM-adaptive robustification, implemented via phase-only whitening and a multi-reference strategy, improves inversion stability and mitigates reference-dominated artifacts;
- Field measurements in an operating TBM tunnel provide evidence that the proposed algorithm is feasible under real tunneling conditions and can produce a virtual-source gather with a physically plausible travel-time structure.
These results improve the interpretability and stability of TBM drilling-noise seismic records, thereby enhancing the efficiency and reliability of tunnel seismic ahead-of-face prospecting.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Sattel G. Predicting Ahead of the Face by Tunnel Seismics: The Tunnel Seismic Prediction (TSP) Method Tunn. Undergr. Space Technol.199611453458
- 2Alimoradi A. Moradzadeh A. Naderi R. Zare S. Rakhshandehroo M. Prediction of Geological Hazardous Zones in Front of a Tunnel Face Using TSP-203 and Artificial Neural Networks Tunn. Undergr. Space Technol.20082371171710.1016/j.tust.2008.01.001 · doi ↗
- 3Jiao Y.-Y. Zhang X.-L. Zeng Z.-P. Zhang J. Li Y.-S. Wu K. Predicting Tunneling Hazardous Geological Zones Using a Seismic Method and Device Near Surf. Geophys.20151342143210.3997/1873-0604.2015017 · doi ↗
- 4Li S. Chen L. Liu B. Xu X. Liu L. Chen Y. Geologic Forward Prospecting Using Improved Tunnel-Seismic-While-Drilling Method: A Case Study of the Water Supply Project at Songhua River, Jilin, China Geophysics 202287 B 93B 10610.1190/geo 2021-0015.1 · doi ↗
- 5Amberg Technologies Tunnel Seismic Prediction—TSP 2020 Company Technical Report Amberg Technologies Regensdorf, Switzerland 2021
- 6Poletto F. Petronio L. Seismic Interferometry with a TBM Source of Transmitted and Reflected Waves Geophysics 200671 SI 85SI 9310.1190/1.2213947 · doi ↗
- 7Petronio L. Poletto F. Schleifer A. Interface Prediction Ahead of the Excavation Front by the Tunnel-Seismic-While-Drilling (TSWD) Method Geophysics 200772 G 39G 4410.1190/1.2740712 · doi ↗
- 8Wessén M. Österberg M. Tunnel Seismic Prediction in Stockholm Bypass Proceedings of the World Tunnel Congress 2014 Iguassu Falls, Brazil 9–15 May 2014
