Advanced Hybrid Transformer LSTM Technique with Attention and TS Mixer for Drilling Rate of Penetration Prediction
Saddam Hussain Khan (Artificial Intelligence Lab, Department of Computer Systems Engineering, University of Engineering, Applied Sciences (UEAS), Swat, Pakistan)

TL;DR
This paper introduces a novel hybrid deep learning framework combining LSTM, Transformer, and TS-Mixer modules with attention mechanisms to improve the accuracy of drilling rate of penetration prediction, addressing the complex, nonlinear drilling data characteristics.
Contribution
The study develops a new hybrid LSTM-Trans-Mixer-Attention model that effectively captures multi-scale temporal dependencies and feature interactions for ROP prediction, outperforming existing models.
Findings
Achieved an Rsquare of 0.9991 in real-world datasets.
Attained a MAPE of 1.447%, indicating high prediction accuracy.
Demonstrated significant performance improvements over baseline models.
Abstract
Rate of Penetration (ROP) prediction is critical for drilling optimization yet remains challenging due to the nonlinear, dynamic, and heterogeneous characteristics of drilling data. Conventional empirical, physics-based, and standard machine learning models rely on oversimplified assumptions or intensive feature engineering, constraining their capacity to model long-term dependencies and intricate feature interactions. To address these issues, this study presents a new deep learning Hybrid LSTM-Trans-Mixer-Att framework that first processes input data through a customized Long Short-Term Memory (LSTM) network to capture multi-scale temporal dependencies aligned with drilling cycles. Subsequently, an Enhanced Transformer encoder with drilling-specific positional encodings and real-time optimization refines the features. Concurrently, a parallel Time-Series Mixer (TS-Mixer) block…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
