Advanced Hybrid Transformer LSTM Technique with Attention and TS Mixer for Drilling Rate of Penetration Prediction

Saddam Hussain Khan (Artificial Intelligence Lab; Department of Computer Systems Engineering; University of Engineering; Applied Sciences (UEAS); Swat; Pakistan)

arXiv:2508.05210·cs.LG·November 10, 2025

Advanced Hybrid Transformer LSTM Technique with Attention and TS Mixer for Drilling Rate of Penetration Prediction

Saddam Hussain Khan (Artificial Intelligence Lab, Department of Computer Systems Engineering, University of Engineering, Applied Sciences (UEAS), Swat, Pakistan)

PDF

TL;DR

This paper introduces a novel hybrid deep learning framework combining LSTM, Transformer, and TS-Mixer modules with attention mechanisms to improve the accuracy of drilling rate of penetration prediction, addressing the complex, nonlinear drilling data characteristics.

Contribution

The study develops a new hybrid LSTM-Trans-Mixer-Attention model that effectively captures multi-scale temporal dependencies and feature interactions for ROP prediction, outperforming existing models.

Findings

01

Achieved an Rsquare of 0.9991 in real-world datasets.

02

Attained a MAPE of 1.447%, indicating high prediction accuracy.

03

Demonstrated significant performance improvements over baseline models.

Abstract

Rate of Penetration (ROP) prediction is critical for drilling optimization yet remains challenging due to the nonlinear, dynamic, and heterogeneous characteristics of drilling data. Conventional empirical, physics-based, and standard machine learning models rely on oversimplified assumptions or intensive feature engineering, constraining their capacity to model long-term dependencies and intricate feature interactions. To address these issues, this study presents a new deep learning Hybrid LSTM-Trans-Mixer-Att framework that first processes input data through a customized Long Short-Term Memory (LSTM) network to capture multi-scale temporal dependencies aligned with drilling cycles. Subsequently, an Enhanced Transformer encoder with drilling-specific positional encodings and real-time optimization refines the features. Concurrently, a parallel Time-Series Mixer (TS-Mixer) block…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.