Robust representations of oil wells' intervals via sparse attention mechanism
Alina Ermilova, Nikita Baramiia, Valerii Kornilov, Sergey Petrakov,, Alexey Zaytsev

TL;DR
This paper introduces Regularized Transformers (Reguformers), an efficient and robust attention-based model for multivariate time series data, specifically applied to oil well logs, outperforming existing models in classification and representation tasks.
Contribution
The paper proposes a new class of efficient, regularized Transformer models tailored for industrial time series, enhancing robustness to noise and missing data while reducing computational complexity.
Findings
Reguformers outperform RNNs, Transformer, Informer, and Performer in well-interval classification.
They demonstrate high accuracy with a PR~AUC score of 0.983 on well-interval similarity.
Models show significant robustness to missing and noisy data.
Abstract
Transformer-based neural network architectures achieve state-of-the-art results in different domains, from natural language processing (NLP) to computer vision (CV). The key idea of Transformers, the attention mechanism, has already led to significant breakthroughs in many areas. The attention has found their implementation for time series data as well. However, due to the quadratic complexity of the attention calculation regarding input sequence length, the application of Transformers is limited by high resource demands. Moreover, their modifications for industrial time series need to be robust to missing or noised values, which complicates the expansion of the horizon of their application. To cope with these issues, we introduce the class of efficient Transformers named Regularized Transformers (Reguformers). We implement the regularization technique inspired by the dropout ideas to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Anomaly Detection Techniques and Applications
MethodsMulti-Head Attention · Attention Is All You Need · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · Absolute Position Encodings · Fast Attention Via Positive Orthogonal Random Features · Linear Layer · Adam · Layer Normalization
