Soft Sensing Transformer: Hundreds of Sensors are Worth a Single Word
Chao Zhang, Jaswanth Yella, Yu Huang, Xiaoye Qian, Sergei Petrov,, Andrey Rzhetsky, Sthitie Bom

TL;DR
This paper introduces a Soft Sensing Transformer model that effectively processes high-dimensional industrial sensor data, outperforming traditional models like auto-encoders and LSTMs, and provides large-scale datasets for the field.
Contribution
It presents the first benchmarking of transformer models on large-scale industrial soft sensing data, demonstrating superior performance over existing methods.
Findings
Transformer outperforms auto-encoder and LSTM models in soft sensing tasks.
Provides large-scale, high-dimensional sensor datasets for industrial applications.
First to benchmark transformer models on such data in academia and industry.
Abstract
With the rapid development of AI technology in recent years, there have been many studies with deep learning models in soft sensing area. However, the models have become more complex, yet, the data sets remain limited: researchers are fitting million-parameter models with hundreds of data samples, which is insufficient to exercise the effectiveness of their models and thus often fail to perform when implemented in industrial applications. To solve this long-lasting problem, we are providing large scale, high dimensional time series manufacturing sensor data from Seagate Technology to the public. We demonstrate the challenges and effectiveness of modeling industrial big data by a Soft Sensing Transformer model on these data sets. Transformer is used because, it has outperformed state-of-the-art techniques in Natural Language Processing, and since then has also performed well in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTime Series Analysis and Forecasting · Stock Market Forecasting Methods · Big Data and Digital Economy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Label Smoothing · Position-Wise Feed-Forward Layer · Adam · Layer Normalization · Byte Pair Encoding · Dense Connections
