LISTA-Transformer Model Based on Sparse Coding and Attention Mechanism and Its Application in Fault Diagnosis
Shuang Liu, Lina Zhao, Tian Wang, Huaqing Wang

TL;DR
This paper introduces a novel LISTA-Transformer model that combines sparse coding and attention mechanisms, enhancing feature extraction for fault diagnosis and outperforming traditional and existing Transformer-based methods.
Contribution
The paper proposes a new LISTA-Transformer architecture integrating sparse coding with Transformers for improved local and global feature modeling in fault diagnosis.
Findings
Fault recognition rate reached 98.5% on CWRU dataset
Outperformed traditional methods by 3.3%
Demonstrated superiority over existing Transformer approaches
Abstract
Driven by the continuous development of models such as Multi-Layer Perceptron, Convolutional Neural Network (CNN), and Transformer, deep learning has made breakthrough progress in fields such as computer vision and natural language processing, and has been successfully applied in practical scenarios such as image classification and industrial fault diagnosis. However, existing models still have certain limitations in local feature modeling and global dependency capture. Specifically, CNN is limited by local receptive fields, while Transformer has shortcomings in effectively modeling local structures, and both face challenges of high model complexity and insufficient interpretability. In response to the above issues, we proposes the following innovative work: A sparse Transformer based on Learnable Iterative Shrinkage Threshold Algorithm (LISTA-Transformer) was designed, which deeply…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Fault Diagnosis Techniques · Generative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
