Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse Thermography
Mohammed Salah, Naoufel Werghi, Davor Svetinovic, and Yusra, Abdulrahman

TL;DR
This paper introduces PT-Fusion, a multi-modal attention network that fuses PCA and TSR data for improved subsurface defect segmentation and depth estimation in pulse thermography, outperforming existing models.
Contribution
The paper presents a novel multi-modal attention-based fusion network with new feature fusion modules for enhanced PT defect detection and depth estimation.
Findings
PT-Fusion outperforms state-of-the-art models by 10% in accuracy.
Introduces a new data augmentation technique for PT datasets.
Demonstrates improved segmentation and depth estimation results.
Abstract
AI-driven pulse thermography (PT) has become a crucial tool in non-destructive testing (NDT), enabling automatic detection of hidden anomalies in various industrial components. Current state-of-the-art techniques feed segmentation and depth estimation networks compressed PT sequences using either Principal Component Analysis (PCA) or Thermographic Signal Reconstruction (TSR). However, treating these two modalities independently constrains the performance of PT inspection models as these representations possess complementary semantic features. To address this limitation, this work proposes PT-Fusion, a multi-modal attention-based fusion network that fuses both PCA and TSR modalities for defect segmentation and depth estimation of subsurface defects in PT setups. PT-Fusion introduces novel feature fusion modules, Encoder Attention Fusion Gate (EAFG) and Attention Enhanced Decoding Block…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsThermography and Photoacoustic Techniques · Industrial Vision Systems and Defect Detection · Laser Material Processing Techniques
MethodsSoftmax · Attention Is All You Need · Max Pooling · Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Concatenated Skip Connection · U-Net · Principal Components Analysis
