Audio-visual cross-modality knowledge transfer for machine learning-based in-situ monitoring in laser additive manufacturing
Jiarui Xie, Mutahar Safdar, Lequn Chen, Seung Ki Moon, Yaoyao Fiona, Zhao

TL;DR
This paper introduces cross-modality knowledge transfer methods for laser additive manufacturing monitoring, enabling high-accuracy defect detection with reduced sensor requirements by transferring knowledge between audio and visual data modalities.
Contribution
It proposes three novel CMKT methods, including semantic alignment, fully supervised, and semi-supervised mapping, to improve feature representation and reduce hardware costs in LAM monitoring.
Findings
Semantic alignment achieves 98.6% accuracy without audio sensors.
CMKT methods outperform traditional multimodal fusion in accuracy.
Explainable AI reveals improved feature relevance and noise reduction.
Abstract
Various machine learning (ML)-based in-situ monitoring systems have been developed to detect anomalies and defects in laser additive manufacturing (LAM) processes. While multimodal fusion, which integrates data from visual, audio, and other modalities, can improve monitoring performance, it also increases hardware, computational, and operational costs. This paper introduces a cross-modality knowledge transfer (CMKT) methodology for LAM in-situ monitoring, which transfers knowledge from a source modality to a target modality. CMKT enhances the representativeness of the features extracted from the target modality, allowing the removal of source modality sensors during prediction. This paper proposes three CMKT methods: semantic alignment, fully supervised mapping, and semi-supervised mapping. The semantic alignment method establishes a shared encoded space between modalities to facilitate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsThermography and Photoacoustic Techniques
MethodsALIGN
