Audio-visual cross-modality knowledge transfer for machine   learning-based in-situ monitoring in laser additive manufacturing

Jiarui Xie; Mutahar Safdar; Lequn Chen; Seung Ki Moon; Yaoyao Fiona; Zhao

arXiv:2408.05307·cs.CE·February 11, 2025

Audio-visual cross-modality knowledge transfer for machine learning-based in-situ monitoring in laser additive manufacturing

Jiarui Xie, Mutahar Safdar, Lequn Chen, Seung Ki Moon, Yaoyao Fiona, Zhao

PDF

Open Access

TL;DR

This paper introduces cross-modality knowledge transfer methods for laser additive manufacturing monitoring, enabling high-accuracy defect detection with reduced sensor requirements by transferring knowledge between audio and visual data modalities.

Contribution

It proposes three novel CMKT methods, including semantic alignment, fully supervised, and semi-supervised mapping, to improve feature representation and reduce hardware costs in LAM monitoring.

Findings

01

Semantic alignment achieves 98.6% accuracy without audio sensors.

02

CMKT methods outperform traditional multimodal fusion in accuracy.

03

Explainable AI reveals improved feature relevance and noise reduction.

Abstract

Various machine learning (ML)-based in-situ monitoring systems have been developed to detect anomalies and defects in laser additive manufacturing (LAM) processes. While multimodal fusion, which integrates data from visual, audio, and other modalities, can improve monitoring performance, it also increases hardware, computational, and operational costs. This paper introduces a cross-modality knowledge transfer (CMKT) methodology for LAM in-situ monitoring, which transfers knowledge from a source modality to a target modality. CMKT enhances the representativeness of the features extracted from the target modality, allowing the removal of source modality sensors during prediction. This paper proposes three CMKT methods: semantic alignment, fully supervised mapping, and semi-supervised mapping. The semantic alignment method establishes a shared encoded space between modalities to facilitate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsThermography and Photoacoustic Techniques

MethodsALIGN