ArtifactNet: Detecting AI-Generated Music via Forensic Residual Physics
Heewon Oh

TL;DR
ArtifactNet is a lightweight forensic physics-based framework that detects AI-generated music by analyzing residual artifacts from neural audio codecs, achieving high accuracy and generalizability.
Contribution
The paper introduces ArtifactNet, a novel physics-inspired approach for AI music detection, with a new benchmark and improved cross-codec robustness over prior methods.
Findings
ArtifactNet achieves F1 = 0.9829 on unseen test data.
Codec-aware training reduces cross-codec drift by 83%.
ArtifactNet uses significantly fewer parameters than prior methods.
Abstract
We present ArtifactNet, a lightweight framework that detects AI-generated music by reframing the problem as forensic physics -- extracting and analyzing the physical artifacts that neural audio codecs inevitably imprint on generated audio. A bounded-mask UNet (ArtifactUNet, 3.6M parameters) extracts codec residuals from magnitude spectrograms, which are then decomposed via HPSS into 7-channel forensic features for classification by a compact CNN (0.4M parameters; 4.0M total). We introduce ArtifactBench, a multi-generator evaluation benchmark comprising 6,183 tracks (4,383 AI from 22 generators and 1,800 real from 6 diverse sources). Each track is tagged with bench_origin for fair zero-shot evaluation. On the unseen test partition (n=2,263), ArtifactNet achieves F1 = 0.9829 with FPR = 1.49%, compared to CLAM (F1 = 0.7576, FPR = 69.26%) and SpecTTTra (F1 = 0.7713, FPR = 19.43%) evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
