MELD: Multi-Task Equilibrated Learning Detector for AI-Generated Text
Chenjun Li, Cheng Wan, Johannes C. Paetzold

TL;DR
MELD is a multi-task learning detector for AI-generated text that enhances robustness, transferability, and low false-positive rates by auxiliary supervision and adversarial training.
Contribution
It introduces MELD, a novel multi-task framework with auxiliary heads and uncertainty weighting, improving detection robustness and transferability over existing methods.
Findings
MELD achieves 99.9% TPR at 1% FPR on MELD-eval without finetuning.
MELD outperforms supervised baselines across benchmarks.
MELD is the strongest open-source detector on the RAID leaderboard.
Abstract
Large language models are now embedded in everyday writing workflows, making reliable AI-generated text detection important for academic integrity, content moderation, and provenance tracking. In practice, however, a detector must do more than achieve high aggregate AUROC on clean, in-distribution human and AI text: it should remain robust to attacks and adversarial rewrites, transfer to unseen generators and domains, and operate at low false-positive rates (FPR). Most existing detectors optimize a single AI/Human objective, giving the representation little incentive to learn generator, attack, or domain structure once the binary task saturates. We introduce MELD (Multi-Task Equilibrated Learning Detector), a deployable detector for AI-generated text that enriches binary detection with auxiliary supervision. MELD attaches generator-family, attack-type, and source-domain heads to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
