Group-Adaptive Threshold Optimization for Robust AI-Generated Text Detection
Minseok Jung, Cynthia Fuertes Panizo, Liam Dugan, Yi R. (May) Fung, Pin-Yu Chen, Paul Pu Liang

TL;DR
This paper introduces FairOPT, a group-specific threshold optimization algorithm that improves the robustness and fairness of AI-generated text detectors across diverse text attributes, reducing misclassification disparities.
Contribution
The paper proposes a novel group-adaptive threshold optimization method, FairOPT, to address distributional biases in AI-text detection, enhancing fairness and robustness.
Findings
Significant reduction in detection discrepancy across subgroups.
Decreased overall discrepancy by 27.4% across multiple metrics.
Minimal impact on detection accuracy, only 0.005% sacrifice.
Abstract
The advancement of large language models (LLMs) has made it difficult to differentiate human-written text from AI-generated text. Several AI-text detectors have been developed in response, which typically utilize a fixed global threshold (e.g., ) to classify machine-generated text. However, one universal threshold could fail to account for distributional variations by subgroups. For example, when using a fixed threshold, detectors make more false positive errors on shorter human-written text, and more positive classifications of neurotic writing styles among long texts. These discrepancies can lead to misclassifications that disproportionately affect certain groups. We address this critical limitation by introducing FairOPT, an algorithm for group-specific threshold optimization for probabilistic AI-text detectors. We partitioned data into subgroups based on attributes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Text and Document Classification Technologies
