Measuring Human Involvement in AI-Generated Text: A Case Study on Academic Writing
Yuchen Guo, Zhicheng Dou, Huy H. Nguyen, Ching-Chun Chang, Saku Sugawara, Isao Echizen

TL;DR
This paper introduces a novel method using BERTScore and a multi-task RoBERTa regressor to measure human involvement in AI-generated academic texts, addressing the limitations of binary detection methods.
Contribution
It proposes a new approach to quantify human involvement in AI-generated content, moving beyond binary detection to a continuous measurement framework.
Findings
The proposed method achieved an F1 score of 0.9423 in detecting human involvement.
It demonstrated strong generalizability across different generative models.
Existing detectors failed to accurately measure human involvement, highlighting the novelty of this approach.
Abstract
Content creation has dramatically progressed with the rapid advancement of large language models like ChatGPT and Claude. While this progress has greatly enhanced various aspects of life and work, it has also negatively affected certain areas of society. A recent survey revealed that nearly 30% of college students use generative AI to help write academic papers and reports. Most countermeasures treat the detection of AI-generated text as a binary classification task and thus lack robustness. This approach overlooks human involvement in the generation of content even though human-machine collaboration is becoming mainstream. Besides generating entire texts, people may use machines to complete or revise texts. Such human involvement varies case by case, which makes binary classification a less than satisfactory approach. We refer to this situation as participation detection obfuscation.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Topic Modeling · Computational and Text Analysis Methods
