Towards an Understanding and Explanation for Mixed-Initiative Artificial Scientific Text Detection
Luoxuan Weng, Minfeng Zhu, Kam Kwai Wong, Shi Liu, Jiashun Sun, Hang, Zhu, Dongming Han, and Wei Chen

TL;DR
This paper investigates the differences between machine-generated and human-written scientific texts, proposing a mixed-initiative detection approach that combines human expertise with machine analysis, supported by visual analytics and validated through case studies and user studies.
Contribution
It introduces a novel mixed-initiative workflow and visual analytics tool for scientific text detection, addressing interpretability and generalization challenges in AI-based detection methods.
Findings
Quantitative analysis of text differences
Effective human-AI collaborative detection workflow
Validated approach through case studies and user study
Abstract
Large language models (LLMs) have gained popularity in various fields for their exceptional capability of generating human-like text. Their potential misuse has raised social concerns about plagiarism in academic contexts. However, effective artificial scientific text detection is a non-trivial task due to several challenges, including 1) the lack of a clear understanding of the differences between machine-generated and human-written scientific text, 2) the poor generalization performance of existing methods caused by out-of-distribution issues, and 3) the limited support for human-machine collaboration with sufficient interpretability during the detection process. In this paper, we first identify the critical distinctions between machine-generated and human-written scientific text through a quantitative experiment. Then, we propose a mixed-initiative workflow that combines human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI) · Topic Modeling
