AI vs. Human -- Differentiation Analysis of Scientific Content Generation
Yongqiang Ma, Jiawei Liu, Fan Yi, Qikai Cheng, Yong Huang, Wei Lu,, Xiaozhong Liu

TL;DR
This paper analyzes the differences between AI-generated and human-written scientific texts, identifying stylistic and factual gaps, and proposes features for detection to improve AI content quality and address ethical concerns.
Contribution
It introduces a feature framework to distinguish AI from human scientific writing and analyzes the stylistic and factual gaps, aiding detection and model optimization.
Findings
AI-generated scientific texts are less deep and comprehensive.
Factual errors are more common in AI-generated content.
A writing style gap exists between AI and human scientific texts.
Abstract
Recent neural language models have taken a significant step forward in producing remarkably controllable, fluent, and grammatical text. Although studies have found that AI-generated text is not distinguishable from human-written text for crowd-sourcing workers, there still exist errors in AI-generated text which are even subtler and harder to spot. We primarily focus on the scenario in which scientific AI writing assistant is deeply involved. First, we construct a feature description framework to distinguish between AI-generated text and human-written text from syntax, semantics, and pragmatics based on the human evaluation. Then we utilize the features, i.e., writing style, coherence, consistency, and argument logistics, from the proposed framework to analyze two types of content. Finally, we adopt several publicly available methods to investigate the gap of between AI-generated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Software Engineering Research · Natural Language Processing Techniques
