A Comprehensive Framework for Semantic Similarity Analysis of Human and AI-Generated Text Using Transformer Architectures and Ensemble Techniques
Lifu Gao, Ziwei Liu, Qi Zhang

TL;DR
This paper introduces a novel semantic similarity framework combining transformer models, LSTMs, and ensemble techniques to improve detection of AI-generated text and analyze human versus machine content.
Contribution
It presents a multi-layered architecture integrating DeBERTa-v3-large, Bi-LSTMs, and attention pooling, with augmentation methods for enhanced performance in AI-generated text detection.
Findings
Outperforms traditional detection methods
Effective across diverse domains
Captures nuanced semantic differences
Abstract
The rapid advancement of large language models (LLMs) has made detecting AI-generated text an increasingly critical challenge. Traditional methods often fail to capture the nuanced semantic differences between human and machine-generated content. We therefore propose a novel approach based on semantic similarity analysis, leveraging a multi-layered architecture that combines a pre-trained DeBERTa-v3-large model, Bi-directional LSTMs, and linear attention pooling to capture both local and global semantic patterns. To enhance performance, we employ advanced input and output augmentation techniques such as sector-level context integration and wide output configurations. These techniques enable the model to learn more discriminative features and generalize across diverse domains. Experimental results show that this approach works better than traditional methods, proving its usefulness for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Topic Modeling
MethodsSoftmax · Attention Is All You Need · Attention Pooling
