Large Language Model (LLM) AI text generation detection based on transformer deep learning algorithm
Yuhong Mo, Hao Qin, Yushan Dong, Ziyi Zhu, Zhenglin Li

TL;DR
This paper presents a transformer-based deep learning tool that effectively detects AI-generated text with high accuracy, combining preprocessing techniques and a hybrid model of LSTM, Transformer, and CNN layers.
Contribution
The study introduces a novel transformer-based detection model with a comprehensive preprocessing pipeline, achieving over 99% accuracy in identifying AI-generated text.
Findings
Model accuracy reaches 99.8% during validation.
Test set results show 99% prediction accuracy.
High precision, recall, and F1 score demonstrate robust detection capability.
Abstract
In this paper, a tool for detecting LLM AI text generation is developed based on the Transformer model, aiming to improve the accuracy of AI text generation detection and provide reference for subsequent research. Firstly the text is Unicode normalised, converted to lowercase form, characters other than non-alphabetic characters and punctuation marks are removed by regular expressions, spaces are added around punctuation marks, first and last spaces are removed, consecutive ellipses are replaced with single spaces and the text is connected using the specified delimiter. Next remove non-alphabetic characters and extra whitespace characters, replace multiple consecutive whitespace characters with a single space and again convert to lowercase form. The deep learning model combines layers such as LSTM, Transformer and CNN for text classification or sequence labelling tasks. The training and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods
MethodsAttention Is All You Need · Sparse Evolutionary Training · Sigmoid Activation · Linear Layer · Position-Wise Feed-Forward Layer · Tanh Activation · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding
