ConvNLP: Image-based AI Text Detection
Suriya Prakash Jambunathan, Ashwath Shankarnarayan, Parijat Dube

TL;DR
This paper introduces ConvNLP, a novel visual-based CNN approach with a specialized scheduler for detecting AI-generated text, achieving high accuracy, efficiency, and strong generalization across multiple LLMs to uphold academic integrity.
Contribution
It presents a new convolutional neural network architecture and scheduler for effective, fast, and generalizable detection of LLM-generated text using visual word embedding representations.
Findings
Achieves an average detection rate of 88.35% across diverse LLMs.
Provides a lightweight detection model with inference latency below 2.5ms per sentence.
Improves performance by nearly 4% over standard ResNet architecture.
Abstract
The potentials of Generative-AI technologies like Large Language models (LLMs) to revolutionize education are undermined by ethical considerations around their misuse which worsens the problem of academic dishonesty. LLMs like GPT-4 and Llama 2 are becoming increasingly powerful in generating sophisticated content and answering questions, from writing academic essays to solving complex math problems. Students are relying on these LLMs to complete their assignments and thus compromising academic integrity. Solutions to detect LLM-generated text are compute-intensive and often lack generalization. This paper presents a novel approach for detecting LLM-generated AI-text using a visual representation of word embedding. We have formulated a novel Convolutional Neural Network called ZigZag ResNet, as well as a scheduler for improving generalization, named ZigZag Scheduler. Through extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques
MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Average Pooling · Linear Layer · Label Smoothing · Adam · Dropout · Multi-Head Attention · Dense Connections
