ConvNLP: Image-based AI Text Detection

Suriya Prakash Jambunathan; Ashwath Shankarnarayan; Parijat Dube

arXiv:2407.07225·cs.LG·July 11, 2024

ConvNLP: Image-based AI Text Detection

Suriya Prakash Jambunathan, Ashwath Shankarnarayan, Parijat Dube

PDF

Open Access

TL;DR

This paper introduces ConvNLP, a novel visual-based CNN approach with a specialized scheduler for detecting AI-generated text, achieving high accuracy, efficiency, and strong generalization across multiple LLMs to uphold academic integrity.

Contribution

It presents a new convolutional neural network architecture and scheduler for effective, fast, and generalizable detection of LLM-generated text using visual word embedding representations.

Findings

01

Achieves an average detection rate of 88.35% across diverse LLMs.

02

Provides a lightweight detection model with inference latency below 2.5ms per sentence.

03

Improves performance by nearly 4% over standard ResNet architecture.

Abstract

The potentials of Generative-AI technologies like Large Language models (LLMs) to revolutionize education are undermined by ethical considerations around their misuse which worsens the problem of academic dishonesty. LLMs like GPT-4 and Llama 2 are becoming increasingly powerful in generating sophisticated content and answering questions, from writing academic essays to solving complex math problems. Students are relying on these LLMs to complete their assignments and thus compromising academic integrity. Solutions to detect LLM-generated text are compute-intensive and often lack generalization. This paper presents a novel approach for detecting LLM-generated AI-text using a visual representation of word embedding. We have formulated a novel Convolutional Neural Network called ZigZag ResNet, as well as a scheduler for improving generalization, named ZigZag Scheduler. Through extensive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques

MethodsAttention Is All You Need · Byte Pair Encoding · Layer Normalization · Average Pooling · Linear Layer · Label Smoothing · Adam · Dropout · Multi-Head Attention · Dense Connections