Clickbait detection: quick inference with maximum impact

Soveatin Kuntur; Panggih Kusuma Ningrum; Anna Wr\'oblewska; Maria Ganzha; Marcin Paprzycki

arXiv:2604.08148·cs.CL·April 10, 2026

Clickbait detection: quick inference with maximum impact

Soveatin Kuntur, Panggih Kusuma Ningrum, Anna Wr\'oblewska, Maria Ganzha, Marcin Paprzycki

PDF

TL;DR

This paper introduces a lightweight hybrid method for clickbait detection combining semantic embeddings and heuristic features, optimized for fast inference with competitive accuracy.

Contribution

It presents a novel hybrid approach that reduces embedding dimensionality and employs graph-based classifiers for efficient, accurate clickbait detection.

Findings

01

Graph-based models achieve high ROC-AUC with reduced inference time.

02

Simplified features slightly lower F1-scores but maintain strong detection performance.

03

Embedding reduction via PCA improves efficiency without significant accuracy loss.

Abstract

We propose a lightweight hybrid approach to clickbait detection that combines OpenAI semantic embeddings with six compact heuristic features capturing stylistic and informational cues. To improve efficiency, embeddings are reduced using PCA and evaluated with XGBoost, GraphSAGE, and GCN classifiers. While the simplified feature design yields slightly lower F1-scores, graph-based models achieve competitive performance with substantially reduced inference time. High ROC--AUC values further indicate strong discrimination capability, supporting reliable detection of clickbait headlines under varying decision thresholds.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.