AdTEC: A Unified Benchmark for Evaluating Text Quality in Search Engine   Advertising

Peinan Zhang; Yusuke Sakai; Masato Mita; Hiroki Ouchi; Taro Watanabe

arXiv:2408.05906·cs.CL·April 29, 2025

AdTEC: A Unified Benchmark for Evaluating Text Quality in Search Engine Advertising

Peinan Zhang, Yusuke Sakai, Masato Mita, Hiroki Ouchi, Taro Watanabe

PDF

Open Access 1 Repo 2 Datasets 1 Video

TL;DR

AdTEC is a comprehensive benchmark for evaluating the quality of automatically generated ad texts across multiple dimensions, aiding the development of better natural language generation models for advertising.

Contribution

This paper introduces the first public benchmark for ad text evaluation, including a Japanese dataset, and analyzes the performance of PLMs and humans on these tasks.

Findings

01

PLMs perform well in some tasks but lag behind humans in others

02

The benchmark reveals specific challenges in automatic ad text evaluation

03

Humans still outperform models in certain quality assessment domains

Abstract

With the increase in the fluency of ad texts automatically created by natural language generation technology, there is high demand to verify the quality of these creatives in a real-world setting. We propose AdTEC (Ad Text Evaluation Benchmark by CyberAgent), the first public benchmark to evaluate ad texts from multiple perspectives within practical advertising operations. Our contributions are as follows: (i) Defining five tasks for evaluating the quality of ad texts, as well as building a Japanese dataset based on the practical operational experiences of building a Japanese dataset based on the practical operational experiences of advertising agencies, which are typically kept in-house. (ii) Validating the performance of existing pre-trained language models (PLMs) and human evaluators on the dataset. (iii) Analyzing the characteristics and providing challenges of the benchmark. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cyberagentailab/adtec
noneOfficial

Datasets

Videos

AdTEC: A Unified Benchmark for Evaluating Text Quality in Search Engine Advertising· underline

Taxonomy

TopicsSpam and Phishing Detection · Consumer Market Behavior and Pricing · Web Data Mining and Analysis