Does It Capture STEL? A Modular, Similarity-based Linguistic Style   Evaluation Framework

Anna Wegmann; Dong Nguyen

arXiv:2109.04817·cs.CL·September 13, 2021

Does It Capture STEL? A Modular, Similarity-based Linguistic Style Evaluation Framework

Anna Wegmann, Dong Nguyen

PDF

Open Access 1 Repo

TL;DR

This paper introduces STEL, a modular framework for evaluating linguistic style in text, which controls for content and compares models based on style similarity across various dimensions.

Contribution

The paper presents STEL, a novel, flexible, and content-controlled style evaluation framework that enables comparison of different style measures and models.

Findings

01

BERT-based methods outperform traditional style measures

02

STEL effectively evaluates style across multiple dimensions

03

Framework encourages development of style-sensitive evaluation tools

Abstract

Style is an integral part of natural language. However, evaluation methods for style measures are rare, often task-specific and usually do not control for content. We propose the modular, fine-grained and content-controlled similarity-based STyle EvaLuation framework (STEL) to test the performance of any model that can compare two sentences on style. We illustrate STEL with two general dimensions of style (formal/informal and simple/complex) as well as two specific characteristics of style (contrac'tion and numb3r substitution). We find that BERT-based methods outperform simple versions of commonly used style measures like 3-grams, punctuation frequency and LIWC-based approaches. We invite the addition of further tasks and task instances to STEL and hope to facilitate the improvement of style-sensitive measures.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nlpsoc/stel
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Authorship Attribution and Profiling