WHISTRESS: Enriching Transcriptions with Sentence Stress Detection

Iddo Yosha; Dorin Shteyman; Yossi Adi

arXiv:2505.19103·cs.CL·May 27, 2025

WHISTRESS: Enriching Transcriptions with Sentence Stress Detection

Iddo Yosha, Dorin Shteyman, Yossi Adi

PDF

Open Access 1 Models 1 Datasets

TL;DR

WHISTRESS is a novel alignment-free method for sentence stress detection in transcriptions, trained on synthetic data, that outperforms existing approaches and generalizes well across benchmarks.

Contribution

Introduces WHISTRESS, an alignment-free sentence stress detection model trained on synthetic data, with strong zero-shot generalization capabilities.

Findings

01

Outperforms existing methods in stress detection accuracy.

02

Requires no additional priors during training or inference.

03

Demonstrates strong zero-shot generalization across benchmarks.

Abstract

Spoken language conveys meaning not only through words but also through intonation, emotion, and emphasis. Sentence stress, the emphasis placed on specific words within a sentence, is crucial for conveying speaker intent and has been extensively studied in linguistics. In this work, we introduce WHISTRESS, an alignment-free approach for enhancing transcription systems with sentence stress detection. To support this task, we propose TINYSTRESS-15K, a scalable, synthetic training data for the task of sentence stress detection which resulted from a fully automated dataset creation process. We train WHISTRESS on TINYSTRESS-15K and evaluate it against several competitive baselines. Our results show that WHISTRESS outperforms existing methods while requiring no additional input priors during training or inference. Notably, despite being trained on synthetic data, WHISTRESS demonstrates strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
slprl/WhiStress
model· ♡ 5
♡ 5

Datasets

slprl/TinyStress-15K
dataset· 162 dl
162 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHumor Studies and Applications