SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and Cross-lingual Focused Evaluation
Daniel Cer, Mona Diab, Eneko Agirre, I\~nigo Lopez-Gazpio, Lucia, Specia

TL;DR
The paper presents the 2017 SemEval STS shared task focusing on multilingual and cross-lingual sentence similarity, introduces the STS Benchmark dataset, and analyzes current model performance and limitations.
Contribution
It introduces a new multilingual and cross-lingual semantic similarity evaluation framework and the STS Benchmark dataset for improved model training and assessment.
Findings
High participation from 31 teams across language tracks
Analysis reveals common errors and limitations in current models
Introduction of the STS Benchmark dataset for future research
Abstract
Semantic Textual Similarity (STS) measures the meaning similarity of sentences. Applications include machine translation (MT), summarization, generation, question answering (QA), short answer grading, semantic search, dialog and conversational systems. The STS shared task is a venue for assessing the current state-of-the-art. The 2017 task focuses on multilingual and cross-lingual pairs with one sub-track exploring MT quality estimation (MTQE) data. The task obtained strong participation from 31 teams, with 17 participating in all language tracks. We summarize performance and review a selection of well performing methods. Analysis highlights common errors, providing insight into the limitations of existing models. To support ongoing work on semantic representations, the STS Benchmark is introduced as a new shared training and evaluation set carefully selected from the corpus of English…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗google-t5/t5-smallmodel· 1.9M dl· ♡ 5381.9M dl♡ 538
- 🤗google-t5/t5-largemodel· 451k dl· ♡ 253451k dl♡ 253
- 🤗google-t5/t5-11bmodel· 22k dl· ♡ 6922k dl♡ 69
- 🤗google-t5/t5-3bmodel· 428k dl· ♡ 52428k dl♡ 52
- 🤗google-t5/t5-basemodel· 1.8M dl· ♡ 7701.8M dl♡ 770
- 🤗Kamrani/t5-largemodel· 6 dl6 dl
- 🤗qiaoyi/Comment_Summarization4DesignTutormodel· 11 dl11 dl
- 🤗ybelkada/t5-11b-shardedmodel· 11 dl· ♡ 211 dl♡ 2
- 🤗michellehbn/brrrrmodel· ♡ 1♡ 1
- 🤗BrainStormersHakton/question-gen-T5-basemodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
