TL;DR
This paper presents a novel gamified crowdsourcing method using a messaging bot to collect idiomatic expressions and usage examples, demonstrating its effectiveness across languages and motivational strategies for building valuable idiom corpora.
Contribution
It introduces the first crowdcreating and crowdrating approach for idiom corpus construction, combining gamification and crowdsourcing in a language-independent manner.
Findings
The approach effectively collects targeted idiomatic data.
Gamification and rewards enhance crowd engagement.
The method accelerates idiom corpus development for multiple applications.
Abstract
Learning idiomatic expressions is seen as one of the most challenging stages in second language learning because of their unpredictable meaning. A similar situation holds for their identification within natural language processing applications such as machine translation and parsing. The lack of high-quality usage samples exacerbates this challenge not only for humans but also for artificial intelligence systems. This article introduces a gamified crowdsourcing approach for collecting language learning materials for idiomatic expressions; a messaging bot is designed as an asynchronous multiplayer game for native speakers who compete with each other while providing idiomatic and nonidiomatic usage examples and rating other players' entries. As opposed to classical crowdprocessing annotation efforts in the field, for the first time in the literature, a crowdcreating & crowdrating approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
