What Can We Learn From Almost a Decade of Food Tweets

Uga Spro\c{g}is; Mat\=iss Rikters

arXiv:2007.05194·cs.CL·September 2, 2020·1 cites

What Can We Learn From Almost a Decade of Food Tweets

Uga Spro\c{g}is, Mat\=iss Rikters

PDF

Open Access 1 Repo 1 Models 1 Datasets

TL;DR

This paper introduces a large, annotated Latvian food-related Twitter corpus collected over 8 years, and demonstrates its usefulness for training domain-specific question-answering and sentiment analysis models.

Contribution

The paper presents the Latvian Twitter Eater Corpus, a comprehensive, multi-annotated dataset for food-related tweets, and showcases its application in NLP tasks.

Findings

01

Successful training of domain-specific question-answering models

02

Effective sentiment analysis using the corpus data

03

Demonstration of corpus utility for NLP research

Abstract

We present the Latvian Twitter Eater Corpus - a set of tweets in the narrow domain related to food, drinks, eating and drinking. The corpus has been collected over time-span of over 8 years and includes over 2 million tweets entailed with additional useful data. We also separate two sub-corpora of question and answer tweets and sentiment annotated tweets. We analyse contents of the corpus and demonstrate use-cases for the sub-corpora by training domain-specific question-answering and sentiment-analysis models using data from the corpus.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Usprogis/Latvian-Twitter-Eater-Corpus
noneOfficial

Models

🤗
matiss/Latvian-Twitter-Sentiment-Analysis
model· 36 dl
36 dl

Datasets

matiss/Latvian-Twitter-Eater-Corpus-Sentiment
dataset· 28 dl
28 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Natural Language Processing Techniques