Neologisms on Facebook
Nikita Muravyev, Alexander Panchenko, Sergei Obiedkov

TL;DR
This study analyzes neologisms and loan words in Russian Facebook posts from 2006-2013, creating a curated list of 168 recent words, classifying their origins, themes, and derivational patterns to understand lexical evolution.
Contribution
The paper introduces a methodology for identifying and classifying neologisms in social media data, focusing on Russian Facebook posts, and provides a curated list of 168 recent words with etymological insights.
Findings
Identified 168 potential neologisms in Russian Facebook posts.
Most neologisms are borrowed from English or derived from borrowed stems.
Classified words into thematic areas like internet, marketing, and multimedia.
Abstract
In this paper, we present a study of neologisms and loan words frequently occurring in Facebook user posts. We have analyzed a dataset of several million publically available posts written during 2006-2013 by Russian-speaking Facebook users. From these, we have built a vocabulary of most frequent lemmatized words missing from the OpenCorpora dictionary the assumption being that many such words have entered common use only recently. This assumption is certainly not true for all the words extracted in this way; for that reason, we manually filtered the automatically obtained list in order to exclude non-Russian or incorrectly lemmatized words, as well as words recorded by other dictionaries or those occurring in texts from the Russian National Corpus. The result is a list of 168 words that can potentially be considered neologisms. We present an attempt at an etymological classification of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistics, Language Diversity, and Identity · Digital Communication and Language · Lexicography and Language Studies
