UnibucKernel: Geolocating Swiss German Jodels Using Ensemble Learning
Mihaela Gaman, Sebastian Cojocariu, Radu Tudor Ionescu

TL;DR
This paper presents UnibucKernel, an ensemble learning approach for geolocating Swiss German Jodels, achieving competitive median distance errors by combining diverse models and features.
Contribution
We introduce a novel ensemble method using XGBoost to combine multiple regression models and features for improved geolocation accuracy of Swiss German social media posts.
Findings
Achieved median distance of 23.6 km on test data
Placed third in the 2021 VarDial geolocation task
Ensemble outperforms individual models in accuracy
Abstract
In this work, we describe our approach addressing the Social Media Variety Geolocation task featured in the 2021 VarDial Evaluation Campaign. We focus on the second subtask, which is based on a data set formed of approximately 30 thousand Swiss German Jodels. The dialect identification task is about accurately predicting the latitude and longitude of test samples. We frame the task as a double regression problem, employing an XGBoost meta-learner with the combined power of a variety of machine learning approaches to predict both latitude and longitude. The models included in our ensemble range from simple regression techniques, such as Support Vector Regression, to deep neural models, such as a hybrid neural network and a neural transformer. To minimize the prediction error, we approach the problem from a few different perspectives and consider various types of features, from low-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Authorship Attribution and Profiling · Speech Recognition and Synthesis
MethodsLinear Layer · Multi-Head Attention · Refunds@Expedia|||How do I get a full refund from Expedia? · Attention Dropout · Weight Decay · WordPiece · Layer Normalization · Dense Connections · Adam · Linear Warmup With Linear Decay
