Comparative Performance of Advanced NLP Models and LLMs in Multilingual Geo-Entity Detection
Kalin Kopanov

TL;DR
This study evaluates and compares the performance of advanced NLP models and large language models in multilingual geo-entity detection, highlighting their strengths and challenges across different languages and datasets.
Contribution
It provides a comprehensive benchmarking of NLP and LLMs for multilingual geo-entity detection, informing future development of more effective geospatial NLP tools.
Findings
GPT-4 outperforms other models in accuracy and F1 scores.
Multilingual models show varied effectiveness across languages.
Challenges remain in achieving high precision in Arabic and Russian datasets.
Abstract
The integration of advanced Natural Language Processing (NLP) methodologies and Large Language Models (LLMs) has significantly enhanced the extraction and analysis of geospatial data from multilingual texts, impacting sectors such as national and international security. This paper presents a comprehensive evaluation of leading NLP models -- SpaCy, XLM-RoBERTa, mLUKE, GeoLM -- and LLMs, specifically OpenAI's GPT 3.5 and GPT 4, within the context of multilingual geo-entity detection. Utilizing datasets from Telegram channels in English, Russian, and Arabic, we examine the performance of these models through metrics such as accuracy, precision, recall, and F1 scores, to assess their effectiveness in accurately identifying geospatial references. The analysis exposes each model's distinct advantages and challenges, underscoring the complexities involved in achieving precise geo-entity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Cosine Annealing · Linear Layer · Residual Connection · Weight Decay · Multi-Head Attention · Adam · Layer Normalization · Linear Warmup With Cosine Annealing · Attention Dropout
