NeoN: A Tool for Automated Detection, Linguistic and LLM-Driven Analysis of Neologisms in Polish

Aleksandra Tomaszewska; Dariusz Czerski; Bartosz \.Zuk; Maciej Ogrodniczuk

arXiv:2505.15426·cs.CL·May 22, 2025

NeoN: A Tool for Automated Detection, Linguistic and LLM-Driven Analysis of Neologisms in Polish

Aleksandra Tomaszewska, Dariusz Czerski, Bartosz \.Zuk, Maciej Ogrodniczuk

PDF

Open Access

TL;DR

NeoN is an innovative tool that automates the detection and analysis of Polish neologisms using linguistic filters, LLM-driven validation, and visualization, significantly reducing manual effort and enabling real-time lexical innovation tracking.

Contribution

The paper introduces NeoN, a novel multi-layered system combining linguistic processing and LLMs for efficient, automated Polish neologism detection and analysis.

Findings

01

High accuracy in neologism detection

02

Reduced manual review time

03

Effective categorization and definition generation

Abstract

NeoN, a tool for detecting and analyzing Polish neologisms. Unlike traditional dictionary-based methods requiring extensive manual review, NeoN combines reference corpora, Polish-specific linguistic filters, an LLM-driven precision-boosting filter, and daily RSS monitoring in a multi-layered pipeline. The system uses context-aware lemmatization, frequency analysis, and orthographic normalization to extract candidate neologisms while consolidating inflectional variants. Researchers can verify candidates through an intuitive interface with visualizations and filtering controls. An integrated LLM module automatically generates definitions and categorizes neologisms by domain and sentiment. Evaluations show NeoN maintains high accuracy while significantly reducing manual effort, providing an accessible solution for tracking lexical innovation in Polish.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and Culture · Linguistics, Language Diversity, and Identity · Natural Language Processing Techniques