An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs

Deshan Sumanathilaka; Nicholas Micallef; Julian Hough

arXiv:2603.05400·cs.CL·March 6, 2026

An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs

Deshan Sumanathilaka, Nicholas Micallef, Julian Hough

PDF

Open Access

TL;DR

This paper shows that small-scale, low-parameter LLMs can perform effective Word Sense Disambiguation comparable to large models like GPT-4-Turbo by using reasoning-focused fine-tuning and Chain-of-Thought prompting, reducing computational costs.

Contribution

It introduces a reasoning-driven fine-tuning approach for low-parameter LLMs that achieves competitive WSD performance and strong cross-domain generalization.

Findings

01

Low-parameter LLMs with reasoning strategies match GPT-4-Turbo in zero-shot WSD.

02

Gemma-3-4B and Qwen-3-4B outperform larger baselines on FEWS.

03

Models generalize well to unseen senses and domains without task-specific fine-tuning.

Abstract

Word Sense Disambiguation (WSD) remains a key challenge in Natural Language Processing (NLP), especially when dealing with rare or domain-specific senses that are often misinterpreted. While modern high-parameter Large Language Models (LLMs) such as GPT-4-Turbo have shown state-of-the-art WSD performance, their computational and energy demands limit scalability. This study investigates whether low-parameter LLMs (<4B parameters) can achieve comparable results through fine-tuning strategies that emphasize reasoning-driven sense identification. Using the FEWS dataset augmented with semi-automated, rationale-rich annotations, we fine-tune eight small-scale open-source LLMs (e.g. Gemma and Qwen). Our results reveal that Chain-of-Thought (CoT)-based reasoning combined with neighbour-word analysis achieves performance comparable to GPT-4-Turbo in zero-shot settings. Importantly, Gemma-3-4B…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Machine Learning and Data Classification