# Learning to Weight for Text Classification

**Authors:** Alejandro Moreo Fern\'andez, Andrea Esuli, Fabrizio Sebastiani

arXiv: 1903.12090 · 2021-09-22

## TL;DR

This paper introduces Learning to Weight (LTW), a novel supervised term weighting method that learns an optimal weighting function from training data, leading to improved text classification performance over existing approaches.

## Contribution

It proposes a data-driven approach to learn term weights directly from training data, challenging assumptions of traditional predefined weighting formulas.

## Key findings

- LTW outperforms previous term weighting methods in multiple benchmarks.
- The learned weighting functions adapt better to specific datasets.
- Experimental results demonstrate significant accuracy improvements.

## Abstract

In information retrieval (IR) and related tasks, term weighting approaches typically consider the frequency of the term in the document and in the collection in order to compute a score reflecting the importance of the term for the document. In tasks characterized by the presence of training data (such as text classification) it seems logical that the term weighting function should take into account the distribution (as estimated from training data) of the term across the classes of interest. Although `supervised term weighting' approaches that use this intuition have been described before, they have failed to show consistent improvements. In this article we analyse the possible reasons for this failure, and call consolidated assumptions into question. Following this criticism we propose a novel supervised term weighting approach that, instead of relying on any predefined formula, learns a term weighting function optimised on the training set of interest; we dub this approach \emph{Learning to Weight} (LTW). The experiments that we run on several well-known benchmarks, and using different learning methods, show that our method outperforms previous term weighting approaches in text classification.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1903.12090/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1903.12090/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/1903.12090/full.md

---
Source: https://tomesphere.com/paper/1903.12090