Multi-D Kneser-Ney Smoothing Preserving the Original Marginal Distributions
Andr\'as Dob\'o

TL;DR
This paper introduces a refined version of Modified Kneser-Ney smoothing that preserves the original marginal distributions, combining the benefits of previous methods while maintaining comparable performance in language modeling.
Contribution
The paper proposes a new smoothing technique that preserves marginal distributions in MKNS, addressing a key limitation of existing methods.
Findings
Achieves similar results to MKNS in language modeling tasks.
Preserves marginal distributions of the original model.
Offers improved theoretical properties over existing smoothing methods.
Abstract
Smoothing is an essential tool in many NLP tasks, therefore numerous techniques have been developed for this purpose in the past. One of the most widely used smoothing methods are the Kneser-Ney smoothing (KNS) and its variants, including the Modified Kneser-Ney smoothing (MKNS), which are widely considered to be among the best smoothing methods available. Although when creating the original KNS the intention of the authors was to develop such a smoothing method that preserves the marginal distributions of the original model, this property was not maintained when developing the MKNS. In this article I would like to overcome this and propose such a refined version of the MKNS that preserves these marginal distributions while keeping the advantages of both previous versions. Beside its advantageous properties, this novel smoothing method is shown to achieve about the same results as the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
