Multinomial Loss on Held-out Data for the Sparse Non-negative Matrix Language Model
Ciprian Chelba, Fernando Pereira

TL;DR
This paper introduces a multinomial loss-based estimation method for Sparse Non-negative Matrix language models, enabling better handling of mismatched training and test data, and improving performance by leveraging multiple data sources.
Contribution
It presents a new training approach using multinomial loss on held-out data, allowing richer meta-features and effective data source combination for language modeling.
Findings
Slight improvement over previous results on the one billion words benchmark.
Meta-features without lexical information perform comparably to lexicalized ones.
Combining data sources based on relevance improves performance by up to 15%.
Abstract
We describe Sparse Non-negative Matrix (SNM) language model estimation using multinomial loss on held-out data. Being able to train on held-out data is important in practical situations where the training data is usually mismatched from the held-out/test data. It is also less constrained than the previous training algorithm using leave-one-out on training data: it allows the use of richer meta-features in the adjustment model, e.g. the diversity counts used by Kneser-Ney smoothing which would be difficult to deal with correctly in leave-one-out training. In experiments on the one billion words language modeling benchmark, we are able to slightly improve on our previous results which use a different loss function, and employ leave-one-out training on a subset of the main training set. Surprisingly, an adjustment model with meta-features that discard all lexical information can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDistributed systems and fault tolerance · Data Quality and Management · Advanced Database Systems and Queries
