Do not forget: Full memory in memory-based learning of word pronunciation
Antal van den Bosch (ILK / Computational Linguistics, Tilburg, University), and Walter Daelemans (ILK / Computational Linguistics, Tilburg, University)

TL;DR
This paper explores partial memory-based learning methods for NLP, finding that excluding low-confidence instances improves efficiency without sacrificing accuracy in English word pronunciation tasks.
Contribution
It introduces and evaluates three heuristics for removing exceptional instances in memory-based learning, identifying the most effective approach for NLP tasks.
Findings
Removing low prediction strength instances maintains accuracy
Full memory of types is preferable to tokens
Excluding minority ambiguities enhances performance
Abstract
Memory-based learning, keeping full memory of learning material, appears a viable approach to learning NLP tasks, and is often superior in generalisation accuracy to eager learning approaches that abstract from learning material. Here we investigate three partial memory-based learning approaches which remove from memory specific task instance types estimated to be exceptional. The three approaches each implement one heuristic function for estimating exceptionality of instance types: (i) typicality, (ii) class prediction strength, and (iii) friendly-neighbourhood size. Experiments are performed with the memory-based learning algorithm IB1-IG trained on English word pronunciation. We find that removing instance types with low prediction strength (ii) is the only tested method which does not seriously harm generalisation accuracy. We conclude that keeping full memory of types rather than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · AI-based Problem Solving and Planning · Topic Modeling
