TL;DR
OCRR introduces a benchmark for online correction recovery that measures how quickly models can adapt to distribution shifts with user corrections, outperforming existing methods in accuracy retention and novel class recognition.
Contribution
The paper presents OCRR, a new benchmark for evaluating online correction recovery under distribution shift, along with baseline algorithms and extensive evaluation results.
Findings
The substrate achieves 88.7% novel-class accuracy and 95.4% retention of original accuracy.
It outperforms continual-learning baselines by 32.6 percentage points at equal memory.
Classification accuracy remains stable at 99% despite retrieval recall degradation.
Abstract
Static benchmarks measure a model frozen at training time. Real systems face distribution shift: new categories, paraphrased queries, drift: and must recover online via user corrections. No existing benchmark measures recovery speed under correction streams. We introduce OCRR (Online Correction Recovery Rate): a benchmark that streams a corpus through a classification system, applies oracle or stochastic corrections to wrong predictions, and reports two curves: novel-class accuracy and original-distribution accuracy versus correction count. We evaluate the substrate alongside nine baseline algorithms from five families plus seven bounded-storage variants of the substrate for the Pareto sweep, including standard online-learning baselines (river), continual-learning methods (EWC, A-GEM, LwF), retrieval/parametric hybrids (kNN-LM), parameter-efficient fine-tuning of a 1.5 B-parameter…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
