OCRR: A Benchmark for Online Correction Recovery under Distribution Shift

Adrian Grassi

arXiv:2605.03153·cs.LG·May 6, 2026

OCRR: A Benchmark for Online Correction Recovery under Distribution Shift

Adrian Grassi

PDF

1 Repo

TL;DR

OCRR introduces a benchmark for online correction recovery that measures how quickly models can adapt to distribution shifts with user corrections, outperforming existing methods in accuracy retention and novel class recognition.

Contribution

The paper presents OCRR, a new benchmark for evaluating online correction recovery under distribution shift, along with baseline algorithms and extensive evaluation results.

Findings

01

The substrate achieves 88.7% novel-class accuracy and 95.4% retention of original accuracy.

02

It outperforms continual-learning baselines by 32.6 percentage points at equal memory.

03

Classification accuracy remains stable at 99% despite retrieval recall degradation.

Abstract

Static benchmarks measure a model frozen at training time. Real systems face distribution shift: new categories, paraphrased queries, drift: and must recover online via user corrections. No existing benchmark measures recovery speed under correction streams. We introduce OCRR (Online Correction Recovery Rate): a benchmark that streams a corpus through a classification system, applies oracle or stochastic corrections to wrong predictions, and reports two curves: novel-class accuracy and original-distribution accuracy versus correction count. We evaluate the substrate alongside nine baseline algorithms from five families plus seven bounded-storage variants of the substrate for the Pareto sweep, including standard online-learning baselines (river), continual-learning methods (EWC, A-GEM, LwF), retrieval/parametric hybrids (kNN-LM), parameter-efficient fine-tuning of a 1.5 B-parameter…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adriangrassi/ocrr-benchmark
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.