N$^2$: A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion
Caleb Chin, Aashish Khubchandani, Harshvardhan Maskara, Kyuseong Choi, Jacob Feitelberg, Albert Gong, Manit Paul, Tathagata Sadhukhan, Anish Agarwal, Raaz Dwivedi

TL;DR
This paper presents N$^2$, a Python package for benchmarking and developing nearest neighbor-based matrix completion methods, demonstrating their robustness and superior performance on real-world datasets.
Contribution
Introduction of N$^2$, a modular, extensible framework for NN-based matrix completion, along with a new state-of-the-art NN variant and a comprehensive real-world benchmark suite.
Findings
NN methods outperform classical approaches on real-world data.
The new NN variant achieves state-of-the-art results.
Classical methods excel mainly on synthetic or idealized data.
Abstract
Nearest neighbor (NN) methods have re-emerged as competitive tools for matrix completion, offering strong empirical performance and recent theoretical guarantees, including entry-wise error bounds, confidence intervals, and minimax optimality. Despite their simplicity, recent work has shown that NN approaches are robust to a range of missingness patterns and effective across diverse applications. This paper introduces N, a unified Python package and testbed that consolidates a broad class of NN-based methods through a modular, extensible interface. Built for both researchers and practitioners, N supports rapid experimentation and benchmarking. Using this framework, we introduce a new NN variant that achieves state-of-the-art results in several settings. We also release a benchmark suite of real-world datasets, from healthcare and recommender systems to causal inference and LLM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Machine Learning and Data Classification · Stochastic Gradient Optimization Techniques
MethodsCausal inference
