Surface-Form Neural Sparse Retrieval: Robust Fuzzy Matching for Industrial Music Search

Paul Greyson; Zhichao Geng; Wei Zhang; Yang Yang

arXiv:2605.17762·cs.AI·May 19, 2026

Surface-Form Neural Sparse Retrieval: Robust Fuzzy Matching for Industrial Music Search

Paul Greyson, Zhichao Geng, Wei Zhang, Yang Yang

PDF

TL;DR

This paper introduces a robust neural sparse retrieval system for music search that handles misspellings and phonetic variations efficiently, achieving high recall and low latency in industrial-scale applications.

Contribution

The work adapts a state-of-the-art inference-free sparse retrieval architecture with domain-specific tokenization, improving exploration and robustness over traditional n-gram methods.

Findings

01

Achieves 91.4% recall@10 on a 6M-document corpus

02

Outperforms trigram-based methods with 57.7% recall

03

Demonstrates improved exploration efficiency in production simulations

Abstract

Music search at the scale of Amazon Music presents a unique challenge: queries frequently deviate from indexed metadata due to misspellings, transpositions, and phonetic variations, yet the retrieval system must operate under strict millisecond-level latency constraints. Our existing learning-to-retrieve system, the High Confidence Index (HCI), learns query-entity associations from customer behavior, relying on continual ``exploration'' to choose candidates. Traditional n-gram matching enables this exploration but suffers from poor semantic robustness and high noise, limiting the system's ability to learn from long-tail queries. In this work, we present a \textbf{robust neural sparse retrieval system} designed to maximize exploration efficiency. We adapt a state-of-the-art \textbf{inference-free} sparse retrieval architecture to the music domain, combining it with an effective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.