Surface-Form Neural Sparse Retrieval: Robust Fuzzy Matching for Industrial Music Search
Paul Greyson, Zhichao Geng, Wei Zhang, Yang Yang

TL;DR
This paper introduces a robust neural sparse retrieval system for music search that handles misspellings and phonetic variations efficiently, achieving high recall and low latency in industrial-scale applications.
Contribution
The work adapts a state-of-the-art inference-free sparse retrieval architecture with domain-specific tokenization, improving exploration and robustness over traditional n-gram methods.
Findings
Achieves 91.4% recall@10 on a 6M-document corpus
Outperforms trigram-based methods with 57.7% recall
Demonstrates improved exploration efficiency in production simulations
Abstract
Music search at the scale of Amazon Music presents a unique challenge: queries frequently deviate from indexed metadata due to misspellings, transpositions, and phonetic variations, yet the retrieval system must operate under strict millisecond-level latency constraints. Our existing learning-to-retrieve system, the High Confidence Index (HCI), learns query-entity associations from customer behavior, relying on continual ``exploration'' to choose candidates. Traditional n-gram matching enables this exploration but suffers from poor semantic robustness and high noise, limiting the system's ability to learn from long-tail queries. In this work, we present a \textbf{robust neural sparse retrieval system} designed to maximize exploration efficiency. We adapt a state-of-the-art \textbf{inference-free} sparse retrieval architecture to the music domain, combining it with an effective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
