A Survey on Efficient Processing of Similarity Queries over Neural Embeddings
Yifan Wang

TL;DR
This survey reviews recent methods for efficiently processing similarity queries over neural embeddings, highlighting challenges and solutions in indexing and application domains like entity resolution and information retrieval.
Contribution
It provides a comprehensive overview of current approaches to similarity query processing with neural embeddings and compares solutions across different application domains.
Findings
Neural embeddings significantly improve semantic similarity measurement.
Efficient indexing methods are crucial for scalable similarity queries.
Embedding-based solutions benefit applications like entity resolution and retrieval.
Abstract
Similarity query is the family of queries based on some similarity metrics. Unlike the traditional database queries which are mostly based on value equality, similarity queries aim to find targets "similar enough to" the given data objects, depending on some similarity metric, e.g., Euclidean distance, cosine similarity and so on. To measure the similarity between data objects, traditional methods normally work on low level or syntax features(e.g., basic visual features on images or bag-of-word features of text), which makes them weak to compute the semantic similarities between objects. So for measuring data similarities semantically, neural embedding is applied. Embedding techniques work by representing the raw data objects as vectors (so called "embeddings" or "neural embeddings" since they are mostly generated by neural network models) that expose the hidden semantics of the raw…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Bioinformatics · Topic Modeling
