A fast compression-based similarity measure with applications to content-based image retrieval
Daniele Cerra, Mihai Datcu

TL;DR
This paper introduces the Fast Compression Distance (FCD), a new, efficient compression-based similarity measure that improves scalability for content-based image retrieval without sacrificing accuracy.
Contribution
It proposes the FCD method that reduces complexity of compression-based similarity measures and demonstrates its effectiveness on large image datasets.
Findings
FCD outperforms existing methods in large-scale image retrieval
FCD maintains high accuracy comparable to state-of-the-art techniques
Experiments show improved efficiency on medium-to-large datasets
Abstract
Compression-based similarity measures are effectively employed in applications on diverse data types with a basically parameter-free approach. Nevertheless, there are problems in applying these techniques to medium-to-large datasets which have been seldom addressed. This paper proposes a similarity measure based on compression with dictionaries, the Fast Compression Distance (FCD), which reduces the complexity of these methods, without degradations in performance. On its basis a content-based color image retrieval system is defined, which can be compared to state-of-the-art methods based on invariant color features. Through the FCD a better understanding of compression-based techniques is achieved, by performing experiments on datasets which are larger than the ones analyzed so far in literature.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
