Sparse Coding with Earth Mover's Distance for Multi-Instance Histogram Representation
Mohua Zhang, Jianhua Peng, Xuejie Liu, Jim Jing-Yan Wang

TL;DR
This paper introduces SC-EMD, a sparse coding method using Earth Mover's Distance for histogram-based multi-instance learning, showing improved performance over traditional L2 norm methods in medical imaging and bioinformatics tasks.
Contribution
It proposes a novel sparse coding approach that replaces L2 norm with Earth Mover's Distance for better histogram reconstruction in multi-instance learning.
Findings
SC-EMD outperforms traditional methods in image anomaly detection.
SC-EMD improves protein binding site retrieval accuracy.
Earth Mover's Distance enhances histogram representation quality.
Abstract
Sparse coding (Sc) has been studied very well as a powerful data representation method. It attempts to represent the feature vector of a data sample by reconstructing it as the sparse linear combination of some basic elements, and a norm distance function is usually used as the loss function for the reconstruction error. In this paper, we investigate using Sc as the representation method within multi-instance learning framework, where a sample is given as a bag of instances, and further represented as a histogram of the quantized instances. We argue that for the data type of histogram, using norm distance is not suitable, and propose to use the earth mover's distance (EMD) instead of norm distance as a measure of the reconstruction error. By minimizing the EMD between the histogram of a sample and the its reconstruction from some basic histograms, a novel sparse coding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
