Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance
Chinh Dang, Hayder Radha

TL;DR
This paper introduces a novel representative selection framework for Big Data using sparse graph construction, spectral clustering on Grassmann manifolds, and geodesic distance analysis to identify key data points efficiently.
Contribution
The paper proposes a new SGGM framework combining sparse graph, Grassmann manifold, and geodesic distance for effective representative selection in Big Data.
Findings
Effective in video summarization tasks
Outperforms some state-of-the-art methods
Validated against human-judged ground truth
Abstract
This paper addresses the problem of identifying a very small subset of data points that belong to a significantly larger massive dataset (i.e., Big Data). The small number of selected data points must adequately represent and faithfully characterize the massive Big Data. Such identification process is known as representative selection [19]. We propose a novel representative selection framework by generating an l1 norm sparse graph for a given Big-Data dataset. The Big Data is partitioned recursively into clusters using a spectral clustering algorithm on the generated sparse graph. We consider each cluster as one point in a Grassmann manifold, and measure the geodesic distance among these points. The distances are further analyzed using a min-max algorithm [1] to extract an optimal subset of clusters. Finally, by considering a sparse subgraph of each selected cluster, we detect a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Analysis and Summarization · Advanced Image and Video Retrieval Techniques · Video Surveillance and Tracking Methods
