Clustering Boolean Tensors

Saskia Metzler; Pauli Miettinen

arXiv:1501.00696·cs.NA·September 19, 2016

Clustering Boolean Tensors

Saskia Metzler, Pauli Miettinen

PDF

Open Access

TL;DR

This paper explores the complexity of Boolean tensor factorizations, introduces a new clustering algorithm that handles high-dimensional data efficiently, and provides approximation guarantees for rank-1 factorizations.

Contribution

It analyzes the impact of mode partitioning on Boolean tensor factorization complexity and proposes a scalable clustering algorithm with approximation guarantees.

Findings

01

Algorithm achieves high scalability and similarity.

02

Provides a PTAS for certain factorizations.

03

Effective on synthetic and real-world data.

Abstract

Tensor factorizations are computationally hard problems, and in particular, are often significantly harder than their matrix counterparts. In case of Boolean tensor factorizations -- where the input tensor and all the factors are required to be binary and we use Boolean algebra -- much of that hardness comes from the possibility of overlapping components. Yet, in many applications we are perfectly happy to partition at least one of the modes. In this paper we investigate what consequences does this partitioning have on the computational complexity of the Boolean tensor factorizations and present a new algorithm for the resulting clustering problem. This algorithm can alternatively be seen as a particularly regularized clustering algorithm that can handle extremely high-dimensional observations. We analyse our algorithms with the goal of maximizing the similarity and argue that this is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications