Algorithms for Boolean Matrix Factorization using Integer Programming and Heuristics
Christos Kolomvakis, Thomas Bobille, Arnaud Vandaele, Nicolas Gillis

TL;DR
This paper introduces new algorithms for Boolean matrix factorization using integer programming and heuristics, improving scalability and performance for applications like topic modeling and imaging.
Contribution
It presents novel AO algorithms with IP solutions, heuristics for scalability, and a fast data structure for Boolean matrices, advancing BMF techniques.
Findings
Algorithms outperform existing methods on real datasets
Heuristics improve scalability to large datasets
New data structure accelerates Boolean matrix operations
Abstract
Boolean matrix factorization (BMF) approximates a given binary input matrix as the product of two smaller binary factors. Unlike binary matrix factorization based on standard arithmetic, BMF employs the Boolean OR and AND operations for the matrix product, which improves interpretability and reduces the approximation error. It is also used in role mining and computer vision. In this paper, we first propose algorithms for BMF that perform alternating optimization (AO) of the factor matrices, where each subproblem is solved via integer programming (IP). We then design different approaches to further enhance AO-based algorithms by selecting an optimal subset of rank-one factors from multiple runs. To address the scalability limits of IP-based methods, we introduce new greedy and local-search heuristics. We also construct a new C++ data structure for Boolean vectors and matrices that is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Constraint Satisfaction and Optimization · Tensor decomposition and applications
