GreCon3: Mitigating High Resource Utilization of GreCon Algorithms for Boolean Matrix Factorization
Petr Kraj\v{c}a, Martin Trnecka

TL;DR
GreCon3 is an improved Boolean matrix factorization algorithm that significantly reduces memory usage and computation time, enabling efficient analysis of large datasets by introducing novel data structures and strategies.
Contribution
This paper presents GreCon3, a new version of the GreCon algorithm that enhances efficiency and memory management through innovative data structures and initialization strategies.
Findings
GreCon3 outperforms GreCon2 in speed and memory efficiency.
The new data structure reduces memory consumption significantly.
GreCon3 enables factorization of larger datasets previously infeasible.
Abstract
Boolean matrix factorization (BMF) is a fundamental tool for analyzing binary data and discovering latent information hidden in the data. Formal Concept Analysis (FCA) provides us with an essential insight into BMF and the design of algorithms. Due to FCA, we have the GreCon and GreCon2 algorithms providing high-quality factorizations at the cost of high memory consumption and long running times. In this paper, we introduce GreCon3, a substantial revision of these algorithms, significantly improving both computational efficiency and memory usage. These improvements are achieved with a novel space-efficient data structure that tracks unprocessed data. Further, a novel strategy incrementally initializing this data structure is proposed. This strategy reduces memory consumption and omits data irrelevant to the remainder of the computation. Moreover, we show that the first factors can be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Bayesian Modeling and Causal Inference · Gene expression and cancer classification
