New advances in enumerative biclustering algorithms with online partitioning
Rosana Veroneze, Fernando J. Von Zuben

TL;DR
This paper introduces RIn-Close_CVC3, an improved biclustering algorithm that performs online partitioning, reducing memory and runtime, handling missing data, and working with mixed data types for more informative results.
Contribution
The paper presents RIn-Close_CVC3, an extension of RIn-Close_CVC, with online partitioning, enhanced efficiency, and support for diverse data types, formally proved and experimentally validated.
Findings
Significant reduction in memory usage.
Improved runtime performance.
Effective handling of missing and mixed data types.
Abstract
This paper further extends RIn-Close_CVC, a biclustering algorithm capable of performing an efficient, complete, correct and non-redundant enumeration of maximal biclusters with constant values on columns in numerical datasets. By avoiding a priori partitioning and itemization of the dataset, RIn-Close_CVC implements an online partitioning, which is demonstrated here to guide to more informative biclustering results. The improved algorithm is called RIn-Close_CVC3, keeps those attractive properties of RIn-Close_CVC, as formally proved here, and is characterized by: a drastic reduction in memory usage; a consistent gain in runtime; additional ability to handle datasets with missing values; and additional ability to operate with attributes characterized by distinct distributions or even mixed data types. The experimental results include synthetic and real-world datasets used to perform…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Rough Sets and Fuzzy Logic · Data Management and Algorithms
