A Goodness-of-fit Test on the Number of Biclusters in a Relational Data Matrix
Chihiro Watanabe, Taiji Suzuki

TL;DR
This paper introduces a new statistical test to accurately determine the number of biclusters in relational data matrices without assuming a regular-grid structure, improving interpretability and applicability.
Contribution
The study proposes a novel bicluster number test that relaxes the regular-grid assumption and derives its asymptotic properties for better biclustering inference.
Findings
Effective in synthetic data
Applicable to real relational data
Improves bicluster interpretability
Abstract
Biclustering is a method for detecting homogeneous submatrices in a given observed matrix, and it is an effective tool for relational data analysis. Although there are many studies that estimate the underlying bicluster structure of a matrix, few have enabled us to determine the appropriate number of biclusters in an observed matrix. Recently, a statistical test on the number of biclusters has been proposed for a regular-grid bicluster structure, where we assume that the latent bicluster structure can be represented by row-column clustering. However, when the latent bicluster structure does not satisfy such regular-grid assumption, the previous test requires a larger number of biclusters than necessary (i.e., a finer bicluster structure than necessary) for the null hypothesis to be accepted, which is not desirable in terms of interpreting the accepted bicluster structure. In this study,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
