Chain-linked multiple matrix integration via embedding alignment
Runbing Zheng, Minh Tang

TL;DR
This paper introduces CMMI, a novel method for integrating multiple noisy submatrices with block-wise missing data to accurately reconstruct the full matrix, using embedding alignment and aggregation techniques.
Contribution
The paper proposes the Chain-linked Multiple Matrix Integration (CMMI) method, a new approach for matrix completion from block-wise missing data using embedding alignment and aggregation.
Findings
CMMI achieves accurate matrix recovery with minimal overlaps.
The method provides theoretical error bounds and normal approximations.
Simulation and real data show CMMI is efficient and effective.
Abstract
Motivated by the increasing demand for multi-source data integration in various scientific fields, in this paper we study matrix completion in scenarios where the data exhibits certain block-wise missing structures -- specifically, where only a few noisy submatrices representing (overlapping) parts of the full matrix are available. We propose the Chain-linked Multiple Matrix Integration (CMMI) procedure to efficiently combine the information that can be extracted from these individual noisy submatrices. CMMI begins by deriving entity embeddings for each observed submatrix, then aligns these embeddings using overlapping entities between pairs of submatrices, and finally aggregates them to reconstruct the entire matrix of interest. We establish, under mild regularity conditions, entrywise error bounds and normal approximations for the CMMI estimates. Simulation studies and real data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsManufacturing Process and Optimization · graph theory and CDMA systems · Color Science and Applications
