Local kernel canonical correlation analysis with application to virtual drug screening
Daniel Samarov, J. S. Marron, Yufeng Liu, Christopher Grulke,, Alexander Tropsha

TL;DR
This paper introduces a novel kernel-based canonical correlation analysis method called IKCCA, which significantly improves virtual drug screening accuracy by effectively filtering large compound libraries using spectral learning techniques.
Contribution
The paper proposes the Indefinite Kernel CCA (IKCCA), a new spectral learning-based approach that enhances virtual screening performance over existing methods.
Findings
IKCCA outperforms existing virtual screening methods in predictive accuracy.
The approach demonstrates strong results on both toy and real-world datasets.
Significant improvements in filtering large compound libraries for drug discovery.
Abstract
Drug discovery is the process of identifying compounds which have potentially meaningful biological activity. A major challenge that arises is that the number of compounds to search over can be quite large, sometimes numbering in the millions, making experimental testing intractable. For this reason computational methods are employed to filter out those compounds which do not exhibit strong biological activity. This filtering step, also called virtual screening reduces the search space, allowing for the remaining compounds to be experimentally tested. In this paper we propose several novel approaches to the problem of virtual screening based on Canonical Correlation Analysis (CCA) and on a kernel-based extension. Spectral learning ideas motivate our proposed new method called Indefinite Kernel CCA (IKCCA). We show the strong performance of this approach both for a toy problem as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
