Adapting the Interrelated Two-way Clustering method for Quantitative Structure-Activity Relationship (QSAR) Modeling of a Diverse Set of Chemical Compounds
Subhabrata Majumdar, Subhash C. Basak, Gregory D. Grunwald

TL;DR
This paper adapts the Interrelated Two-way Clustering (ITC) method for QSAR modeling of diverse chemicals, using it for predictor selection before applying ridge regression to predict mutagenicity.
Contribution
It introduces an adaptation of ITC for predictor selection in QSAR modeling of chemical mutagenicity, combining it with ridge regression for improved prediction.
Findings
ITC-based predictor selection yields comparable results to previous methods.
The combined ITC and ridge regression approach effectively classifies mutagenic chemicals.
The method performs well with diverse chemical descriptors.
Abstract
Interrelated Two-way Clustering (ITC) is an unsupervised clustering method developed to divide samples into two groups in gene expression data obtained through microarrays, selecting important genes simultaneously in the process. This has been found to be a better approach than conventional clustering methods like K-means or self-organizing map for the scenarios when number of samples much smaller than number of variables (n<<p). In this paper we used the ITC approach for classification of a diverse set of 508 chemicals regarding mutagenicity. A large number of topological indices (TIs), 3-dimensional, and quantum chemical descriptors, as well as atom pairs (APs) have been used as explanatory variables. In this paper, ITC has been used only for predictor selection, after which ridge regression is employed to build the final predictive model. The proper leave-one-out (LOO) method of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
