QSAR Classification Modeling for Bioactivity of Molecular Structure via SPL-Logsum
Liang-Yong Xia, Qing-Yong Wang

TL;DR
This paper introduces SPL-Logsum, a novel QSAR classification method combining self-paced learning and Logsum regularization to improve bioactivity prediction and descriptor selection in molecular structure analysis.
Contribution
It proposes a new descriptor selection approach using SPL with Logsum penalized logistic regression, enhancing model accuracy and interpretability in QSAR bioactivity prediction.
Findings
Outperforms existing sparse methods in classification accuracy
Selects fewer meaningful molecular descriptors
Demonstrates effectiveness on multiple QSAR datasets
Abstract
Quantitative structure-activity relationship (QSAR) modelling is effective 'bridge' to search the reliable relationship related bioactivity to molecular structure. A QSAR classification model contains a lager number of redundant, noisy and irrelevant descriptors. To address this problem, various of methods have been proposed for descriptor selection. Generally, they can be grouped into three categories: filters, wrappers, and embedded methods. Regularization method is an important embedded technology, which can be used for continuous shrinkage and automatic descriptors selection. In recent years, the interest of researchers in the application of regularization techniques is increasing in descriptors selection , such as, logistic regression(LR) with penalty. In this paper, we proposed a novel descriptor selection method based on self-paced learning(SPL) with Logsum penalized LR for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Bioinformatics · Machine Learning in Materials Science
