Tensor Neyman-Pearson Classification: Theory, Algorithms, and Error Control
Lingchong Liu, Elynn Chen, Yuefeng Han, Lucy Xia

TL;DR
This paper introduces a novel Tensor Neyman-Pearson classification framework that guarantees finite-sample control of type I error in tensor data, with applications to biochemical molecular structure classification.
Contribution
It develops the first Tensor-NP classification method with finite-sample error control, combining tensor geometry analysis, a new estimator, and deep learning techniques for multiway data.
Findings
Maintains type I error at preset levels across datasets
Achieves competitive type II error performance
Provides reliable asymmetric-risk decision tools in biochemistry
Abstract
Biochemical discovery increasingly relies on classifying molecular structures when the consequences of different errors are highly asymmetric. In mutagenicity and carcinogenicity, misclassifying a harmful compound as benign can trigger substantial scientific, regulatory, and health risks, whereas false alarms primarily increase laboratory workload. Modern representations transform molecular graphs into persistence image tensors that preserve multiscale geometric and topological structure, yet existing tensor classifiers and deep tensor neural networks provide no finite-sample guarantees on type I error and often exhibit severe error inflation in practice. We develop the first Tensor Neyman-Pearson (Tensor-NP) classification framework that achieves finite-sample control of type I error while exploiting the multi-mode structure of tensor data. Under a tensor-normal mixture model, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Anomaly Detection Techniques and Applications · Topological and Geometric Data Analysis
