TL;DR
This paper introduces scalable, near-optimal methods for linear predictive clustering in non-separable spaces by reducing complex optimization problems to more efficient formulations, improving accuracy and scalability.
Contribution
It develops two novel approaches that approximate LPC efficiently using MIP and QPBO reductions, with theoretical error bounds and improved scalability.
Findings
Achieves near-optimal solutions with lower regression errors.
Demonstrates superior scalability over existing MIP methods.
Provides effective approximations in synthetic and real-world datasets.
Abstract
Linear Predictive Clustering (LPC) partitions samples based on shared linear relationships between feature and target variables, with numerous applications including marketing, medicine, and education. Greedy optimization methods, commonly used for LPC, alternate between clustering and linear regression but lack global optimality. While effective for separable clusters, they struggle in non-separable settings where clusters overlap in feature space. In an alternative constrained optimization paradigm, Bertsimas and Shioda (2007) formulated LPC as a Mixed-Integer Program (MIP), ensuring global optimality regardless of separability but suffering from poor scalability. This work builds on the constrained optimization paradigm to introduce two novel approaches that improve the efficiency of global optimization for LPC. By leveraging key theoretical properties of separability, we derive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
