An Aggregate and Iterative Disaggregate Algorithm with Proven Optimality   in Machine Learning

Young Woong Park; Diego Klabjan

arXiv:1607.01400·stat.ML·January 23, 2017

An Aggregate and Iterative Disaggregate Algorithm with Proven Optimality in Machine Learning

Young Woong Park, Diego Klabjan

PDF

TL;DR

This paper introduces an iterative clustering-based algorithm that aggregates and disaggregates data to efficiently solve certain machine learning optimization problems, with proven optimality and convergence guarantees.

Contribution

It presents a novel aggregation-disaggregation algorithm with proven optimality and convergence for specific machine learning problems, enhancing solution efficiency.

Findings

01

Algorithm achieves optimal solutions in tested cases.

02

Proven convergence and bounds on optimality gap.

03

Effective for problems like SVMs and regression.

Abstract

We propose a clustering-based iterative algorithm to solve certain optimization problems in machine learning, where we start the algorithm by aggregating the original data, solving the problem on aggregated data, and then in subsequent steps gradually disaggregate the aggregated data. We apply the algorithm to common machine learning problems such as the least absolute deviation regression problem, support vector machines, and semi-supervised support vector machines. We derive model-specific data aggregation and disaggregation procedures. We also show optimality, convergence, and the optimality gap of the approximated solution in each iteration. A computational study is provided.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.