# Accelerated Dual Learning by Homotopic Initialization

**Authors:** Hadi Daneshmand, Hamed Hassani, Thomas Hofmann

arXiv: 1706.03958 · 2017-06-14

## TL;DR

This paper explores how strategic initialization in dual and non-convex optimization accelerates convergence in machine learning, supported by theoretical analysis and experimental validation.

## Contribution

It introduces novel initialization strategies for dual and non-convex problems, leveraging data set properties to improve optimization speed.

## Key findings

- Proper initialization can significantly speed up convergence.
- Data correlation properties influence initial optimization progress.
- Proposed heuristics are experimentally validated.

## Abstract

Gradient descent and coordinate descent are well understood in terms of their asymptotic behavior, but less so in a transient regime often used for approximations in machine learning. We investigate how proper initialization can have a profound effect on finding near-optimal solutions quickly. We show that a certain property of a data set, namely the boundedness of the correlations between eigenfeatures and the response variable, can lead to faster initial progress than expected by commonplace analysis. Convex optimization problems can tacitly benefit from that, but this automatism does not apply to their dual formulation. We analyze this phenomenon and devise provably good initialization strategies for dual optimization as well as heuristics for the non-convex case, relevant for deep learning. We find our predictions and methods to be experimentally well-supported.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.03958/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1706.03958/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/1706.03958/full.md

---
Source: https://tomesphere.com/paper/1706.03958