Generalization Bounds for Dependent Data using Online-to-Batch Conversion
Sagnik Chatterjee, Manuj Mukherjee, Alhad Sethi

TL;DR
This paper derives generalization bounds for batch learning algorithms trained on dependent data from mixing processes, using an online-to-batch conversion framework and a new Wasserstein-based stability notion, applicable without stability assumptions.
Contribution
It introduces a novel online-to-batch conversion method that removes stability requirements, enabling bounds for any batch learner on dependent data, with bounds comparable to the i.i.d. case.
Findings
Bounds depend on the decay rate of the mixing process
EWA algorithm satisfies the new stability notion
Bounds are applicable to any batch learning algorithm
Abstract
In this work, we upper bound the generalization error of batch learning algorithms trained on samples drawn from a mixing stochastic process (i.e., a dependent data source) both in expectation and with high probability. Unlike previous results by Mohri et al. (2010) and Fu et al. (2023), our work does not require any stability assumptions on the batch learner, which allows us to derive upper bounds for any batch learning algorithm trained on dependent data. This is made possible due to our use of the Online-to-Batch ( OTB ) conversion framework, which allows us to shift the burden of stability from the batch learner to an artificially constructed online learner. We show that our bounds are equal to the bounds in the i.i.d. setting up to a term that depends on the decay rate of the underlying mixing stochastic process. Central to our analysis is a new notion of algorithmic stability for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Compression Techniques · Image and Signal Denoising Methods · Neural Networks and Applications
