Operator-Based Generalization Bound for Deep Learning: Insights on Multi-Task Learning
Mahdi Mohammadigohari, Giuseppe Di Fatta, Giuseppe Nicosia, Panos M. Pardalos

TL;DR
This paper introduces new operator-theoretic generalization bounds for multi-task deep learning, combining Koopman and Perron Frobenius operators with sketching techniques to improve performance guarantees and address overfitting and underfitting.
Contribution
It develops a novel operator-based framework for generalization bounds in multi-task deep learning, integrating Koopman and Perron Frobenius operators with sketching methods.
Findings
Tighter generalization guarantees than traditional bounds.
Effective sketching techniques for Koopman-based methods.
Performance guarantees for robust and quantile regression applications.
Abstract
This paper presents novel generalization bounds for vector-valued neural networks and deep kernel methods, focusing on multi-task learning through an operator-theoretic framework. Our key development lies in strategically combining a Koopman based approach with existing techniques, achieving tighter generalization guarantees compared to traditional norm-based bounds. To mitigate computational challenges associated with Koopman-based methods, we introduce sketching techniques applicable to vector valued neural networks. These techniques yield excess risk bounds under generic Lipschitz losses, providing performance guarantees for applications including robust and multiple quantile regression. Furthermore, we propose a novel deep learning framework, deep vector-valued reproducing kernel Hilbert spaces (vvRKHS), leveraging Perron Frobenius (PF) operators to enhance deep kernel methods. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Advanced Graph Neural Networks
