Operator-Based Generalization Bound for Deep Learning: Insights on Multi-Task Learning

Mahdi Mohammadigohari; Giuseppe Di Fatta; Giuseppe Nicosia; Panos M. Pardalos

arXiv:2512.19184·cs.LG·December 23, 2025

Operator-Based Generalization Bound for Deep Learning: Insights on Multi-Task Learning

Mahdi Mohammadigohari, Giuseppe Di Fatta, Giuseppe Nicosia, Panos M. Pardalos

PDF

Open Access

TL;DR

This paper introduces new operator-theoretic generalization bounds for multi-task deep learning, combining Koopman and Perron Frobenius operators with sketching techniques to improve performance guarantees and address overfitting and underfitting.

Contribution

It develops a novel operator-based framework for generalization bounds in multi-task deep learning, integrating Koopman and Perron Frobenius operators with sketching methods.

Findings

01

Tighter generalization guarantees than traditional bounds.

02

Effective sketching techniques for Koopman-based methods.

03

Performance guarantees for robust and quantile regression applications.

Abstract

This paper presents novel generalization bounds for vector-valued neural networks and deep kernel methods, focusing on multi-task learning through an operator-theoretic framework. Our key development lies in strategically combining a Koopman based approach with existing techniques, achieving tighter generalization guarantees compared to traditional norm-based bounds. To mitigate computational challenges associated with Koopman-based methods, we introduce sketching techniques applicable to vector valued neural networks. These techniques yield excess risk bounds under generic Lipschitz losses, providing performance guarantees for applications including robust and multiple quantile regression. Furthermore, we propose a novel deep learning framework, deep vector-valued reproducing kernel Hilbert spaces (vvRKHS), leveraging Perron Frobenius (PF) operators to enhance deep kernel methods. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Model Reduction and Neural Networks · Advanced Graph Neural Networks