Efficiency versus Convergence of Boolean Kernels for On-Line Learning Algorithms
R. Khardon, D. Roth, R. A. Servedio

TL;DR
This paper examines the tradeoff between computational efficiency and learning effectiveness when using Boolean kernels with Perceptron and Winnow algorithms, highlighting limitations in efficiency and mistake bounds.
Contribution
It introduces and analyzes Boolean kernels for Perceptron and Winnow, demonstrating their computational tradeoffs and proving hardness results for kernel-based Winnow learning of DNF.
Findings
Boolean kernels enable efficient Perceptron training over exponential feature spaces
Perceptron with Boolean kernels can make exponentially many mistakes on simple functions
Kernel-based Winnow cannot efficiently simulate its behavior for learning DNF due to computational hardness
Abstract
The paper studies machine learning problems where each example is described using a set of Boolean features and where hypotheses are represented by linear threshold elements. One method of increasing the expressiveness of learned hypotheses in this context is to expand the feature set to include conjunctions of basic features. This can be done explicitly or where possible by using a kernel function. Focusing on the well known Perceptron and Winnow algorithms, the paper demonstrates a tradeoff between the computational efficiency with which the algorithm can be run over the expanded feature space and the generalization ability of the corresponding learning algorithm. We first describe several kernel functions which capture either limited forms of conjunctions or all conjunctions. We show that these kernels can be used to efficiently run the Perceptron algorithm over a feature space of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
