The Learning and Communication Complexity of Subsequence Containment
Mason DiCicco, Daniel Reichman

TL;DR
This paper investigates the learning and communication complexities of determining whether a binary string contains a fixed subsequence, providing tight bounds and highlighting the impact of subsequence contiguity on complexity.
Contribution
It introduces asymptotically tight bounds for both the VC dimension and communication complexity of subsequence containment problems.
Findings
Sample complexity increases with non-contiguous subsequences.
Derived tight bounds for learning and communication complexities.
Subsequence contiguity significantly affects complexity measures.
Abstract
We consider the learning and communication complexity of subsequence containment. In the learning problem, we seek to learn a classifier that positively labels a binary string if it contains a fixed binary string as a subsequence. In the communication problem, and are partitioned between two players, Alice and Bob, who wish to determine if contains as a subsequence using a minimal amount of communication. We devise asymptotically tight bounds for the sample complexity (VC dimension) of the learning problem and the communication complexity of the communication problem. Our results illustrate that the sample complexity of our learning problem can be considerably larger when the subsequence occurs in non-contiguous locations.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputability, Logic, AI Algorithms · semigroups and automata theory · Machine Learning and Algorithms
