Counting Subwords and Regular Languages
Charles J. Colbourn, Ryan E. Dougherty, Thomas F. Lidbetter and, Jeffrey Shallit

TL;DR
This paper investigates conditions under which languages defined by equal or ordered subword counts are regular, providing a precise criterion and an efficient method to verify it.
Contribution
It establishes a necessary and sufficient condition for the regularity of languages based on subword occurrence counts and offers an efficient checking procedure.
Findings
Characterizes when such subword-count languages are regular.
Provides an algorithm to verify the regularity condition.
Clarifies the structure of languages defined by subword occurrence constraints.
Abstract
Let and be words. We consider the languages whose words are those for which the numbers of occurrences of and , as subwords of , are the same (resp., the number of 's is less than the number of 's, resp., is less than or equal). We give a necessary and sufficient condition on and for these languages to be regular, and we show how to check this condition efficiently.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Algorithms and Data Compression · DNA and Biological Computing
