Using Tree Automata and Regular Expressions to Manipulate Hierarchically Structured Data
Nikita Schmidt, Ahmed Patel

TL;DR
This paper introduces a formal model using tree automata and regular expressions to efficiently manipulate and transform hierarchically structured data, enhancing flexibility and interoperability in data processing applications.
Contribution
It extends classical automata theories to support structured tree-like data, enabling regular expression matching and substitution for hierarchical data processing.
Findings
Model is a superset of classical string and tree automata
Supports regular expression matching on structured hierarchical data
Provides a unified approach for structural and content transformations
Abstract
Information, stored or transmitted in digital form, is often structured. Individual data records are usually represented as hierarchies of their elements. Together, records form larger structures. Information processing applications have to take account of this structuring, which assigns different semantics to different data elements or records. Big variety of structural schemata in use today often requires much flexibility from applications--for example, to process information coming from different sources. To ensure application interoperability, translators are needed that can convert one structure into another. This paper puts forward a formal data model aimed at supporting hierarchical data processing in a simple and flexible way. The model is based on and extends results of two classical theories, studying finite string and tree automata. The concept of finite automata and regular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · semigroups and automata theory · Network Packet Processing and Optimization
