A Modular and Flexible Architecture for an Integrated Corpus Query System
Oliver Christ (IMS Stuttgart, Germany)

TL;DR
This paper presents a modular, extensible architecture for an integrated corpus query system that combines multiple knowledge sources and flexible information retrieval methods, enhancing customization and scalability.
Contribution
It introduces a flexible, modular architecture for a corpus query system that integrates diverse knowledge sources and retrieval methods, enabling extensibility.
Findings
Demonstrated modules within the architecture for corpus querying
Supported multiple knowledge sources for query evaluation
Achieved a flexible, extensible system design
Abstract
The paper describes the architecture of an integrated and extensible corpus query system developed at the University of Stuttgart and gives examples of some of the modules realized within this architecture. The modules form the core of a corpus workbench. Within the proposed architecture, information required for the evaluation of queries may be derived from different knowledge sources (the corpus text, databases, on-line thesauri) and by different means: either through direct lookup in a database or by calling external tools which may infer the necessary information at the time of query evaluation. The information available and the method of information access can be stated declaratively and individually for each corpus, leading to a flexible, extensible and modular corpus workbench.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies
