A Document-based Knowledge Discovery with Microservices Architecture
Habtom Kahsay Gidey, Mario Kesseler, Patrick Stangl, Peter Hillmann,, Andreas Karcher

TL;DR
This paper presents a microservices-based approach for knowledge discovery from digitized organizational data, focusing on keyword extraction, document similarity, and natural language queries, with an emphasis on scalability and resilience.
Contribution
It introduces a novel microservices architecture for document-based knowledge discovery, including design guidelines and an implementation demonstrator.
Findings
The approach effectively extracts keywords and calculates document similarities.
The system supports natural language queries and language-independent information provision.
The demonstrator meets scalability and resilience requirements.
Abstract
The first step towards digitalization within organizations lies in digitization - the conversion of analog data into digitally stored data. This basic step is the prerequisite for all following activities like the digitalization of processes or the servitization of products or offerings. However, digitization itself often leads to 'data-rich' but 'knowledge-poor' material. Knowledge discovery and knowledge extraction as approaches try to increase the usefulness of digitized data. In this paper, we point out the key challenges in the context of knowledge discovery and present an approach to addressing these using a microservices architecture. Our solution led to a conceptual design focusing on keyword extraction, similarity calculation of documents, database queries in natural language, and programming language independent provision of the extracted information. In addition, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Service-Oriented Architecture and Web Services · Data Mining Algorithms and Applications
