MaintNet: A Collaborative Open-Source Library for Predictive Maintenance Language Resources
Farhad Akhbardeh, Travis Desell, Marcos Zampieri

TL;DR
MaintNet is an open-source library offering domain-specific logbook datasets and tools to advance NLP research in predictive maintenance across aviation, automotive, and facilities sectors.
Contribution
It introduces a collaborative platform with specialized datasets and processing tools for analyzing maintenance logbooks, addressing NLP challenges in technical domains.
Findings
Provides novel logbook datasets from multiple domains
Includes tools for data preprocessing and clustering
Encourages community sharing and discussion
Abstract
Maintenance record logbooks are an emerging text type in NLP. They typically consist of free text documents with many domain specific technical terms, abbreviations, as well as non-standard spelling and grammar, which poses difficulties to NLP pipelines trained on standard corpora. Analyzing and annotating such documents is of particular importance in the development of predictive maintenance systems, which aim to provide operational efficiencies, prevent accidents and save lives. In order to facilitate and encourage research in this area, we have developed MaintNet, a collaborative open-source library of technical and domain-specific language datasets. MaintNet provides novel logbook data from the aviation, automotive, and facilities domains along with tools to aid in their (pre-)processing and clustering. Furthermore, it provides a way to encourage discussion on and sharing of new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
