Using Artificial Neural Networks to Determine Ontologies Most Relevant   to Scientific Texts

Luk\'a\v{s} Korel; Alexander S. Behr; Norbert Kockmann; Martin; Hole\v{n}a

arXiv:2309.09203·cs.NE·September 19, 2023·1 cites

Using Artificial Neural Networks to Determine Ontologies Most Relevant to Scientific Texts

Luk\'a\v{s} Korel, Alexander S. Behr, Norbert Kockmann, Martin, Hole\v{n}a

PDF

Open Access

TL;DR

This paper explores using neural networks, specifically transformer embeddings and various classifiers, to identify the most relevant ontologies for scientific texts, demonstrating the effectiveness of SVM in catalysis research.

Contribution

It introduces a method combining transformer embeddings with multiple classifiers to determine relevant ontologies for scientific texts, highlighting SVM's superior performance.

Findings

01

Support vector machine achieved the best classification results.

02

Random forest classifier performed the worst.

03

The approach is effective for catalysis research texts.

Abstract

This paper provides an insight into the possibility of how to find ontologies most relevant to scientific texts using artificial neural networks. The basic idea of the presented approach is to select a representative paragraph from a source text file, embed it to a vector space by a pre-trained fine-tuned transformer, and classify the embedded vector according to its relevance to a target ontology. We have considered different classifiers to categorize the output from the transformer, in particular random forest, support vector machine, multilayer perceptron, k-nearest neighbors, and Gaussian process classifiers. Their suitability has been evaluated in a use case with ontologies and scientific texts concerning catalysis research. From results we can say the worst results have random forest. The best results in this task brought support vector machine classifier.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques

MethodsGaussian Process