Text Analysis Using Deep Neural Networks in Digital Humanities and Information Science
Omri Suissa, Avshalom Elmalech, Maayan Zhitomirsky-Geffet

TL;DR
This paper reviews how deep neural networks are transforming digital humanities by enabling advanced text analysis, discusses challenges like data availability, and offers guidance for selecting suitable deep learning methods.
Contribution
It provides a comprehensive analysis of DNN applications in DH, addresses key challenges, and proposes a decision model for researchers to adopt deep learning techniques effectively.
Findings
DNNs achieve high performance in NLP tasks relevant to DH.
Challenges include data scarcity and domain adaptation issues.
A practical decision model aids DH researchers in method selection.
Abstract
Combining computational technologies and humanities is an ongoing effort aimed at making resources such as texts, images, audio, video, and other artifacts digitally available, searchable, and analyzable. In recent years, deep neural networks (DNN) dominate the field of automatic text analysis and natural language processing (NLP), in some cases presenting a super-human performance. DNNs are the state-of-the-art machine learning algorithms solving many NLP tasks that are relevant for Digital Humanities (DH) research, such as spell checking, language detection, entity extraction, author detection, question answering, and other tasks. These supervised algorithms learn patterns from a large number of "right" and "wrong" examples and apply them to new examples. However, using DNNs for analyzing the text resources in DH research presents two main challenges: (un)availability of training data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Computational and Text Analysis Methods
