Reflexivity in Issues of Scale and Representation in a Digital   Humanities Project

Annie T. Chen; Camille Lyans Cole

arXiv:2109.14184·cs.CL·September 30, 2021·1 cites

Reflexivity in Issues of Scale and Representation in a Digital Humanities Project

Annie T. Chen, Camille Lyans Cole

PDF

Open Access

TL;DR

This paper discusses challenges and considerations in developing a digital humanities pipeline that integrates NLP, data analysis, and visualization for a longitudinal personal diary corpus, emphasizing issues of scale and representation.

Contribution

It highlights the conceptual and practical challenges of representing and visualizing a single-person, multi-decade diary corpus in digital humanities research.

Findings

01

Addressed issues of data representation and visualization in a longitudinal diary corpus

02

Identified conceptual challenges in scale and historical interpretation

03

Explored team-based approaches to data analysis in digital humanities

Abstract

In this paper, we explore issues that we have encountered in developing a pipeline that combines natural language processing with data analysis and visualization techniques. The characteristics of the corpus - being comprised of diaries of a single person spanning several decades - present both conceptual challenges in terms of issues of representation, and affordances as a source for historical research. We consider these issues in a team context with a particular focus on the generation and interpretation of visualizations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Humanities and Scholarship · Computational and Text Analysis Methods · Data Visualization and Analytics