Temporal Analysis of Literary and Programming Prose
Brian Michalski, Mukkai Krishnamoorthy, Tsz-Yam Lau

TL;DR
This paper explores temporal patterns in literary and programming prose using Google n-gram viewer and commit logs, revealing societal insights and differences between literary works and code repositories.
Contribution
It introduces a method for analyzing time-based properties in both literature and software repositories, highlighting differences and providing a comparative framework.
Findings
Sunday is disproportionately referenced in literature.
Thursday is notably less popular as a reference day.
Temporal analysis can inform understanding of societal and software development patterns.
Abstract
Literary works reference a variety of globally shared themes including well-known people, events, and time periods. It is particularly interesting to locate patterns that are either invariant across time or exhibit a characteristic change across time, as they could imply something important about society that those works record. This paper suggests the use of Google n-gram viewer as a fast prototyping method for examining time-based properties over a rich sample of literary prose. Using this method, we find that some repeating periods of time, like Sunday, are referenced disproportionally, allowing us to pose questions such as why a day like Thursday is so unpopular. Furthermore, by treating software as a work of prose, we can apply a similar analysis to open-source software repositories and explore time-based relations in commit logs. Doing a simple statistical analysis on a few…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques · Language and cultural evolution · Digital Humanities and Scholarship
