Critical biblical studies via word frequency analysis: unveiling text authorship
Shira Faigenbaum-Golovin, Alon Kipnis, Axel B\"uhler, Eli Piasetzky,, Thomas R\"omer, Israel Finkelstein

TL;DR
This study employs statistical word frequency analysis to differentiate authorship in biblical texts, revealing distinct linguistic patterns and improving attribution accuracy without prior assumptions.
Contribution
It introduces a sensitive statistical method for authorship attribution in biblical texts, highlighting differences among authors and aligning with expert assessments.
Findings
D and DtrH authors are more closely related than P.
High accuracy in authorship attribution based on word frequency similarities.
Statistically significant evidence of distinct linguistic features among biblical authors.
Abstract
The Bible, a product of an extensive and intricate process of oral-written transmission spanning centuries, obscures the contours of its earlier recensions. Debate rages over determining the existing layers and identifying the date of composition and historical background of the biblical texts. Traditional manual methodologies have grappled with authorship challenges through scrupulous textual criticism, employing linguistic, stylistic, inner-biblical, and historical criteria. Despite recent progress in computer-assisted analysis, many patterns still need to be uncovered in Biblical Texts. In this study, we address the question of authorship of biblical texts by employing statistical analysis to the frequency of words using a method that is particularly sensitive to deviations in frequencies associated with a few words out of potentially many. We aim to differentiate between three…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Spam and Phishing Detection
