A Text Mining Discovery of Similarities and Dissimilarities Among Sacred Scriptures
Younous Mofenjou Peuriekeu, Victoire Djimna Noyum, Cyrille Feudjio,, Alkan Goktug, Ernest Fokoue

TL;DR
This paper employs NLP and machine learning to analyze and classify sacred scriptures, revealing similarities and differences among texts from different religious traditions with high accuracy.
Contribution
It introduces a novel approach combining NLP and machine learning to compare sacred texts and classify them accurately based on their linguistic features.
Findings
Multinomial Naive Bayes achieved 85.84% accuracy in classifying sacred texts.
The study demonstrates the effectiveness of NLP and machine learning in religious text analysis.
Different machine learning models show varying success in classifying sacred scriptures.
Abstract
The careful examination of sacred texts gives valuable insights into human psychology, different ideas regarding the organization of societies as well as into terms like truth and God. To improve and deepen our understanding of sacred texts, their comparison, and their separation is crucial. For this purpose, we use our data set has nine sacred scriptures. This work deals with the separation of the Quran, the Asian scriptures Tao-Te-Ching, the Buddhism, the Yogasutras, and the Upanishads as well as the four books from the Bible, namely the Book of Proverbs, the Book of Ecclesiastes, the Book of Ecclesiasticus, and the Book of Wisdom. These scriptures are analyzed based on the natural language processing NLP creating the mathematical representation of the corpus in terms of frequencies called document term matrix (DTM). After this analysis, machine learning methods like supervised and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
