Six Llamas: Comparative Religious Ethics Through LoRA-Adapted Language Models
Chad Coleman, W. Russell Neuman, Manan Shah, Ali Dasdan, Matthew Crispi, Morris Chiang, Zack Leitman, Mustafa Poonawala

TL;DR
This study investigates whether language models fine-tuned on religious texts encode distinct ethical reasoning patterns aligned with their training traditions, using a comprehensive probing and stability analysis across multiple models and prompts.
Contribution
It introduces a novel comparative method employing LoRA-adapted language models trained on religious corpora to analyze cultural and ethical differences systematically.
Findings
LoRA models produce ethically differentiated reasoning patterns.
Models' moral logics align with their training traditions.
High response consistency for core dilemmas like the Trolley Problem.
Abstract
We present Six Llamas, a comparative study examining whether large language models fine-tuned on distinct religious corpora encode systematically different patterns of ethical reasoning. Six variants of Meta-Llama-3.1-8B are constructed: one unmodified control and five LoRA-adapted models trained exclusively on the sacred and theological texts of Christianity, Islam, Judaism, Hinduism, or Buddhism. All six models are probed with an identical battery of 17 standardized ethical prompts spanning moral dilemmas, game-theoretic scenarios, public policy questions, and moral-psychological self-assessments. To assess robustness and reproducibility, we implement a multi-temperature sampling design spanning ten temperature settings. We compute response consistency metrics, pairwise inter-model agreement rates, temperature sensitivity coefficients across four prompt domains, and run-to-run…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
